In this Section you will find some lexical resources that we have built for research purposes. Feel free to use any of them for your own research and make sure, if so, to cite the adequate paper.

  • Dictionary of common slang, emoticons and abbreviations used in Social Media. This dictionary contains slag used in English and some Spanish variations. (Download here!).
    If you use this resource, please make sure to cite the following paper:
    [Importancia del lenguaje coloquial y de los símbolos de puntuación en el perfilado de autores. Diana M. Sepúlveda-Barrera, Daniel Martínez-Espino, Esaú Villatoro-Tello and Gabriela Ramírez-De-La-Rosa. In Journal of Research in Computing Science. May 2016.]
  • Dialogue corpora between Mexican teenagers. We collected a total of 304,399 pairs of question-answer examples from a popular social network among Mexican teenagers called ASK. This corpus was used to train and generate an automatic dialogue system. (Download here!). Also, if you want to try our chatbot, the GitHub project is available here.
    If you use this resource, please make sure to cite the following paper:
    [From dialogue corpora to dialogue systems: Generating a chatbot with teenager personality for preventing cyber-pedophilia. Ángel Callejas-Rodríguez, Esaú Villatoro-Tello, Ivan Meza, Gabriela Ramírez-de-la-Rosa,. In LNAI, Proceedings of the 19th International Conference on Text, Speech and Dialogue TSD 2016.]