Using Guarani Verbal Morphology on Guarani-Spanish Machine Translation Experiments

Yanina Borges; Florencia Mercant; Luis Chiruzzo

Using Guarani Verbal Morphology on Guarani-Spanish Machine Translation Experiments

Autores: Yanina Borges, Florencia Mercant, Luis Chiruzzo
Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 66, 2021, págs. 89-98
Idioma: inglés
Títulos paralelos:
- Uso de la Morfología Verbal del Guaraní en Experimentos de Traducción Automática Guaraní-Español
Enlaces
- Texto completo
Resumen
- español
  Este artículo muestra los resultados de un proyecto para construir herramientas y recursos computacionales para procesar el idioma guaraní, un idioma poco explorado por la comunidad de PLN. Se desarrolló una línea base de traducción automática para el par Guaraní-Español, y luego se realizaron una serie de experimentos para intentar mejorar la calidad de esta línea base utilizando información morfológica. Este trabajo se enfoca en el análisis de los verbos, los cuales componen la categoría gramatical más compleja en el idioma guaraní. Se reportan los resultados de las distintas herramientas implementadas para el análisis y detección de verbos en guaraní, así como los experimentos sobre traducción automática hechos sobre diferentes versiones del corpus aumentado con atributos morfológicos.
- English
  This paper shows the results of a project for building computational tools and resources for processing the Guarani language, an under-researched language in the NLP community. We developed a baseline machine translation system for the Guarani-Spanish pair, and then performed a series of experiments trying to improve its quality using morphological information. In this work we focus on the analysis of verbs, which is the most complex part of speech in Guarani. We report the results of the different tools implemented for verbs analysis and detection in Guarani, as well as the experiments on machine translation carried on using different versions of the corpus augmented with morphological features.
Referencias bibliográficas
- Abdelali, A., J. Cowie, S. Helmreich, W. Jin, M. P. Milagros, B. Ogden, H. M. Rad, and R. Zacharski. 2006. Guarani: a case study in resource...
- Academia de la Lengua Guaraní (ALG). 2018. Gramática Guaraní.
- Alcaraz, N. A. and P. A. Alcaraz. 2020. Aplicación web de análisis y traducción automática guaraní–español/español– guaraní. Revista Científica...
- Bird, Steven, E. L. and E. Klein. 2009. Natural language processing with python.
- Bisazza, A. and M. Federico. 2009. Morphological pre-processing for turkish to english statistical machine translation. In nnnn.
- Chiruzzo, L., P. Amarilla, A. Ríos, and G. Giménez Lugo. 2020. Development of a Guarani - Spanish Parallel Corpus. In Proceedings of The 12th...
- Dooley, R. A. 2006. Léxico guarani, dialeto mbyá com informa¸c˜oes úteis para o ensino médio, a aprendizagem e a pesquisa lingüística. Cuiabá,...
- El-Kahlout, I. D., E. Bekta¸s, N. S¸. Erdem, and H. Kaya. 2019. Translating between morphologically rich languages: An arabic-to-turkish machine...
- Estigarribia, B. 2015. Guarani-spanish jopara mixing in a paraguayan novel: Does it reflect a third language, a language variety, or true...
- Estigarribia, B. and J. Pinta. 2017. Guarani linguistics in the 21st century. Brill.
- Gasser, M. 2018. Mainumby: un ayudante para la traducción castellano-guaraní. arXiv preprint arXiv:1810.08603.
- Holtzman, A., J. Buys, L. Du, M. Forbes, and Y. Choi. 2019. The curious case of neural text degeneration. arXiv preprint arXiv:1904.09751.
- Klein, G., Y. Kim, Y. Deng, J. Senellart, and A. Rush. 2017. OpenNMT: Opensource toolkit for neural machine translation. In Proceedings of...
- Lustig, W. 2010. Mba’éichapa oiko la guarani? guaraní y jopara en el paraguay. PAPIA-Revista Brasileira de Estudos do Contato Linguístico,...
- Myrzakhmetov, B. and A. Makazhanov. 2016. Initial experiments on russian to kazakh smt.
- Papineni, K., S. Roukos, T. Ward, and W.-J. Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the...
- Pedregosa, F., G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas,...
- Rudnick, A., T. Skidmore, A. Samaniego, and M. Gasser. 2014. Guampa: a toolkit for collaborative translation. In LREC, pages 1659–1663.
- Secretaría de Políticas Lingüísticas del Paraguay. 2019. Corpus de Referencia del Guaraní Paraguayo Actual – COREGUAPA. http://www.spl.gov.py....
- Thomas, G. 2019. Universal dependencies for mbyá guaraní. In Proceedings of the Third Workshop on Universal Dependencies (UDW, SyntaxFest...

Mi Hispadoc

Selección

Opciones de artículo

Seleccionado

Opciones de compartir

Opciones de entorno

Sugerencia / Errata

Acceso de usuarios registrados

Using Guarani Verbal Morphology on Guarani-Spanish Machine Translation Experiments

Mi Hispadoc

Opciones de artículo

Opciones de compartir

Opciones de entorno