Promoting Generalized Cross-lingual Question Answering in Few-resource Scenarios via Self-knowledge Distillation

Casimiro Pio Carrino; Carlos Escolano; Luis Gascó; José Adrián Rodríguez Fonollosa

Promoting Generalized Cross-lingual Question Answering in Few-resource Scenarios via Self-knowledge Distillation

Autores: Casimiro Pio Carrino, Carlos Escolano, Luis Gascó, José Adrián Rodríguez Fonollosa
Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 75, 2025 (Ejemplar dedicado a: Procesamiento del Lenguaje Natural, Revista nº 75, septiembre de 2025), págs. 65-82
Idioma: inglés
Títulos paralelos:
- Fomentando la Respuesta a Preguntas Cross-Lingüística Generalizada en Escenarios con Pocos Recursos mediante Auto-Destilación del Conocimiento
Enlaces
- Texto completo
Resumen
- español
  Abordamos el desafío de la Transferencia Translingüística Generalizada (G-XLT) en Respuesta a Preguntas extractiva, donde los idiomas de la pregunta y el contexto difieren, un problema particularmente difícil para idiomas con pocos recursos. Trabajando con solo mil muestras paralelas de QA, combinamos el muestreo translingüístico con la autodestilación de conocimiento para regularizar el ajuste fino translingüístico. Introducimos el novedoso coeficiente de Precisión media en k (mAP@k), que mitiga el impacto negativo de predicciones incorrectas durante el entrenamiento y sirve como herramienta de diagnostico que proporciona orientación temprana en el entrenamiento e indicadores confiables del aprendizaje del modelo. Las evaluaciones en los conjuntos de datos MLQA, XQuAD y TyDiQA-GoldP demuestran que nuestro enfoque supera consistentemente el ajuste fino de entropía cruzada estándar del modelo multilingüe mBERT. Nuestro método representa una alternativa prometedora a los enfoques basados en traducción automática, particularmente valiosa para idiomas con pocos recursos donde la calidad de traducción es deficiente, ofreciendo una solución eficiente para la transferencia translingüística en entornos con escasez de datos.
- English
  We address the challenge of Generalized Cross-Lingual Transfer (G-XLT) in extractive Question Answering, where question and context languages differ, a problem particularly difficult for low-resource languages. Working with only a thousand parallel QA samples, we combine cross-lingual sampling with self-knowledge distillation to regularize cross-lingual fine-tuning. We introduce the novel mean Average Precision at k (mAP@k) coefficient, which mitigates the negative impact of incorrect predictions during training and serves as a diagnostic tool providing early training guidance and reliable indicators of model learning. Evaluations on MLQA, XQuAD, and TyDiQA-GoldP datasets demonstrate that our approach consistently outperforms standard cross-entropy fine-tuning of the mBERT multilingual model. Our method represents a promising alternative to machine translation-based approaches, particularly valuable for low-resource languages where translation quality is poor, offering an efficient solution for cross-lingual transfer in data-scarce settings.
Referencias bibliográficas
- Agrawal, P., C. Alberti, F. Huot, J. Maynez, J. Ma, S. Ruder, K. Ganchev, D. Das, and M. Lapata. 2023. QAmeleon: Multilingual QA with only...
- Ahuja, K., H. Diddee, R. Hada, M. Ochieng, K. Ramesh, P. Jain, A. Nambi, T. Ganu, S. Segal, M. Ahmed, K. Bali, and S. Sitaram. 2023. MEGA:...
- Ahuja, S., D. Aggarwal, V. Gumma, I.Watts, A. Sathe, M. Ochieng, R. Hada, P. Jain, M. Ahmed, K. Bali, and S. Sitaram. 2024. MEGAVERSE: Benchmarking...
- Artetxe, M., G. Labaka, and E. Agirre. 2020. Translation artifacts in cross-lingual transfer learning. In Proceedings of the 2020 Conference...
- Artetxe, M., S. Ruder, and D. Yogatama. 2020. On the Cross-lingual Transferability of Monolingual Representations. In Proceedings of the 58th...
- Asai, A., J. Kasai, J. Clark, K. Lee, E. Choi, and H. Hajishirzi. 2021. XOR QA: Crosslingual Open-Retrieval Question Answering. In Proceedings...
- Bornea, M., L. Pan, S. Rosenthal, H. Florian, and A. Sil. 2021. Multilingual Transfer Learning for QA using Translation as Data Augmentation....
- Chen, N., L. Shou, M. Gong, and J. Pei. 2022. From Good to Best: Two-Stage Training for Cross-Lingual Machine Reading Comprehension. Proceedings...
- Clark, J. H., E. Choi, M. Collins, D. Garrette, T. Kwiatkowski, V. Nikolaev, and J. Palomaki. 2020. TyDi QA: A Benchmark for Information-Seeking...
- Conneau, A., K. Khandelwal, N. Goyal, V. Chaudhary, G. Wenzek, F. Guzmán, E. Grave, M. Ott, L. Zettlemoyer, and V. Stoyanov. 2020. Unsupervised...
- Conneau, A. and G. Lample. 2019. Crosslingual Language Model Pretraining. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d. Alché-Buc, E....
- Costa-jussà, M. R., J. Cross, O. C¸ elebi, M. Elbayad, K. Heafield, K. Heffernan, E. Kalbassi, J. Lam, D. Licht, J. Maillard, A. Sun, S. Wang,...
- Cui, Y., W. Che, T. Liu, B. Qin, S. Wang, and G. Hu. 2019. Cross-Lingual Machine Reading Comprehension. In Proceedings of the 2019 Conference...
- Devlin, J., M.-W. Chang, K. Lee, and K. Toutanova. 2019a. BERT: Pretraining of Deep Bidirectional Transformers for Language Understanding....
- Devlin, J., M. Chang, K. Lee, and K. Toutanova. 2019b. BERT: pretraining of deep bidirectional transformers for language understanding. In...
- Dror, R., G. Baumer, S. Shlomov, and R. Reichart. 2018. The hitchhiker’s guide to testing statistical significance in natural language processing....
- Duan, Z., X. Li, Z. Zhang, Z. Li, N. Liu, and J. Wang. 2021. Bridging the Language Gap: Knowledge Injected Multilingual Question Answering....
- Furlanello, T., Z. Lipton, M. Tschannen, L. Itti, and A. Anandkumar. 2018. Born again neural networks. In J. Dy and A. Krause, editors, Proceedings...
- Hahn, S. and H. Choi. 2019. Self-knowledge distillation in natural language processing. In R. Mitkov and G. Angelova, editors, Proceedings...
- Hsu, T.-Y., C.-L. Liu, and H.-y. Lee. 2019. Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language...
- Hu, J., S. Ruder, A. Siddhant, G. Neubig, O. Firat, and M. Johnson. 2020. XTREME: A massively multilingual multi-task benchmark for evaluating...
- Huang, H., Y. Liang, N. Duan, M. Gong, L. Shou, D. Jiang, and M. Zhou. 2019. Unicoder: A Universal Language Encoder by Pre-training with Multiple...
- Huang, H., T. Tang, D. Zhang, X. Zhao, T. Song, Y. Xia, and F. Wei. 2023. Not all languages are created equal in LLMs: Improving multilingual...
- Kingma, D. and J. Ba. 2014. Adam: A method for stochastic optimization. International Conference on Learning Representations, 12.
- Lauscher, A., V. Ravishankar, I. Vulic, and G. Glavas. 2020. From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual...
- Lee, C.-H. and H.-Y. Lee. 2019. Cross-Lingual Transfer Learning for Question Answering, July. arXiv:1907.06042 [cs].
- Lewis, P., B. Oguz, R. Rinott, S. Riedel, and H. Schwenk. 2020. MLQA: Evaluating Cross-lingual Extractive Question Answering. In Proceedings...
- Liu, J., Y. Lin, Z. Liu, and M. Sun. 2019. XQA: A cross-lingual open-domain question answering dataset. In Proceedings of the 57th Annual...
- Liu, J., L. Shou, J. Pei, M. Gong, M. Yang, and D. Jiang. 2020. Cross-lingual Machine Reading Comprehension with Language Branch Knowledge...
- Longpre, S., Y. Lu, and J. Daiber. 2021. MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering. Transactions...
- Nooralahzadeh, F., G. Bekoulis, J. Bjerva, and I. Augenstein. 2020. Zero-Shot Cross-Lingual Transfer with Meta Learning. In Proceedings of...
- Rajpurkar, P., J. Zhang, K. Lopyrev, and P. Liang. 2016. SQuAD: 100,000+ questions for machine comprehension of text. In Proceedings of...
- Riabi, A., T. Scialom, R. Keraron, B. Sagot, D. Seddah, and J. Staiano. 2021. Synthetic data augmentation for zero-shot cross-lingual question...
- Roy, U., N. Constant, R. Al-Rfou’, A. Barua, A. Phillips, and Y. Yang. 2020. Lareqa: Language-agnostic answer retrieval from a multilingual...
- Singh, J., B. McCann, N. S. Keskar, C. Xiong, and R. Socher. 2019. XLDA: Cross- Lingual Data Augmentation for Natural Language Inference and...
- Wolf, T., L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, P. Cistac, T. Rault, R. Louf, M. Funtowicz, J. Davison, S. Shleifer, P. von...
- Xue, L., N. Constant, A. Roberts, M. Kale, R. Al-Rfou, A. Siddhant, A. Barua, and C. Raffel. 2021. mT5: A massively multilingual pre-trained...
- Yang, C., L. Xie, S. Qiao, and A. L. Yuille. 2019. Training deep neural networks in generations: a more tolerant teacher educates better students....
- Yuan, F., L. Shou, X. Bai, M. Gong, Y. Liang, N. Duan, Y. Fu, and D. Jiang. 2020. Enhancing Answer Boundary Detection for Multilingual Machine...
- Zheng, B., L. Dong, S. Huang, W. Wang, Z. Chi, S. Singhal, W. Che, T. Liu, X. Song, and F. Wei. 2021. Consistency regularization for cross-lingual...

Mi Hispadoc

Selección

Opciones de artículo

Seleccionado

Opciones de compartir

Opciones de entorno

Sugerencia / Errata

Acceso de usuarios registrados

Promoting Generalized Cross-lingual Question Answering in Few-resource Scenarios via Self-knowledge Distillation

Mi Hispadoc

Opciones de artículo

Opciones de compartir

Opciones de entorno