Natural Language Processing Faculty Publications

TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties

Karima Kadaoui, Mohamed bin Zayed University of Artificial IntelligenceFollow
Samar M. Magdy, Mohamed bin Zayed University of Artificial IntelligenceFollow
Abdul Waheed, Mohamed bin Zayed University of Artificial IntelligenceFollow
Md Tawkat Islam Khondaker, The University of British Columbia
Ahmed Oumar El-Shangiti, Mohamed bin Zayed University of Artificial IntelligenceFollow
El Moatez Billah Nagoudi, The University of British Columbia
Muhammad Abdul-Mageed, The University of British Columbia

Document Type

Conference Proceeding

Publication Title

ArabicNLP 2023 - 1st Arabic Natural Language Processing Conference, Proceedings

Abstract

Despite the purported multilingual proficiency of instruction-finetuned large language models (LLMs) such as ChatGPT and Bard, the linguistic inclusivity of these models remains insufficiently explored. Considering this constraint, we present a thorough assessment of Bard and ChatGPT (encompassing both GPT-3.5 and GPT-4) regarding their machine translation proficiencies across ten varieties of Arabic. Our evaluation covers diverse Arabic varieties such as Classical Arabic (CA), Modern Standard Arabic (MSA), and several country-level dialectal variants. Our analysis indicates that LLMs may encounter challenges with dialects for which minimal public datasets exist, but on average are better translators of dialects than existing commercial systems. On CA and MSA, instruction-tuned LLMs, however, trail behind commercial systems such as Google Translate. Finally, we undertake a human-centric study to scrutinize the efficacy of the relatively recent model, Bard, in following human instructions during translation tasks. Our analysis reveals a circumscribed capability of Bard in aligning with human instructions in translation contexts. Collectively, our findings underscore that prevailing LLMs remain far from inclusive, with only limited ability to cater for the linguistic and cultural intricacies of diverse communities.

First Page

Last Page

Publication Date

1-1-2023

Recommended Citation

K. Kadaoui et al., "TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties," ArabicNLP 2023 - 1st Arabic Natural Language Processing Conference, Proceedings, pp. 52 - 75, Jan 2023.

This document is currently not available here.

COinS

Natural Language Processing Faculty Publications

TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties

Document Type

Publication Title

Abstract

First Page

Last Page

Publication Date

Recommended Citation

Browse

Contribute

Links

Natural Language Processing Faculty Publications

TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

Publication Date

Recommended Citation

Share

Browse

Contribute

Links