TOWARDS AN ARABIC-URDU COMMON LEXICON USING NLP

Zaid Rafi; Sahil Sholla

doi:10.36227/techrxiv.23536284.v1

loading page

TOWARDS AN ARABIC-URDU COMMON LEXICON USING NLP

Zaid Rafi ,
Sahil Sholla

Abstract

Arabic and Urdu are two prominent languages, both these languages share script similarities and exhibit lexical overlap, with certain words being common to both languages. Our objective is to find the intersection of the lexicon of words between these two languages (having similar script and meaning). Surprisingly, there has been limited exploration of this particular area for Arabic and Urdu. To address this gap, we will leverage data manipulation, analysis tools, and NLP methodologies employing specialized NLP libraries.The resultant corpus may hold a great potential for enhancing cross-linguistic communication and fostering a deeper understanding between Arabic and Urdu speakers.