Abstract
Arabic and Urdu are two prominent languages, both these languages share
script similarities and exhibit lexical overlap, with certain words
being common to both languages. Our objective is to find the
intersection of the lexicon of words between these two languages (having
similar script and meaning). Surprisingly, there has been limited
exploration of this particular area for Arabic and Urdu. To address this
gap, we will leverage data manipulation, analysis tools, and NLP
methodologies employing specialized NLP libraries.The resultant corpus
may hold a great potential for enhancing cross-linguistic communication
and fostering a deeper understanding between Arabic and Urdu speakers.