Sindhi transliteration
{{Short description|Ways to convert text between Arabic and Sindhi language}}
Sindhi is a language broadly spoken by the people of the historical Sindh region in the Indian subcontinent. Modern Sindhi is written in an extended Perso-Arabic script in Sindh province of Pakistan{{Cite web|date=2016-10-03|title=Sindhi gains national status|url=http://tribune.com.pk/story/1192917/sindhi-gains-national-status|access-date=2021-07-02|website=The Express Tribune|language=en}} and (formally) in extended-Devanagari by Sindhis in partitioned India.{{cite web |url=http://www.rajbhasha.nic.in/en/languages-included-eighth-schedule-indian-constution |title=Languages Included in the Eighth Schedule of the Indian {{as written|Constu|tion [sic]}} |publisher=Department of Official Language, Ministry of Home Affairs |access-date=2018-04-09}} Historically, Sindhi was written in various forms of Landa scripts and various other Indic scripts.{{Cite web|last=Lakhani|first=Rakesh|date=2013-06-06|title=Forgotten Sindhi Script – Waranki|url=https://hindusofsindh.wordpress.com/2013/06/06/forgotten-sindhi-script-waranki/|access-date=2021-07-02|website=Indigenous Sindhis|language=en}}
Sindhi Transliteration is essential to convert between Arabic and Devanagari so that speakers of both the countries can read the text of each other.{{Cite web|date=2014-09-18|title=Transcending barriers: Software to break down the wall within the Sindhi language|url=http://tribune.com.pk/story/764134/transcending-barriers-software-to-break-down-the-wall-within-the-sindhi-language|access-date=2021-07-02|website=The Express Tribune|language=en}} In modern day, Sindhi script colloquially just refers to the Perso-Arabic script since majority of Sindhis are from Pakistan. It is also important to note that the Sindhi script is not same as the Urdu-Shahmukhi script,{{cite news|title=The Imposition Of Urdu|url=http://nation.com.pk/editorials/10-Sep-2015/the-imposition-of-urdu|access-date=12 September 2015|publisher=NAWAIWAQT GROUP OF NEWSPAPERS|date=September 10, 2015}} hence one cannot use script conversions like Hindi-Urdu Transliteration.
Technically, a direct one-to-one mapping or rule-based script conversion is not possible between Pakistani and Indian Sindhi, majorly since Devanagari is an abugida script and Arabic-Sindhi is an abjad script, and also other constraints like multiple similar characters from Perso-Arabic which map onto a single character in Devanagari.{{Cite journal|last1=Leghari|first1=Mehwish|last2=U Rahman|first2=Mutee|date=2015-10-30|title=Towards Transliteration between Sindhi Scripts Using Roman Script|ssrn=3820433|language=en|journal=Linguistics and Literature Review |volume=1 |issue=2 |pages=95–104|doi=10.32350/llr.12.03|doi-access=free}} Hence it is preferred to use dictionary-based or machine learning-based transliteration between the Sindhi scripts.{{Cite journal|last1=Lehal|first1=Gurpreet Singh|last2=Saini|first2=Tejinder Singh|date=December 2014|title=Sangam: A Perso-Arabic to Indic Script Machine Transliteration Model|url=https://aclanthology.org/W14-5135|language=en-us|pages=232–239}} For colloquial usage in the digital space where writing Sindhi in Latin script is prevalent, Romanisation of Sindhi is used.{{Cite journal|last1=Sodhar|first1=Irum Naz|last2=Jalbani|first2=Akhtar Hussain|last3=Channa|first3=Muhammad Ibrahim|last4=Hakro|first4=Dil Nawaz|date=2021-04-01|title=Romanized Sindhi Rules for Text Communication|url=https://publications.muet.edu.pk/index.php/muetrj/article/view/2082|journal=Mehran University Research Journal of Engineering and Technology|language=en|volume=40|issue=2|pages=298–304|doi=10.22581/muet1982.2102.04|bibcode=2021MURJE..40..298S |issn=2413-7219|doi-access=free}}
In addition to Sindhi, there have been attempts to design Indo-Pakistani transliteration systems for digraphic languages like Punjabi (written in Gurmukhi in East Punjab and Shahmukhi in West Punjab), Saraiki (written in an extended-Shahmukhi script in Saraikistan and unofficially in Sindhi-Devanagari script in India) and Kashmiri (written in extended Perso-Arabic by Kashmiri Muslims and extended-Devanagari by Kashmiri Hindus).{{Cite web|title=Perso-Arabic To Indic Script Transliteration|url=http://sangam.learnpunjabi.org/|access-date=2021-04-07|website=sangam.learnpunjabi.org}}{{Cite web|title=Saraiki - Devanagari Machine Transliteration System - SDMTS|url=http://www.sanlp.org/saraikiMT/index.html|access-date=2021-08-09|website=www.sanlp.org}}
Consonants
The following table provides an approximate one-to-one mapping for modern Sindhi consonants,{{Citation|last=NC|first=Gokul|title=GokulNC/Indic-PersoArabic-Script-Converter|date=2021-05-07|url=https://github.com/GokulNC/Indic-PersoArabic-Script-Converter|access-date=2021-05-28}} especially for computational purposes (lossless script conversion). Note that this direct script conversion will not yield correct spellings, but rather a readable text for both the readers.{{Cite journal|last=Pedersen|first=Thomas T.|title=Sindhi: Transliteration of Non-Roman Scripts|url=https://transliteration.eki.ee/pdf/Sindhi.pdf|journal=Transliteration of Non-Roman Scripts}}
class="wikitable"
|+ Sindhi Consonants ! Sindhi Abjad !! Urdu-ShahmukhiCarefully note the differences. List also includes 4 implosive consonants from Saraiki alphabet (ڳ, ڄ, ݙ, ٻ) !! Roman !! DevanagariAll the consonants with nuqta sign (dots below) denote adaptation for PersoArabic consonants | |||
ڪ | ک | k | क |
ک | کھ | kh | ख |
ق | ق | q | क़ |
خ | خ | k͟h | ख़ |
گ | گ | g | ग |
غ | غ | g͟h | ग़ |
ڳ | ڳ | g̤ | ॻ |
گھ | گھ | gh | घ |
چ | چ | c | च |
ڇ | چھ | ch | छ |
ج | ج | j | ज |
جھ | جھ | jh | झ |
ڄ | ڄ | j̈ | ॼ |
ز | ز | z | ज़ |
ذ | ذ | z | ज़ |
ض | ض | z | ज़ |
ظ | ظ | z | ज़ |
ژ | ژ | zh | झ़ |
ٽ | ٹ | ṭ | ट |
ٺ | ٹھ | ṭh | ठ |
ڏ | ݙ | d̤ | ॾ |
ڊ | ڈ | ḍ | ड |
ڍ | ڈھ | ḍh | ढ |
ڙ | ڑ | ṛ | ड़ |
ڙھ | ڑھ | ṛh | ढ़ |
ت | ت | t | त |
ٿ | تھ | th | थ |
ط | ط | t | त |
د | د | d | द |
ڌ | دھ | dh | ध |
ن | ن | n | न |
ڻ | ݨ | ṇ | ण |
ڱ | ن٘ | ṅ | ङ |
ڃ | ݩ | ñ | ञ |
پ | پ | p | प |
ڦ | پھ | ph | फ |
ف | ف | f | फ़ |
ب | ب | b | ब |
ٻ | ٻ | ḇ | ॿ |
ڀ | بھ | bh | भ |
م | م | m | म |
ي | ی | y | य |
ر | ر | r | र |
ل | ل | l | ल |
ݪ | لؕ | ḷ | ळ |
و | و | v | व |
ش | ش | sh | श |
ݜ | سؕ | ṣh | ष |
س | س | s | स |
ص | ص | s | स |
ث | ث | s | स |
ه | ہ | h | ह |
ح | ح | h | ह |
ۃ | ۃ | h | ह |
ع | ع | ʿ | ʿ |
Single-letter word ligatures
class="wikitable"
! Sindhi !! Urdu !! Roman !! Devanagari | |||
۽ | اَیں | a͠i | ऐं |
۾ | میں | mẽ | में |
Numerals
class="wikitable"
! Usage !! Numeral System | |||||||||||
Pakistan | Arabic numerals | ۰ | ۱ | ۲ | ۳ | ۴ | ۵ | ٦ | ۷ | ۸ | ۹ |
International | Hindu-Arabic | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
India | Modern Devanagari | ० | १ | २ | ३ | ४ | ५ | ६ | ७ | ८ | ९ |
Punctuations and symbols
class="wikitable"
! Script !! Period !! Question Mark !! Comma !! Semi-colon !! Slash !! Percent !! End of verse | |||||||
Perso-Arabic | ۔ | ؟ | ، | ؛ | ؍ | ٪ | |
Modern Devanagari | । | ? | , | ; | / | % | ॥ |