Romanization of Khmer

{{Short description|Representation of the Khmer language in Latin alphabets}}

{{Contains special characters|Khmer}}

The romanization of Khmer is a representation of the Khmer (Cambodian) language using letters of the Latin alphabet. This is most commonly done with Khmer proper nouns, such as names of people and geographical names, as in a gazetteer.

Romanization systems for Khmer

Cambodian geographical names are often romanized with a transliteration system, where representations in the Khmer script are mapped regularly to representations in the Latin alphabet (sometimes with some additional diacritics). The results do not always reflect standard Khmer pronunciation, as no special treatment is given to unpronounced letters and irregular pronunciations, although the two registers of Khmer vowel symbols are often taken into account.

When transcription is used, words are romanized based on their pronunciation. However, pronunciation of Khmer can vary by speaker and region. Roman transcription of Khmer is often done ad hoc on Internet forums and chatrooms, the results sometimes being referred to as Khmenglish or Khmerlish. These ad hoc romanizations are usually based on English pronunciations of letters, although they may also be influenced by Khmer spelling (as with the use of s rather than h to represent a final aspirate).

Since some sounds can be represented by more than one symbol in Khmer orthography, it is not generally possible to recover the original Khmer spelling from a pronunciation-based Roman transcription. Even transliteration systems often do not preserve all of the distinctions made in the Khmer script.

Some of the more commonly used romanization systems for Khmer are listed below. For full details of the various systems, see the links given in the External Links section.

=UNGEGN=

The Khmer romanization scheme published by the United Nations Group of Experts on Geographical Names is based on the BGN/PCGN system, described below. It is used for Cambodian geographical names in some recent maps and gazetteers, although the Geographic Department's modified system (see below) has come into use in the country since 1995.[http://www.eki.ee/wgrs/rom1_km.pdf Report on the Current Status of United Nations Romanization Systems for Geographical Names – Khmer], UNGEGN Working Group on Romanization Systems, September 2013 (linked from [http://www.eki.ee/wgrs/ WGRS website]). Correspondences in the UNGEGN system are detailed in the Khmer alphasyllabary article.

=Geographic Department=

The Geographic Department of the Cambodian Ministry of Land Management and Urban Planning has developed a modified version of the UNGEGN system,[https://unstats.un.org/unsd/geoinfo/UNGEGN/docs/8th-uncsgn-docs/inf/8th_UNCSGN_econf.94_INF.30.pdf Geographical Names of the Kingdom of Cambodia], submitted by Cambodia to the 8th UN Conference on the Standardization of Geographical Names, 2002 (also [http://unstats.un.org/unsd/geoinfo/UNGEGN/docs/8th-uncsgn-docs/inf/8th_UNCSGN_econf.94_INF.30_corr1.pdf addendum with corrections]). originally put forward in 1995, and used in the second edition of the Gazetteer of Cambodia in 1996. Further modifications were made in 1997, and the system continues to be used in Cambodia.

The main change made in this system compared with the UNGEGN system is that diacritics on vowels are omitted. Some of the vowels are also represented using different letter combinations.

=BGN/PCGN=

A system used by the United States Board on Geographic Names and the Permanent Committee on Geographical Names for British Official Use, published in 1972. It is based on the modified 1959 Service Géographique Khmer (SGK) system.[https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/320109/Khmer_Romanization_Nov12.pdf Romanization System for Khmer (Cambodian)], BGN/PCGN 1972 System.

=ALA-LC Romanization Tables=

This system (also called Transliteration System for Khmer Script), from the American Library Association and Library of Congress,[https://www.loc.gov/catdir/cpso/roman.html ALA-LC Romanization Tables], Khmer, rev. 2012. romanizes Khmer words using the original Indic values of the Khmer letters, which are often different from their modern values. This can obscure the modern Khmer pronunciation, but the system has the advantage of relative simplicity, and facilitates the etymological reconstruction of Sanskrit and Pali loanwords whose pronunciation may be different in modern Khmer. The system is a modification of that proposed by Lewitz (1969), and was developed by Franklin Huffman of Cornell University and Edwin Bonsack of the Library of Congress for the library cataloguing of publications in Khmer.

Example words written in each romanization system

class=wikitable
rowspan="2"|English

!rowspan="2"|Khmer

!rowspan="2"|Pronunciation

!colspan="3"|Romanization

UNGEGN
{{small|(or BGN/PCGN)}}

!Geographic
Department

!ALA-LC

style="vertical-align:top; text-align:center;"

|Khmer script

|{{lang|km|អក្សរខ្មែរ}}

|[ʔaksɑː kʰmae]

|'âksâr khmêr

|'aksar khmaer

|ʿʹaksar khmaer

style="vertical-align:top; text-align:center;"

|Cambodia

|{{lang|km|កម្ពុជា}}

|[kampuciə]

|Kâmpŭchéa

|Kampuchea

|Kambujā

style="vertical-align:top; text-align:center;"

|centre

|{{lang|km|មណ្ឌល}}

|[mɔnɗɔl], [mŏənɗɔl]

|môndôl

|mondol

|maṇḍal

style="vertical-align:top; text-align:center;"

|brightness

|{{lang|km|ពន្លឺ}}

|[pɔnlɨː]

|pônlœ

|ponlueu

|banlȳ

style="vertical-align:top; text-align:center;"

|peace

|{{lang|km|សន្តិភាព}}

|[sɑntepʰiəp]

|sântĕphéap

|santepheap

|santibhāb

style="vertical-align:top; text-align:center;"

|belief

|{{lang|km|ជំនឿ}}

|[cumnɨə]

|chumnœă

|chumnoea

|jaṃnẏa

style="vertical-align:top; text-align:center;"

|to go

|{{lang|km|ទៅ}}

|[təw]

|tŏu

|tov

|dau

Tables of romanization systems

This chart shows in full the three main systems for the romanization of Khmer: UNGEGN (or BGN/PCGN), Geographic Department and ALA-LC:

=Consonants=

{{color box|#F5E8E4}} 1st series {{color box|#E4F1F5}} 2nd seriesKhmer consonants belong to two classes that dictate the value of dependent vowels.

class=wikitable style="text-align: center;"
colspan="3" | Khmer

! rowspan="2"| UNGEGN
{{small|(or BGN/PCGN)}}

! rowspan="2"| {{small|Geographic
Department}}

! rowspan="2"| ALA-LC

Full
form

! {{small|Subscript
form}}

!IPA

style="background-color:#F5E8E4"

| ក

្ក

|{{IPA|[k]}}

kaKak
style="background-color:#F5E8E4"

| ខ

្ខ

|{{IPA|[kʰ]}}

khaKhakh
style="background-color:#E4F1F5"

| គ

្គ

|{{IPA|[k]}}

GaGog
style="background-color:#E4F1F5"

| ឃ

្ឃ

|{{IPA|[kʰ]}}

GhaGhogh
style="background-color:#E4F1F5"

| ង

្ង

|{{IPA|[ŋ]}}

ṄaṄong
style="background-color:#F5E8E4"

| ច

្ច

|{{IPA|[c]}}

CaCac
style="background-color:#F5E8E4"

| ឆ

្ឆ

|{{IPA|[cʰ]}}

ChaChach
style="background-color:#E4F1F5"

| ជ

្ជ

|{{IPA|[c]}}

JaJoj
style="background-color:#E4F1F5"

| ឈ

្ឈ

|{{IPA|[cʰ]}}

JhaJhojh
style="background-color:#E4F1F5"

| ញ

្ញ

|{{IPA|[ɲ]}}

ÑaÑoñ
style="background-color:#F5E8E4"

| ដ

្ដ

|{{IPA|[ɗ]}}

ṬaṬa
style="background-color:#F5E8E4"

| ឋ

្ឋ

|{{IPA|[tʰ]}}

ṬhaṬhaṭh
style="background-color:#E4F1F5"

| ឌ

្ឌ

|{{IPA|[ɗ]}}

ḌaDo
style="background-color:#E4F1F5"

| ឍ

្ឍ

|{{IPA|[tʰ]}}

ḌhaḌhoḍh
style="background-color:#F5E8E4"

| ណ

្ណ

|{{IPA|[n]}}

ṆaṆa
style="background-color:#F5E8E4"

| ត

្ត

|{{IPA|[t]}}

TaTat
style="background-color:#F5E8E4"

| ថ

្ថ

|{{IPA|[tʰ]}}

ThaThath
style="background-color:#E4F1F5"

| ទ

្ទ

|{{IPA|[t]}}

DaDod
style="background-color:#E4F1F5"

| ធ

្ធ

|{{IPA|[tʰ]}}

DhaDhodh
style="background-color:#E4F1F5"

| ន

្ន

|{{IPA|[n]}}

NaNon
style="background-color:#F5E8E4"

| ប

្ប

|{{IPA|[ɓ], [p]}}

PaPa,BaWhen accompanied by a subscript form, it is romanized as p in the 1st series, although the Khmer diacritical mark {{lang|km|៉}} is generally omitted: {{lang|km|ប្លែង}} → {{Transliteration|km|plaeng}}, {{lang|km|ប្អូន}} →

{{Transliteration|km|p'oun}}, {{lang|km|ប្រាប់}} → {{Transliteration|km|prab}}.

p
style="background-color:#F5E8E4"

| ផ

្ផ

|{{IPA|[pʰ]}}

PhaPhaph
style="background-color:#E4F1F5"

| ព

្ព

|{{IPA|[p]}}

BaBo, po

[Note 2]

| b

style="background-color:#E4F1F5"

| ភ

្ភ

|{{IPA|[pʰ]}}

BhaBhobh
style="background-color:#E4F1F5"

| ម

្ម

|{{IPA|[m]}}

MaMom
style="background-color:#E4F1F5"

| យ

្យ

|{{IPA|[j]}}

YaYoy
style="background-color:#E4F1F5"

| រ

្រ

|{{IPA|[r]}}

RaRor
style="background-color:#E4F1F5"

| ល

្ល

|{{IPA|[l]}}

LaLol
style="background-color:#E4F1F5"

| វ

្វ

|{{IPA|[ʋ]}}

VaVov
style="background-color:#F5E8E4"

| ឝ

្ឝ

|{{IPA|[s]}}

Śashaś
style="background-color:#E4F1F5"

| ឞ

្ឞ

|{{IPA|[s]}}

ṢaSha
style="background-color:#F5E8E4"

| ស

្ស

|{{IPA|[s]}}

SaSas
style="background-color:#F5E8E4"

| ហ

្ហ

|{{IPA|[h]}}

HaHah
style="background-color:#F5E8E4"

| ឡ

|{{IPA|[l]}}

ḶaLa
style="background-color:#F5E8E4"

| អ

្អ

|{{IPA|[ʔ]}}

AAA

=Dependent vowels=

class=wikitable style="text-align: center;"
rowspan="2" | Khmer

! colspan="2"| UNGEGN
{{small|(or BGN/PCGN)}}

! colspan="2"| {{small|Geographic
Department}}

! ALA-LC

A-series

! O-series

! A-series

! O-series

! A-series

◌◌âôaoa
◌់áóaoá
aéaaeaā
ា់, ័◌ă, aea, oaâ
ăakeakà
័យăyoăyaieyăy
ĕĭeii
eiieiiī
œ̆œ̆oeue
œœeuueuȳ
ŏŭouu
ououuū
uououa
aeueuaeueuoe
œăœăoeaoeaẏa
ieieia
ééeee
êêaeeaeae
aieyaieyai
aoouo
auŏuauovau
ុំomŭmomumuṃ
âmumamumaṃ
ាំămŏâmamoamāṃ
ាំងăngeăngangeangāṃng
ăheăhaheahaḥ
ិះĕhĭhehisiḥ
ឹះœ̆hœ̆hoehuehẏḥ
ុះŏhŭhohuhuḥ
េះéhéheheheḥ
ើះaeuheuhaeuheuhoeḥ
ែះêhêhaeheaehaeḥ
ោះaôhŏăhaohuohoaḥ

=Independent vowels=

class=wikitable style="text-align: center;"
Khmer

! UNGEGN
{{small|(or BGN/PCGN)}}

! {{small|Geographic
Department}}

! ALA-LC

âaa
អាaaā
ĕei
eieiī
ŏ, ŭo, uu
o, uou, uū
âuauýu
rœ̆rue
rueu
lœ̆lue
lueu
êaeae
aiaiai
ឱ, ឲaoo
auauau

International Phonetic Alphabet transcription

Various authors have used systems based on the International Phonetic Alphabet (IPA) to transcribe Khmer. One such system is used in the books of Franklin E. Huffman and others;For example, Franklin E. Huffman, Cambodian System of Writing and Beginning Reader with Drills and Glossary, Adam Wood, 1970 ([http://www.pratyeka.org/csw downloadable PDF]). a more recent scheme is that used in J.M. Filippi's 2004 textbook Everyday Khmer or Khmer au quotidien.Jean Michel Filippi, Everyday Khmer, Funan, Phnom Penh , 2004. French edition: Filippi et al., Khmer au quotidien, Librairie You-Feng, 2008. These systems differ in certain respects: for example, Huffman's uses doubling of vowel symbols to indicate long vowels, whereas Filippi's uses the IPA triangular colon vowel length symbol.

Notes

{{reflist|group=note}}

References

{{reflist}}