Wiktionary:ឡាតាំងយានកម្មខ្មែរ
ទាំងនេះគឺជាក្បួនស្ដីពីបម្លែងតួអក្សរក្នុងមេពាក្យខ្មែរ។
ឡាតាំងយានកម្មខ្មែរ
[កែប្រែ]ភាសាខ្មែរសរសេរដោយអក្សរដែលជាអក្សរស្រៈនិស្ស័យតាមផ្អែកលើក្រុមឥណ្ឌូភាសា។ មានវិធីសាស្ត្រជាច្រើនដើម្បីសរសេរប្រែអក្សរខ្មែរទៅជាអក្សរឡាតាំង ដែលគំនូសបំព្រួញដែលឧស្សាហ៍ប្រទះបំផុតគឺជារបស់ ក្រុមអ្នកជំនាញនៃអង្គការសហប្រជាជាតិខាងឈ្មោះភូមិសាស្ត្រ (UNGEGN) គំនូសបំព្រួញផ្នែកភូមិសាស្ត្រដែលផ្អែកលើគំនូសបំព្រួញរបស់ UGEGN គំនូសបំព្រួញ BGN/PCGN និងគំនូសបំព្រួញ ALA-LC ។ គំនូសបំព្រួញទាំងអស់នេះប្រើការលាយចូលគ្នានៃមូលភាពបម្លែងតួអក្សរ និងអក្សរសព្ទ (ជាមួយនឹងសមភាគផ្សេងគ្នានៃការលាយចូលគ្នា) ហើយជាវិបាក វាគឺពិបាកយ៉ាងសមគួរក្នុងការបង្កើតឱ្យមានAll of these schemes use a mix of transcription and transliteration principles (with different proportions of mixing), and as a consequence it is appreciably difficult to algorithmically generate these romanisations in an accurate manner. Monolingual Khmer dictionaries, such as the renowned Chuon Nath Dictionary, traditionally make use of ‘respellings’ to indicate irregularities in pronunciations in a fashion similar to Thai dictionaries, though the use of respellings is not as consistent. The following will attempt to introduce the intricacies of the Khmer script and the romanisations.
Consonants
[កែប្រែ]‘Syllabic configurations’
[កែប្រែ]- a-series = 1st class; o-series = 2nd class.
- Note that the combination of diacritics may not be displayed as desired; please consult the column of examples.
Independent vowels
[កែប្រែ]- Note that words spelt with independent vowels should always have respellings in entries, for example ឩកា should be respelt as អ៊ូកា.
- Also note that the independent vowel ឣ is different from the consonant sign អ. On Wiktionary, only the latter should be used in entries.
Independent vowels |
UN romanization | IPA |
---|---|---|
ឣ | Wiktionary:KM TR/T | /ʔɑʔ/ |
ឤ | Wiktionary:KM TR/T | /ʔa/ |
ឥ | Wiktionary:KM TR/T | /ʔe/ |
ឦ | Wiktionary:KM TR/T | /ʔəj/ |
ឧ | Wiktionary:KM TR/T | /ʔ/ |
ឨ | ||
ឩ | Wiktionary:KM TR/T | /ʔu/ |
ឪ | Wiktionary:KM TR/T | /ʔɨw/ |
ឫ | Wiktionary:KM TR/T | /ʔrɨ/ |
ឬ | Wiktionary:KM TR/T | /ʔrɨː/ |
ឭ | Wiktionary:KM TR/T | /ʔlɨ/ |
ឮ | Wiktionary:KM TR/T | /ʔlɨː/ |
ឯ | Wiktionary:KM TR/T | /ʔeː/ |
ឰ | Wiktionary:KM TR/T | /ʔaj/ |
ឱ, ឲ | Wiktionary:KM TR/T | /ʔaːo/ |
ឳ | Wiktionary:KM TR/T | /ʔaw/ |
Diacritics
[កែប្រែ]Diacritics | Name | Notes |
---|---|---|
ំ | Wiktionary:KM TR/T (និគ្គហិត) | niggahita; nasalizes the inherent vowels and some of the dependent vowels, see anusvara, sometimes used to represent [aɲ] in Sanskrit loanwords |
ះ | Wiktionary:KM TR/T (រះមុខ) | "shining face"; adds final aspiration to dependent or inherent vowels, usually omitted, corresponds to the visarga diacritic, it maybe included as dependent vowel symbol |
ៈ | Wiktionary:KM TR/T (យុគលពិន្ទុ) | yugala bindu ("pair of dots"); adds final glottalness to dependent or inherent vowels, usually omitted |
( ៉) | Wiktionary:KM TR/T (មូសិកទន្ត) | mūsikadanta ("mouse teeth"); used to convert some o-series consonants to the a-series |
៊ | Wiktionary:KM TR/T (ត្រីសព្ទ) | trīsabda; used to convert some a-series consonants to the o-series |
ុ | Wiktionary:KM TR/T (ក្បៀសក្រោម) | also known as bok cəəng (បុកជើង); used in place when the diacritics trəysap and muusekaʾtŏən impede with superscript vowels |
់ | Wiktionary:KM TR/T (បន្តក់) | used to shorten some vowels |
៌ | Wiktionary:KM TR/T (របាទ) répheăk (រេផៈ) |
rapāda, repha; behave similarly to the tŏəndĕəʾkhiət, corresponds to the Devanagari diacritic repha, however it lost its original function which was to represent a vocalic "r" |
៍ | Wiktionary:KM TR/T (ទណ្ឌឃាដ) | daṇḍaghāta; used to render some letters as unpronounced |
៎ | Wiktionary:KM TR/T (កាកបាទ) | kākapāda ("crow's foot"); more a punctuation mark than a diacritic; used in writing to indicate the rising intonation of an exclamation or interjection; often placed on grammatical particles such as /na/, /nɑː/, /nɛː/, /vəːj/, and the feminine response /cah/ |
៏ | Wiktionary:KM TR/T (អស្តា) | denotes stressed intonation in some single-consonant words[១] |
័ | Wiktionary:KM TR/T (សំយោគសញ្ញា) | represents a short inherent vowel in Sanskrit and Pali words; usually omitted |
៑ | Wiktionary:KM TR/T (វិរាម) | a mostly obsolete diacritic, corresponds to the virāma |
្ | Wiktionary:KM TR/T (ជើង) | a.w. coeng; a sign developed by Unicode to input subscript consonants, appearance of this sign varies among fonts |