Phonetic symbols in Unicode
Unicode supports several phonetic scripts and notations through the existing writing systems and the addition of extra blocks with phonetic characters. These phonetic extras are derived of an existing script, usually Latin, Greek or Cyrillic. In Unicode there is no "IPA script". Apart from IPA, extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.
Phonetic scripts
The International Phonetic Alphabet (IPA) makes use of letters from other writing systems as most phonetic scripts do. IPA notably uses Latin, Greek and Cyrillic characters. Combining diacritics also adds meaning to the phonetic text. Finally, these phonetic alphabets make use of modifier letters, that are specially constructed for the phonetic meaning. A "modifier letter" is strictly intended not as an independent grapheme but as a modification of the preceding character[1] resulting in a distinct grapheme, notably in the context of the International Phonetic Alphabet. For example, ʰ should not occur on its own but modifies the preceding or following symbol. Thus, tʰ is a single IPA symbol, distinct from t. In practice, however, several of these "modifier letters" are also used as full graphemes, e.g. ʿ as transliterating Semitic ayin or Hawaiian okina, or ˚ transliterating Abkhaz ә.
From IPA to Unicode
Consonants
The following tables indicates the Unicode code point sequences for phonemes as used in the International Phonetic Alphabet. A bold code point indicates that the Unicode chart provides an application note such as "voiced retroflex lateral" for U+026D ɭ LATIN SMALL LETTER L WITH RETROFLEX HOOK (HTML ɭ
). An entry in bold italics indicates the character name itself refers to a phoneme such as U+0298 ʘ LATIN LETTER BILABIAL CLICK (HTML ʘ
)
Bilabial | Labiodental | Dental | Alveolar | Postalveolar | Retroflex | Labial-palatal | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Plosive | p 0070 | b 0062 | p̪ 0070 032A | b̪ 0062 032A | t̪ 0074 032A | d̪ 0064 032A | t 0074 |
d 0064 | ʈ 0288 | ɖ 0256 | ||||
Implosive | ɓ̥ 0253 0325 | ɓ 0253 | ɗ̪ 0257 032A | ɗ 0257 | * | |||||||||
Ejective | pʼ 0070 02BC | t̪ʼ 0074 032A 02BC | tʼ 0074 02BC | ʈʼ 0288 02BC | ||||||||||
Nasal | m̥ 006D 0325 | m 006D | ɱ̊ 0271 030A | ɱ 0271 | n̪̊ 006E 032A 030A | n̪ 006E 032A | n̥ 006E 0325 | n 006E | ɳ̊ 0273 030A | ɳ 0273 | ||||
Trill | ʙ 0299 | r̥ 0072 0325 | r 0072 | * | ||||||||||
Tap or Flap | ⱱ̟ 2C71 031F | ɱ 2C71 | ɾ 027E | ɽ 027D | ||||||||||
Lateral flap | ɺ 027A | * | ||||||||||||
Fricative | ɸ 0278 | β 03B2 | f 0066 |
v 0076 | θ 03B8 | ð 00F0 | ɳ 0073 | ɺ 007A | ʃ 0283 | ʒ 0292 | ʂ 0282 | ʐ 0290 | ||
Lateral fricative | ɬ 026C | n 026E | * | |||||||||||
Ejective fricative | sʼ 0073 02BC | ʃʼ 0283 02BC | ||||||||||||
Ejective lateral fricative | ɬʼ 026C 02BC | |||||||||||||
Percussive | ʬ 02AC |
ʭ 02AD |
||||||||||||
Approximant | β̞̊ 03B2 031E 030A | β̞ 03B2 031E | ʋ̥ 028B 0325 | ʋ 028B | ð̞ 00F0 031E | ɹ̥ 0279 0325 | ɹ 0279 | ɻ̊ 027B 030A | ɻ 027B | ɥ̊ 0265 030A | ɥ 0265 | |||
Lateral approximant | l̥ 006C 0325 | ɬ 006C | m 026D | |||||||||||
Click consonant | ʘ 0298 |
ǀ 01C0 |
ǃ 01C3 |
ǃ / ǂ 01C3 / 01C2 |
||||||||||
Lateral click | * | ǁ 01C1 |
Alveolo-palatal | Palatal | Labial-velar | Velar | Uvular | Pharyngeal | Epiglottal | Glottal | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Plosive | ȶ 0236 | ȡ 0221 | c 0063 | ɟ 025F | k͡p 006B 0361 0070 | ɡ͡b 0261 0361 0062 | k 006B | ɡ 0261 | ɱ 0071 | b 0262 | ʡ 02A1 | ʔ 0294 | ||||
Implosive | ʄ 0284 | ɠ 0260 | ʛ 029B | |||||||||||||
Ejective | cʼ 0063 02BC | kʼ 006B 02BC | qʼ 0071 02BC | |||||||||||||
Nasal | ȵ 0235 | r 0272 | ŋ͡m 014B 0361 006D | ŋ 014B | t 0274 | |||||||||||
Trill | ʀ 0280 | * | ||||||||||||||
Tap or Flap | * | |||||||||||||||
Lateral flap | * | * | ||||||||||||||
Fricative | ɕ 0255 | ʑ 0291 | ç 0063 0327 | ʝ 029D | ɸ 0078 | c 0263 | χ 03C7 | ʁ 0281 | ħ 0127 | ʕ 0295 | ʜ 029C | ʢ 02A2 | h 0068 | f 0266 | ||
Approximant | j 006A | ʍ 028D | w 0077 | p 0270 | ||||||||||||
Lateral approximant | ȴ 0234 | ʎ 028E | ʟ 029F |
Vowels
The following figures depict the phonetic vowels and their Unicode / UCS code points. Vowels appearing in pairs in the figure to the right indicate rounded and unrounded variations respectively. Again, characters with Unicode names referring to phonemes are indicated by bold text. Those with explicit application notes are indicated by bold italic text. Those from borrowed unchanged from another script (Latin, Greek or Cyrillic) are indicated by italics.
This table represents the phonetic vowel trapezium Before and after a bullet are the unrounded · rounded vowels | |||||
Close | i · ɹ 0069 0079 |
h · ʉ 0268 0289 |
ɯ · u 026F 0075 | ||
Near-close | j · ʏ 026A 028F |
ɪ̈ · ʊ̈ 026A 0308 · 028A 0308 |
· ʊ 028A |
||
Close-mid | ɥ · ø 0065 00F8 |
ɘ · u 0258 0275 |
d · ɯ 0264 006F | ||
Mid | ə 0259 |
||||
Open-mid | ɛ · ɓ 025B 0153 |
ɜ · ɞ 025C 025E |
ʌ · ɔ 028C 0254 | ||
Near-open | æ · 00E6 |
ɐ 0250 |
|||
Open | ɡ · v 0061 0276 |
ɑ · ɒ 0251 0252 | |||
Vowel length marker | ː 02D0 |
Unicode blocks
- Basic Latin (0020–007E), IPA example: Open front unrounded vowel (0061)
- Latin-1 Supplement (00A0–00FF), IPA example: Near-open front unrounded vowel (00E6)
- Latin Extended-A (0100–017F), IPA example: Voiceless pharyngeal fricative (0127)
- Latin Extended-B (0180–024F), IPA example: Tenuis dental click (01C0 0287)
- IPA Extensions (0250–02AF), IPA example: Near-open central vowel (0250)
- Spacing Modifier Letters (02B0–02FF), IPA example: Palatal ejective (0063 02BC)
- Combining Diacritical Marks (0300–036F), IPA example: Near-close central unrounded vowel (026A 0308)
- Greek and Coptic (0370–03FF), IPA example: Voiced bilabial fricative (03B2)
- Latin Extended-C (2C60–2C7F), IPA example: Labiodental flap (2C71)
- Phonetic Extensions (1D00–1D7F)
- Phonetic Extensions Supplement (1D80–1DBF)
- Superscripts and Subscripts (2070–209F)
- Modifier Tone Letters (A700–A71F)
From Unicode blocks to scripts
Phonetical scripts are encoded in six Unicode blocks.
IPA Extensions (U+0250–02AF)
IPA Extensions[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+025x | ɐ | ɑ | ɒ | ɓ | ɔ | ɕ | ɖ | ɗ | ɘ | ə | ɚ | ɛ | ɜ | ɝ | ɞ | ɟ |
U+026x | ɠ | ɡ | ɢ | ɣ | ɤ | ɥ | ɦ | ɧ | ɨ | ɩ | ɪ | ɫ | ɬ | ɭ | ɮ | ɯ |
U+027x | ɰ | ɱ | ɲ | ɳ | ɴ | ɵ | ɶ | ɷ | ɸ | ɹ | ɺ | ɻ | ɼ | ɽ | ɾ | ɿ |
U+028x | ʀ | ʁ | ʂ | ʃ | ʄ | ʅ | ʆ | ʇ | ʈ | ʉ | ʊ | ʋ | ʌ | ʍ | ʎ | ʏ |
U+029x | ʐ | ʑ | ʒ | ʓ | ʔ | ʕ | ʖ | ʗ | ʘ | ʙ | ʚ | ʛ | ʜ | ʝ | ʞ | ʟ |
U+02Ax | ʠ | ʡ | ʢ | ʣ | ʤ | ʥ | ʦ | ʧ | ʨ | ʩ | ʪ | ʫ | ʬ | ʭ | ʮ | ʯ |
Notes
|
Spacing Modifier Letters (U+02B0–02FF)
The characters in the "Spacing Modifier Letters" block are intended as forming a unity with the preceding letter (which they "modify"). E.g. the character U+02B0 ʰ MODIFIER LETTER SMALL H isn't intended simply as a superscript h (h), but as the mark of aspiration placed after the letter being aspirated, as in pʰ "aspirated voiceless bilabial plosive". The block contains:
- Latin superscript modifier letters: (U+02B0–U+02B8): ʰ aspiration; ʱ breathy voice, murmured; ʲ palatalization; ʳ, ʴ, ʵ, ʶ r-coloring or r-offglides; ʷ labialization; ʸ palatalization, Americanist usage for U+02B2
- Miscellaneous phonetic modifiers: (U+02B9–U+02D7): ʹ ʺ ʻ ʼ ʽ ʾ ʿ ˀ ˁ ˂ ˃ ˄ ˅ ˆ ˇ ˈ ˉ ˊ ˋ ˌ ˍ ˎ ˏ ː ˑ ˒ ˓ ˔ ˕ ˖ ˗
- Spacing clones of diacritics: (U+02D8–U+02DD): ˘ breve; ˙ dot above; ˚ ring above; ˛ ogonek; ˜ small tilde; ˝ double acute accent
- Additions based on 1989 IPA: (U+02DE–U+02E4): ˞ ˟ ˠ ˡ ˢ ˣ ˤ
- Tone letters: (U+02E5–U+02E9): ˥ ˦ ˧ ˨ ˩
- Extended Bopomofo tone marks: U+02EA ˪ YIN DEPARTING TONE MARK; U+02EB ˫ YANG DEPARTING TONE MARK
- IPA modifiers: U+02EC ˬ MODIFIER LETTER VOICING, unaspirated
- Other modifier letters: U+02EE ˮ MODIFIER LETTER DOUBLE APOSTROPHE for Nenets
- Uralic Phonetic Alphabet (UPA) modifiers: (U+02EF–U+02FF): ˯ ˰ ˱ ˲ ˳ ˴ ˵ ˶ ˷ ˸ ˹ ˺ ˻ ˼ ˽ ˾ ˿
Spacing Modifier Letters[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+02Bx | ʰ | ʱ | ʲ | ʳ | ʴ | ʵ | ʶ | ʷ | ʸ | ʹ | ʺ | ʻ | ʼ | ʽ | ʾ | ʿ |
U+02Cx | ˀ | ˁ | ˂ | ˃ | ˄ | ˅ | ˆ | ˇ | ˈ | ˉ | ˊ | ˋ | ˌ | ˍ | ˎ | ˏ |
U+02Dx | ː | ˑ | ˒ | ˓ | ˔ | ˕ | ˖ | ˗ | ˘ | ˙ | ˚ | ˛ | ˜ | ˝ | ˞ | ˟ |
U+02Ex | ˠ | ˡ | ˢ | ˣ | ˤ | ˥ | ˦ | ˧ | ˨ | ˩ | ˪ | ˫ | ˬ | ˭ | ˮ | ˯ |
U+02Fx | ˰ | ˱ | ˲ | ˳ | ˴ | ˵ | ˶ | ˷ | ˸ | ˹ | ˺ | ˻ | ˼ | ˽ | ˾ | ˿ |
Notes
|
Phonetic Extensions (U+1D00–1D7F)
This block, together with Phonetic Extensions Supplement below, contains:
- Small capitals "ɢ ɪ ɴ ɶ ʀ ʏ ʙ ʜ ʟ"
- Turned small letters "ɐ ɥ ɯ ɹ ɺ ɻ ʇ ʌ ʍ ʎ ʞ ʮ ʯ"
- Extra small capitals "ʁ ʛ ᴀ ᴁ ᴃ ᴄ ᴅ ᴆ ᴇ ᴊ ᴋ ᴌ ᴍ ᴎ ᴏ ᴐ ᴘ ᴙ ᴚ ᴛ ᴜ ᴠ ᴡ ᴢ ᴣ ᴦ ᴧ ᴨ ᴩ ᴪ"
- Letters with palatal hooks "ƫ ᶀ ᶁ ᶂ ᶃ ᶄ ᶅ ᶆ ᶇ ᶈ ᶉ ᶊ ᶋ ᶌ ᶍ ᶎ ᶪ ᶵ"
- Letters with retroflex hooks "ᶏ ᶐ ᶒ ᶓ ᶔ ᶕ ᶖ ᶗ ᶘ ᶙ ᶚ ᶩ ᶯ ᶼ"
Phonetic Extensions[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+1D0x | ᴀ | ᴁ | ᴂ | ᴃ | ᴄ | ᴅ | ᴆ | ᴇ | ᴈ | ᴉ | ᴊ | ᴋ | ᴌ | ᴍ | ᴎ | ᴏ |
U+1D1x | ᴐ | ᴑ | ᴒ | ᴓ | ᴔ | ᴕ | ᴖ | ᴗ | ᴘ | ᴙ | ᴚ | ᴛ | ᴜ | ᴝ | ᴞ | ᴟ |
U+1D2x | ᴠ | ᴡ | ᴢ | ᴣ | ᴤ | ᴥ | ᴦ | ᴧ | ᴨ | ᴩ | ᴪ | ᴫ | ᴬ | ᴭ | ᴮ | ᴯ |
U+1D3x | ᴰ | ᴱ | ᴲ | ᴳ | ᴴ | ᴵ | ᴶ | ᴷ | ᴸ | ᴹ | ᴺ | ᴻ | ᴼ | ᴽ | ᴾ | ᴿ |
U+1D4x | ᵀ | ᵁ | ᵂ | ᵃ | ᵄ | ᵅ | ᵆ | ᵇ | ᵈ | ᵉ | ᵊ | ᵋ | ᵌ | ᵍ | ᵎ | ᵏ |
U+1D5x | ᵐ | ᵑ | ᵒ | ᵓ | ᵔ | ᵕ | ᵖ | ᵗ | ᵘ | ᵙ | ᵚ | ᵛ | ᵜ | ᵝ | ᵞ | ᵟ |
U+1D6x | ᵠ | ᵡ | ᵢ | ᵣ | ᵤ | ᵥ | ᵦ | ᵧ | ᵨ | ᵩ | ᵪ | ᵫ | ᵬ | ᵭ | ᵮ | ᵯ |
U+1D7x | ᵰ | ᵱ | ᵲ | ᵳ | ᵴ | ᵵ | ᵶ | ᵷ | ᵸ | ᵹ | ᵺ | ᵻ | ᵼ | ᵽ | ᵾ | ᵿ |
Notes
|
Phonetic Extensions Supplement (U+1D80–1DBF)
Phonetic Extensions Supplement[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+1D8x | ᶀ | ᶁ | ᶂ | ᶃ | ᶄ | ᶅ | ᶆ | ᶇ | ᶈ | ᶉ | ᶊ | ᶋ | ᶌ | ᶍ | ᶎ | ᶏ |
U+1D9x | ᶐ | ᶑ | ᶒ | ᶓ | ᶔ | ᶕ | ᶖ | ᶗ | ᶘ | ᶙ | ᶚ | ᶛ | ᶜ | ᶝ | ᶞ | ᶟ |
U+1DAx | ᶠ | ᶡ | ᶢ | ᶣ | ᶤ | ᶥ | ᶦ | ᶧ | ᶨ | ᶩ | ᶪ | ᶫ | ᶬ | ᶭ | ᶮ | ᶯ |
U+1DBx | ᶰ | ᶱ | ᶲ | ᶳ | ᶴ | ᶵ | ᶶ | ᶷ | ᶸ | ᶹ | ᶺ | ᶻ | ᶼ | ᶽ | ᶾ | ᶿ |
Notes
|
Modifier Tone Letters (U+A700–A71F)
Modifier Tone Letters[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+A70x | ꜀ | ꜁ | ꜂ | ꜃ | ꜄ | ꜅ | ꜆ | ꜇ | ꜈ | ꜉ | ꜊ | ꜋ | ꜌ | ꜍ | ꜎ | ꜏ |
U+A71x | ꜐ | ꜑ | ꜒ | ꜓ | ꜔ | ꜕ | ꜖ | ꜗ | ꜘ | ꜙ | ꜚ | ꜛ | ꜜ | ꜝ | ꜞ | ꜟ |
Notes
|
Superscripts and Subscripts (U+2070–209F)
Superscripts and Subscripts[1][2][3] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+207x | ⁰ | ⁱ | ⁴ | ⁵ | ⁶ | ⁷ | ⁸ | ⁹ | ⁺ | ⁻ | ⁼ | ⁽ | ⁾ | ⁿ | ||
U+208x | ₀ | ₁ | ₂ | ₃ | ₄ | ₅ | ₆ | ₇ | ₈ | ₉ | ₊ | ₋ | ₌ | ₍ | ₎ | |
U+209x | ₐ | ₑ | ₒ | ₓ | ₔ | ₕ | ₖ | ₗ | ₘ | ₙ | ₚ | ₛ | ₜ | |||
Notes
|
See also
References
- ↑ "Spacing modifier letters". Everything2.com. 2002-08-29. Retrieved 2016-01-23.
External links
- links to PDFs of Unicode codes for several phonetic symbol sets.
- PhoTransEdit This free software tool translates English texts into IPA phonetics and also exports transcriptions to HTML decimal code numbers.
- Spacing Modifier Letters (everything2.com)
|