Latin alphabets

A Latin or Roman alphabet is one of many alphabets that use the letters of the Latin script. These letters formed the basis of the original Roman Latin alphabet. Some letters of the Latin script were altered slightly for use in particular languages, although the main letters are largely the same. There were several general types of alterations made to extend the alphabet's uses, depending on the language: diacritics could be added to existing letters; two letters could be fused together into ligatures; additional letters could be inserted; or pairs or triplets of letters could be treated as units (digraphs and trigraphs).

Any additional letters were often given a place in the alphabet by defining an alphabetical order or collation sequence, which can vary between languages. Some of the additions, especially letters which only have diacritics added to them, were not considered distinct letters for this purpose. For example, the French é and the German ö, are not listed separately in their respective alphabet sequences. In some languages, digraphs are included in the collation sequence (e.g. Hungarian CS, Welsh RH).

The ISO basic Latin alphabet includes the 26 basic letters that are included in most alphabets. The International Phonetic Alphabet is also derived mainly from the Latin script.

The tables below summarize and compare some of the alphabets. In this article, the scope of the word "alphabet" is broadened to include letters with tone marks, and other diacritics used to represent a wide range of orthographic traditions, without regard to whether or how they are sequenced in their alphabet or the table.

Usage of the ISO basic Latin alphabet

The Afrikaans, Basque,[4] Breton, Catalan,[6] Czech,[8] Danish,[9] Dutch,[10] English,[36] Estonian, Filipino,[11] Finnish, French,[12] German,[13], Greenlandic, Hungarian,[15] Interlingua, Karakalpak, Kurdish, Modern Latin, Malay, Norwegian,[9] Pan-European, Portuguese, Romanian, Slovak,[24] Spanish,[25] Swedish, Võro, Walloon[27], Xhosa, and Zulu alphabets include all 26 letters, at least in their largest version.

The International Phonetic Alphabet (IPA) includes all 26 letters in their lowercase forms, although g is always single-storey (ɡ) in the IPA and never double-storey ().

List

Reduced usage of the letters of the ISO basic Latin alphabet[1] (A–Z) in various alphabets:
Alphabet A B C D E F G H I J K L M N O P Q R S T U V W X Y Z #
Classical Latin[2] ABCDEFGHI KLMNOPQRST V XYZ23
Albanian[3] ABCDEFGHIJKLMNOPQRSTUV XYZ25
Anglo-Saxon ABCDEFGHI KLMNOPQRSTU XYZ23
Arbëresh ABCDEFGHIJKLMNOPQRSTUV XYZ25
Asturian ABCDEFGHI LMNOPQRSTUV XYZ23
Azeri ABCDEFGHIJKLMNOPQRSTUV XYZ25
Belarusian[5] ABCDEFGHIJKLMNOP RSTUV YZ23
Berber ABCDEFGHIJKLMN QRSTU WXYZ23
Bosnian[7] ABCDEFGHIJKLMNOP RSTUV Z22
British ABCDEFGHI KLMNOPQRST V XYZ23
Chamorro ABCDEFGHI KLMNOP RSTU Y 20
Corsican[31] ABCDEFGHIJ LMNOPQRSTUV Z22
Crimean Tatar ABCDEFGHIJKLMNOPQRSTUV YZ24
Croatian[7] ABCDEFGHIJKLMNOP RSTUV Z22
Dalecarlian AB DEFGHIJKLMNOP RSTUVW Y 22
Esperanto ABCDEFGHIJKLMNOP RSTUV Z22
Faroese AB DEFGHIJKLMNOP RSTUV Y 21
Friulian ABCDEFGHIJ LMNOPQRSTUV Z22
Galician[33] ABCDEFGHI LMNOPQRSTUV X Z22
Gilbertese AB E I K MNO R TU W 12
Guaraní[14] ABCDE GHIJKLMNOP RSTUV Y 18
Hän AB DE GHI KLMNOP RSTU W YZ20
Haitian AB DEFGHIJKLMNOP RST VW YZ22
Hausa[30] ABCDEFGHIJKLMNO RSTU W YZ22
Hawaiian A E HI KLMNOP U W 12
Icelandic AB DEFGHIJKLMNOP RSTUV XY 22
Irish[16] ABCDEFGHI LMNOP RSTU 18
Italian[17] ABCDEFGHI LMNOPQRSTUV Z21
Kashubian ABCDEFGHIJKLMNOP RSTU W YZ23
Latvian[18] ABCDEFGHIJKLMNOP RSTUV Z22
Leonese ABCDEFGHIJ LMNOPQRSTUV XYZ24
Lithuanian[19] ABCDEFGHIJKLMNOP RSTUV YZ23
Malagasy AB DEFGHIJKLMNOP RST V YZ21
Maltese[20] AB DEFGHIJKLMNOPQRSTUVWX Z24
Māori[34] A E GHI K MNOP R TU W 14
Mohawk[35] A E HI K NO RST W Y 12
Northern Sami ABCDEFGHIJKLMNOP RSTUV Z22
Pan-Nigerian ABCDEFGHIJKLMNOP RSTUVW YZ24
Pinyin[32] ABCDEFGHIJKLMNOPQRSTU WXYZ25
Polish[22] ABCDEFGHIJKLMNOP RSTU W YZ23
Traditional Portuguese ABCDEFGHIJ LMNOPQRSTUV X Z23
Romani[29] ABCDEFGHIJKLMNOP RSTUV X Z23
Sardinian ABCDEFGHIJKLMNOP RSTUV XYZ24
Scots Gaelic ABCDEFGHI LMNOP RSTU 18
Serbian[7] ABCDEFGHIJKLMNOP RSTUV Z22
Shona ABCDEFGHIJKLMNOP RSTUVW YZ24
Sicilian ABCDEFGHIJ LMNOPQRSTUV Z22
Slovenian ABCDEFGHIJKLMNOP RSTUV Z22
Somali ABCDEFGHIJKLMNO QRSTU WXY 23
Sorbian ABCDEFGHIJKLMNOP RSTU W YZ23
Swahili ABCDEFGHIJKLMNOP RSTUVW YZ24
Tagalog[11] AB DE GHI KLMNOP RSTU W Y 19
Tetum AB DEFGHIJKLMNOP RSTUV X Z22
Tongan A EFGHI KLMNOP STUV 16
Turkish ABCDEFGHIJKLMNOP RSTUV YZ23
Turkmen AB DEFGHIJKLMNOP RSTU W YZ22
Uzbek[25] ABCDEFGHIJKLMNOPQRSTUV XYZ25
Vietnamese[26] ABCDE GHI KLMNOPQRSTUV XY 22
Volapük ABCDEFGHIJKLMNOP RSTUVWXYZ25
Welsh[28] ABCDEFGHIJ LMNOP RSTU W Y 21
Wolof ABCDEFG IJKLMNOPQRSTU WXY 23
count 6056465560525758604349575960595522585759 564022223843

ISO 646 statistics

The chart above lists a variety of alphabets that do not officially contain all 26 letters of the ISO basic Latin alphabet. In this list, four letters are used by all of them: A, E, I, and N. For each of the 26 basic ISO Latin alphabet letters, the number of alphabets in the list above using it is as follows:

Letter A E I N M O T H R G L S B U D P F K C J Z V Y Q W X
Alphabets 60 60 60 60 59 59 59 58 58 57 57 57 56 56 55 55 52 49 46 43 43 40 38 22 22 22

Note: The I is used in two distinct versions in Turkic languages, dotless (I ı) and dotted (İ i). They are considered different letters, and case conversion must take care to preserve the distinction. Note that Irish traditionally does not write the dot, or tittle, over the small letter i, but the language makes no distinction here if a dot is displayed, so no specific encoding and special case conversion rule is needed like for Turkic alphabets.

Additional letters used in Latin alphabets

Some languages have extended the Latin alphabet with ligatures, modified letters, or digraphs. These symbols are listed below. The characters in the following tables may not all render, depending on which operating system and browser version are used, and the presence or absence of Unicode fonts.

Additional letters by type

Independent letters and ligatures

Additional base letters Æ Ð Ǝ Ə Ɛ Ɣ IJ Ɩ Ŋ Œ Ɔ Ʊ (K‘) (S) [13] Þ Ʋ Ƿ Ȝ ʔ
æɑðǝəɛɣijɩŋœɔʊĸſßþʋƿȝʔ
Danish[9]
Norwegian[9]
Æ
Faroese Æ Ð
Pan-European Æ Ð IJ ŊŒ ĸ ſ ß Þ
Anglo-Saxon Æ Ð Œ Þ Ƿȝ
Icelandic
Norn
Æ Ð Þ
Latin[2]
Celtic British
English[36]
French[12]
Æ Œ
Greenlandic Æ ĸ
German[13] ß
Dalecarlian Ð
Pan-Nigerian Ǝ
Alphabet of Cameroon Æ ƏƐ ŊŒƆ
Alphabet of Benin Ǝ ƐƔ Ŋ Ɔ Ʋ
Alphabet of Burkina Faso Ǝ Ɛ ƖŊ Ɔ Ʋ
Alphabet of Chad ƏƐ Ŋ Ɔ
Alphabet of Côte d'Ivoire Ɛ ƖŊ ƆƱ ʔ
Alphabet of Mali Ǝ ƐƔ Ŋ Ɔ ʔ
Alphabet of Niger Ǝ Ɣ Ŋ
Azeri Ə
Berber ƐƔ
Dutch[10] IJ
Northern Sami Ŋ

Letter–diacritic combinations: connected or overlaid

Modified letters Ą Ɓ Ç Đ Ɗ Ɖ Ę Ȩ Ə̧ Ɛ̧ Ƒ Ħ Į Ɨ Ɨ̧ Ƙ Ł Ɲ Ǫ Ø Ơ Ɔ̧ Ɍ Ş Ţ Ŧ Ų Ư Ʉ Ƴ
ąɓçđɗɖęȩə̧ɛ̧ƒħɦįɨɨ̧ƙłɲǫøơɔ̧ɍşţŧųưʉƴ
Pan-European Ą ÇĐ Ę Ħ Į Ł Ø ŞŢŦŲ
Hän
Navajo
Ą Ę Į Ł Ǫ
Lithuanian[19] Ą Ę Į Ų
Dalecarlian Ą Ę Į Ų
Polish[22] Ą Ę Ł
Kashubian Ą Ł
Pan-Nigerian Ɓ Ɗ Ƙ
Hausa[30] Ɓ Ɗ Ƙ Ƴ
Alphabet of Cameroon Ɓ Ɗ ȨƏ̧Ɛ̧ ƗƗ̧ Ø Ɔ̧ Ƴ
Alphabet of Benin Ɖ Ƒ
Alphabet of Burkina Faso Ɓ Ç Ɗ Ƴ
Alphabet of Chad Ɓ Ɗ Ɨ Ƴ
Alphabet of Mali Ɓ Ɗ Ɲ Ƴ
Alphabet of Niger Ɓ Ɗ Ƙ Ɲ Ɍ Ƴ
Albanian[3]
Arbëresh
Catalan[6]
English[36]
French[12]
Friulian
Portuguese[23]
Walloon[27]
Ç
Azeri
Crimean Tatar
Kurdish
Turkish
Turkmen
Ç Ş
Croatian[7]
Serbian[7]
Đ
Northern Sami Đ Ŧ
Vietnamese[26] Đ Ơ Ư
Maltese[20] Ħ
Belarusian[5]
Sorbian
Ł
Danish[9]
Faroese
Greenlandic
Norn
Norwegian[9]
Ø
Romanian[10] ŞŢ

Other letters in collation order

Note: The tables below are a work in progress. Eventually, table cells with light blue shading will indicate letter forms which do not constitute distinct letters in their associated alphabets. Please help with this task if you have the required linguistic knowledge and technical editing skill.

For the order in which the characters are sorted in each alphabet, see Collating sequence.

Letters derived from A-G

Letter-diacritic combinations (detached) in various Latin alphabets (A–G)
Alphabet Á À Â Ä Ǎ Ă Ā Ã Å Ǻ Ą Æ Ǽ Ǣ Ɓ Ć Ċ Ĉ Č Ç Ď Đ Ɗ Ð É È Ė Ê Ë Ě Ĕ Ē Ę Ǝ Ə Ɛ Ġ Ĝ Ǧ Ğ Ģ Ɣ
áàâäǎăāãåǻąæǽǣɓćċĉčçďđɗðéèėêëěĕēęǝəɛġĝǧğģɣ
Latin[2] ĂĀ Æ ĔĒ
Afrikaans Á ÉÈ ÊË
Albanian[3] Â Ç ÊË
Alemannic ÁÀÂÄ Å ÉÈ Ê
Anglo-Saxon Ā Æ Ǣ Ð Ē
Arbëresh Á Ç É Ë
Asturian Á É
Austro-Bavarian ÁÀÂÄ Å ÉÈ Ê
Azeri Ç Ə Ğ
Belarusian[5] Ć Č
Berber Â Č ƐĠ Ǧ Ɣ
Breton Â É Ê
Cantabrian Á É
Catalan[6] À Ç ÉÈ
Celtic British ĂĀ Æ ĔĒ
Chamorro Å
Corsican[31] À È
Crimean Tatar Â Ç Ğ
Croatian[7] Ć Č Đ
Czech[8] Á Č Ď É Ě
Dalecarlian Ä Å Ą Ð Ę
Danish[9] Å Æ
Dutch[10] ÁÀÂÄ ÉÈ ÊË
English[36] ÀÂÄ Å Æ Ç ÉÈ ÊË
Esperanto Ĉ Ĝ
Estonian Ä
Faroese Á Æ Ð
Filipino[11] ÁÀÂ ÉÈ Ê
Finnish Ä Å
French[12] ÀÂ Æ Ç ÉÈ ÊË
Friulian ÀÂ Ç È Ê
Galician[33] Á É
German[13] Ä
Greenlandic Å Æ
Guaraní[14] Á Ã É
Haitian À È
Hän ÀÂÄǍ Ą È ÊËĚ Ę
Hausa[30] Ɓ Ɗ
Hungarian[15] Á É
Icelandic Á Æ ÐÉ
Irish[16] Á É
Italian[17] À ÉÈ
Kashubian à Ą É Ë
Kurdish Ç Ê
Kurdish (IS) É
Latvian[18] Ā Č Ē Ģ
Leonese Á É
Lithuanian[19] Ą Č Ė Ę
Luxembourgish Ä É Ë
Malagasy ÀÂ È Ê
Maltese[20] À Ċ È Ġ
Māori Ā Ē
Mirandese É
Mohawk ÁÀ ÉÈ
Navajo Á Ą É Ę
Norn Á Å Æ ÐÉ
Northern Sami Á Č Đ
Norwegian[9] ÁÀÂ Å Æ ÉÈ Ê
Pan-European ÁÀÂÄ ĂĀÃÅǺĄÆǼ ĆĊĈČÇĎ Đ ÐÉÈĖÊËĚĔĒĘ ĠĜ ĞĢ
Pan-Nigerian Ɓ Ɗ Ǝ
Piedmontese[37] À ÉÈ Ë
Pinyin[32] ÁÀ Ǎ Ā ÉÈ Ě Ē
Polish[22] Ą Ć Ę
Portuguese[23] ÁÀÂ Ã Ç É Ê
Romani[29] Č
Romanian  Ă
Sardinian ÁÀ Ç ÉÈ
Scots Gaelic À È
Serbian[7] Ć Č Đ
Sicilian ÀÂ È Ê
Slovak[24] Á Ä Č Ď É
Slovenian Č
Sorbian Ć Č Ě
Spanish[25] Á É
Swedish[21] ÁÀ Ä Å ÉÈ
Tetum Á É
Tongan Á Ā É Ē
Turkish Â Ç Ğ
Turkmen Ä Ç
Vietnamese[26] ÁÀÂ Ă Ã Đ ÉÈ Ê
Volapük Ä
Võro Ä
Walloon[27] Â Å Ç ÉÈ Ê
Welsh[28] ÁÀÂÄ ÉÈ ÊË
Wolof À É Ë
Xhosa ÁÀÂÄ ÉÈ ÊË

Letters derived from H-Q

Letter-diacritic combinations (detached) in various Latin alphabets (H–Q)
Alphabet Ĥ Ħ I Í Ì İ Î Ï Ǐ Ĭ Ī Ĩ Į IJ Ĵ Ķ Ƙ (K‘) Ĺ Ļ Ł Ľ Ŀ ʼN Ń Ň Ñ Ņ Ŋ Ó Ò Ô Ö Ǒ Ŏ Ō Õ Ő Ǫ Ø Ǿ Ơ Œ
ĥħıíìiîïǐĭīĩįijĵķƙĸĺļłľŀʼnńňñņŋóòôöǒŏōõőǫøǿơœ
Latin[2] ĬĪ ŎŌ Œ
Afrikaans Í ÎÏ ʼn Ó Ô
Albanian[3] Î Ô
Alemannic ÍÌ Î ÓÒÔÖ
Anglo-Saxon Ī Ō
Arbëresh Í ÓÒ
Asturian Í Ñ Ó
Austro-Bavarian ÍÌ Î ÓÒÔÖ
Azeri ı İ Ö
Basque[4] Ñ
Belarusian[5] Ł Ń
Berber
Breton Î Ñ Ô
Cantabrian Í Ñ Ó
Catalan[6] Í Ï Ŀ ÓÒ
Celtic British ĬĪ ŎŌ Œ
Corsican[31] Ì Ï Ò
Crimean Tatar ı İ Ñ Ö
Czech[8] Í Ň Ó
Dalecarlian Į Ö
Danish[9] Ø
Dutch[10] ÍÌ ÎÏ IJ ÓÒÔÖ
English[36] ÎÏ Ó ÔÖ Œ
Esperanto Ĥ Ĵ
Estonian Ö Õ
Extremaduran Ñ
Fala Í
Faroese Í Ó Ø
Filipino[11] ÍÌ Î Ñ ÓÒÔ
Finnish Ö
French[12] ÎÏ Ô Œ
Friulian Ì Î ÒÔ
Galician[33] Í Ï Ñ Ó
German[13] Ö
Greenlandic Ø
Guaraní[14] Í Ĩ Ñ Ó Õ
Haitian Ò
Hän Ì Î Ǐ Į Ł ÒÔ Ǒ Ǫ
Hausa[30] Ƙ
Hungarian[15] Í Ó Ö Ő
Icelandic Í Ó Ö
Irish[16] Í Ó
Italian[17] Ì Î ÓÒ
Kashubian Ł Ń ÓÒÔ
Kurdish Î
Latvian[18] Ī Ķ Ļ Ņ
Leonese Í Ï Ñ Ó
Lithuanian[19] Į
Malagasy Ì Ñ ÒÔ
Maltese[20] Ħ Ì Ò
Māori Ī Ō
Mohawk ÍÌ ÓÒ
Navajo Í Į Ó Ǫ
Norn Í Ó Ø
Northern Sami Ŋ
Norwegian[9] ÍÌ Î ÓÒÔ Ø
Pan-European Ĥ ĦıÍÌİÎÏ ĬĪĨĮ IJĴĶ ĸĹĻŁĽĿʼnŃ ŇÑŅŊÓÒÔÖ ŎŌÕŐ ØǾ Œ
Pan-Nigerian Ƙ
Piedmontese[37] Ì Ò
Pinyin[32] ÍÌ Ǐ Ī ÓÒ Ǒ Ō
Polish[22] Ł Ń Ó
Portuguese[23] Í Ó Ô Õ
Romanian Î
Sardinian ÍÌ ÓÒ
Scots Gaelic Ì Ò
Sicilian Ì Î ÒÔ
Slovak[24] Í Ĺ Ľ Ň Ó Ô
Sorbian Ł Ń Ó
Spanish[25] Í Ï Ñ Ó
Swedish[21] Ö
Tetum Í Ñ Ó
Tongan Í Ī Ó Ō
Turkish ı İÎ Ö
Turkmen Ň Ö
Vietnamese[26] ÍÌ Ĩ ÓÒÔ Õ Ơ
Volapük Ö
Võro Ö Õ
Walloon[27] Î Ô
Welsh[28] ÍÌ ÎÏ ÓÒÔÖ
Wolof Ñ Ó
Xhosa ÍÌ ÎÏ ÓÒÔÖ

Letters derived from R-Z

Letter-diacritic combinations (detached) in various Latin alphabets (R–Z)
Alphabet Ŕ Ř Ŗ (S) Ś Ŝ Š Ş Ș Ť Ţ Ț Ŧ Þ Ú Ù Û Ü Ǔ Ŭ Ū Ũ Ű Ů Ų Ư Ŵ Ƿ Ý Ŷ Ÿ Ȳ Ƴ Ź Ż Ž
ŕřŗſśŝšşșßťţțŧþúùûüǔŭūũűůųưŵƿýŷÿȳƴźżž
Latin[2] ŬŪ
Afrikaans Ú Û Ý
Albanian[3] Û Ŷ
Alemannic ÚÙÛÜ
Anglo-Saxon Þ Ū Ƿ Ȳ
Arbëresh ÚÙÛ
Asturian Ú Ü
Austro-Bavarian ÚÙÛÜ
Azeri Ş Ü
Basque[4] Ü
Belarusian[5] Ś Š Ŭ Ź Ž
Berber Ř
Breton ÙÛÜ
Cantabrian Ú Ü
Catalan[6] Ú Ü
Celtic British ŬŪ
Corsican[31] Ù
Crimean Tatar Ş Ü
Croatian[7] Š Ž
Czech[8] Ř Š Ť Ú Ů Ý Ž
Dalecarlian Ų
Dutch[10] ÚÙÛÜ
English[36] ÛÜ
Esperanto Ŝ Ŭ
Estonian Š Ü Ž
Faroese Ú Ý
Filipino[11] ÚÙÛ
Finnish Š Ž
French[12] ÙÛÜ Ÿ
Friulian ÙÛ
Galician[33] Ú Ü
German[13] ß Ü
Guaraní[14] Ú Ű Ý
Hän ÙÛ Ǔ Ų
Hausa[30] Ƴ
Hungarian[15] Ú Ü Ű
Icelandic ÞÚ Ý
Irish[16] Ú
Italian[17] Ù
Kashubian Ù Ż
Kurdish Ş Û
Latvian[18] Š Ū Ž
Lithuanian[19] Š Ū Ų Ž
Malagasy
Maltese[20] Ù Ż
Māori Ū
Mirandese Ũ
Norn ÞÚ Ý
Northern Sami Š Ŧ Ž
Norwegian[9] ÚÙÛÜ ÝŶ
Pan-European ŔŘŖſŚŜŠŞ ßŤŢ ŦÞÚÙÛÜ ŬŪŨŰŮŲ Ŵ ÝŶŸ ŹŻŽ
Pan-Nigerian
Piedmontese[37] Ù
Pinyin[32] ÚÙ ÜǓ Ū
Polish[22] Ś ŹŻ
Portuguese[23] Ú
Romani[29] Š Ž
Romanian[10] ŞȘ ŢȚ
Sardinian ÚÙ
Scots Gaelic Ù
Serbian[7] Š Ž
Sicilian ÙÛ
Slovak[24] Ŕ Š Ť Ú Ý Ž
Slovenian Š Ž
Sorbian ŔŘ Ś Š Ź Ž
Spanish[25] Ú Ü
Swedish Ü
Tetum Ú
Tongan Ú Ū
Turkish Ş ÛÜ
Turkmen Ş Ü Ý Ž
Vietnamese[26] ÚÙ Ũ Ư Ý
Volapük Ü
Võro Š Ü Ž
Walloon[27] Û
Welsh[28] ÚÙÛÜ Ŵ ÝŶŸ
Xhosa ÚÙÛÜ

Notes

  1. In classical Latin, the digraphs CH, PH, RH, TH were used in loanwords from Greek, but they were not included in the alphabet. The ligatures Æ, Œ and W, as well as lowercase letters, were added to the alphabet only in Middle Ages. The letters J and U were used as typographical variants of I and V, respectively, roughly until the Enlightenment.
  2. Albanian officially has the digraphs dh, gj, ll, nj, rr, sh, th, xh, zh, which is sufficient to represent the Tosk dialect. The Gheg dialect supplements the official alphabet with 6 nasal vowels, namely â, ê, î, ô, û, ŷ.
  3. Arbëresh apparently requires the digraphs dh, gj, hj, ll, nj, rr, sh, th, xh, zh. Arbëresh has the distinctive hj, which is considered as a letter in its own right.
  4. Basque has several digraphs: dd, ll, rr, ts, tt, tx, tz. The ü, which is pronounced as /ø/, is required for various words in its Zuberoan dialect.
  5. Belarusian also has several digraphs: ch, dz, dź, dž.
  6. Breton also has the digraphs ch, c'h, zh.
  7. Catalan also has a large number of digraphs: dj, gu, ig, ix, ll, l·l, ny, qu, rr, ss, ts, tx, tz.
  8. Corsican has the trigraphs: chj, ghj.
  9. Croatian also has the digraphs: , lj, nj. It can also be written with four tone markers above on top of the vowels. Note that Croatian Latin is the same as Serbian Latin and they both map 1:1 to Serbian Cyrillic, where the three digraphs map to Cyrillic letters џ, љ and њ, respectively. Rarely and non-standardly, digraph dj is used instead of đ (Cyrillic ђ).
  10. Czech also has the digraph ch, which is considered a separate letter and is sorted between h and i. While á, ď, é, ě, í, ň, ó, ť, ú, ů, and ý are considered separate letters, in collation they are treated merely as letters with diacritics. However, č, ř, š, and ž are actually sorted as separate letters.
  11. The Norwegian alphabet is currently identical with the Danish alphabet. C is part of both alphabets and is used in native Danish, but not in native Norwegian. Norwegian and Danish uses é in "én" and more uses, although é is considered a diacritic mark, while å, æ and ø are letters. Q, w, x, z are not used except for names and some foreign words.
  12. The status of ij as a letter, ligature or digraph in Dutch is disputed.
  13. English generally now uses extended Latin letters only in loan words. Rare publication guides may still use the dieresis on words, such as "coöperate", rather than the now-more-common "co-operate" (UK) or "cooperate" (US). For a fuller discussion, see articles branching from Lists of English words of international origin, which was used to determine the diacritics needed for more unambiguous English. However, an é or è is sometimes used in poetry to show that a normally silent vowel is to be pronounced, as in "Blessèd".
  14. Filipino also known as Tagalog also use the digraph ng, even originally with a large tilde that spanned both n and g (as in n͠g) when a vowel follows the digraph. (The use of the tilde over the two letters is now rare).
  15. Uppercase diacritics in French are often thought as being dispensable, while they are obligatory. Many pairs or triplets are read as digraphs or trigraphs depending on context, but are not treated as such lexicographically: consonants ph, (ng), th, gu/gü, qu, ce, ch/(sh/sch), rh; vocal vowels (ee), ai/ay, ei/ey, eu, au/eau, ou; nasal vowels ain/aim, in/im/ein, un/um/eun, an/am, en/em, om/on; the half-consonant -(i)ll-; half-consonant and vowel pairs oi, oin/ouin, ien, ion. When rules that govern the French orthography are not observed, they are read as separate letters, or using an approximating phonology of a foreign language for loan words, and there are many exceptions. In addition, most final consonants are mute (including those consonants that are part of feminine, plural, and conjugation endings).
  16. Galician. The standard of 1982 set also the digraphs gu, qu (both always before e and i), ch, ll, nh and rr. In addition, the standard of 2003 added the grapheme ao as an alternative writing of ó. Although not marked (or forgotten) in the list of digraphs, they are used to represent the same sound, so the sequence ao should be considered as a digraph. Note also that nh represents a velar nasal (not a palatal as in Portuguese) and is restricted only to three feminine words, being either demonstrative or pronoun: unha ('a' and 'one'), algunha ('some') and ningunha ('not one'). The Galician reintegracionismo movement uses it as in Portuguese.
  17. German also retains most original letters in French loan words. Swiss German does not use ß any more. The long s (ſ) was in use until the mid-20th century. Sch is usually not treated like a true trigraph, neither are ch and qu digraphs. Q only appears in the sequence qu, while y is found only (and x almost only) in loan words. The capital ß () is almost never used, ß is replaced with SS when writing all-caps.
  18. Guaraní also uses tilde over e and g (the last one not available precomposed in Unicode), as well as digraphs ch, mb, nd, ng, nt, rr and the glottal stop ' .
  19. Hausa has the digraphs: sh, ts.
  20. Hungarian also has the digraphs: cs, dz, gy, ly, ny, sz, ty, zs; and the trigraph: dzs. Letters á, é, í, ó, ő, ú, and ű are considered separate letters, but are collated as variants of a, e, i, o, ö, u, and ü.
  21. Irish formerly used the dot diacritic in ḃ, ċ, ḋ, ḟ, ġ, ṁ, ṗ, ṡ, ṫ. These have been replaced by the digraphs: bh, ch, dh, fh, gh, mh, ph, sh, th except for in formal instances.
  22. Italian also has the digraphs: ch, gh, gn, gl, sc. J, K, W, X, Y are used in foreign words. X is also used for native words derived from Latin and Greek; J is also used for just a few native words, mainly names of persons (as in Jacopo) or of places (as in Jesolo and Jesi), in which is always pronounced as letter I.
  23. Karakalpak also has the digraphs: ch, sh. A', G', I', N', O', U' are considered as letters. C, F, H, V, X are used in foreign words.
  24. Latvian also has the digraphs: dz, dž, ie. Dz and are occasionally considered separate letters of the alphabet in more archaic examples, which have been published as recently as the 1950s; however, modern alphabets and teachings discourage this due to an ongoing effort to set decisive rules for Latvian and eliminate barbaric words accumulated during the Soviet occupation. The digraph "ie" is never considered a separate letter.
  25. Lithuanian also has the digraphs: ch, dz, dž, ie, uo. However, these are not considered separate letters of the alphabet.
  26. Maltese also has the digraphs: ie, għ.
  27. Māori uses g only in ng digraph. Wh is also a digraph.
  28. Some Mohawk speakers use orthographic i in place of the consonant y. The glottal stop is indicated with an apostrophe and long vowels are written with a colon :.
  29. Piedmontese also uses the letter n- to indicate a velar nasal N-sound (pronounced as the gerundive termination in going), which usually precedes a vowel, as in lun-a [moon].
  30. Pinyin has four tone markers that can go on top of any of the six vowels (a, e, i, o, u, ü); e.g.: macron (ā, ē, ī, ō, ū, ǖ), acute accent (á, é, í, ó, ú, ǘ), caron (ǎ, ě, ǐ, ǒ, ǔ, ǚ), grave accent (à, è, ì, ò, ù, ǜ). It also uses the digraphs: ch, sh, zh.
  31. Polish also has the digraphs: ch, cz, dz, dż, dź, sz, rz.
  32. Portuguese also uses the digraphs ch, lh, nh, rr, ss. The trema on ü was used in Brazilian Portuguese before 2009. Neither the digraphs nor accented letters are considered part of the alphabet.
  33. Romanian normally uses a comma diacritic below the letters s and t (ș, ț), but it is frequently replaced with an attached cedilla below these letters (ş, ţ) due to past lack of standardization.
  34. Romani has the digraphs: čh, dž, kh, ph, th.
  35. Slovak also has the digraphs dz, dž, and ch, which are considered separate letters While á, ä, ď, é, í, ĺ, ň, ó, ô, ŕ, ť, ú, and ý are considered separate letters, in collation they are treated merely as letters with diacritics. However, č, ľ, š, and ž, as well as the digraphs, are actually sorted as separate letters.
  36. Spanish uses several digraphs to represent single sounds: ch, gu (preceding e or i), ll, qu, rr; of these, the digraphs ch and ll were traditionally considered individual letters with their own name (che, elle) and place in the alphabet (after c and l, respectively), but in order to facilitate international compatibility the Royal Spanish Academy decided to cease this practice in 1994 and all digraphs are now collated as combinations of two separate characters. The c-cedilla ç used earlier has been replaced completely by z.
  37. Swedish uses é in well integrated loan words like idé and armé, although é is considered a modified e, while å, ä, ö are letters. á and à are rarely used words. W and z are used in some integrated words like webb and zon. Q, ü, è are used for names only, but exist in Swedish names. For foreign names ó, ë, ñ and more are sometimes used, but usually not. Swedish has many digraphs and some trigraphs. ch, dj, lj, rl, rn, rs, sj, sk, si, ti, sch, skj, stj and others are usually pronounced as one sound.
  38. Uzbek also has the digraphs: ch, ng, sh considered as letters. C used only in digraphs. G', O' and apostrophe (') are considered as letters. These letters have preferred typographical variants: Gʻ, Oʻ and ʼ respectively.
  39. Vietnamese has seven additional base letters: ă â đ ê ô ơ ư. It uses five tone markers that can go on top (or below) any of the 12 vowels (a, ă, â, e, ê, i, o, ô, ơ, u, ư, y); e.g.: grave accent (à, ằ, ầ, è, ề, ì, ò, ồ, ờ, ù, ừ, ỳ), hook above (ả, ẳ, ẩ, ẻ, ể, ỉ, ỏ, ổ, ở, ủ, ử, ỷ), tilde (ã, ẵ, ẫ, ẽ, ễ, ĩ, õ, ỗ, ỡ, ũ, ữ, ỹ), acute accent (á, ắ, ấ, é, ế, í, ó, ố, ớ, ú, ứ, ý), and dot below (ạ, ặ, ậ, ẹ, ệ, ị, ọ, ộ, ợ, ụ, ự, ỵ). It also uses several digraphs and trigraphs  ch, gh, gi, kh, ng, ngh, nh, ph, th, tr  but they are no longer considered letters.
  40. Walloon has the digraphs and trigraphs: ae, ch, dj, ea, jh, oe, oen, oi, sch, sh, tch, xh; the letter x is almost only used in xh digraph, the letter j is almost only used in dj and jh digraphs.
  41. Welsh has the digraphs ch, dd, ff, ng, ll, ph, rh, th. It also occasionally uses circumflexes, diaereses, acute accents and grave accents on its seven vowels (a, e, i, o, u, w, y), but accented characters are not regarded as separate letters of the alphabet.
  42. Xhosa has a large number of digraphs, trigraphs, and even one tetragraph are used to represent various phonemes: bh, ch, dl, dy, dz, gc, gq, gr, gx, hh, hl, kh, kr, lh, mb, mf, mh, nc, ndl, ndz, ng, ng', ngc, ngh, ngq, ngx, nh, nkc, nkq, nkx, nq, nx, ntl, ny, nyh, ph, qh, rh, sh, th, ths, thsh, ts, tsh, ty, tyh, wh, xh, yh, zh. It also occasionally uses acute accents, grave accents, circumflexes, and diaereses on its five vowels (a, e, i, o, u), but accented characters are not regarded as separate letters of the alphabet.

Miscellanea

See also

Footnotes

  1. As defined in ISO/IEC 646 based on ASCII which was based on the 26 letters of the English alphabet and previous telecommunications standards, and used in later ISO standards, see Latin characters in Unicode.

External links

This article is issued from Wikipedia - version of the Monday, April 25, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.