Jump to content

Latin Extended-B

From Wikipedia, the free encyclopedia
Latin Extended-B
RangeU+0180..U+024F
(208 code points)
PlaneBMP
ScriptsLatin
Major alphabetsAfrica alphabet
Americanist
Khoisan
Pan-Nigerian
Pinyin
Romanian
Assigned208 code points
Unused0 reserved code points
Unicode version history
1.0.0 (1991)113 (+113)
1.1 (1993)148 (+35)
3.0 (1999)178 (+30)
3.2 (2002)179 (+1)
4.0 (2003)183 (+4)
4.1 (2005)194 (+11)
5.0 (2006)208 (+14)
Unicode documentation
Code chart ∣ Web page
Note: Block range was extended by 80 code points in Unicode 1.1 during the unification with ISO 10646.[1][2]

Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin.[3]

Character table[edit]

Code Glyph Decimal Description
Non-European and historic Latin
U+0180 ƀ Latin Small Letter B with Stroke
U+0181 Ɓ Latin Capital Letter B with Hook
U+0182 Ƃ Latin Capital Letter B with Top Bar
U+0183 ƃ Latin Small Letter B with Top Bar
U+0184 Ƅ Latin Capital Letter Tone Six
U+0185 ƅ Latin Small Letter Tone Six
U+0186 Ɔ Latin Capital Letter Open O
U+0187 Ƈ Latin Capital Letter C with Hook
U+0188 ƈ Latin Small Letter C with Hook
U+0189 Ɖ Latin Capital Letter African D
U+018A Ɗ Latin Capital Letter D with Hook
U+018B Ƌ Latin Capital Letter D with Top Bar
U+018C ƌ Latin Small Letter D with Top Bar
U+018D ƍ Latin Small Letter Turned Delta
U+018E Ǝ Latin Capital Letter Reversed E
U+018F Ə Latin Capital Letter Schwa
U+0190 Ɛ Latin Capital Letter Open E (= Latin Capital Letter Epsilon)
U+0191 Ƒ Latin Capital Letter F with Hook
U+0192 ƒ Latin Small Letter F with Hook
U+0193 Ɠ Latin Capital Letter G with Hook
U+0194 Ɣ Latin Capital Letter Gamma
U+0195 ƕ Latin Small Letter HV
U+0196 Ɩ Latin Capital Letter Iota
U+0197 Ɨ Latin Capital Letter I with Stroke
U+0198 Ƙ Latin Capital Letter K with Hook
U+0199 ƙ Latin Small Letter K with Hook
U+019A ƚ Latin Small Letter L with Bar
U+019B ƛ Latin Small Letter Lambda with Stroke
U+019C Ɯ Latin Capital Letter Turned M
U+019D Ɲ Latin Capital Letter N with Left Hook
U+019E ƞ Latin Small Letter N with Long Right Leg
U+019F Ɵ Latin Capital Letter O with Middle Tilde
U+01A0 Ơ Latin Capital Letter O with Horn
U+01A1 ơ Latin Small Letter O with Horn
U+01A2 Ƣ Latin Capital Letter OI (= Latin Capital Letter Gha)
U+01A3 ƣ Latin Small Letter OI (= Latin Small Letter Gha)
U+01A4 Ƥ Latin Capital Letter P with Hook
U+01A5 ƥ Latin Small Letter P with Hook
U+01A6 Ʀ Latin Letter YR
U+01A7 Ƨ Latin Capital Letter Tone Two
U+01A8 ƨ Latin Small Letter Tone Two
U+01A9 Ʃ Latin Capital Letter Esh
U+01AA ƪ Latin Letter Reversed Esh Loop
U+01AB ƫ Latin Small Letter T with Palatal Hook
U+01AC Ƭ Latin Capital Letter T with Hook
U+01AD ƭ Latin Small Letter T with Hook
U+01AE Ʈ Latin Capital Letter T with Retroflex Hook
U+01AF Ư Latin Capital Letter U with Horn
U+01B0 ư Latin Small Letter U with Horn
U+01B1 Ʊ Latin Capital Letter Upsilon
U+01B2 Ʋ Latin Capital Letter V with Hook
U+01B3 Ƴ Latin Capital Letter Y with Hook
U+01B4 ƴ Latin Small Letter Y with Hook
U+01B5 Ƶ Latin Capital Letter Z with Stroke
U+01B6 ƶ Latin Small Letter Z with Stroke
U+01B7 Ʒ Latin Capital Letter Ezh
U+01B8 Ƹ Latin Capital Letter Ezh Reversed
U+01B9 ƹ Latin Small Letter Ezh Reversed
U+01BA ƺ Latin Small Letter Ezh with Tail
U+01BB ƻ Latin Letter Two with Stroke
U+01BC Ƽ Latin Capital Letter Tone Five
U+01BD ƽ Latin Small Letter Tone Five
U+01BE ƾ Latin Letter Inverted Glottal Stop with Stroke
U+01BF ƿ Latin Letter Wynn
African letters for clicks
U+01C0 ǀ Latin Letter Dental Click
U+01C1 ǁ Latin Letter Lateral Click
U+01C2 ǂ Latin Letter Alveolar Click
U+01C3 ǃ Latin Letter Retroflex Click
Croatian digraphs matching Serbian Cyrillic letters
U+01C4 DŽ Latin Capital Letter DZ with Caron
U+01C5 Dž Latin Capital Letter D with Small Letter Z with Caron
U+01C6 dž Latin Small Letter DZ with Caron
U+01C7 LJ Latin Capital Letter LJ
U+01C8 Lj Latin Capital Letter L with Small Letter J
U+01C9 lj Latin Small Letter LJ
U+01CA NJ Latin Capital Letter NJ
U+01CB Nj Latin Capital Letter N with Small Letter J
U+01CC nj Latin Small Letter NJ
Pinyin diacritic-vowel combinations
U+01CD Ǎ Latin Capital Letter A with Caron
U+01CE ǎ Latin Small Letter A with Caron
U+01CF Ǐ Latin Capital Letter I with Caron
U+01D0 ǐ Latin Small Letter I with Caron
U+01D1 Ǒ Latin Capital Letter O with Caron
U+01D2 ǒ Latin Small Letter O with Caron
U+01D3 Ǔ Latin Capital Letter U with Caron
U+01D4 ǔ Latin Small Letter U with Caron
U+01D5 Ǖ Latin Capital Letter U with Diaeresis and Macron
U+01D6 ǖ Latin Small Letter U with Diaeresis and Macron
U+01D7 Ǘ Latin Capital Letter U with Diaeresis and Acute
U+01D8 ǘ Latin Small Letter U with Diaeresis and Acute
U+01D9 Ǚ Latin Capital Letter U with Diaeresis and Caron
U+01DA ǚ Latin Small Letter U with Diaeresis and Caron
U+01DB Ǜ Latin Capital Letter U with Diaeresis and Grave
U+01DC ǜ Latin Small Letter U with Diaeresis and Grave
Phonetic and historic letters
U+01DD ǝ Latin Small Letter Turned E
U+01DE Ǟ Latin Capital Letter A with Diaeresis and Macron
U+01DF ǟ Latin Small Letter A with Diaeresis and Macron
U+01E0 Ǡ Latin Capital Letter A with Dot Above and Macron
U+01E1 ǡ Latin Small Letter A with Dot Above and Macron
U+01E2 Ǣ Latin Capital Letter AE with Macron
U+01E3 ǣ Latin Small Letter AE with Macron
U+01E4 Ǥ Latin Capital Letter G with Stroke
U+01E5 ǥ Latin Small Letter G with Stroke
U+01E6 Ǧ Latin Capital Letter G with Caron
U+01E7 ǧ Latin Small Letter G with Caron
U+01E8 Ǩ Latin Capital Letter K with Caron
U+01E9 ǩ Latin Small Letter K with Caron
U+01EA Ǫ Latin Capital Letter O with Ogonek
U+01EB ǫ Latin Small Letter O with Ogonek
U+01EC Ǭ Latin Capital Letter O with Ogonek and Macron (=Latin Capital Letter O with Macron and Ogonek)
U+01ED ǭ Latin Small Letter O with Ogonek and Macron (=Latin Small Letter O with Macron and Ogonek)
U+01EE Ǯ Latin Capital Letter Ezh with Caron
U+01EF ǯ Latin Small Letter Ezh with Caron
U+01F0 ǰ Latin Small Letter J with Caron
U+01F1 DZ Latin Capital Letter DZ
U+01F2 Dz Latin Capital Letter D with Small Letter Z
U+01F3 dz Latin Small Letter DZ
U+01F4 Ǵ Latin Capital Letter G with Acute
U+01F5 ǵ Latin Small Letter G with Acute
U+01F6 Ƕ Latin Capital Letter Hwair
U+01F7 Ƿ Latin Capital Letter Wynn
U+01F8 Ǹ Latin Capital Letter N with Grave
U+01F9 ǹ Latin Small Letter N with Grave
U+01FA Ǻ Latin Capital Letter A with Ring Above and Acute
U+01FB ǻ Latin Small Letter A with Ring Above and Acute
U+01FC Ǽ Latin Capital Letter AE with Acute
U+01FD ǽ Latin Small Letter AE with Acute
U+01FE Ǿ Latin Capital Letter O with Stroke and Acute
U+01FF ǿ Latin Small Letter O with Stroke and Acute
Additions for Slovenian and Croatian
U+0200 Ȁ Latin Capital Letter A with Double Grave
U+0201 ȁ Latin Small Letter A with Double Grave
U+0202 Ȃ Latin Capital Letter A with Inverted Breve
U+0203 ȃ Latin Small Letter A with Inverted Breve
U+0204 Ȅ Latin Capital Letter E with Double Grave
U+0205 ȅ Latin Small Letter E with Double Grave
U+0206 Ȇ Latin Capital Letter E with Inverted Breve
U+0207 ȇ Latin Small Letter E with Inverted Breve
U+0208 Ȉ Latin Capital Letter I with Double Grave
U+0209 ȉ Latin Small Letter I with Double Grave
U+020A Ȋ Latin Capital Letter I with Inverted Breve
U+020B ȋ Latin Small Letter I with Inverted Breve
U+020C Ȍ Latin Capital Letter O with Double Grave
U+020D ȍ Latin Small Letter O with Double Grave
U+020E Ȏ Latin Capital Letter O with Inverted Breve
U+020F ȏ Latin Small Letter O with Inverted Breve
U+0210 Ȑ Latin Capital Letter R with Double Grave
U+0211 ȑ Latin Small Letter R with Double Grave
U+0212 Ȓ Latin Capital Letter R with Inverted Breve
U+0213 ȓ Latin Small Letter R with Inverted Breve
U+0214 Ȕ Latin Capital Letter U with Double Grave
U+0215 ȕ Latin Small Letter U with Double Grave
U+0216 Ȗ Latin Capital Letter U with Inverted Breve
U+0217 ȗ Latin Small Letter U with Inverted Breve
Additions for Romanian
U+0218 Ș Latin Capital Letter S with Comma Below
U+0219 ș Latin Small Letter S with Comma Below
U+021A Ț Latin Capital Letter T with Comma Below
U+021B ț Latin Small Letter T with Comma Below
Miscellaneous additions
U+021C Ȝ Latin Capital Letter Yogh
U+021D ȝ Latin Small Letter Yogh
U+021E Ȟ Latin Capital Letter H with Caron
U+021F ȟ Latin Small Letter H with Caron
U+0220 Ƞ Latin Capital Letter N with Long Right Leg
U+0221 ȡ Latin Small Letter D with Curl
U+0222 Ȣ Latin Capital Letter OU
U+0223 ȣ Latin Small Letter OU
U+0224 Ȥ Latin Capital Letter Z with Hook
U+0225 ȥ Latin Small Letter Z with Hook
U+0226 Ȧ Latin Capital Letter A with Dot Above
U+0227 ȧ Latin Small Letter A with Dot Above
U+0228 Ȩ Latin Capital Letter E with Cedilla
U+0229 ȩ Latin Small Letter E with Cedilla
Additions for Livonian
U+022A Ȫ Latin Capital Letter O with Diaeresis and Macron
U+022B ȫ Latin Small Letter O with Diaeresis and Macron
U+022C Ȭ Latin Capital Letter O with Tilde and Macron
U+022D ȭ Latin Small Letter O with Tilde and Macron
U+022E Ȯ Latin Capital Letter O with Dot Above
U+022F ȯ Latin Small Letter O with Dot Above
U+0230 Ȱ Latin Capital Letter O with Dot Above and Macron
U+0231 ȱ Latin Small Letter O with Dot Above and Macron
U+0232 Ȳ Latin Capital Letter Y with Macron
U+0233 ȳ Latin Small Letter Y with Macron
Additions for Sinology
U+0234 ȴ Latin Small Letter L with Curl
U+0235 ȵ Latin Small Letter N with Curl
U+0236 ȶ Latin Small Letter T with Curl
Miscellaneous addition
U+0237 ȷ Latin Small Letter Dotless J
Additions for Africanist linguistics
U+0238 ȸ Latin Small Letter DB Digraph
U+0239 ȹ Latin Small Letter QP Digraph
Additions for Sencoten
U+023A Ⱥ Latin Capital Letter A with Stroke
U+023B Ȼ Latin Capital Letter C with Stroke
U+023C ȼ Latin Small Letter C with Stroke
U+023D Ƚ Latin Capital Letter L with Bar
U+023E Ⱦ Latin Capital Letter T with Diagonal Stroke
Additions for Africanist linguistics
U+023F ȿ Latin Small Letter S with Swash Tail
U+0240 ɀ Latin Small Letter Z with Swash Tail
Miscellaneous additions
U+0241 Ɂ Latin Capital Letter Glottal Stop
U+0242 ɂ Latin Small Letter Glottal Stop
U+0243 Ƀ Latin Capital Letter B with Stroke
U+0244 Ʉ Latin Capital Letter U Bar
U+0245 Ʌ Latin Capital Letter Turned V
U+0246 Ɇ Latin Capital Letter E with Stroke
U+0247 ɇ Latin Small Letter E with Stroke
U+0248 Ɉ Latin Capital Letter J with Stroke
U+0249 ɉ Latin Small Letter J with Stroke
U+024A Ɋ Latin Capital Letter Q with Hook Tail
U+024B ɋ Latin Small Letter Q with Hook Tail
U+024C Ɍ Latin Capital Letter R with Stroke
U+024D ɍ Latin Small Letter R with Stroke
U+024E Ɏ Latin Capital Letter Y with Stroke
U+024F ɏ Latin Small Letter Y with Stroke

Subheadings[edit]

The Latin Extended-B block contains ten subheadings for groups of characters: Non-European and historic Latin, African letters for clicks, Croatian digraphs matching Serbian Cyrillic letters, Pinyin diacritic-vowel combinations, Phonetic and historic letters, Additions for Slovenian and Croatian, Additions for Romanian, Miscellaneous additions, Additions for Livonian, and Additions for Sinology. The Non-European and historic, African clicks, Croatian digraphs, Pinyin, and the first part of the Phonetic and historic letters were present in Unicode 1.0; additional Phonetic and historic letters were added for version 3.0; and other Phonetic and historic, as well as the rest of the sub-blocks were the characters added for version 1.1.

Non-European and historic Latin[edit]

The Non-European and historic Latin subheading contains the first 64 characters of the block, and includes various variant letters for use in Zhuang, Americanist phonetic transcription, African languages, and other Latin script alphabets. It does not contain any standard letters with diacritics.

African letters for clicks[edit]

The four African letters for clicks are used in Khoisan orthography.

Croatian digraphs matching Serbian Cyrillic letters[edit]

The Croatian digraphs matching Serbian Cyrillic letters are three sets of three case mappings (lower case, upper case, and title case) of Latin digraphs used for compatibility with Cyrillic texts, Serbo-Croatian being a digraphic language.

Pinyin diacritic-vowel combinations[edit]

The 16 Pinyin diacritic-vowel combinations are used to represent the standard Mandarin Chinese vowel sounds with tone marks.

Phonetic and historic letters[edit]

The 35 Phonetic and historic letters are largely various standard and variant Latin letters with diacritic marks.

Additions for Slovenian and Croatian[edit]

The 24 Additions for Slovenian and Croatian are all standard Latin letters with unusual diacritics, like the double grave and inverted breve.

Additions for Romanian[edit]

The Additions for Romanian are 4 characters that were erroneously unified as having a cedilla, when they have a comma below. The conflation of S and T with cedilla vs. comma below continues to plague Romanian language implementation up to the present.[4]

Miscellaneous additions[edit]

The Miscellaneous additions subheading contains 39 characters of various description and origin.

Additions for Livonian[edit]

The Additions for Livonian are 10 letters with diacritics for writing the Livonian language.

Additions for Sinology[edit]

The Additions for Sinology are three lowercase letters with curls used in the study of classical Chinese language.

Additions for Africanist linguistics[edit]

The Additions for Africanist linguistics are two lowercase letter with swash tails used in Africanist linguistics.

Additions for Sencoten[edit]

The Additions for Sencoten are 5 letters with strokes for writing Saanich.

Number of letters[edit]

The following table shows the number of letters in the Latin Extended-B block.

Type of subheading Number of symbols Range of characters
Non-European and historic Latin 64 various letters for use in Zhuang, Americanist phonetic transcription, African languages, and other Latin script alphabets. U+0180 to U+01BF
African letters for clicks Four African letters for clicks are used in Khoisan orthography. U+01C0 to U+01C3
Croatian digraphs matching Serbian Cyrillic letters Three sets of three case mappings (lower case, upper case, and title case) of Latin digraphs used for compatibility with Cyrillic texts. U+01C4 to U+01CC
Pinyin diacritic-vowel combinations Sixteen diacritic-vowel combinations which are used to represent the standard Mandarin Chinese vowel sounds with tone marks. U+01CD to U+01DC
Phonetic and historic letters 35 Phonetic and historic letters which are largely various standard and variant Latin letters with diacritic marks. U+01DD to U+01FF
Additions for Slovenian and Croatian 24 Additions for Slovenian and Croatian are all standard Latin letters with unusual diacritics, like the double grave and inverted breve. U+0200 to U+0217
Additions for Romanian 4 characters that were erroneously unified as having a cedilla, when they have a comma below. U+0218 to U+021B
Miscellaneous additions 14 characters of various description and origin. U+021C to U+0229
Additions for Livonian 10 letters with diacritics for writing the Livonian language. U+022A to U+0233
Additions for Sinology Three lowercase letters with curls used in the study of classical Chinese language. U+0234 to U+0236

Compact table[edit]

Latin Extended-B[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+018x ƀ Ɓ Ƃ ƃ Ƅ ƅ Ɔ Ƈ ƈ Ɖ Ɗ Ƌ ƌ ƍ Ǝ Ə
U+019x Ɛ Ƒ ƒ Ɠ Ɣ ƕ Ɩ Ɨ Ƙ ƙ ƚ ƛ Ɯ Ɲ ƞ Ɵ
U+01Ax Ơ ơ Ƣ ƣ Ƥ ƥ Ʀ Ƨ ƨ Ʃ ƪ ƫ Ƭ ƭ Ʈ Ư
U+01Bx ư Ʊ Ʋ Ƴ ƴ Ƶ ƶ Ʒ Ƹ ƹ ƺ ƻ Ƽ ƽ ƾ ƿ
U+01Cx ǀ ǁ ǂ ǃ DŽ Dž dž LJ Lj lj NJ Nj nj Ǎ ǎ Ǐ
U+01Dx ǐ Ǒ ǒ Ǔ ǔ Ǖ ǖ Ǘ ǘ Ǚ ǚ Ǜ ǜ ǝ Ǟ ǟ
U+01Ex Ǡ ǡ Ǣ ǣ Ǥ ǥ Ǧ ǧ Ǩ ǩ Ǫ ǫ Ǭ ǭ Ǯ ǯ
U+01Fx ǰ DZ Dz dz Ǵ ǵ Ƕ Ƿ Ǹ ǹ Ǻ ǻ Ǽ ǽ Ǿ ǿ
U+020x Ȁ ȁ Ȃ ȃ Ȅ ȅ Ȇ ȇ Ȉ ȉ Ȋ ȋ Ȍ ȍ Ȏ ȏ
U+021x Ȑ ȑ Ȓ ȓ Ȕ ȕ Ȗ ȗ Ș ș Ț ț Ȝ ȝ Ȟ ȟ
U+022x Ƞ ȡ Ȣ ȣ Ȥ ȥ Ȧ ȧ Ȩ ȩ Ȫ ȫ Ȭ ȭ Ȯ ȯ
U+023x Ȱ ȱ Ȳ ȳ ȴ ȵ ȶ ȷ ȸ ȹ Ⱥ Ȼ ȼ Ƚ Ⱦ ȿ
U+024x ɀ Ɂ ɂ Ƀ Ʉ Ʌ Ɇ ɇ Ɉ ɉ Ɋ ɋ Ɍ ɍ Ɏ ɏ
Notes
1.^ As of Unicode version 15.1

History[edit]

The following Unicode-related documents record the purpose and process of defining specific characters in the Latin Extended-B block:

See also[edit]

References[edit]

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. ^ "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.
  4. ^ Kaplan, Michael. "The history of messing up Romanian on computers". Sorting it all out.