Latin-1 Supplement

{{Confuse|ISO/IEC 8859-1}}

{{For|a list of all Latin characters encoded in Unicode|Latin script in Unicode}}

{{Also|Basic Latin (Unicode block)}}

{{Infobox Unicode block

|blockname = Latin-1 Supplement
{{nobold|1=or}}
C1 Controls and Latin-1 Supplement

|rangestart = 0080

|rangeend = 00FF

|script1 = {{nowrap|Latin (64 char.)}}

|script2 = {{nowrap|Common (64 char.)}}

|symbols = Punctuation
Mathematics
Currency

|alphabets = French
German
Icelandic
Portuguese
Spanish

|1_0_0 = 128

|controls = 33

|sources = ISO/IEC 8859-1

|note = {{cite web|url=https://www.unicode.org/ucd/|title=Unicode character database|work=The Unicode Standard|accessdate=2023-07-26}}{{cite web|url=https://www.unicode.org/versions/enumeratedversions.html|title=Enumerated Versions of The Unicode Standard|work=The Unicode Standard|accessdate=2023-07-26}}

}}

The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) – FF (U+00FF). C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.

The C1 Controls and Latin-1 Supplement block has been included in its present form, with the same character repertoire since version 1.0 of the Unicode Standard.{{cite book|title=The Unicode Standard Version 1.0, Volume 1|orig-year=1990|year=1991|publisher=Addison-Wesley Publishing Company, Inc.|isbn=0-201-56788-1}} Its block name in Unicode 1.0 was simply Latin1.{{cite web |url=https://www.unicode.org/versions/Unicode1.0.0/CodeCharts2.pdf |work=The Unicode Standard |version=version 1.0 |title=3.8: Block-by-Block Charts |publisher=Unicode Consortium}}

Character table

class="wikitable"

!Code

!Result

!Description

!Acronym

colspan="4" | C1 Controls
U+0080

|

| Padding Character

PAD
U+0081

|

| High Octet Preset

HOP
U+0082

|

| Break Permitted Here

BPH
U+0083

|

| No Break Here

NBH
U+0084

|

| Index

IND
U+0085

|

| Next Line

NEL
U+0086

|

| Start of Selected Area

SSA
U+0087

|

| End of Selected Area

ESA
U+0088

|

| Character (Horizontal) Tabulation Set

HTS
U+0089

|

| Character (Horizontal) Tabulation with Justification

HTJ
U+008A

|

| Line (Vertical) Tabulation Set

LTS
U+008B

|

| Partial Line Forward (Down)

PLD
U+008C

|

| Partial Line Backward (Up)

PLU
U+008D

|

| Reverse Line Feed (Index)

RI
U+008E

|

| Single-Shift Two

SS2
U+008F

|

| Single-Shift Three

SS3
U+0090

|

| Device Control String

DCS
U+0091

|

| Private Use One

PU1
U+0092

|

| Private Use Two

PU2
U+0093

|

| Set Transmit State

STS
U+0094

|

| Cancel Character

CCH
U+0095

|

| Message Waiting

MW
U+0096

|

| Start of Protected Area

SPA
U+0097

|

| End of Protected Area

EPA
U+0098

|

| Start of String

SOS
U+0099

|

| Single Graphic Character Introducer

SGCI
U+009A

|

| Single Character Introducer

SCI
U+009B

|

| Control Sequence Introducer

CSI
U+009C

|

| String Terminator

ST
U+009D

|

| Operating System Command

OSC
U+009E

|

| Private Message

PM
U+009F

|

| Application Program Command

APC
colspan=4 | Latin-1 Punctuation and Symbols
U+00A0

|  

| Non-breaking space

NBSP
U+00A1

|Inverted exclamation mark

U+00A2

|Cent sign

U+00A3

|Pound sign

U+00A4

|Currency sign

U+00A5

|Yen sign

U+00A6

|Broken bar

U+00A7

|Section sign

U+00A8

|Diaeresis

U+00A9

|Copyright sign

U+00AA

|Feminine ordinal indicator

U+00AB

|Left-pointing double angle quotation mark

U+00AC

|Not sign

U+00AD

|

|Soft hyphen

SHY
U+00AE

|Registered sign

U+00AF

|Macron

U+00B0

|Degree symbol

U+00B1

|Plus-minus sign

U+00B2

|{{not a typo|²}}

|Superscript two

U+00B3

|Superscript three

U+00B4

|{{not a typo|´}}

|Acute accent

U+00B5

|{{not a typo|µ}}

|Micro sign

U+00B6

|Pilcrow sign

U+00B7

|Middle dot

U+00B8

|Cedilla

U+00B9

|Superscript one

U+00BA

|Masculine ordinal indicator

U+00BB

|Right-pointing double angle quotation mark

U+00BC

|Vulgar fraction one quarter

U+00BD

|Vulgar fraction one half

U+00BE

|Vulgar fraction three quarters

U+00BF

|¿

|Inverted question mark

colspan=4 | Letters
U+00C0

|Latin Capital Letter A with grave

U+00C1

|Latin Capital letter A with acute

U+00C2

|Latin Capital letter A with circumflex

U+00C3

|Latin Capital letter A with tilde

U+00C4

|Latin Capital letter A with diaeresis

U+00C5

|Latin Capital letter A with ring above

U+00C6

|Latin Capital letter AE

U+00C7

|Latin Capital letter C with cedilla

U+00C8

|Latin Capital letter E with grave

U+00C9

|Latin Capital letter E with acute

U+00CA

|Latin Capital letter E with circumflex

U+00CB

|Latin Capital letter E with diaeresis

U+00CC

|Latin Capital letter I with grave

U+00CD

|Latin Capital letter I with acute

U+00CE

|Latin Capital letter I with circumflex

U+00CF

|Latin Capital letter I with diaeresis

U+00D0

|Latin Capital letter Eth

U+00D1

|Latin Capital letter N with tilde

U+00D2

|Latin Capital letter O with grave

U+00D3

|Latin Capital letter O with acute

U+00D4

|Latin Capital letter O with circumflex

U+00D5

|Latin Capital letter O with tilde

U+00D6

|Latin Capital letter O with diaeresis

colspan=4 | Mathematical operator
U+00D7

|Multiplication sign

colspan=4 | Letters
U+00D8

|Latin Capital letter O with stroke

U+00D9

|Latin Capital letter U with grave

U+00DA

|Latin Capital letter U with acute

U+00DB

|Latin Capital Letter U with circumflex

U+00DC

|Latin Capital Letter U with diaeresis

U+00DD

|Latin Capital Letter Y with acute

U+00DE

|Latin Capital Letter Thorn

U+00DF

|Latin Small Letter sharp S

U+00E0

|Latin Small Letter A with grave

U+00E1

|Latin Small Letter A with acute

U+00E2

|Latin Small Letter A with circumflex

U+00E3

|Latin Small Letter A with tilde

U+00E4

|Latin Small Letter A with diaeresis

U+00E5

|Latin Small Letter A with ring above

U+00E6

|Latin Small Letter AE

U+00E7

|Latin Small Letter C with cedilla

U+00E8

|Latin Small Letter E with grave

U+00E9

|Latin Small Letter E with acute

U+00EA

|Latin Small Letter E with circumflex

U+00EB

|Latin Small Letter E with diaeresis

U+00EC

|Latin Small Letter I with grave

U+00ED

|Latin Small Letter I with acute

U+00EE

|Latin Small Letter I with circumflex

U+00EF

|Latin Small Letter I with diaeresis

U+00F0

|Latin Small Letter Eth

U+00F1

|Latin Small Letter N with tilde

U+00F2

|Latin Small Letter O with grave

U+00F3

|Latin Small Letter O with acute

U+00F4

|Latin Small Letter O with circumflex

U+00F5

|Latin Small Letter O with tilde

U+00F6

|Latin Small Letter O with diaeresis

colspan=4 | Mathematical operator
U+00F7

|Division sign

colspan=4 | Letters
U+00F8

|Latin Small Letter O with stroke

U+00F9

|Latin Small Letter U with grave

U+00FA

|Latin Small Letter U with acute

U+00FB

|Latin Small Letter U with circumflex

U+00FC

|Latin Small Letter U with diaeresis

U+00FD

|Latin Small Letter Y with acute

U+00FE

|Latin Small Letter Thorn

U+00FF

|ÿ

|Latin Small Letter Y with diaeresis

Subheadings

The C1 Controls and Latin-1 Supplement block has four subheadings within its character collection: C1 controls, Latin-1 Punctuation and Symbols, Letters, and Mathematical operator(s).{{cite web|title=Unicode 6.2 code charts|url=https://www.unicode.org/Public/6.2.0/charts/CodeCharts.pdf|work=The Unicode Standard|accessdate=1 April 2013}}

=C1 controls=

The C1 controls subheading contains 32 supplementary control codes inherited from ISO/IEC 8859-1 and many other 8-bit character standards. The alias names for the C0 and C1 control codes are taken from ISO/IEC 6429:1992.

=Latin-1 punctuation and symbols=

The Latin-1 Punctuation and Symbols subheading contains 32 characters of common international punctuation characters, such as the inverted question and exclamation marks, a middle dot, and symbols such as currency signs, spacing diacritic marks, vulgar fractions, and superscript numbers.

=Letters=

The Letters subheading contains 30 pairs of majuscule and minuscule accented or novel Latin characters for western European languages, and two extra minuscule characters (ß and ÿ) not commonly used as the first letter of words.

=Mathematical operator=

The Mathematical operator subheading is used for the multiplication and division signs.

Number of symbols, letters and control codes

The table below shows the number of letters, symbols and control codes in each of the subheadings in the C1 Controls and Latin-1 Supplement block.

class="wikitable"

!Type of subheading!!Number of symbols!!Range of characters

C1 controls32 control codesU+0080 to U+009F
Latin-1 punctuation and symbols32 punctuation and symbolsU+00A0 to U+00BF
Letters30 pairs of majuscule and minuscule accented Latin characters, and two extra minuscule charactersU+00C0 to U+00D6, U+00D8 to U+00F6 and U+00F8 to U+00FF
Mathematical operatorsThe {{unichar|d7|MULTIPLICATION SIGN}} and {{unichar|f7|DIVISION SIGN}} symbols.U+00D7 and U+00F7

Compact table

{{Unicode chart C1 Controls and Latin-1 Supplement}}

Emoji

The Latin-1 Supplement block contains two emoji:

U+00A9 and U+00AE.{{Cite web|url=https://unicode.org/reports/tr51/|title=UTR #51: Unicode Emoji|publisher=Unicode Consortium|date=2023-09-05}}{{Cite web|url=https://unicode.org/Public/UNIDATA/emoji/emoji-data.txt|title=UCD: Emoji Data for UTR #51|publisher=Unicode Consortium|date=2023-02-01}}

The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the

two emoji, both of which default to a text presentation.{{cite web|url=https://unicode.org/Public/UNIDATA/emoji/emoji-variation-sequences.txt|title=UTS #51 Emoji Variation Sequences | publisher=The Unicode Consortium}}

class="wikitable nounderlines" style="border-collapse:collapse;background:#FFFFFF;font-size:large;text-align:center"

|+style="font-size:small" | Emoji variation sequences

style="background:#F8F8F8;font-size:small"

| style="text-align:right" | U+

00A900AE
style="background:#F8F8F8;font-size:small;text-align:left" | base code point©®
style="background:#F8F8F8;font-size:small;text-align:left" | base+VS15 (text){{Emoji presentation|©|text}}{{Emoji presentation|®|text}}
style="background:#F8F8F8;font-size:small;text-align:left" | base+VS16 (emoji){{Emoji presentation|©}}{{Emoji presentation|®}}

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Latin-1 Supplement block:

{{sticky header}}

class="wikitable collapsible sticky-header"
Version{{nobr|Final code points}}CountL2 IDWG2 IDDocument
rowspan="18" | 1.0.0rowspan="11" | U+0080..009Frowspan="11" | 32{{nobr|X3L2/95-002}}{{Citation|title=PDAM No. 3 to ISO/IEC 10646-1 on coding of C1 controls|date=1994-11-01}}
{{nobr|X3L2/95-028}}N1148{{Citation|title=Nine tables of replies to repeated/extended votes|date=1995-02-22}}
[https://web.archive.org/web/20200215052615/http://std.dkuug.dk/jtc1/sc2/wg2/docs/n1203.txt N1203]{{Citation|title=Unconfirmed minutes of SC2/WG2 Meeting 27, Geneva|date=1995-05-03|first1=V. S.|last1=Umamaheswaran|first2=Mike|last2=Ksar|section=5.3}}
{{nobr|X3L2/95-061}}{{Citation|title=DAM no.3 to ISO/IEC 10646-1 (Coding of C1 controls)|date=1995-06-01}}
N1307{{Citation|title=Table of replies to JTC1 letter ballot on 10646 DAM 3, Coding of C1 Controls, (SC2 N 2666)|date=1996-01-15}}
N1309{{Citation|title=Report and Disposition of Comments on DAM 1, UTF 16 and DAM 2, UTF-8, DAM 3, Coding of C1 Controls, and DAM 4, Removal of Annex G: UTF1|date=1996-01-17|first=Bruce|last=Paterson}}
N1312{{Citation|title=Draft Final Text of 10646 AMD-3, Coding of C1 Controls|date=1996-01-17|first=Bruce|last=Paterson}}
{{nobr|[https://www.unicode.org/L2/L1999/99048.htm L2/99-048]}}{{Citation|title=C1 controls in the code charts|date=1999-02-04|first=V. S.|last=Umamaheswaran}}
{{nobr|[https://www.unicode.org/L2/L1999/99054r.htm L2/99-054R]}}{{Citation|title=Approved Minutes from the UTC/L2 meeting in Palo Alto, February 3-5, 1999|date=1999-06-21|first=Joan|last=Aliprand|section=C1 Controls}}
[https://www.unicode.org/wg2/docs/n3046.pdf N3046]{{Citation|title=Improving formal definition for control characters|date=2006-02-22|first=Michel|last=Suignard}}
{{nobr|[https://www.unicode.org/wg2/docs/n3103.pdf N3103 (pdf],}} [https://www.unicode.org/wg2/docs/n3103.doc doc]){{Citation|title=Unconfirmed minutes of WG 2 meeting 48, Mountain View, CA, USA; 2006-04-24/27|date=2006-08-25|first=V. S.|last=Umamaheswaran|section=M48.33}}
rowspan="7" | U+00A0..00FFrowspan="7" | 96(to be determined)
{{nobr|X3L2/94-077}}[https://web.archive.org/web/20200215052615/http://std.dkuug.dk/jtc1/sc2/wg2/docs/n0994.doc N994]{{Citation|title=ISO/IEC 10646-1 - Proposed Draft Corrigendum 1|date=1994-03-03|first=Mark|last=Davis|author-link=Mark Davis (Unicode)}}
{{nobr|X3L2/94-098}}{{nobr|[https://web.archive.org/web/20200215052615/http://std.dkuug.dk/jtc1/sc2/wg2/docs/n1033.pdf N1033 (pdf],}} [https://web.archive.org/web/20200215052615/http://std.dkuug.dk/jtc1/sc2/wg2/docs/n1033.doc doc]){{Citation|title=Unconfirmed Minutes of ISO/IEC JTC 1/SC 2/WG 2 Meeting 25, Falez Hotel, Antalya, Turkey, 1994-04-18--22|date=1994-06-01|first1=V. S.|last1=Umamaheswaran|first2=Mike|last2=Ksar|section=8.1.15}}
{{nobr|[https://www.unicode.org/L2/L2011/11016.htm L2/11-016]}}{{Citation|title=UTC #126 / L2 #223 Minutes|date=2011-02-15|first=Lisa|last=Moore|section=Correct mistakes in property assignments for super and subscripted letters (B.13.4) [U+00AA, U+00BA]}}
{{nobr|[https://www.unicode.org/L2/L2011/11116.htm L2/11-116]}}{{Citation|title=UTC #127 / L2 #224 Minutes|date=2011-05-17|first=Lisa|last=Moore|section=Consensus 127-C14|quote=Change the general category of to U+00AA FEMININE ORDINAL INDICATOR and U+00BA MASCULINE ORDINAL INDICATOR "Lo" for Unicode 6.1.}}
{{nobr|[https://www.unicode.org/L2/L2011/11261.htm L2/11-261R2]}}{{Citation|title=UTC #128 / L2 #225 Minutes|date=2011-08-16|first=Lisa|last=Moore|section=Consensus 128-C6|quote=Change the general category from "So" to "Po" ... [U+00A7 and U+00B6]}}
{{nobr|[https://www.unicode.org/L2/L2015/15050r-emoji-var-sel.pdf L2/15-050R]}}{{Citation|title=Additional variation selectors for emoji|date=2015-01-29|first1=Mark|last1=Davis|display-authors=etal}}
class="sortbottom"

| colspan="6" | {{reflist|group=lower-alpha|refs=

Proposed code points and characters names may differ from final code points and names

See also [https://www.unicode.org/L2/L2013/13207-emoji.html L2/13-207], [https://www.unicode.org/L2/L2014/14054-emoji-style.pdf L2/14-054], [https://www.unicode.org/L2/L2014/14063-emoji-sheet.pdf L2/14-063], [https://www.unicode.org/L2/L2015/15051-A-text-vs.txt L2/15-051A], [https://www.unicode.org/L2/L2015/15051-B-text-style.html L2/15-051B]

Refer to the history section of the Miscellaneous Symbols and Pictographs block for additional emoji-related documents}}

See also

References