JIS encoding
{{Short description|Collection of Japanese standards for digital character encoding}}
In computing, JIS encoding refers to several Japanese Industrial Standards for encoding the Japanese language.{{cite book|title=Fonts & Encodings|publisher=O'Reilly Media|last=Haralambous|first=Yannis|date=2007|pages=42–44|isbn=9780596102425}} Strictly speaking, the term means either:
- A set of standard coded character sets for Japanese, notably:
- JIS X 0201, the Japanese version of ISO 646 (ASCII) containing the base 7-bit ASCII characters (with some modifications) and 64 half-width katakana characters.
- JIS X 0208, the most common kanji character set containing 6,879 characters, including 6,355 kanji and 524 other characters (one 94 by 94 plane)
- JIS X 0212, a supplement for JIS X 0208 which adds 5,801 kanji, totaling 12,156 kanji (a second 94 by 94 plane)
- JIS X 0213, which extends JIS X 0208 (two planes)
- JIS X 0202 (also known as ISO-2022-JP), a set of encoding mechanisms for sending JIS character data over transmission media that only support 7-bit data.
In practice, "JIS encoding" usually refers to JIS X 0208 character data encoded with JIS X 0202. For instance, the IANA uses the JIS_Encoding
label to refer to JIS X 0202, and the ISO-2022-JP
label to refer to the profile thereof defined by {{IETF RFC|1468}}.{{cite web | url=https://www.iana.org/assignments/character-sets/character-sets.xhtml | publisher=IANA | title=Character Sets}}
Other encoding mechanisms for JIS characters include the Shift JIS encoding and EUC-JP. Shift JIS adds the kanji, full-width hiragana and full-width katakana from JIS X 0208 to JIS X 0201 in a backward compatible way.{{cite book|title=CJKV Information Processing|publisher=O'Reilly Media|last=Lunde|first=Ken|date=2009|pages=262–268|isbn=9780596514471}} Shift JIS is perhaps the most widely used encoding in Japan, as the compatibility with the single-byte JIS X 0201 character set made it possible for electronic equipment manufacturers (such as cash register manufacturers) to offer an upgrade from older cheaper equipment that was not capable of displaying kanji to newer equipment while retaining character-set compatibility.
EUC-JP is used on UNIX systems, where the JIS encodings are incompatible with POSIX standards.
A more recent alternative to JIS coded characters is Unicode (UCS coded characters), particularly in the UTF-8 encoding mechanism.
Encoding comparison
The following table compares the features of the three main encoding schemes for JIS X 0208.
{{JIS X 0208 encoding comparison}}
{{notelist}}
See also
References
{{reflist}}
{{Character encoding}}
{{DEFAULTSORT:Jis Encoding}}