JIS X 0201#Code page 903
{{Short description|Japanese single byte character encoding}}
{{Infobox character encoding
| image = JIS-C-6220.svg
| caption = JIS X 0201 8-bit code page
| name = JIS X 0201
| _nomimecode = 1
| mime = 8-bit: JIS_X0201
7-bit Roman: JIS_C6220-1969-ro
7-bit Kana: JIS_C6220-1969-jp
| alias = JIS C 6220
8-bit: csHalfWidthKatakana
Roman: ISO646-JP, iso-ir-14
Kana: iso-ir-13, x0201-7
| standard = JIS X 0201:1969
| lang = Japanese (basic support), English
| status =
| classification = ISO 646, Extended ISO 646
| prev = Wabun code, JIS C 0803
| next = Shift JIS
| otherrelated = N-byte Hangul code
}}
JIS X 0201, a Japanese Industrial Standard developed in 1969, was the first Japanese electronic character set to become widely used. The character set was initially known as JIS C 6220 before the JIS category reform. Its two forms were a 7-bit encoding or an 8-bit encoding, although the 8-bit form was dominant until Unicode (specifically UTF-8) replaced it. The full name of this standard is 7-bit and 8-bit coded character sets for information interchange ({{lang|ja|7ビット及び8ビットの情報交換用符号化文字集合}}).
The first 96 codes comprise an ISO 646 variant, mostly following ASCII with some differences, while the second 96 character codes represent the phonetic Japanese katakana signs. Since the encoding does not provide any way to express hiragana or kanji, it is only capable of expressing simplified written Japanese. Nevertheless, this simplification can represent the full range of sounds in the language. In the 1970s, this was acceptable for media such as text mode computer terminals, telegrams, receipts, or other electronically handled data.
JIS X 0201 was supplanted by subsequent encodings such as Shift JIS, which combines this standard and JIS X 0208, and later by Unicode.
History
The Comite Consultatif International Telephonique et Telegraphique (CCITT) introduced the International Telegraph Alphabet No.2 (ITA2) code as an international standard, which was the 5-bit Latin encoding. Most countries have their own national standards based on this. In Japan, the Agency of Industrial Science and Technology (AIST) standardized it as the 6-bit character codes of JIS C 0803-1961 (Keyboard layout and codes for teleprinters), which combined with katakana characters. However, it didn't match the industry requirements because the character map was small, and the code layout was impractical. The AIST considered a practical character encoding to replace various codes used in Japan.{{Cite book|title=行政における電子計算機の共同利用に関する調査研究報告書|last=行政管理庁 (The Agency of Administrative Management)|publisher=行政事務機械化研究協会|year=1968|pages=108–113|language=ja|oclc=703804474}}
In 1963, ISO introduced a draft of ISO R 646 (6 and 7-bit coded character sets for information processing interchange). AIST committed the conjunction of ISO R 646 and katakana mapping to the Information Processing Society of Japan (IPSJ). IPSJ formed the code standardization committee. The committee didn't adopt the 6-bit form of ISO's draft because the katakana set couldn't fit into its character map. The early JIS draft mapped small katakana characters next to each of their normal katakana characters. It was considered to be convenient for sorting by Gojūon order (JIS X 0208:1978 chose this ordering). Some committee members criticized it would complicate the mechanic of keyboards which only handled normal katakana characters. The later draft mapped small katakana characters to positions {{mono|0xA7-0xAF}}.
The 1964 ISO draft reserved the positions {{mono|0x24}} and {{mono|0x5c}} for first and second currency symbols to be assigned by each country, but it was considered too dangerous in international communications to use currency symbols that could be localized. The ISO committee had two options that to use a generic currency symbol (¤) or to give the dollar ($) and pound (£) signs permanent assignments. It was agreed that the dollar sign was assigned to position {{mono|0x24}} and the pound sign was to position {{mono|0x23}}. The latter was not required in countries that did not need the pound sign.{{cite web |author-last=Fischer |author-first=Eric N. |date=2000-06-20 |title=The Evolution of Character Codes, 1874–1968 |url=https://archive.org/details/enf-ascii |access-date=2023-11-02 |id=ark:/13960/t07x23w8s}} The JIS committee decided to put the yen sign (¥) in {{mono|0x5c}} (one of national use positions).
JIS C 6220 (Codes for information interchange, 情報交換用符号) was published in 1969. Its number was changed to JIS X 0201 due to the JIS category reform in 1987, and the name was changed to 7-bit and 8-bit coded character sets for information interchange (7ビット及び8ビットの情報交換用符号化文字集合) in the 1990 edition.
The character set of JIS X 0201 had been widely used in Japan. The Nationwide Banking Data Communication System (全国銀行データ通信システム), the largest funds transfer system in Japan, was established in 1973. Transaction messages between banks used a subset of JIS X 0201. The system was used until 2018, and it was replaced by the ZEDI (The Nationwide Banking Electronic Data Interchange System, 全銀EDIシステム) which could handle hiragana and kanji characters.{{Cite web|url=https://tech.nikkeibp.co.jp/it/atcl/column/14/346926/112801223/|title=経理部門の人材不足で悩む会社に朗報、金融EDI「ZEDI」が2018年稼働へ|date=2017-11-30|website=Nikkei X-TECH|access-date=2019-07-24}} In 1978, the JIS C 6226 (JIS X 0208) 2-byte character set was developed to express hiragana and kanji characters. It includes katakana characters, but their codes and layout are different from JIS X 0201. Computer manufacturers developed their own extensions of JIS X 0208 to retain compatibility with JIS X 0201. In 1982, the Microsoft Kanji encoding scheme (Codepage 932 of MS-DOS) and Digital Research's SJC26 (for Japanese CP/M-86) were developed to combine JIS X 0201 single-byte encoding and JIS X 0208 double byte encoding without shift out and shift in characters.{{Cite journal|first=憲正|last=西田|title=Unix風の機能を持ち込んだ日本語MS-DOS 2.0の機能と内部構造|journal=日経エレクトロニクス|language=ja|publisher=Nikkei McGraw-Hill|publication-date=1983-12-19|pages=165–190|issn=0385-1680}} They were called Shift JIS, which became the industrial standard for personal computers.
Implementation details
File:ISO-646-JP-paths.svg) set]]
The first half (Roman set) of JIS X 0201 constitutes a Japanese variant of ISO 646, amounting to ASCII with backslash (\) and tilde (~) replaced by yen (¥) and overline (‾),{{cite web | url=http://www.opengroup.or.jp:80/jvc/cde/ucs-conv-e.html#ch3_1_1 | title=3.1.1 Details of Problems | publisher=The Open Group Japan | work=Problems and Solutions for Unicode and User/Vendor Defined Characters | archive-url=https://web.archive.org/web/19990203115405/http://www.opengroup.or.jp/jvc/cde/ucs-conv-e.html#ch3_1_1 | archive-date=1999-02-03 | url-status=dead | access-date=2019-04-15 }} while the second half (Kana set) consists mainly of katakana. Control characters are specified in JIS X 0211.
In the 7-bit format, the shift out control character {{mono|0x0E}} switches to the Kana set and shift in (0x0F) switches to the Roman set.{{cite iso-ir |number=13 |title=The Japanese KATAKANA graphic set of characters |id-in-title=yes |sponsor=Japanese Industrial Standards Committee |sponsor-link=Japanese Industrial Standards Committee}}{{cite iso-ir |number=14 |title=The Japanese Roman graphic set of characters |id-in-title=yes |sponsor=Japanese Industrial Standards Committee |sponsor-link=Japanese Industrial Standards Committee}} In the 8-bit format, given in the chart below, bytes with the most significant bit set (i.e. {{mono|0x80–0xFF}}) are used for the Kana set and bytes with it unset (i.e. {{mono|0x00–0x7F}}) are used otherwise.
Names used specifically for the 7-bit Roman set include "JISCII",{{citation | url=https://www.ibm.com/support/knowledgecenter/en/ssw_aix_71/com.ibm.aix.nlsgdrf/ibm-943_ibm-932.htm | title=IBM-943 and IBM-932 | publisher=IBM | work=IBM Knowledge Center}} "JIS Roman",{{citation |url=https://developer.apple.com/documentation/coreservices/1433493-conversion_masks/kunicodeforceasciirangemask?language=objc | title=kUnicodeForceASCIIRangeMask | work=Apple Developer Documentation |publisher=Apple Inc}} "ISO646-JP",{{IETF RFC|1345}}{{cite web | url=https://www.iana.org/assignments/character-sets/character-sets.xhtml | publisher=IANA | title=Character Sets}} "JIS C6220-1969-ro", "Japanese-Roman",{{citation |url=http://www.columbia.edu/kermit/csetnames.html |title=Kermit and MIME Character-Set Names |last=da Cruz |first=Frank |date=2010-04-02 |work=Kermit Project |publisher=Columbia University}} "Japan 7-Bit Latin",{{citation| url=https://www-01.ibm.com/software/globalization/cp/cp00895.html | work=IBM Globalization — Code page identifiers | title=CP 00895 | date=9 November 2020 | publisher=IBM}} and "ISO-IR-14", whereas names used specifically for the 7-bit Kana set include "ISO-IR-13", "JIS C6220-1969-jp" and "x0201-7".
The substitution of the yen symbol for backslash can make paths on DOS and Windows-based computers with Japanese support display strangely, like "C:¥Program Files¥", for example.{{cite web | title=When is a backslash not a backslash? | date=2005-09-17 | author=Kaplan, Michael S. | url=http://archives.miloush.net/michkap/archive/2005/09/17/469941.html}} Another similar problem is C programming language's control characters of string literals, like printf("Hello, world.¥n");
.
Codepage layout
The following table is the original 8-bit coded character set of JIS X 0201 (with the kana set indicated by bytes with the high bit set).{{Cite book|title=JIS X 0201-1997|date=1997-02-28|publisher=Japanese Standards Association|pages=17|language=ja}}{{Cite web|last=Unicode Consortium|date=2015-12-02|title=JIS X 0201 (1976) to Unicode 1.1 Table|url=http://unicode.org/Public/MAPPINGS/OBSOLETE/EASTASIA/JIS/JIS0201.TXT|access-date=2021-10-01|website=unicode.org}}
{{chset-table-header1|8-bit form of JIS X 0201-1997}} |
{{chset-left1|0x}}
| colspan="16" rowspan="2" {{chset-cell1|U+0000—U+001F|C0 codes|style=background:#DDD|fn={{efn|name="ref_jis0211"|Control characters are specified in JIS X 0211.}}}} |
{{chset-left1|1x}} |
{{chset-left1|2x}}
|{{chset-ctrl1|U+0020 SPACE| SP }} |{{chset-cell1|U+0021 EXCLAMATION MARK|!}} |{{chset-cell1|U+0022 QUOTATION MARK|"}} |{{chset-cell1|U+0023 NUMBER SIGN|#}} |{{chset-cell1|U+0024 DOLLAR SIGN|$}} |{{chset-cell1|U+0025 PERCENT SIGN|%}} |{{chset-cell1|U+0026 AMPERSAND|&}} |{{chset-cell1|U+0027 APOSTROPHE|'}} |{{chset-cell1|U+0028 LEFT PARENTHESIS|(}} |{{chset-cell1|U+0029 RIGHT PARENTHESIS|)}} |{{chset-cell1|U+002A ASTERISK|*}} |{{chset-cell1|U+002B PLUS SIGN|+}} |{{chset-cell1|U+002C COMMA|,}} |{{chset-cell1|U+002D HYPHEN-MINUS|Plus and minus signs |
}}
|{{chset-cell1|U+002E FULL STOP|.}} |{{chset-cell1|U+002F SOLIDUS|/}} |
{{chset-left1|3x}}
|{{chset-cell1|U+0030 DIGIT ZERO|0}} |{{chset-cell1|U+0031 DIGIT ONE|1}} |{{chset-cell1|U+0032 DIGIT TWO|2}} |{{chset-cell1|U+0033 DIGIT THREE|3}} |{{chset-cell1|U+0034 DIGIT FOUR|4}} |{{chset-cell1|U+0035 DIGIT FIVE|5}} |{{chset-cell1|U+0036 DIGIT SIX|6}} |{{chset-cell1|U+0037 DIGIT SEVEN|7}} |{{chset-cell1|U+0038 DIGIT EIGHT|8}} |{{chset-cell1|U+0039 DIGIT NINE|9}} |{{chset-cell1|U+003A COLON|:}} |{{chset-cell1|U+003B SEMICOLON|;}} |{{chset-cell1|U+003C LESS-THAN SIGN|<}} |{{chset-cell1|U+003D EQUALS SIGN|{{=}}}} |{{chset-cell1|U+003E GREATER-THAN SIGN|>}} |{{chset-cell1|U+003F QUESTION MARK|?}} |
{{chset-left1|4x}}
|{{chset-cell1|U+0040 COMMERCIAL AT|@}} |{{chset-cell1|U+0041 LATIN CAPITAL LETTER A|A}} |{{chset-cell1|U+0042 LATIN CAPITAL LETTER B|B}} |{{chset-cell1|U+0043 LATIN CAPITAL LETTER C|C}} |{{chset-cell1|U+0044 LATIN CAPITAL LETTER D|D}} |{{chset-cell1|U+0045 LATIN CAPITAL LETTER E|E}} |{{chset-cell1|U+0046 LATIN CAPITAL LETTER F|F}} |{{chset-cell1|U+0047 LATIN CAPITAL LETTER G|G}} |{{chset-cell1|U+0048 LATIN CAPITAL LETTER H|H}} |{{chset-cell1|U+0049 LATIN CAPITAL LETTER I|I}} |{{chset-cell1|U+004A LATIN CAPITAL LETTER J|J}} |{{chset-cell1|U+004B LATIN CAPITAL LETTER K|K}} |{{chset-cell1|U+004C LATIN CAPITAL LETTER L|L}} |{{chset-cell1|U+004D LATIN CAPITAL LETTER M|M}} |{{chset-cell1|U+004E LATIN CAPITAL LETTER N|N}} |{{chset-cell1|U+004F LATIN CAPITAL LETTER O|O}} |
{{chset-left1|5x}}
|{{chset-cell1|U+0050 LATIN CAPITAL LETTER P|P}} |{{chset-cell1|U+0051 LATIN CAPITAL LETTER Q|Q}} |{{chset-cell1|U+0052 LATIN CAPITAL LETTER R|R}} |{{chset-cell1|U+0053 LATIN CAPITAL LETTER S|S}} |{{chset-cell1|U+0054 LATIN CAPITAL LETTER T|T}} |{{chset-cell1|U+0055 LATIN CAPITAL LETTER U|U}} |{{chset-cell1|U+0056 LATIN CAPITAL LETTER V|V}} |{{chset-cell1|U+0057 LATIN CAPITAL LETTER W|W}} |{{chset-cell1|U+0058 LATIN CAPITAL LETTER X|X}} |{{chset-cell1|U+0059 LATIN CAPITAL LETTER Y|Y}} |{{chset-cell1|U+005A LATIN CAPITAL LETTER Z|Z}} |{{chset-cell1|U+005B LEFT SQUARE BRACKET|[}} |{{chset-cell1|U+00A5 YEN SIGN|¥|style=background:#FFD}} |{{chset-cell1|U+005D RIGHT SQUARE BRACKET|]}} |{{chset-cell1|U+005E CIRCUMFLEX ACCENT|^}} |{{chset-cell1|U+005F LOW LINE|_}} |
{{chset-left1|6x}}
|{{chset-cell1|U+0060 GRAVE ACCENT|`}} |{{chset-cell1|U+0061 LATIN SMALL LETTER A|a}} |{{chset-cell1|U+0062 LATIN SMALL LETTER B|b}} |{{chset-cell1|U+0063 LATIN SMALL LETTER C|c}} |{{chset-cell1|U+0064 LATIN SMALL LETTER D|d}} |{{chset-cell1|U+0065 LATIN SMALL LETTER E|e}} |{{chset-cell1|U+0066 LATIN SMALL LETTER F|f}} |{{chset-cell1|U+0067 LATIN SMALL LETTER G|g}} |{{chset-cell1|U+0068 LATIN SMALL LETTER H|h}} |{{chset-cell1|U+0069 LATIN SMALL LETTER I|i}} |{{chset-cell1|U+006A LATIN SMALL LETTER J|j}} |{{chset-cell1|U+006B LATIN SMALL LETTER K|k}} |{{chset-cell1|U+006C LATIN SMALL LETTER L|l}} |{{chset-cell1|U+006D LATIN SMALL LETTER M|m}} |{{chset-cell1|U+006E LATIN SMALL LETTER N|n}} |{{chset-cell1|U+006F LATIN SMALL LETTER O|o}} |
{{chset-left1|7x}}
|{{chset-cell1|U+0070 LATIN SMALL LETTER P|p}} |{{chset-cell1|U+0071 LATIN SMALL LETTER Q|q}} |{{chset-cell1|U+0072 LATIN SMALL LETTER R|r}} |{{chset-cell1|U+0073 LATIN SMALL LETTER S|s}} |{{chset-cell1|U+0074 LATIN SMALL LETTER T|t}} |{{chset-cell1|U+0075 LATIN SMALL LETTER U|u}} |{{chset-cell1|U+0076 LATIN SMALL LETTER V|v}} |{{chset-cell1|U+0077 LATIN SMALL LETTER W|w}} |{{chset-cell1|U+0078 LATIN SMALL LETTER X|x}} |{{chset-cell1|U+0079 LATIN SMALL LETTER Y|y}} |{{chset-cell1|U+007A LATIN SMALL LETTER Z|z}} |{{chset-cell1|U+007B LEFT CURLY BRACKET|{}} |{{chset-cell1|U+007C VERTICAL LINE|{{pipe}}}} |{{chset-cell1|U+007D RIGHT CURLY BRACKET|Braces (punctuation) |
|{{chset-cell1|U+203E OVERLINE|‾|style=background:#FFD}}
|{{chset-ctrl1|U+007F DELETE|DEL}}
|-
|{{chset-left1|8x}}
| colspan="16" rowspan="2" {{chset-cell1|U+0080—U+009F|C1 codes or Empty Block|style=background:#DDD|fn={{efn|name="ref_jis0211"}}}}
|-
|{{chset-left1|9x}}
|-
|{{chset-left1|Ax}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|U+FF61 HALFWIDTH IDEOGRAPHIC FULL STOP|。}}
|{{chset-cell1|U+FF62 HALFWIDTH LEFT CORNER BRACKET|「}}
|{{chset-cell1|U+FF63 HALFWIDTH RIGHT CORNER BRACKET|」}}
|{{chset-cell1|U+FF64 HALFWIDTH IDEOGRAPHIC COMMA|、}}
|{{chset-cell1|U+FF65 HALFWIDTH KATAKANA MIDDLE DOT|・}}
|{{chset-cell1|U+FF66 HALFWIDTH KATAKANA LETTER WO|ヲ}}
|{{chset-cell1|U+FF67 HALFWIDTH KATAKANA LETTER SMALL A|ァ}}
|{{chset-cell1|U+FF68 HALFWIDTH KATAKANA LETTER SMALL I|ィ}}
|{{chset-cell1|U+FF69 HALFWIDTH KATAKANA LETTER SMALL U|ゥ}}
|{{chset-cell1|U+FF6A HALFWIDTH KATAKANA LETTER SMALL E|ェ}}
|{{chset-cell1|U+FF6B HALFWIDTH KATAKANA LETTER SMALL O|ォ}}
|{{chset-cell1|U+FF6C HALFWIDTH KATAKANA LETTER SMALL YA|ャ}}
|{{chset-cell1|U+FF6D HALFWIDTH KATAKANA LETTER SMALL YU|ュ}}
|{{chset-cell1|U+FF6E HALFWIDTH KATAKANA LETTER SMALL YO|ョ}}
|{{chset-cell1|U+FF6F HALFWIDTH KATAKANA LETTER SMALL TU|ッ}}
|-
|{{chset-left1|Bx}}
|{{chset-cell1|U+FF70 HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK|ー}}
|{{chset-cell1|U+FF71 HALFWIDTH KATAKANA LETTER A|ア}}
|{{chset-cell1|U+FF72 HALFWIDTH KATAKANA LETTER I|イ}}
|{{chset-cell1|U+FF73 HALFWIDTH KATAKANA LETTER U|ウ}}
|{{chset-cell1|U+FF74 HALFWIDTH KATAKANA LETTER E|エ}}
|{{chset-cell1|U+FF75 HALFWIDTH KATAKANA LETTER O|オ}}
|{{chset-cell1|U+FF76 HALFWIDTH KATAKANA LETTER KA|カ}}
|{{chset-cell1|U+FF77 HALFWIDTH KATAKANA LETTER KI|キ}}
|{{chset-cell1|U+FF78 HALFWIDTH KATAKANA LETTER KU|ク}}
|{{chset-cell1|U+FF79 HALFWIDTH KATAKANA LETTER KE|ケ}}
|{{chset-cell1|U+FF7A HALFWIDTH KATAKANA LETTER KO|コ}}
|{{chset-cell1|U+FF7B HALFWIDTH KATAKANA LETTER SA|サ}}
|{{chset-cell1|U+FF7C HALFWIDTH KATAKANA LETTER SI|シ}}
|{{chset-cell1|U+FF7D HALFWIDTH KATAKANA LETTER SU|ス}}
|{{chset-cell1|U+FF7E HALFWIDTH KATAKANA LETTER SE|セ}}
|{{chset-cell1|U+FF7F HALFWIDTH KATAKANA LETTER SO|ソ}}
|-
|{{chset-left1|Cx}}
|{{chset-cell1|U+FF80 HALFWIDTH KATAKANA LETTER TA|タ}}
|{{chset-cell1|U+FF81 HALFWIDTH KATAKANA LETTER TI|チ}}
|{{chset-cell1|U+FF82 HALFWIDTH KATAKANA LETTER TU|ツ}}
|{{chset-cell1|U+FF83 HALFWIDTH KATAKANA LETTER TE|テ}}
|{{chset-cell1|U+FF84 HALFWIDTH KATAKANA LETTER TO|ト}}
|{{chset-cell1|U+FF85 HALFWIDTH KATAKANA LETTER NA|ナ}}
|{{chset-cell1|U+FF86 HALFWIDTH KATAKANA LETTER NI|ニ}}
|{{chset-cell1|U+FF87 HALFWIDTH KATAKANA LETTER NU|ヌ}}
|{{chset-cell1|U+FF88 HALFWIDTH KATAKANA LETTER NE|ネ}}
|{{chset-cell1|U+FF89 HALFWIDTH KATAKANA LETTER NO|ノ}}
|{{chset-cell1|U+FF8A HALFWIDTH KATAKANA LETTER HA|ハ}}
|{{chset-cell1|U+FF8B HALFWIDTH KATAKANA LETTER HI|ヒ}}
|{{chset-cell1|U+FF8C HALFWIDTH KATAKANA LETTER HU|フ}}
|{{chset-cell1|U+FF8D HALFWIDTH KATAKANA LETTER HE|ヘ}}
|{{chset-cell1|U+FF8E HALFWIDTH KATAKANA LETTER HO|ホ}}
|{{chset-cell1|U+FF8F HALFWIDTH KATAKANA LETTER MA|マ}}
|-
|{{chset-left1|Dx}}
|{{chset-cell1|U+FF90 HALFWIDTH KATAKANA LETTER MI|ミ}}
|{{chset-cell1|U+FF91 HALFWIDTH KATAKANA LETTER MU|ム}}
|{{chset-cell1|U+FF92 HALFWIDTH KATAKANA LETTER ME|メ}}
|{{chset-cell1|U+FF93 HALFWIDTH KATAKANA LETTER MO|モ}}
|{{chset-cell1|U+FF94 HALFWIDTH KATAKANA LETTER YA|ヤ}}
|{{chset-cell1|U+FF95 HALFWIDTH KATAKANA LETTER YU|ユ}}
|{{chset-cell1|U+FF96 HALFWIDTH KATAKANA LETTER YO|ヨ}}
|{{chset-cell1|U+FF97 HALFWIDTH KATAKANA LETTER RA|ラ}}
|{{chset-cell1|U+FF98 HALFWIDTH KATAKANA LETTER RI|リ}}
|{{chset-cell1|U+FF99 HALFWIDTH KATAKANA LETTER RU|ル}}
|{{chset-cell1|U+FF9A HALFWIDTH KATAKANA LETTER RE|レ}}
|{{chset-cell1|U+FF9B HALFWIDTH KATAKANA LETTER RO|ロ}}
|{{chset-cell1|U+FF9C HALFWIDTH KATAKANA LETTER WA|ワ}}
|{{chset-cell1|U+FF9D HALFWIDTH KATAKANA LETTER N|ン}}
|{{chset-cell1|U+FF9E HALFWIDTH KATAKANA VOICED SOUND MARK|゙}}
|{{chset-cell1|U+FF9F HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK|゚}}
|-
|{{chset-left1|Ex}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|-
|{{chset-left1|Fx}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|}
{{legend|#FFD|Differences from ASCII}}
=As part of Shift JIS=
Following is the mapping used for JIS X 0201 as part of Shift JIS,{{cite web|url=
https://icu4c-demos.unicode.org/icu-bin/convexp?conv=ibm-943_P130-1999|title=ibm-943_P130-1999|work=ICU Demonstration - Converter Explorer|publisher=International Components for Unicode}}{{cite web|url=https://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/JAPANESE.TXT|title=JAPANESE.TXT: Map (external version) from Mac OS Japanese encoding to Unicode 2.1 and later.|author=Apple, Inc|author-link=Apple, Inc|date=2005-04-05|orig-year=1995-04-15|publisher=Unicode Consortium}} i.e. showing the 8-bit form of JIS X 0201, and mapping the Katakana characters to the Halfwidth and Fullwidth Forms block (which in turn derives its half width kana layout from JIS X 0201).
{{chset-table-header1|Single-byte character set of Shift JIS}} | ||||||||||||||||
{{chset-left1|0x}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}} |
{{chset-left1|1x}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}} |
{{chset-left1|2x}}
|{{chset-ctrl1|U+0020 SPACE| SP }} |{{chset-cell1|U+0021 EXCLAMATION MARK|!}} |{{chset-cell1|U+0022 QUOTATION MARK|"}} |{{chset-cell1|U+0023 NUMBER SIGN|#}} |{{chset-cell1|U+0024 DOLLAR SIGN|$}} |{{chset-cell1|U+0025 PERCENT SIGN|%}} |{{chset-cell1|U+0026 AMPERSAND|&}} |{{chset-cell1|U+0027 APOSTROPHE|'}} |{{chset-cell1|U+0028 LEFT PARENTHESIS|(}} |{{chset-cell1|U+0029 RIGHT PARENTHESIS|)}} |{{chset-cell1|U+002A ASTERISK|*}} |{{chset-cell1|U+002B PLUS SIGN|+}} |{{chset-cell1|U+002C COMMA|,}} |{{chset-cell1|U+002D HYPHEN-MINUS|Plus and minus signs | ||||||||||||||||
}}
|{{chset-cell1|U+002E FULL STOP|.}} |{{chset-cell1|U+002F SOLIDUS|/}} | ||||||||||||||||
{{chset-left1|3x}}
|{{chset-cell1|U+0030 DIGIT ZERO|0}} |{{chset-cell1|U+0031 DIGIT ONE|1}} |{{chset-cell1|U+0032 DIGIT TWO|2}} |{{chset-cell1|U+0033 DIGIT THREE|3}} |{{chset-cell1|U+0034 DIGIT FOUR|4}} |{{chset-cell1|U+0035 DIGIT FIVE|5}} |{{chset-cell1|U+0036 DIGIT SIX|6}} |{{chset-cell1|U+0037 DIGIT SEVEN|7}} |{{chset-cell1|U+0038 DIGIT EIGHT|8}} |{{chset-cell1|U+0039 DIGIT NINE|9}} |{{chset-cell1|U+003A COLON|:}} |{{chset-cell1|U+003B SEMICOLON|;}} |{{chset-cell1|U+003C LESS-THAN SIGN|<}} |{{chset-cell1|U+003D EQUALS SIGN|{{=}}}} |{{chset-cell1|U+003E GREATER-THAN SIGN|>}} |{{chset-cell1|U+003F QUESTION MARK|?}} | ||||||||||||||||
{{chset-left1|4x}}
|{{chset-cell1|U+0040 COMMERCIAL AT|@}} |{{chset-cell1|U+0041 LATIN CAPITAL LETTER A|A}} |{{chset-cell1|U+0042 LATIN CAPITAL LETTER B|B}} |{{chset-cell1|U+0043 LATIN CAPITAL LETTER C|C}} |{{chset-cell1|U+0044 LATIN CAPITAL LETTER D|D}} |{{chset-cell1|U+0045 LATIN CAPITAL LETTER E|E}} |{{chset-cell1|U+0046 LATIN CAPITAL LETTER F|F}} |{{chset-cell1|U+0047 LATIN CAPITAL LETTER G|G}} |{{chset-cell1|U+0048 LATIN CAPITAL LETTER H|H}} |{{chset-cell1|U+0049 LATIN CAPITAL LETTER I|I}} |{{chset-cell1|U+004A LATIN CAPITAL LETTER J|J}} |{{chset-cell1|U+004B LATIN CAPITAL LETTER K|K}} |{{chset-cell1|U+004C LATIN CAPITAL LETTER L|L}} |{{chset-cell1|U+004D LATIN CAPITAL LETTER M|M}} |{{chset-cell1|U+004E LATIN CAPITAL LETTER N|N}} |{{chset-cell1|U+004F LATIN CAPITAL LETTER O|O}} | ||||||||||||||||
{{chset-left1|5x}}
|{{chset-cell1|U+0050 LATIN CAPITAL LETTER P|P}} |{{chset-cell1|U+0051 LATIN CAPITAL LETTER Q|Q}} |{{chset-cell1|U+0052 LATIN CAPITAL LETTER R|R}} |{{chset-cell1|U+0053 LATIN CAPITAL LETTER S|S}} |{{chset-cell1|U+0054 LATIN CAPITAL LETTER T|T}} |{{chset-cell1|U+0055 LATIN CAPITAL LETTER U|U}} |{{chset-cell1|U+0056 LATIN CAPITAL LETTER V|V}} |{{chset-cell1|U+0057 LATIN CAPITAL LETTER W|W}} |{{chset-cell1|U+0058 LATIN CAPITAL LETTER X|X}} |{{chset-cell1|U+0059 LATIN CAPITAL LETTER Y|Y}} |{{chset-cell1|U+005A LATIN CAPITAL LETTER Z|Z}} |{{chset-cell1|U+005B LEFT SQUARE BRACKET|[}} |{{chset-cell1|U+00A5 YEN SIGN|¥|style=background:#FFD}} |{{chset-cell1|U+005D RIGHT SQUARE BRACKET|]}} |{{chset-cell1|U+005E CIRCUMFLEX ACCENT|^}} |{{chset-cell1|U+005F LOW LINE|_}} | ||||||||||||||||
{{chset-left1|6x}}
|{{chset-cell1|U+0060 GRAVE ACCENT|`}} |{{chset-cell1|U+0061 LATIN SMALL LETTER A|a}} |{{chset-cell1|U+0062 LATIN SMALL LETTER B|b}} |{{chset-cell1|U+0063 LATIN SMALL LETTER C|c}} |{{chset-cell1|U+0064 LATIN SMALL LETTER D|d}} |{{chset-cell1|U+0065 LATIN SMALL LETTER E|e}} |{{chset-cell1|U+0066 LATIN SMALL LETTER F|f}} |{{chset-cell1|U+0067 LATIN SMALL LETTER G|g}} |{{chset-cell1|U+0068 LATIN SMALL LETTER H|h}} |{{chset-cell1|U+0069 LATIN SMALL LETTER I|i}} |{{chset-cell1|U+006A LATIN SMALL LETTER J|j}} |{{chset-cell1|U+006B LATIN SMALL LETTER K|k}} |{{chset-cell1|U+006C LATIN SMALL LETTER L|l}} |{{chset-cell1|U+006D LATIN SMALL LETTER M|m}} |{{chset-cell1|U+006E LATIN SMALL LETTER N|n}} |{{chset-cell1|U+006F LATIN SMALL LETTER O|o}} | ||||||||||||||||
{{chset-left1|7x}}
|{{chset-cell1|U+0070 LATIN SMALL LETTER P|p}} |{{chset-cell1|U+0071 LATIN SMALL LETTER Q|q}} |{{chset-cell1|U+0072 LATIN SMALL LETTER R|r}} |{{chset-cell1|U+0073 LATIN SMALL LETTER S|s}} |{{chset-cell1|U+0074 LATIN SMALL LETTER T|t}} |{{chset-cell1|U+0075 LATIN SMALL LETTER U|u}} |{{chset-cell1|U+0076 LATIN SMALL LETTER V|v}} |{{chset-cell1|U+0077 LATIN SMALL LETTER W|w}} |{{chset-cell1|U+0078 LATIN SMALL LETTER X|x}} |{{chset-cell1|U+0079 LATIN SMALL LETTER Y|y}} |{{chset-cell1|U+007A LATIN SMALL LETTER Z|z}} |{{chset-cell1|U+007B LEFT CURLY BRACKET|{}} |{{chset-cell1|U+007C VERTICAL LINE|{{pipe}}}} |{{chset-cell1|U+007D RIGHT CURLY BRACKET|Braces (punctuation) |
|{{chset-cell1|U+203E OVERLINE|‾|style=background:#FFD}}
|{{chset-cell1|||style=background:#DDD}}
|-
|{{chset-left1|8x}}
|{{chset-cell1|||style=background:#DDD}}
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|-
|{{chset-left1|9x}}
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|-
|{{chset-left1|Ax}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|U+FF61 HALFWIDTH IDEOGRAPHIC FULL STOP|。}}
|{{chset-cell1|U+FF62 HALFWIDTH LEFT CORNER BRACKET|「}}
|{{chset-cell1|U+FF63 HALFWIDTH RIGHT CORNER BRACKET|」}}
|{{chset-cell1|U+FF64 HALFWIDTH IDEOGRAPHIC COMMA|、}}
|{{chset-cell1|U+FF65 HALFWIDTH KATAKANA MIDDLE DOT|・}}
|{{chset-cell1|U+FF66 HALFWIDTH KATAKANA LETTER WO|ヲ}}
|{{chset-cell1|U+FF67 HALFWIDTH KATAKANA LETTER SMALL A|ァ}}
|{{chset-cell1|U+FF68 HALFWIDTH KATAKANA LETTER SMALL I|ィ}}
|{{chset-cell1|U+FF69 HALFWIDTH KATAKANA LETTER SMALL U|ゥ}}
|{{chset-cell1|U+FF6A HALFWIDTH KATAKANA LETTER SMALL E|ェ}}
|{{chset-cell1|U+FF6B HALFWIDTH KATAKANA LETTER SMALL O|ォ}}
|{{chset-cell1|U+FF6C HALFWIDTH KATAKANA LETTER SMALL YA|ャ}}
|{{chset-cell1|U+FF6D HALFWIDTH KATAKANA LETTER SMALL YU|ュ}}
|{{chset-cell1|U+FF6E HALFWIDTH KATAKANA LETTER SMALL YO|ョ}}
|{{chset-cell1|U+FF6F HALFWIDTH KATAKANA LETTER SMALL TU|ッ}}
|-
|{{chset-left1|Bx}}
|{{chset-cell1|U+FF70 HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK|ー}}
|{{chset-cell1|U+FF71 HALFWIDTH KATAKANA LETTER A|ア}}
|{{chset-cell1|U+FF72 HALFWIDTH KATAKANA LETTER I|イ}}
|{{chset-cell1|U+FF73 HALFWIDTH KATAKANA LETTER U|ウ}}
|{{chset-cell1|U+FF74 HALFWIDTH KATAKANA LETTER E|エ}}
|{{chset-cell1|U+FF75 HALFWIDTH KATAKANA LETTER O|オ}}
|{{chset-cell1|U+FF76 HALFWIDTH KATAKANA LETTER KA|カ}}
|{{chset-cell1|U+FF77 HALFWIDTH KATAKANA LETTER KI|キ}}
|{{chset-cell1|U+FF78 HALFWIDTH KATAKANA LETTER KU|ク}}
|{{chset-cell1|U+FF79 HALFWIDTH KATAKANA LETTER KE|ケ}}
|{{chset-cell1|U+FF7A HALFWIDTH KATAKANA LETTER KO|コ}}
|{{chset-cell1|U+FF7B HALFWIDTH KATAKANA LETTER SA|サ}}
|{{chset-cell1|U+FF7C HALFWIDTH KATAKANA LETTER SI|シ}}
|{{chset-cell1|U+FF7D HALFWIDTH KATAKANA LETTER SU|ス}}
|{{chset-cell1|U+FF7E HALFWIDTH KATAKANA LETTER SE|セ}}
|{{chset-cell1|U+FF7F HALFWIDTH KATAKANA LETTER SO|ソ}}
|-
|{{chset-left1|Cx}}
|{{chset-cell1|U+FF80 HALFWIDTH KATAKANA LETTER TA|タ}}
|{{chset-cell1|U+FF81 HALFWIDTH KATAKANA LETTER TI|チ}}
|{{chset-cell1|U+FF82 HALFWIDTH KATAKANA LETTER TU|ツ}}
|{{chset-cell1|U+FF83 HALFWIDTH KATAKANA LETTER TE|テ}}
|{{chset-cell1|U+FF84 HALFWIDTH KATAKANA LETTER TO|ト}}
|{{chset-cell1|U+FF85 HALFWIDTH KATAKANA LETTER NA|ナ}}
|{{chset-cell1|U+FF86 HALFWIDTH KATAKANA LETTER NI|ニ}}
|{{chset-cell1|U+FF87 HALFWIDTH KATAKANA LETTER NU|ヌ}}
|{{chset-cell1|U+FF88 HALFWIDTH KATAKANA LETTER NE|ネ}}
|{{chset-cell1|U+FF89 HALFWIDTH KATAKANA LETTER NO|ノ}}
|{{chset-cell1|U+FF8A HALFWIDTH KATAKANA LETTER HA|ハ}}
|{{chset-cell1|U+FF8B HALFWIDTH KATAKANA LETTER HI|ヒ}}
|{{chset-cell1|U+FF8C HALFWIDTH KATAKANA LETTER HU|フ}}
|{{chset-cell1|U+FF8D HALFWIDTH KATAKANA LETTER HE|ヘ}}
|{{chset-cell1|U+FF8E HALFWIDTH KATAKANA LETTER HO|ホ}}
|{{chset-cell1|U+FF8F HALFWIDTH KATAKANA LETTER MA|マ}}
|-
|{{chset-left1|Dx}}
|{{chset-cell1|U+FF90 HALFWIDTH KATAKANA LETTER MI|ミ}}
|{{chset-cell1|U+FF91 HALFWIDTH KATAKANA LETTER MU|ム}}
|{{chset-cell1|U+FF92 HALFWIDTH KATAKANA LETTER ME|メ}}
|{{chset-cell1|U+FF93 HALFWIDTH KATAKANA LETTER MO|モ}}
|{{chset-cell1|U+FF94 HALFWIDTH KATAKANA LETTER YA|ヤ}}
|{{chset-cell1|U+FF95 HALFWIDTH KATAKANA LETTER YU|ユ}}
|{{chset-cell1|U+FF96 HALFWIDTH KATAKANA LETTER YO|ヨ}}
|{{chset-cell1|U+FF97 HALFWIDTH KATAKANA LETTER RA|ラ}}
|{{chset-cell1|U+FF98 HALFWIDTH KATAKANA LETTER RI|リ}}
|{{chset-cell1|U+FF99 HALFWIDTH KATAKANA LETTER RU|ル}}
|{{chset-cell1|U+FF9A HALFWIDTH KATAKANA LETTER RE|レ}}
|{{chset-cell1|U+FF9B HALFWIDTH KATAKANA LETTER RO|ロ}}
|{{chset-cell1|U+FF9C HALFWIDTH KATAKANA LETTER WA|ワ}}
|{{chset-cell1|U+FF9D HALFWIDTH KATAKANA LETTER N|ン}}
|{{chset-cell1|U+FF9E HALFWIDTH KATAKANA VOICED SOUND MARK|゙}}
|{{chset-cell1|U+FF9F HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK|゚}}
|-
|{{chset-left1|Ex}}
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|-
|{{chset-left1|Fx}}
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|style="background-color: #DFD;"|
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|{{chset-cell1|||style=background:#DDD}}
|}
{{legend|#DFD|First bytes of double-byte Shift JIS characters.}}
= Alternative mapping of katakana =
The basic ISO-2022-JP profile does not permit the Kana set of JIS X 0201, only the Roman set and JIS X 0208 (although ISO 2022 / JIS X 0202 itself permits it). Accordingly, when converting JIS X 0201 katakana (or Unicode half-width kana, which use the same layout) to ISO-2022-JP, the following mapping or transformation is often used.{{refn|The WHATWG Encoding Standard, for instance, uses it as a transformation when encoding Unicode half-width kana data to ISO-2022-JP.{{refn|{{cite web|title=12.2.2. ISO-2022-JP encoder|url=https://encoding.spec.whatwg.org/#ref-for-index-iso-2022-jp-katakana%E2%91%A1|work=Encoding Standard|last=van Kesteren|first=Anne|author-link=Anne van Kesteren|publisher=WHATWG|date=2019-02-11}}}}}} This allows the kana to be converted to JIS X 0208.
In theory, this mapping is equally correct, as JIS X 0201 itself does not specify display width, although in practice (and especially in duospaced environments) JIS X 0201 is used for half-width katakana.
For ease of comparison with the chart above, the mapping is shown below over the JIS X 0201 katakana encoding and with the high bit set.
{{chset-table-header1|JIS X 0208 compatible mappings for JIS X 0201 katakana{{cite web|title=Index ISO-2022-JP Katakana|url=https://encoding.spec.whatwg.org/index-iso-2022-jp-katakana.txt|work=Encoding Standard|last=van Kesteren|first=Anne|author-link=Anne van Kesteren|publisher=WHATWG|date=2018-01-06}}}} | |
{{chset-left1|Ax}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1|U+3002 IDEOGRAPHIC FULL STOP|。 |
|{{chset-cell1|U+300C LEFT CORNER BRACKET|「|}}
|{{chset-cell1|U+300D RIGHT CORNER BRACKET|」|}}
|{{chset-cell1|U+3001 IDEOGRAPHIC COMMA|、|}}
|{{chset-cell1|U+30FB KATAKANA MIDDLE DOT|・|}}
|{{chset-cell1|U+30F2 KATAKANA LETTER WO|ヲ|}}
|{{chset-cell1|U+30A1 KATAKANA LETTER SMALL A|ァ|}}
|{{chset-cell1|U+30A3 KATAKANA LETTER SMALL I|ィ|}}
|{{chset-cell1|U+30A5 KATAKANA LETTER SMALL U|ゥ|}}
|{{chset-cell1|U+30A7 KATAKANA LETTER SMALL E|ェ|}}
|{{chset-cell1|U+30A9 KATAKANA LETTER SMALL O|ォ|}}
|{{chset-cell1|U+30E3 KATAKANA LETTER SMALL YA|ャ|}}
|{{chset-cell1|U+30E5 KATAKANA LETTER SMALL YU|ュ|}}
|{{chset-cell1|U+30E7 KATAKANA LETTER SMALL YO|ョ|}}
|{{chset-cell1|U+30C3 KATAKANA LETTER SMALL TU|ッ|}}
|-
|{{chset-left1|Bx}}
|{{chset-cell1|U+30FC KATAKANA-HIRAGANA PROLONGED SOUND MARK|ー|}}
|{{chset-cell1|U+30A2 KATAKANA LETTER A|ア|}}
|{{chset-cell1|U+30A4 KATAKANA LETTER I|イ|}}
|{{chset-cell1|U+30A6 KATAKANA LETTER U|ウ|}}
|{{chset-cell1|U+30A8 KATAKANA LETTER E|エ|}}
|{{chset-cell1|U+30AA KATAKANA LETTER O|オ|}}
|{{chset-cell1|U+30AB KATAKANA LETTER KA|カ|}}
|{{chset-cell1|U+30AD KATAKANA LETTER KI|キ|}}
|{{chset-cell1|U+30AF KATAKANA LETTER KU|ク|}}
|{{chset-cell1|U+30B1 KATAKANA LETTER KE|ケ|}}
|{{chset-cell1|U+30B3 KATAKANA LETTER KO|コ|}}
|{{chset-cell1|U+30B5 KATAKANA LETTER SA|サ|}}
|{{chset-cell1|U+30B7 KATAKANA LETTER SI|シ|}}
|{{chset-cell1|U+30B9 KATAKANA LETTER SU|ス|}}
|{{chset-cell1|U+30BB KATAKANA LETTER SE|セ|}}
|{{chset-cell1|U+30BD KATAKANA LETTER SO|ソ|}}
|-
|{{chset-left1|Cx}}
|{{chset-cell1|U+30BF KATAKANA LETTER TA|タ|}}
|{{chset-cell1|U+30C1 KATAKANA LETTER TI|チ|}}
|{{chset-cell1|U+30C4 KATAKANA LETTER TU|ツ|}}
|{{chset-cell1|U+30C6 KATAKANA LETTER TE|テ|}}
|{{chset-cell1|U+30C8 KATAKANA LETTER TO|ト|}}
|{{chset-cell1|U+30CA KATAKANA LETTER NA|ナ|}}
|{{chset-cell1|U+30CB KATAKANA LETTER NI|ニ|}}
|{{chset-cell1|U+30CC KATAKANA LETTER NU|ヌ|}}
|{{chset-cell1|U+30CD KATAKANA LETTER NE|ネ|}}
|{{chset-cell1|U+30CE KATAKANA LETTER NO|ノ|}}
|{{chset-cell1|U+30CF KATAKANA LETTER HA|ハ|}}
|{{chset-cell1|U+30D2 KATAKANA LETTER HI|ヒ|}}
|{{chset-cell1|U+30D5 KATAKANA LETTER HU|フ|}}
|{{chset-cell1|U+30D8 KATAKANA LETTER HE|ヘ|}}
|{{chset-cell1|U+30DB KATAKANA LETTER HO|ホ|}}
|{{chset-cell1|U+30DE KATAKANA LETTER MA|マ|}}
|-
|{{chset-left1|Dx}}
|{{chset-cell1|U+30DF KATAKANA LETTER MI|ミ|}}
|{{chset-cell1|U+30E0 KATAKANA LETTER MU|ム|}}
|{{chset-cell1|U+30E1 KATAKANA LETTER ME|メ|}}
|{{chset-cell1|U+30E2 KATAKANA LETTER MO|モ|}}
|{{chset-cell1|U+30E4 KATAKANA LETTER YA|ヤ|}}
|{{chset-cell1|U+30E6 KATAKANA LETTER YU|ユ|}}
|{{chset-cell1|U+30E8 KATAKANA LETTER YO|ヨ|}}
|{{chset-cell1|U+30E9 KATAKANA LETTER RA|ラ|}}
|{{chset-cell1|U+30EA KATAKANA LETTER RI|リ|}}
|{{chset-cell1|U+30EB KATAKANA LETTER RU|ル|}}
|{{chset-cell1|U+30EC KATAKANA LETTER RE|レ|}}
|{{chset-cell1|U+30ED KATAKANA LETTER RO|ロ|}}
|{{chset-cell1|U+30EF KATAKANA LETTER WA|ワ|}}
|{{chset-cell1|U+30F3 KATAKANA LETTER N|ン|}}
|{{chset-cell1|U+309B KATAKANA-HIRAGANA VOICED SOUND MARK|゛|fn={{efn|Gets mapped to correspond to the JIS X 0208 character (mapped to U+309B), not compatibility normalization (which would be U+3099, the combining version).{{cite web|title=5. Indexes|url=https://encoding.spec.whatwg.org/#index-iso-2022-jp-katakana|work=Encoding Standard|last=van Kesteren|first=Anne|author-link=Anne van Kesteren|publisher=WHATWG|date=2019-02-11}}}}}}
|{{chset-cell1|U+309C KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK|゜|fn={{efn|Gets mapped to correspond to the JIS X 0208 character (mapped to U+309C), not compatibility normalization (which would be U+309A, the combining version).}}}}
|-
|}
Variants and extensions
=Shift JIS=
{{main|Shift JIS}}
=IBM's implementations=
Code page 897 is IBM's implementation of the 8-bit form of JIS X 0201. It includes several additional graphical characters in the C0 control characters area, and the code points in question may be used as control characters or graphical characters depending on the context,{{cite web | url=http://www-01.ibm.com/software/globalization/cp/cp00897.html | title=Code page identifiers - CP 00897 | publisher=IBM | work=IBM Globalization | archive-url=https://web.archive.org/web/20160317053427/http://www-01.ibm.com/software/globalization/cp/cp00897.html | archive-date=2016-03-17 | url-status=dead}} similarly in concept to OEM-US, but with different graphical characters. The C0 rows are shown below. IBM also designate pure 8-bit JIS X 0201 without these control code replacements as Code page 1139.{{cite web |archive-url=https://web.archive.org/web/20150708052353/http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp01139z.pdf |url=http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp01139z.pdf |archive-date=2015-07-08 |url-status=dead |title=Code Page 01139 |publisher=IBM |access-date=2021-10-22 }} Another variant, including a smaller subset of these C0 replacement graphics (including only the box drawing characters in {{mono|0x01–06, 0x10, 0x15–17 and 0x19}} and the line/arrow characters in {{mono|0x1B–1F}}), but using a different style of up-arrow ({{unichar|21E7}}) at {{mono|0x1C}}, is designated Code page 1086.{{cite web |archive-url=https://web.archive.org/web/20150708055629/http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp01086z.pdf |url=http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp01086z.pdf |archive-date=2015-07-08 |url-status=dead |title=Code Page 01086 |publisher=IBM |access-date=2021-10-22 }}
{{chset-table-header1|Code page 897, rows 0x00 and 0x10 only{{refn|1=Graphics are listed per CP00897.pdf and CP00897.txt provided by IBM.{{cite web | url=https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00897.pdf | title=CP00897.pdf | publisher=IBM}}{{cite web | url=https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00897.txt | title=CP00897.txt | publisher=IBM}} Controls are listed, in absence of graphical function or where they differ from ASCII, per the ibm-943_P130-1999 codec provided by IBM to International Components for Unicode{{cite web | url=http://demo.icu-project.org/icu-bin/convexp?conv=ibm-943_P130-1999 | title = Converter Explorer - ibm-943_P130-1999 | work = ICU Demonstration | publisher = International Components for Unicode}} (IBM-943 is a Code page 897 superset).{{cite web | url=http://www-01.ibm.com/software/globalization/ccsid/ccsid943.html | title=Coded character set identifiers - CCSID 943 | publisher=IBM | work=IBM Globalization | archive-url=https://web.archive.org/web/20160315110642/http://www-01.ibm.com/software/globalization/ccsid/ccsid943.html | archive-date=2016-03-15}} SUB is assigned to 0x7F.}}}} |
{{chset-left1|0x}}
|{{chset-ctrl1|U+0000 NULL|NUL}} |{{chset-cell1|U+2554 BOX DRAWINGS DOUBLE DOWN AND RIGHT|╔}} |{{chset-cell1|U+2557 BOX DRAWINGS DOUBLE DOWN AND LEFT|╗}} |{{chset-cell1|U+255A BOX DRAWINGS DOUBLE UP AND RIGHT|╚}} |{{chset-cell1|U+255D BOX DRAWINGS DOUBLE UP AND LEFT|╝}} |{{chset-cell1|U+2551 BOX DRAWINGS DOUBLE VERTICAL|║}} |{{chset-cell1|U+2550 BOX DRAWINGS DOUBLE HORIZONTAL|═}} |{{chset-cell1|U+FFEC HALFWIDTH DOWNWARDS ARROW|↓}} |{{chset-ctrl1|U+0008 BACKSPACE|BS}} |{{chset-cell1|U+FFEE HALFWIDTH WHITE CIRCLE|○}} |{{chset-ctrl1|U+000A LINE FEED (LF)|LF}} |{{chset-cell1|U+303F IDEOGRAPHIC HALF FILL SPACE|〿}} |{{chset-ctrl1|U+000C FORM FEED (FF)|FF}} |{{chset-ctrl1|U+000D CARRIAGE RETURN (CR)|CR}} |{{chset-cell1|U+FFED HALFWIDTH BLACK SQUARE|■}} |{{chset-cell1|U+263C WHITE SUN WITH RAYS|☼}} |
{{chset-left1|1x}}
|{{chset-cell1|U+256C BOX DRAWINGS DOUBLE VERTICAL AND HORIZONTAL|╬}} |{{chset-ctrl1|U+0011 DEVICE CONTROL ONE|DC1}} |{{chset-cell1|U+2195 UP DOWN ARROW|↕}} |{{chset-ctrl1|U+0013 DEVICE CONTROL THREE|DC3}} |{{chset-cell1|U+2593 DARK SHADE|▓}} |{{chset-cell1|U+2569 BOX DRAWINGS DOUBLE UP AND HORIZONTAL|╩}} |{{chset-cell1|U+2566 BOX DRAWINGS DOUBLE DOWN AND HORIZONTAL|╦}} |{{chset-cell1|U+2563 BOX DRAWINGS DOUBLE VERTICAL AND LEFT|╣}} |{{chset-ctrl1|U+0018 CANCEL|CAN}} |{{chset-cell1|U+2560 BOX DRAWINGS DOUBLE VERTICAL AND RIGHT|╠}} |{{chset-cell1|U+2591 LIGHT SHADE|░}} |{{chset-cell1|U+21B5 DOWNWARDS ARROW WITH CORNER LEFTWARDS|↵}} |{{chset-cell1|U+FFEA HALFWIDTH UPWARDS ARROW|↑}} |{{chset-cell1|U+FFE8 HALFWIDTH FORMS LIGHT VERTICAL|│}} |{{chset-cell1|U+FFEB HALFWIDTH RIGHTWARDS ARROW|→}} |{{chset-cell1|U+FFE9 HALFWIDTH LEFTWARDS ARROW|←}} |
IBM also implements the 7-bit Roman set of JIS X 0201 as Code page 895{{cite web | url=https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00895.pdf | title=CP00895.pdf | publisher=IBM}} and the 7-bit Kana set as Code page 896 for use as ISO 2022 or EUC-JP code-sets. Code page 896, in addition to standard JIS X 0201 assignments, defines five additional assignments, shown below. Although use of these extended characters is not permitted by the associated CCSID 896,{{cite web | url=https://www-01.ibm.com/software/globalization/ccsid/ccsid896.html | title=Coded character set identifiers - CCSID 896 | publisher=IBM | work=IBM Globalization | archive-url=https://web.archive.org/web/20160326215214/http://www-01.ibm.com/software/globalization/ccsid/ccsid896.html | archive-date=2016-03-26 | url-status=dead}} they are permitted by the alternative CCSID 4992.{{cite web | url=https://www-01.ibm.com/software/globalization/ccsid/ccsid4992.html | title=Coded character set identifiers - CCSID 4992 | publisher=IBM | work=IBM Globalization | archive-url=https://web.archive.org/web/20160327022259/http://www-01.ibm.com/software/globalization/ccsid/ccsid4992.html | archive-date=2016-03-27 | url-status=dead}}
{{chset-table-header1|Code page 896, row 0x60 only{{cite web | url=https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00896.pdf | title=CP00896.pdf | publisher=IBM}}}} | |||||||||||
{{chset-left1|6x}}
|{{chset-cell1|U+00A2 CENT SIGN|¢}} |{{chset-cell1|U+00A3 POUND SIGN|£}} |{{chset-cell1|U+00AC NOT SIGN|¬}} |{{chset-cell1|U+005C REVERSE SOLIDUS|\}} |{{chset-cell1|U+007E TILDE|~}} |{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}}
|{{chset-cell1 | |style=background:#DDD}} |
IBM's Code page 1041 is an extended version of Code page 897, encoding these five IBM extended{{citation | mode=cs1 | chapter-url=https://public.dhe.ibm.com/as400/products/clientaccess/win32/files/globalization/Japanese_EUC.pdf | chapter=11.2 - IBM Extended SBCS Set | title=IBM Japanese Graphic Character Set for Extended UNIX Code (EUC) | pages=315 | publisher=IBM}} characters in alternative locations which are compatible with Shift JIS (respectively {{mono|0x80, 0xA0, 0xFD, 0xFE and 0xFF}}).{{cite web | url=https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01041.pdf | title=CP01041.pdf | publisher=IBM}} Code page 911, another extended 8-bit JIS X 0201 implementation (which uses the same C0 replacement graphics as Code page 1086) encodes the pound (sterling) sign (£) at {{mono|0xE1}}, similarly to Code page 896 with the eight bit set, but differs by encoding the cent sign (¢) at {{mono|0xE2}} and the not-sign (¬) at {{mono|0xE3}}.{{cite web |archive-url=https://web.archive.org/web/20150708023547/http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp00911z.pdf |url=http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp00911z.pdf |archive-date=2015-07-08 |url-status=dead |title=Code Page 00911 |publisher=IBM |access-date=2021-10-22 }}
{{anchor|Code page 903}}IBM's Code page 903 is encoded for use as the single byte component of certain simplified Chinese character encodings,{{cite web | url=https://www-01.ibm.com/software/globalization/cp/cp00903.html | title=Code page identifiers - CP 903 | publisher=IBM | work=IBM Globalization | archive-url=https://web.archive.org/web/20160317081123/http://www-01.ibm.com/software/globalization/cp/cp00903.html | archive-date=2016-03-17 | url-status=dead}} accompanying the ASCII-based Code page 904 used with traditional Chinese encodings.{{cite web | url=http://www-01.ibm.com/software/globalization/ccsid/ccsid904.html | title=Coded character set identifiers - CCSID 904 | publisher=IBM | work=IBM Globalization |archive-url=https://web.archive.org/web/20160327035829/http://www-01.ibm.com/software/globalization/ccsid/ccsid904.html |archive-date=2016-03-27 |url-status=dead}}{{cite web | url=https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00904.pdf | title=CP00904.pdf | publisher=IBM}} Despite this, Code page 903 follows ISO 646-JP / the Roman half of JIS X 0201, in that it replaces the ASCII backslash {{mono|0x5C}} (rather than the ASCII dollar sign {{mono|0x24}} as in GB 1988 / ISO 646-CN) with the yen/yuan sign. It also uses the same C0 replacement graphics as code page 897.{{cite web | url=https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP00903.pdf | title=CP00903.pdf | publisher=IBM}} Code page 1042 extends code page 903 with the pound (sterling) sign at {{mono|0x80}}, and the not-sign, backslash and tilde at their Code page 1041 locations.{{cite web |archive-url=https://web.archive.org/web/20150708115828/http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp01042z.pdf |url=https://public.dhe.ibm.com/software/globalization/gcoc/attachments/CP01042.pdf |archive-date=2015-07-08 |url-status=live |title=Code Page 01042 |publisher=IBM}}
=Others=
File:PC-8001 character set.png|NEC PC-8001 (1979) character set as rendered in the 8×8 pixel font
File:NEC-C-6220-paths.svg|NEC variant used on the PC98 series.
Footnotes
{{notelist}}
References
{{Reflist}}
External links
- [http://www.cqpub.co.jp/interface/toku/2002/200212/img/zu5.gif Diagram of JIS X 0201 (as 7-bit code sets)]
{{character encoding}}