Views User Guides > Foundation > Internationalization > Reference: Encoding Listings
Reference: Encoding Listings
The following encodings are supported by Views:
*US-ASCII
*ISO-8859-1 (Latin1)
*ISO-8859-2 (Latin2)
*ISO-8859-3 (Latin3)
*ISO-8859-4 (Latin4)
*ISO-8859-5 (LatinCyrillic)
*ISO-8859-6 (LatinArabic)
*ISO-8859-7 (LatinGreek)
*ISO-8859-8 (LatinHebrew)
*ISO-8859-9 (Latin5)
*ISO-8859-10 (Latin6)
*ISO-8859-11 (LatinThai)
*ISO-8859-13 (Latin7)
*ISO-8859-14 (Latin8)
*SO-8859-15 (Latin9)
*EUC-JP
*Shift_JIS
*EUC-KR
*GB2312
*Big5
*Big5-HKSCS
*EUC-TW
*hp-roman8
*IBM850
*windows-1250
*windows-1251
*windows-1252
*windows-1253
*windows-1254
*windows-1255
*windows-1256
*windows-1257
*windows-1258
*windows-874
*windows-949
*UTF-8
ISO-8859-1
Latin1 covers most West European languages, such as:
*Afrikaans (af)
*Albanian (sq)
*Basque (eu)
*Catalan (ca)
*Danish (da)
*Dutch (nl)
*English (en)
*Faroese (fo)
*Finnish (fi)
*French (fr)
*Galician (gl)
*German (de)
*Icelandic (is)
*Irish (ga)
*Italian (it)
*Norwegian (no)
*Portuguese (pt)
*Scottish (gd)
*Spanish (es)
*Swedish (sv)
ISO-8859-2
Latin2 covers the languages of Central and Eastern Europe:
*Croatian (hr),
*Czech (cs),
*Hungarian (hu),
*Polish (pl),
*Romanian (ro),
*Slovak (sk),
*Slovenian (sl)
ISO-8859-3
Latin3 is popular with authors of Esperanto (eo), Maltese (mt), and it covered Turkish before the introduction of Latin5.
ISO-8859-4
Latin4 introduced letters for Estonian, Baltic languages, Latvian and Lithuanian, Greenlandic and Lappish. It is an incomplete precursor of Latin6.
ISO-8859-5
With these Cyrillic letters you can type Bulgarian (bg), Byelorussian (be), Macedonian (mk), Russian (ru), Serbian (sr) and Ukrainian (uk).
ISO-8859-6
This is the Arabic (ar) alphabet.
Note: This version of Views does not support bidirectional text.
ISO-8859-7
This is modern Greek (el).
ISO-8859-8
This is Hebrew (iw).
Note: This version of Views does not support bidirectional text.
ISO-8859-9
Latin5 replaces the rarely needed Icelandic letters in Latin1 with the Turkish (tr) ones.
ISO-8859-10
Latin6 rearranged Latin4, added the last missing Inuit (Greenlandic Eskimo) and non-Skolt Sami (Lappish) letters, and reintroduced the rarely Icelandic letters to cover the entire Nordic area:
*Estonian (et)
*Lapp
*Latvian (lv)
*Lithuanian (lt)
Skolt Sami still needs a few more accents.
ISO-8859-11
To cover the Thai language. On UNIX systems, this is similar to the tis620 encoding.
ISO-8859-13
To cover the Baltic Rim. Latin7 is going to cover the Baltic Rim and re-establish the Latvian (lv) support lost in Latin6 and may introduce the local quotation marks. It resembles WinBaltic, that is, windows-1257.
ISO-8859-14
To cover Celtic. Latin8 adds the last Gaelic and Welsh (cy) letters to Latin1 to cover all Celtic languages.
ISO-8859-15
Similar to Latin1 with euro and oe ligature. The new Latin9 nicknamed Latin0 aims to update Latin1 by replacing the less needed symbols ¨´¸ with forgotten French and Finnish letters and placing the U+20AC Euro sign in the cell =A4 of the former international currency sign ¤.
EUC-JP
Extended UNIX Code for Japanese.
Standardized by UNIX International and UNIX Systems Laboratories Pacific. Uses ISO 2022 rules to select:
*code set 0: JIS Roman (a single 7-bit byte set)
*code set 1: JIS X0208-1990 (a double 8-bit byte set) restricted to A0-FF in both bytes
*code set 2: Half Width Katakana (a single 7-bit byte set) requiring SS2 as the character prefix
*code set 3: JIS X0212-1990 (a double 7-bit byte set) restricted to A0-FF in both bytes requiring SS3 as the character prefix
Shift_JIS
A Microsoft code that extends csHalfWidthKatakana to include kanji by adding a second byte when the value of the first byte is in the ranges 81-9F or E0-EF.
EUC-KR (KS C 5861-1992)
Extended UNIX Code for Korean.
GB2312
Multibyte encoding standardized by the People’s Republic of China.
Big5
Multibyte encoding standardized by Taiwan
Big5-HKSCS
Hong-Kong Supplementary Character Set
EUC-TW (cns11643)
Extended UNIX Code for Traditional Chinese
hp-roman8
HP specific
IBM850
IBM specific
windows-1250
Windows 3.1 Eastern European languages.
windows-1251
Windows 3.1 Cyrillic
windows-1252
Windows 3.1 US (ANSI)
windows-1253
Windows 3.1 Greek
windows-1254
Windows 3.1 Turkish
Note: This version of Views is only supported with the Views BIDI version.
windows-1255
Hebrew
Note: This version of Views does not support bidirectional text.
windows-1256
Arabic
Note: This version of Views is only supported with the Views BIDI version.
windows-1257
Baltic
windows-1258
Vietnamese
windows-874
Thai
windows-949
Korean (Wansung)
UTF-8
Unicode UTF-8
Published date: 05/24/2022
Last modified date: 02/24/2022