international foreign language codes - ifls-edu.com

21
INTERNATIONAL FOREIGN LANGUAGE CODES List of ISO 639-1 codes - Content taken from Wikipedia. ISO 639 is a standardized nomenclature used to classify languages. Each language is assigned a two-letter (639-1) and three-letter (639-2 and 639-3) lowercase abbreviation, amended in later versions of the nomenclature. This table lists all of: ISO 639-1: two-letter codes, one per language for ISO 639 macrolanguage And some of: ISO 639-2/T: three-letter codes, for the same languages as 639-1 ISO 639-2/B: three-letter codes, mostly the same as 639-2/T, but with some codes derived from English names rather than native names of languages (in the following table, these differing codes are highlighted in boldface) ISO 639-3: three-letter codes, the same as 639-2/T for languages, but with distinct codes for each variety of an ISO 639 macrolanguage Note: Colors on the leftmost column represent the language family mentioned in the second column. Sl. No. Language Family ISO Language Name Native Name 639- 1 639- 2/T 639- 2/B 639-3 Notes 1 Northwest Caucasian Abkhazian аҧсуа бызшәа, аҧсшәа ab abk abk abk also known as Abkhaz 2 Afro-Asiatic Afar Afaraf aa aar aar aar 3 Indo- European Afrikaans Afrikaans af afr afr afr 4 Niger–Congo Akan Akan ak aka aka aka + 2 macrolanguage, Twi is tw/twi, Fanti is fat 5 Indo- European Albanian Shqip sq sqi alb sqi + 4 macrolanguage, "Albanian Phylozone" in 639- 6 6 Afro-Asiatic Amharic አማርኛ am amh amh amh 7 Afro-Asiatic Arabic العربيةar ara ara ara + 29 macrolanguage, Standard Arabic is arb 8 Indo- European Aragonese aragonés an arg arg arg 9 Indo- European Armenian Հայերեն hy hye arm hye also known as Հայերէն; ISO 639-3 code hye is for Eastern Armenian, hyw is for Western Armenian, and xcl is for Classical Armenian 10 Indo- European Assamese অসমীয as asm asm asm

Upload: others

Post on 26-Nov-2021

26 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

INTERNATIONAL FOREIGN LANGUAGE CODES

List of ISO 639-1 codes - Content taken from Wikipedia.

ISO 639 is a standardized nomenclature used to classify languages. Each language is assigned a two-letter (639-1) and three-letter (639-2 and 639-3) lowercase abbreviation, amended in later versions of the nomenclature.

This table lists all of:

• ISO 639-1: two-letter codes, one per language for ISO 639 macrolanguage

And some of:

• ISO 639-2/T: three-letter codes, for the same languages as 639-1 • ISO 639-2/B: three-letter codes, mostly the same as 639-2/T, but with some codes derived from English names rather than native names of languages (in

the following table, these differing codes are highlighted in boldface) • ISO 639-3: three-letter codes, the same as 639-2/T for languages, but with distinct codes for each variety of an ISO 639 macrolanguage

Note: Colors on the leftmost column represent the language family mentioned in the second column.

Sl. No.

Language Family

ISO Language Name

Native Name 639-

1 639-2/T

639-2/B

639-3 Notes

1 Northwest Caucasian

Abkhazian аҧсуа бызшәа, аҧсшәа

ab abk abk abk also known as Abkhaz

2 Afro-Asiatic Afar Afaraf aa aar aar aar

3 Indo-European

Afrikaans Afrikaans af afr afr afr

4 Niger–Congo Akan Akan ak aka aka aka +

2

macrolanguage, Twi is tw/twi, Fanti is fat

5 Indo-European

Albanian Shqip sq sqi alb sqi + 4

macrolanguage, "Albanian Phylozone" in 639-6

6 Afro-Asiatic Amharic አማርኛ am amh amh amh

7 Afro-Asiatic Arabic العربية ar ara ara ara +

29

macrolanguage, Standard Arabic is arb

8 Indo-European

Aragonese aragonés an arg arg arg

9 Indo-European

Armenian Հայերեն hy hye arm hye

also known as Հայերէն; ISO 639-3 code hye is for Eastern Armenian, hyw is for Western Armenian, and xcl is for Classical Armenian

10 Indo-European

Assamese অসমীয়া as asm asm asm

Page 2: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

11 Northeast Caucasian

Avaric авар мацӀ, магӀарул мацӀ

av ava ava ava also known as Avar

12 Indo-European

Avestan avesta ae ave ave ave ancient

13 Aymaran Aymara aymar aru ay aym aym aym +

2 macrolanguage

14 Turkic Azerbaijani azərbaycan dili, تۆرکجه

az aze aze aze +

2 macrolanguage

15 also known as Azeri

16

17 Niger–Congo Bambara bamanankan bm bam bam bam

18 Turkic Bashkir башҡорт теле ba bak bak bak

19 Language isolate

Basque euskara, euskera

eu eus baq eus

20 Indo-European

Belarusian беларуская мова

be bel bel bel

21 Indo-European

Bengali ব়াাংল়া bn ben ben ben also known as Bangla

22 Indo-European

Bihari languages

भोजपुरी bh bih bih

collective language code for Bhojpuri, Magahi, and Maithili

23 Creole Bislama Bislama bi bis bis bis

Language formed from English and Ni-Vanuatu, with some French influence.

24 Indo-European

Bosnian bosanski jezik bs bos bos bos

25 Indo-European

Breton brezhoneg br bre bre bre

26 Indo-European

Bulgarian български език bg bul bul bul

27 Sino-Tibetan Burmese ဗမာစာ my mya bur mya also known as Myanmar

28 Indo-European

Catalan, Valencian

català, valencià ca cat cat cat

29 Austronesian Chamorro Chamoru ch cha cha cha

30 Northeast Caucasian

Chechen нохчийн мотт ce che che che

31 Niger–Congo Chichewa, Chewa, Nyanja

chiCheŵa, chinyanja

ny nya nya nya

32 Sino-Tibetan Chinese

中文

(Zhōngwén), 汉

语, 漢語

zh zho chi zho +

16 macrolanguage

33 Turkic Chuvash чӑваш чӗлхи cv chv chv chv

34 Indo-European

Cornish Kernewek kw cor cor cor

Page 3: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

35 Indo-European

Corsican corsu, lingua corsa

co cos cos cos

36 Algonquian Cree ᓀᐦᐃᔭᐍᐏᐣ cr cre cre cre +

6 macrolanguage

37 Indo-European

Croatian hrvatski jezik hr hrv hrv hrv

38 Indo-European

Czech čeština, český jazyk

cs ces cze ces

39 Indo-European

Danish dansk da dan dan dan

40 Indo-European

Divehi, Dhivehi, Maldivian

dv div div div ދިވެހިި

41 Indo-European

Dutch, Flemish Nederlands, Vlaams

nl nld dut nld

Flemish is not to be confused with the closely related West Flemish which is referred to as Vlaams (Dutch for "Flemish") in ISO 639-3 and has the ISO 639-3 code vls

42 Sino-Tibetan Dzongkha རྫོང་ཁ dz dzo dzo dzo

43 Indo-European

English English en eng eng eng

44 Constructed Esperanto Esperanto eo epo epo epo constructed, initiated from L.L. Zamenhof, 1887

45 Uralic Estonian eesti, eesti keel et est est est +

2 macrolanguage

46 Niger–Congo Ewe Eʋegbe ee ewe ewe ewe

47 Indo-European

Faroese føroyskt fo fao fao fao

48 Austronesian Fijian vosa Vakaviti fj fij fij fij

49 Uralic Finnish suomi, suomen kieli

fi fin fin fin

50 Indo-European

French français, langue française

fr fra fre fra

51 Niger–Congo Fulah Fulfulde, Pulaar, Pular

ff ful ful ful + 9 macrolanguage, also known as Fula

52 Indo-European

Galician Galego gl glg glg glg

53 Kartvelian Georgian ქართული ka kat geo kat

54 Indo-European

German Deutsch de deu ger deu

55 Indo-European

Greek, Modern (1453–)

ελληνικά el ell gre ell

56 Tupian Guarani Avañe'ẽ gn grn grn grn +

5 macrolanguage

Page 4: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

57 Indo-European

Gujarati ગુજરાતી gu guj guj guj

58 Creole Haitian, Haitian Creole

Kreyòl ayisyen ht hat hat hat

59 Afro-Asiatic Hausa (Hausa) َوُسَ ha hau hau hau ه

60 Afro-Asiatic Hebrew עברית he heb heb heb

Modern Hebrew. Code changed in 1989 from original ISO 639:1988, iw.[1]

61 Niger–Congo Herero Otjiherero hz her her her

62 Indo-European

Hindi हिन्दी, हििंदी hi hin hin hin

63 Austronesian Hiri Motu Hiri Motu ho hmo hmo hmo

64 Uralic Hungarian magyar hu hun hun hun

65 Constructed

Interlingua (International Auxiliary Language Association)

Interlingua ia ina ina ina

constructed by International Auxiliary Language Association

66 Austronesian Indonesian Bahasa Indonesia

id ind ind ind

Covered by macrolanguage ms/msa. Changed in 1989 from original ISO 639:1988, in.[1]

67 Constructed Interlingue, Occidental

(originally:) Occidental, (after WWII:) Interlingue

ie ile ile ile

constructed by Edgar de Wahl, first published in 1922

68 Indo-European

Irish Gaeilge ga gle gle gle

69 Niger–Congo Igbo Asụsụ Igbo ig ibo ibo ibo

70 Eskimo–Aleut Inupiaq Iñupiaq, Iñupiatun

ik ipk ipk ipk +

2 macrolanguage

71 Constructed Ido Ido io ido ido ido

constructed by De Beaufront, 1907, as variation of Esperanto

72 Indo-European

Icelandic Íslenska is isl ice isl

73 Indo-European

Italian Italiano it ita ita ita

74 Eskimo–Aleut Inuktitut ᐃᓄᒃᑎᑐᑦ iu iku iku iku +

2 macrolanguage

75 Japonic Japanese 日本語 (にほん

ご) ja jpn jpn jpn

76 Austronesian Javanese ꦧꦱꦗꦮ, Basa jv jav jav jav

Page 5: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

Jawa

77 Eskimo–Aleut Kalaallisut, Greenlandic

kalaallisut, kalaallit oqaasii

kl kal kal kal

78 Dravidian Kannada ಕನ್ನ ಡ kn kan kan kan

79 Nilo-Saharan Kanuri Kanuri kr kau kau kau +

3 macrolanguage

80 Indo-European

Kashmiri कश्मीरी, ي ks kas kas kas كشمير

81 Turkic Kazakh қазақ тілі kk kaz kaz kaz

82 Austroasiatic Central Khmer

ខ្មែរ,

ខមមរភាសា,

ភាសាខ្មែរ

km khm khm khm also known as Khmer or Cambodian

83 Niger–Congo Kikuyu, Gikuyu Gĩkũyũ ki kik kik kik

84 Niger–Congo Kinyarwanda Ikinyarwanda rw kin kin kin

85 Turkic Kirghiz, Kyrgyz Кыргызча, Кыргыз тили

ky kir kir kir

86 Uralic Komi коми кыв kv kom kom kom +

2 macrolanguage

87 Niger–Congo Kongo Kikongo kg kon kon kon +

3 macrolanguage

88 Koreanic Korean 한국어 ko kor kor kor

89 Indo-European

Kurdish Kurdî, کوردی ku kur kur kur +

3 macrolanguage

90 Niger–Congo Kuanyama, Kwanyama

Kuanyama kj kua kua kua

91 Indo-European

Latin latine, lingua latina

la lat lat lat ancient

92 Indo-European

Luxembourgish, Letzeburgesch

Lëtzebuergesch lb ltz ltz ltz

93 Niger–Congo Ganda Luganda lg lug lug lug

94 Indo-European

Limburgan, Limburger, Limburgish

Limburgs li lim lim lim

95 Niger–Congo Lingala Lingála ln lin lin lin

96 Tai–Kadai Lao ພາສາລາວ lo lao lao lao

97 Indo-European

Lithuanian lietuvių kalba lt lit lit lit

98 Niger–Congo Luba-Katanga Kiluba lu lub lub lub also known as Luba-Shaba

99 Indo-European

Latvian latviešu valoda lv lav lav lav + 2 macrolanguage

100 Indo-European

Manx Gaelg, Gailck gv glv glv glv

101 Indo-European

Macedonian македонски јазик

mk mkd mac mkd

102 Austronesian Malagasy fiteny malagasy mg mlg mlg mlg +

11 macrolanguage

Page 6: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

103 Austronesian Malay Bahasa Melayu, بهاس ملايو

ms msa may msa +

36

macrolanguage, Standard Malay is zsm, Indonesian is id/ind

104 Dravidian Malayalam മലയാളം ml mal mal mal

105 Afro-Asiatic Maltese Malti mt mlt mlt mlt

106 Austronesian Māori te reo Māori mi mri mao mri also known as Māori

107 Indo-European

Marathi मराठी mr mar mar mar also known as Marāṭhī

108 Austronesian Marshallese Kajin M̧ajeļ mh mah mah mah

109 Mongolic Mongolian Монгол хэл mn mon mon mon +

2 macrolanguage

110 Austronesian Nauru Dorerin Naoero na nau nau nau also known as Nauruan

111 Dené–Yeniseian

Navajo, Navaho Diné bizaad nv nav nav nav

112 Niger–Congo North Ndebele isiNdebele nd nde nde nde also known as Northern Ndebele

113 Indo-European

Nepali नेपाली ne nep nep nep +

2 macrolanguage

114 Niger–Congo Ndonga Owambo ng ndo ndo ndo

115 Indo-European

Norwegian Bokmål

Norsk Bokmål nb nob nob nob Covered by macrolanguage no/nor

116 Indo-European

Norwegian Nynorsk

Norsk Nynorsk nn nno nno nno Covered by macrolanguage no/nor

117 Indo-European

Norwegian Norsk no nor nor nor +

2

macrolanguage, Bokmål is nb/nob, Nynorsk is nn/nno

118 Sino-Tibetan Sichuan Yi, Nuosu

ꆈꌠ꒿ Nuosuhxop

ii iii iii iii Standard form of Yi languages

119 Niger–Congo South Ndebele isiNdebele nr nbl nbl nbl also known as Southern Ndebele

120 Indo-European

Occitan occitan, lenga d'òc

oc oci oci oci

121 Algonquian Ojibwa ᐊᓂᔑᓈᐯᒧᐎᓐ oj oji oji oji + 7 macrolanguage, also known as Ojibwe

122 Indo-European

Church Slavic, Old Slavonic, Church Slavonic, Old Bulgarian, Old Church Slavonic

ѩзыкъ словѣньскъ

cu chu chu chu ancient, in use by Orthodox Church

123 Afro-Asiatic Oromo Afaan Oromoo om orm orm orm +

4 macrolanguage

Page 7: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

124 Indo-European

Oriya ଓଡ଼ିଆ or ori ori ori + 2 macrolanguage, also known as Odia

125 Indo-European

Ossetian, Ossetic

ирон æвзаг os oss oss oss

126 Indo-European

Punjabi, Panjabi

ਪੰਜਾਬੀ, پنجاب pa pan pan pan

127 Indo-European

Pali पाहल, पाहि pi pli pli pli ancient, also known as Pāli

128 Indo-European

Persian فارسی fa fas per fas + 2 macrolanguage, also known as Farsi

129 Indo-European

Polish język polski, polszczyzna

pl pol pol pol

130 Indo-European

Pashto, Pushto پښتو ps pus pus pus +

3 macrolanguage

131 Indo-European

Portuguese Português pt por por por

132 Quechuan Quechua Runa Simi, Kichwa

qu que que que +

43 macrolanguage

133 Indo-European

Romansh Rumantsch Grischun

rm roh roh roh

134 Niger–Congo Rundi Ikirundi rn run run run also known as Kirundi

135 Indo-European

Romanian, Moldavian, Moldovan

Română, Moldovenească

ro ron rum ron

The identifiers mo and mol are deprecated, leaving ro and ron (639-2/T) and rum (639-2/B) the current language identifiers to be used for the variant of the Romanian language also known as Moldavian and Moldovan in English and moldave in French. The identifiers mo and mol will not be assigned to different items, and recordings using these identifiers will not be invalid.

136 Indo- Russian русский ru rus rus rus

Page 8: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

European

137 Indo-European

Sanskrit सिंसृ्कतम्, sa san san san ancient

138 Indo-European

Sardinian sardu sc srd srd srd +

4 macrolanguage

139 Indo-European

Sindhi हसन्धी, ،سنڌي

سندھیsd snd snd snd

140 Uralic Northern Sami Davvisámegiella se sme sme sme

141 Austronesian Samoan gagana fa'a Samoa

sm smo smo smo

142 Creole Sango yângâ tî sängö sg sag sag sag

143 Indo-European

Serbian српски језик sr srp srp srp

The ISO 639-2/T code srp deprecated the ISO 639-2/B code scc[2]

144 Indo-European

Gaelic, Scottish Gaelic

Gàidhlig gd gla gla gla

145 Niger–Congo Shona chiShona sn sna sna sna

146 Indo-European

Sinhala, Sinhalese

සිංහල si sin sin sin

147 Indo-European

Slovak Slovenčina, Slovenský jazyk

sk slk slo slk

148 Indo-European

Slovenian Slovenski jezik, Slovenščina

sl slv slv slv also known as Slovene

149 Afro-Asiatic Somali Soomaaliga, af Soomaali

so som som som

150 Niger–Congo Southern Sotho Sesotho st sot sot sot

151 Indo-European

Spanish, Castilian

Español es spa spa spa

152 Austronesian Sundanese Basa Sunda su sun sun sun

153 Niger–Congo Swahili Kiswahili sw swa swa swa +

2 macrolanguage

154 Niger–Congo Swati SiSwati ss ssw ssw ssw also known as Swazi

155 Indo-European

Swedish Svenska sv swe swe swe

156 Dravidian Tamil தமிழ் ta tam tam tam

157 Dravidian Telugu తెలుగు te tel tel tel

158 Indo-European

Tajik тоҷикӣ, toçikī, تاجیکی

tg tgk tgk tgk

159 Tai–Kadai Thai ไทย th tha tha tha

160 Afro-Asiatic Tigrinya ትግርኛ ti tir tir tir

161 Sino-Tibetan Tibetan བྫོད་ཡགི bo bod tib bod also known as Standard Tibetan

162 Turkic Turkmen Türkmen, Түркмен

tk tuk tuk tuk

Page 9: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

163 Austronesian Tagalog Wikang Tagalog tl tgl tgl tgl Note: Filipino (Pilipino) has the code fil

164 Niger–Congo Tswana Setswana tn tsn tsn tsn

165 Austronesian Tonga (Tonga Islands)

Faka Tonga to ton ton ton also known as Tongan

166 Turkic Turkish Türkçe tr tur tur tur

167 Niger–Congo Tsonga Xitsonga ts tso tso tso

168 Turkic Tatar татар теле, tatar tele

tt tat tat tat

169 Niger–Congo Twi Twi tw twi twi twi Covered by macrolanguage ak/aka

170 Austronesian Tahitian Reo Tahiti ty tah tah tah

One of the Reo Mā`ohi (languages of French Polynesia)[3]

171 Turkic Uighur, Uyghur ,ئۇيغۇرچەUyghurche

ug uig uig uig

172 Indo-European

Ukrainian Українська uk ukr ukr ukr

173 Indo-European

Urdu اردو ur urd urd urd

174 Turkic Uzbek Oʻzbek, Ўзбек, أۇزبېك

uz uzb uzb uzb +

2 macrolanguage

175 Niger–Congo Venda Tshivenḓa ve ven ven ven

176 Austroasiatic Vietnamese Tiếng Việt vi vie vie vie

177 Constructed Volapük Volapük vo vol vol vol constructed

178 Indo-European

Walloon Walon wa wln wln wln

179 Indo-European

Welsh Cymraeg cy cym wel cym

180 Niger–Congo Wolof Wollof wo wol wol wol

181 Indo-European

Western Frisian Frysk fy fry fry fry also known as Frisian

182 Niger–Congo Xhosa isiXhosa xh xho xho xho

183 Indo-European

Yiddish ייִדיש yi yid yid yid +

2

macrolanguage. Changed in 1989 from original ISO 639:1988, ji.[1]

184 Niger–Congo Yoruba Yorùbá yo yor yor yor

185 Tai–Kadai Zhuang, Chuang

Saɯ cueŋƅ, Saw cuengh

za zha zha zha +

16 macrolanguage

186 Niger–Congo Zulu isiZulu zu zul zul zul

Code for the Representation of the Names of Languages. From ISO 639,

Page 10: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

revised 1989.

ISO 639. Code for the representation of the names of languages.

[2001-08-29] Note: See now the updated document "Language Identifiers in the Markup Context."

Note. See also related materials:

• NISO 3-character language codes (Z39.53-1994) [unofficial]

• Bibliography entry for ISO 639:1988 (E/F). Code for the Representation of Names of Languages

• Bibliography entry for ISO CD 639/2:1991. Code for the Representation of Names of Languages: alpha-3 Code

[December 20, 1997] Updated to reflect six changes from the "1989" revision by the ISO 639 Registration Authority; see the note at the end of the document itemizing the edits. The partial listing of ISO 639 two-character codes is supplied here will supplement the shorter lists given in Martin Bryan (SGML: An Author's Guide to the Standard Generalized Markup Language, 92-93) and Eric van Herwijnen (Practical SGML, 67-68). The two-character language codes of ISO 639 are relevant to SGML encoding in two respects. First, the SGML standard (ISO 8879) itself specifies that declaration of public text language should be given using the language code(s) from ISO 639; see ISO 8879-1986(E) page 36, section 10.2.2.3. Second, the WSD (Writing System Declaration) implemented in the Text Encoding Initiative uses the [two-character] language code of ISO 639 (as amended) as a language.code attribute of the nat.language declaration, specifying the language in which the WSD is written. The information on 2-character language codes summarized below has been taken from ISO 639 Code for the representation of the names of languages. First edition, 1988-04-01. Reference number: ISO 639: 1988 (E/F). iii + 17 pages. ISO 639:1988 is a technical revision of ISO 639: 1967, prepared by Technical Committee ISO/TC 37. The language codes are listed in ISO 639 with lowercase letters, but are given here in uppercase, as recommended for use as SGML tag names for "public text language." See ISO 8879 section 10.2.2.3: "the 'public text language' must be a two-character name, entered with upper-case letters." ISO 639 contains much other information about the use of language symbols, registration of new symbols, etc. The language codes of ISO 639 are said to be "devised primarily for use in terminology, lexicography and linguistics, but they may be used for any application requiring the expression of languages in coded form." The registration authority for ISO 639 is given as Infoterm, Österreiches Normungsinstitut (ON), Postfach 130, A-1021 Vienna, AUSTRIA. The two-character language codes of ISO 639 are recognized as being inadequate for use as SGML language attributes when tagging text, viz, for use as global lang attributes attached to any element to identify the language of the text element or a language shift. On lang as a global attribute, see the TEI Guidelines, page 45, section 3.2.1. In principle, there should be nothing wrong with tagging language using SGML elements rather than attributes, if the encoder has principled reasons for not using attributes (e.g., indexing engines which read simple tags but not SGML attributes). But the two-character codes of ISO 639 are neither sufficiently mnemonic nor complete for the world's languages: whereas ISO 639 supplies codes for only about 136 languages, the Ethnologue published by the Summer Institute of Linguistics identifies over 6100 languages (see Ethnologue: Languages of the World, ed. Barbara Grimes. 11th edition. Dallas, TX: Summer Institute of Linguistics, 1988). A revision of ISO 639 completed late 1990 is described as supplying 3-character language codes (following MARC 3-character language codes in part), based upon the code sequence of the American National Standard (ANSI Z39.53). This draft will be circulated for worldwide review in 1991. It remains to be seen whether these new ISO 639 3-character codes qualify mnemonically for use in SGML tagging and if the set is complete. Provisionally, and as a convenience, the set of 3-character MARC language codes are supplied in this appendix. Where they are mnemonic, unique and adequately distinguish dialectical variants, it would seem permissible to use them for lang attribute values or as language tags. ISO 639 CODES ALPHABETIC BY LANGUAGE NAME (ENGLISH SPELLING)

Page 11: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

Changes made December 20, 1997, based upon information in the following note from a member of the W3C HTML group:

LANGUAGE NAME CODE LANGUAGE FAMILY

ABKHAZIAN AB IBERO-CAUCASIAN

AFAN (OROMO) OM HAMITIC

AFAR AA HAMITIC

AFRIKAANS AF GERMANIC

ALBANIAN SQ INDO-EUROPEAN (OTHER)

AMHARIC AM SEMITIC

ARABIC AR SEMITIC

ARMENIAN HY INDO-EUROPEAN (OTHER)

ASSAMESE AS INDIAN

AYMARA AY AMERINDIAN

AZERBAIJANI AZ TURKIC/ALTAIC

BASHKIR BA TURKIC/ALTAIC

BASQUE EU BASQUE

BENGALI;BANGLA BN INDIAN

BHUTANI DZ ASIAN

BIHARI BH INDIAN

BISLAMA BI [not given]

BRETON BR CELTIC

BULGARIAN BG SLAVIC

BURMESE MY ASIAN

BYELORUSSIAN BE SLAVIC

CAMBODIAN KM ASIAN

CATALAN CA ROMANCE

CHINESE ZH ASIAN

CORSICAN CO ROMANCE

CROATIAN HR SLAVIC

CZECH CS SLAVIC

DANISH DA GERMANIC

DUTCH NL GERMANIC

ENGLISH EN GERMANIC

ESPERANTO EO INTERNATIONAL AUX.

ESTONIAN ET FINNO-UGRIC

FAROESE FO GERMANIC

FIJI FJ OCEANIC/INDONESIAN

FINNISH FI FINNO-UGRIC

FRENCH FR ROMANCE

FRISIAN FY GERMANIC

GALICIAN GL ROMANCE

Page 12: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

GEORGIAN KA IBERO-CAUCASIAN

GERMAN DE GERMANIC

GREEK EL LATIN/GREEK

GREENLANDIC KL ESKIMO

GUARANI GN AMERINDIAN

GUJARATI GU INDIAN

HAUSA HA NEGRO-AFRICAN

HEBREW HE SEMITIC [*Changed 1989 from original ISO 639:1988, IW]

HINDI HI INDIAN

HUNGARIAN HU FINNO-UGRIC

ICELANDIC IS GERMANIC

INDONESIAN ID OCEANIC/INDONESIAN [*Changed 1989 from original ISO 639:1988, IN]

INTERLINGUA IA INTERNATIONAL AUX.

INTERLINGUE IE INTERNATIONAL AUX.

INUKTITUT IU [ ]

INUPIAK IK ESKIMO

IRISH GA CELTIC

ITALIAN IT ROMANCE

JAPANESE JA ASIAN

JAVANESE JV OCEANIC/INDONESIAN

KANNADA KN DRAVIDIAN

KASHMIRI KS INDIAN

KAZAKH KK TURKIC/ALTAIC

KINYARWANDA RW NEGRO-AFRICAN

KIRGHIZ KY TURKIC/ALTAIC

KURUNDI RN NEGRO-AFRICAN

KOREAN KO ASIAN

KURDISH KU IRANIAN

LAOTHIAN LO ASIAN

LATIN LA LATIN/GREEK

LATVIAN;LETTISH LV BALTIC

LINGALA LN NEGRO-AFRICAN

LITHUANIAN LT BALTIC

MACEDONIAN MK SLAVIC

MALAGASY MG OCEANIC/INDONESIAN

MALAY MS OCEANIC/INDONESIAN

MALAYALAM ML DRAVIDIAN

MALTESE MT SEMITIC

MAORI MI OCEANIC/INDONESIAN

MARATHI MR INDIAN

MOLDAVIAN MO ROMANCE

MONGOLIAN MN [not given]

Page 13: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

NAURU NA [not given]

NEPALI NE INDIAN

NORWEGIAN NO GERMANIC

OCCITAN OC ROMANCE

ORIYA OR INDIAN

PASHTO;PUSHTO PS IRANIAN

PERSIAN (farsi) FA IRANIAN

POLISH PL SLAVIC

PORTUGUESE PT ROMANCE

PUNJABI PA INDIAN

QUECHUA QU AMERINDIAN

RHAETO-ROMANCE RM ROMANCE

ROMANIAN RO ROMANCE

RUSSIAN RU SLAVIC

SAMOAN SM OCEANIC/INDONESIAN

SANGHO SG NEGRO-AFRICAN

SANSKRIT SA INDIAN

SCOTS GAELIC GD CELTIC

SERBIAN SR SLAVIC

SERBO-CROATIAN SH SLAVIC

SESOTHO ST NEGRO-AFRICAN

SETSWANA TN NEGRO-AFRICAN

SHONA SN NEGRO-AFRICAN

SINDHI SD INDIAN

SINGHALESE SI INDIAN

SISWATI SS NEGRO-AFRICAN

SLOVAK SK SLAVIC

SLOVENIAN SL SLAVIC

SOMALI SO HAMITIC

SPANISH ES ROMANCE

SUNDANESE SU OCEANIC/INDONESIAN

SWAHILI SW NEGRO-AFRICAN

SWEDISH SV GERMANIC

TAGALOG TL OCEANIC/INDONESIAN

TAJIK TG IRANIAN

TAMIL TA DRAVIDIAN

TATAR TT TURKIC/ALTAIC

TELUGU TE DRAVIDIAN

THAI TH ASIAN

TIBETAN BO ASIAN

TIGRINYA TI SEMITIC

Page 14: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

TONGA TO OCEANIC/INDONESIAN

TSONGA TS NEGRO-AFRICAN

TURKISH TR TURKIC/ALTAIC

TURKMEN TK TURKIC/ALTAIC

TWI TW NEGRO-AFRICAN

UIGUR UG [ ]

UKRAINIAN UK SLAVIC

URDU UR INDIAN

UZBEK UZ TURKIC/ALTAIC

VIETNAMESE VI ASIAN

VOLAPUK VO INTERNATIONAL AUX.

WELSH CY CELTIC

WOLOF WO NEGRO-AFRICAN

XHOSA XH NEGRO-AFRICAN

YIDDISH YI GERMANIC [*Changed 1989 from original ISO 639:1988, JI]

YORUBA YO NEGRO-AFRICAN

ZHUANG ZA [ ]

ZULU ZU NEGRO-AFRICAN

ISO 639 CODES SORTED BY LANGUAGE CODE

LANGUAGE NAME CODE LANGUAGE FAMILY

AFAR AA HAMITIC

ABKHAZIAN AB IBERO-CAUCASIAN

AFRIKAANS AF GERMANIC

AMHARIC AM SEMITIC

ARABIC AR SEMITIC

ASSAMESE AS INDIAN

AYMARA AY AMERINDIAN

AZERBAIJANI AZ TURKIC/ALTAIC

BASHKIR BA TURKIC/ALTAIC

BYELORUSSIAN BE SLAVIC

BULGARIAN BG SLAVIC

BIHARI BH INDIAN

BISLAMA BI [not given]

BENGALI;BANGLA BN INDIAN

TIBETAN BO ASIAN

BRETON BR CELTIC

CATALAN CA ROMANCE

CORSICAN CO ROMANCE

Page 15: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

CZECH CS SLAVIC

WELSH CY CELTIC

DANISH DA GERMANIC

GERMAN DE GERMANIC

BHUTANI DZ ASIAN

GREEK EL LATIN/GREEK

ENGLISH EN GERMANIC

ESPERANTO EO INTERNATIONAL AUX.

SPANISH ES ROMANCE

ESTONIAN ET FINNO-UGRIC

BASQUE EU BASQUE

PERSIAN (farsi) FA IRANIAN

FINNISH FI FINNO-UGRIC

FIJI FJ OCEANIC/INDONESIAN

FAROESE FO GERMANIC

FRENCH FR ROMANCE

FRISIAN FY GERMANIC

IRISH GA CELTIC

SCOTS GAELIC GD CELTIC

GALICIAN GL ROMANCE

GUARANI GN AMERINDIAN

GUJARATI GU INDIAN

HAUSA HA NEGRO-AFRICAN

HEBREW HE SEMITIC [*Changed 1989 from original ISO 639:1988, IW]

HINDI HI INDIAN

CROATIAN HR SLAVIC

HUNGARIAN HU FINNO-UGRIC

ARMENIAN HY INDO-EUROPEAN (OTHER)

INTERLINGUA IA INTERNATIONAL AUX.

INTERLINGUE IE INTERNATIONAL AUX.

INUPIAK IK ESKIMO

INDONESIAN ID OCEANIC/INDONESIAN [*Changed 1989 from original ISO 639:1988, IN]

ICELANDIC IS GERMANIC

ITALIAN IT ROMANCE

INUKTITUT IU [ ]

JAPANESE JA ASIAN

JAVANESE JV OCEANIC/INDONESIAN

GEORGIAN KA IBERO-CAUCASIAN

KAZAKH KK TURKIC/ALTAIC

GREENLANDIC KL ESKIMO

CAMBODIAN KM ASIAN

KANNADA KN DRAVIDIAN

KOREAN KO ASIAN

KASHMIRI KS INDIAN

KURDISH KU IRANIAN

KIRGHIZ KY TURKIC/ALTAIC

LATIN LA LATIN/GREEK

LINGALA LN NEGRO-AFRICAN

Page 16: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

LAOTHIAN LO ASIAN

LITHUANIAN LT BALTIC

LATVIAN;LETTISH LV BALTIC

MALAGASY MG OCEANIC/INDONESIAN

MAORI MI OCEANIC/INDONESIAN

MACEDONIAN MK SLAVIC

MALAYALAM ML DRAVIDIAN

MONGOLIAN MN [not given]

MOLDAVIAN MO ROMANCE

MARATHI MR INDIAN

MALAY MS OCEANIC/INDONESIAN

MALTESE MT SEMITIC

BURMESE MY ASIAN

NAURU NA [not given]

NEPALI NE INDIAN

DUTCH NL GERMANIC

NORWEGIAN NO GERMANIC

OCCITAN OC ROMANCE

AFAN (OROMO) OM HAMITIC

ORIYA OR INDIAN

PUNJABI PA INDIAN

POLISH PL SLAVIC

PASHTO;PUSHTO PS IRANIAN

PORTUGUESE PT ROMANCE

QUECHUA QU AMERINDIAN

RHAETO-ROMANCE RM ROMANCE

KURUNDI RN NEGRO-AFRICAN

ROMANIAN RO ROMANCE

RUSSIAN RU SLAVIC

KINYARWANDA RW NEGRO-AFRICAN

SANSKRIT SA INDIAN

SINDHI SD INDIAN

SANGHO SG NEGRO-AFRICAN

SERBO-CROATIAN SH SLAVIC

SINGHALESE SI INDIAN

SLOVAK SK SLAVIC

SLOVENIAN SL SLAVIC

SAMOAN SM OCEANIC/INDONESIAN

SHONA SN NEGRO-AFRICAN

SOMALI SO HAMITIC

ALBANIAN SQ INDO-EUROPEAN (OTHER)

SERBIAN SR SLAVIC

SISWATI SS NEGRO-AFRICAN

SESOTHO ST NEGRO-AFRICAN

SUNDANESE SU OCEANIC/INDONESIAN

SWEDISH SV GERMANIC

SWAHILI SW NEGRO-AFRICAN

TAMIL TA DRAVIDIAN

Page 17: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

TELUGU TE DRAVIDIAN

TAJIK TG IRANIAN

THAI TH ASIAN

TIGRINYA TI SEMITIC

TURKMEN TK TURKIC/ALTAIC

TAGALOG TL OCEANIC/INDONESIAN

SETSWANA TN NEGRO-AFRICAN

TONGA TO OCEANIC/INDONESIAN

TURKISH TR TURKIC/ALTAIC

TSONGA TS NEGRO-AFRICAN

TATAR TT TURKIC/ALTAIC

TWI TW NEGRO-AFRICAN

UIGUR UG [ ]

UKRAINIAN UK SLAVIC

URDU UR INDIAN

UZBEK UZ TURKIC/ALTAIC

VIETNAMESE VI ASIAN

VOLAPUK VO INTERNATIONAL AUX.

WOLOF WO NEGRO-AFRICAN

XHOSA XH NEGRO-AFRICAN

YIDDISH YI GERMANIC [*Changed 1989 from original ISO 639:1988, JI]

YORUBA YO NEGRO-AFRICAN

ZHUANG ZA [ ]

CHINESE ZH ASIAN

ZULU ZU NEGRO-AFRICAN

ISO 639 LANGUAGE CODES SORTED BY LANGUAGE GROUP

LANGUAGE NAME CODE LANGUAGE FAMILY

AYMARA AY AMERINDIAN

GUARANI GN AMERINDIAN

QUECHUA QU AMERINDIAN

BHUTANI DZ ASIAN

BURMESE MY ASIAN

CAMBODIAN KM ASIAN

CHINESE ZH ASIAN

JAPANESE JA ASIAN

KOREAN KO ASIAN

LAOTHIAN LO ASIAN

THAI TH ASIAN

TIBETAN BO ASIAN

VIETNAMESE VI ASIAN

LATVIAN;LETTISH LV BALTIC

LITHUANIAN LT BALTIC

Page 18: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

BASQUE EU BASQUE

BRETON BR CELTIC

IRISH GA CELTIC

SCOTS GAELIC GD CELTIC

WELSH CY CELTIC

KANNADA KN DRAVIDIAN

MALAYALAM ML DRAVIDIAN

TAMIL TA DRAVIDIAN

TELUGU TE DRAVIDIAN

GREENLANDIC KL ESKIMO

INUPIAK IK ESKIMO

ESTONIAN ET FINNO-UGRIC

FINNISH FI FINNO-UGRIC

HUNGARIAN HU FINNO-UGRIC

AFRIKAANS AF GERMANIC

DANISH DA GERMANIC

DUTCH NL GERMANIC

ENGLISH EN GERMANIC

FAROESE FO GERMANIC

FRISIAN FY GERMANIC

GERMAN DE GERMANIC

ICELANDIC IS GERMANIC

NORWEGIAN NO GERMANIC

SWEDISH SV GERMANIC

YIDDISH YI GERMANIC [*Changed 1989 from original ISO 639:1988, JI]

AFAN (OROMO) OM HAMITIC

AFAR AA HAMITIC

SOMALI SO HAMITIC

ABKHAZIAN AB IBERO-CAUCASIAN

GEORGIAN KA IBERO-CAUCASIAN

ASSAMESE AS INDIAN

BENGALI;BANGLA BN INDIAN

BIHARI BH INDIAN

GUJARATI GU INDIAN

HINDI HI INDIAN

KASHMIRI KS INDIAN

MARATHI MR INDIAN

NEPALI NE INDIAN

ORIYA OR INDIAN

Page 19: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

PUNJABI PA INDIAN

SANSKRIT SA INDIAN

SINDHI SD INDIAN

SINGHALESE SI INDIAN

URDU UR INDIAN

ALBANIAN SQ INDO-EUROPEAN (OTHER)

ARMENIAN HY INDO-EUROPEAN (OTHER)

ESPERANTO EO INTERNATIONAL AUX.

INTERLINGUA IA INTERNATIONAL AUX.

INTERLINGUE IE INTERNATIONAL AUX.

VOLAPUK VO INTERNATIONAL AUX.

KURDISH KU IRANIAN

PASHTO;PUSHTO PS IRANIAN

PERSIAN (farsi) FA IRANIAN

TAJIK TG IRANIAN

GREEK EL LATIN/GREEK

LATIN LA LATIN/GREEK

HAUSA HA NEGRO-AFRICAN

KINYARWANDA RW NEGRO-AFRICAN

KURUNDI RN NEGRO-AFRICAN

LINGALA LN NEGRO-AFRICAN

SANGHO SG NEGRO-AFRICAN

SESOTHO ST NEGRO-AFRICAN

SETSWANA TN NEGRO-AFRICAN

SHONA SN NEGRO-AFRICAN

SISWATI SS NEGRO-AFRICAN

SWAHILI SW NEGRO-AFRICAN

TSONGA TS NEGRO-AFRICAN

TWI TW NEGRO-AFRICAN

WOLOF WO NEGRO-AFRICAN

XHOSA XH NEGRO-AFRICAN

YORUBA YO NEGRO-AFRICAN

ZULU ZU NEGRO-AFRICAN

FIJI FJ OCEANIC/INDONESIAN

INDONESIAN ID OCEANIC/INDONESIAN [*Changed 1989 from original ISO 639:1988, IN]

JAVANESE JV OCEANIC/INDONESIAN

MALAGASY MG OCEANIC/INDONESIAN

MALAY MS OCEANIC/INDONESIAN

MAORI MI OCEANIC/INDONESIAN

SAMOAN SM OCEANIC/INDONESIAN

SUNDANESE SU OCEANIC/INDONESIAN

TAGALOG TL OCEANIC/INDONESIAN

Page 20: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

TONGA TO OCEANIC/INDONESIAN

CATALAN CA ROMANCE

CORSICAN CO ROMANCE

FRENCH FR ROMANCE

GALICIAN GL ROMANCE

ITALIAN IT ROMANCE

MOLDAVIAN MO ROMANCE

OCCITAN OC ROMANCE

PORTUGUESE PT ROMANCE

RHAETO-ROMANCE RM ROMANCE

ROMANIAN RO ROMANCE

SPANISH ES ROMANCE

AMHARIC AM SEMITIC

ARABIC AR SEMITIC

HEBREW HE SEMITIC [*Changed 1989 from original ISO 639:1988, IW]

MALTESE MT SEMITIC

TIGRINYA TI SEMITIC

BULGARIAN BG SLAVIC

BYELORUSSIAN BE SLAVIC

CROATIAN HR SLAVIC

CZECH CS SLAVIC

MACEDONIAN MK SLAVIC

POLISH PL SLAVIC

RUSSIAN RU SLAVIC

SERBIAN SR SLAVIC

SERBO-CROATIAN SH SLAVIC

SLOVAK SK SLAVIC

SLOVENIAN SL SLAVIC

UKRAINIAN UK SLAVIC

AZERBAIJANI AZ TURKIC/ALTAIC

BASHKIR BA TURKIC/ALTAIC

KAZAKH KK TURKIC/ALTAIC

KIRGHIZ KY TURKIC/ALTAIC

TATAR TT TURKIC/ALTAIC

TURKISH TR TURKIC/ALTAIC

TURKMEN TK TURKIC/ALTAIC

UZBEK UZ TURKIC/ALTAIC

BISLAMA BI [not given]

MONGOLIAN MN [not given]

NAURU NA [not given] "In 1989, the ISO 639 Registration Authority changed a number of codes as follows (the quote is taken from RFC 1766):

Page 21: INTERNATIONAL FOREIGN LANGUAGE CODES - ifls-edu.com

: The following codes have been added in 1989 (nothing later): ug : (Uigur), iu (Inuktitut, also called Eskimo), za (Zhuang), he (Hebrew, : replacing iw), yi (Yiddish, replacing ji), and id (Indonesian, : replacing in)." Hence these changes in the listings above (assignment of UIGUR, INUKTITUT and ZHUANG to a 'LANGUAGE FAMILY' to be determined): HEBREW HE SEMITIC (3 occurrences, replacing IW with HE) YIDDISH YI GERMANIC (3 occurrences, replacing JI with YI) INDONESIAN ID OCEANIC/INDONESIAN (3 occurrences, replacing IN with ID) UIGUR UG [ ] (2 occurrences added) INUKTITUT IU [ ] (2 occurrences added) ZHUANG ZA [ ] (2 occurrences added) Additional Note 2001-08-29 The provisional/draft (informative) "Annex B" in ISO 639-1:2001 (FDIS) offers these clarifications: From: http://www.rtt.org/ISO/TC37/SC2/WG1/639/639-1-FDIS-x-2001-02-09.htm Changes from ISO 639:1988 to ISO 639-1:2001 This annex lists all languages that have been added since the publication of ISO 639:1988. Modifications to the names of the languages are not included. Three language identifiers were changed in 1989. The changes were Publicized, but they have not been included in printed versions of ISO 639. These changes are: The identifier for Hebrew was changed from "iw" to "he". The identifier for Indonesian was changed from "in" to "id". The identifier for Yiddish was changed from "ji" to "yi". In addition, ISO 639:1988 contains one error. The identifier for Javanese is rendered as "jw" in table 1, while it is correctly given as "jv" in the other tables.