Tocharian languages

Tocharian
(a. k. a. Agnean-Kuchean)
Tocharian ; (a. k. a. Agnean-Kuchean)
	Tocharian B manuscript, c. 7th century AD
Native to	Agni, Kucha, Turfan and Krorän
Region	Tarim Basin
Ethnicity	Tocharians
Extinct	9th century AD
Language family	Indo-European Tocharian ; (a. k. a. Agnean-Kuchean);
Early form	Proto-Tocharian
Dialects	Agnean (Tocharian A); Kuchean (Tocharian B); Kroränian (Tocharian C);
Writing system	Brahmi script (Tocharian alphabet); Manichaean script;
Language codes
ISO 639-3	Either:; xto – Tocharian A; txb – Tocharian B
Linguist List	xto Tocharian A
	txb Tocharian B
Glottolog	tokh1241
	This article contains IPA phonetic symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Unicode characters. For an introductory guide on IPA symbols, see Help:IPA.

The Tocharian (sometimes Tokharian) languages (/təˈkɛəriən/ or /təˈkɑːriən/), also known as Arśi-Kuči, Agnean-Kuchean or Kuchean-Agnean, are an extinct branch of the Indo-European language family spoken by inhabitants of the Tarim Basin, the Tocharians.^[3] They are known from manuscripts dating from the 5th to the 8th century AD, which were found in oasis cities on the northern edge of the Tarim Basin (now part of Xinjiang in northwest China) and the Lop Desert. The discovery of this language family in the early 20th century contradicted the formerly prevalent idea of an east–west division of the Indo-European language family on the centum–satem isogloss, and prompted reinvigorated study of the family. Mistakenly identifying the authors with the Tokharoi people of ancient Bactria (Tokharistan), early authors called these languages "Tocharian". This naming has remained, although the names Agnean and Kuchean have been proposed as a replacement.^[4]^[3]

The documents record two closely related languages, called Tocharian A (also East Tocharian, Agnean or Turfanian) and Tocharian B (West Tocharian or Kuchean). The subject matter of the texts suggests that Tocharian A was more archaic and used as a Buddhist liturgical language, while Tocharian B was more actively spoken in the entire area from Turfan in the east to Tumshuq in the west. A body of loanwords and names found in Prakrit documents from the Lop Nor basin have been dubbed Tocharian C (Kroränian). A claimed find of ten Tocharian C texts written in Kharoṣṭhī script has been discredited.^[5]

The oldest extant manuscripts in Tocharian B are now dated to the 5th or even late 4th century AD, making Tocharian a language of Late Antiquity contemporary with Gothic, Classical Armenian and Primitive Irish.^[6]

Discovery and significance[]

Afanasievo
culture

Tocharians

Indo-Aryans

Indo-European migrations, with location of the Afanasievo culture (genetically identical to the Yamnaya culture of the Pontic steppes) and their probable Tocharians descendants.^[7]

The existence of the Tocharian languages and alphabet was not even suspected until archaeological exploration of the Tarim Basin by Aurel Stein in the early 20th century brought to light fragments of manuscripts in an unknown language, dating from the 6th to 8th centuries AD.^[8]

It soon became clear that these fragments were actually written in two distinct but related languages belonging to a hitherto unknown branch of Indo-European, now known as Tocharian:

Tocharian A (Agnean or East Tocharian; natively ārśi) of Qarašähär (ancient Agni, Chinese Yanqi) and Turpan (ancient Turfan and Xočo), and
Tocharian B (Kuchean or West Tocharian) of Kucha and Tocharian A sites.

The geographical spread of Indo-European languages

Prakrit documents from 3rd-century Krorän and Niya on the southeast edge of the Tarim Basin contain loanwords and names that appear to come from a closely related language, referred to as Tocharian C.^[1]

The discovery of Tocharian upset some theories about the relations of Indo-European languages and revitalized their study. In the 19th century, it was thought that the division between centum and satem languages was a simple west–east division, with centum languages in the west. The theory was undermined in the early 20th century by the discovery of Hittite, a centum language in a relatively eastern location, and Tocharian, which was a centum language despite being the easternmost branch. The result was a new hypothesis, following the wave model of Johannes Schmidt, suggesting that the satem isogloss represents a linguistic innovation in the central part of the Proto-Indo-European home range, and the centum languages along the eastern and the western peripheries did not undergo that change.^[9]

Several scholars identify the ancestors of the Tocharians with the Afanasievo culture of South Siberia (c. 3300—2500 BCE), an early eastern offshoot of the steppe cultures of the Don-Volga area that later became the Yamnayans.^[10]^[11]^[12] Under this scenario, Tocharian-speakers would have immigrated to the Tarim Basin from the north at some later point. On this basis, Michaël Peyrot argues that several of the most striking typological peculiarities of Tocharian are rooted in a prolonged contact of Proto-Tocharian with an early stage of Proto-Samoyedic in South Siberia. Among others, this might explain the merger of all three stop series (e.g. *t, *d, *dʰ > *t), which must have led to a huge amount of homonyms, as well as the development of an agglutinative case system.^[13]

Most scholars reject Walter Bruno Henning's proposed link to Gutian, a language spoken on the Iranian plateau in the 22nd century BC and known only from personal names.^[14]

Tocharian probably died out after 840 when the Uyghurs, expelled from Mongolia by the Kyrgyz, moved into the Tarim Basin.^[1] The theory is supported by the discovery of translations of Tocharian texts into Uyghur.

Some modern Chinese words may ultimately derive from a Tocharian or related source, e.g. Old Chinese *mjit (蜜; mì) "honey", from proto-Tocharian *ḿət(ə) (where *ḿ is palatalized; cf. Tocharian B mit), cognate with Proto-Slavonic medǔ (honey) and English mead.^[15]

Names[]

Tocharian royal family (King, Queen and young blond-hair Prince), Kizil, Cave 17 (entrance wall, lower left panel). Hermitage Museum.^[16]^[17]^[18]^[19]

A colophon to a Buddhist manuscript in Old Turkic from 800 AD states that it was translated from Sanskrit via a twγry language. In 1907, Emil Sieg and Friedrich W. K. Müller guessed that this referred to the newly discovered language of the Turpan area.^[20] Sieg and Müller, reading this name as toxrï, connected it with the ethnonym Tócharoi (Ancient Greek: Τόχαροι, Ptolemy VI, 11, 6, 2nd century AD), itself taken from Indo-Iranian (cf. Old Persian tuxāri-, Khotanese ttahvāra, and Sanskrit tukhāra), and proposed the name "Tocharian" (German Tocharisch). Ptolemy's Tócharoi are often associated by modern scholars with the Yuezhi of Chinese historical accounts, who founded the Kushan empire.^[21]^[22] It is now clear that these people actually spoke Bactrian, an Eastern Iranian language, rather than the language of the Tarim manuscripts, so the term "Tocharian" is considered a misnomer.^[23]^[24]^[25]

Nevertheless, it remains the standard term for the language of the Tarim Basin manuscripts.^[2]^[26]

In 1938, Walter Henning found the term "four twγry" used in early 9th-century manuscripts in Sogdian, Middle Iranian and Uighur. He argued that it referred to the region on the northeast edge of the Tarim, including Agni and Karakhoja but not Kucha. He thus inferred that the colophon referred to the Agnean language.^[27]^[28]

Although the term twγry or toxrï appears to be the Old Turkic name for the Tocharians, it is not found in Tocharian texts.^[2] The apparent self-designation ārśi appears in Tocharian A texts. Tocharian B texts use the adjective kuśiññe, derived from kuśi or kuči, a name also known from Chinese and Turkic documents.^[2] The historian Bernard Sergent compounded these names to coin an alternative term Arśi-Kuči for the family, recently revised to Agni-Kuči,^[29] but this name has not achieved widespread usage.

Writing system[]

Tocharian B inscription from the Kizil Caves, in the Tocharian version of the Brahmi script, reading:

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

Indo-European topics
Part of a series on

show Languages List of Indo-European languages Extant Albanian Armenian Balto-Slavic Baltic Slavic Celtic Germanic Hellenic Greek Indo-Iranian Indo-Aryan Iranian Italic Romance Extinct Anatolian Tocharian Paleo-Balkan Dacian Illyrian Liburnian Messapian Mysian Paeonian Phrygian Thracian Reconstructed Proto-Indo-European language Phonology: Sound laws, Accent, Ablaut Hypothetical Daco-Thracian Graeco-Armenian Graeco-Aryan Graeco-Phrygian Indo-Hittite Italo-Celtic Thraco-Illyrian Grammar Vocabulary Root Verbs Nouns Pronouns Numerals Particles Other Proto-Albanian Proto-Anatolian Proto-Armenian Proto-Germanic (Proto-Norse) Proto-Celtic Proto-Italic Proto-Greek Proto-Balto-Slavic (Proto-Slavic) Proto-Indo-Iranian (Proto-Iranian)
show Philology Hittite texts Hieroglyphic Luwian Linear B Rigveda Avesta Homer Behistun Gaulish epigraphy Latin epigraphy Runic epigraphy Ogam Gothic Bible Armenian Bible Slanting Brahmi Old Irish glosses
show Origins Homeland Proto-Indo-Europeans Society Religion Mainstream Kurgan hypothesis Indo-European migrations Eurasian nomads Alternative and fringe Anatolian hypothesis Armenian hypothesis Beech argument Indigenous Aryans Baltic homeland Paleolithic Continuity Theory
show Archaeology Chalcolithic (Copper Age) Pontic Steppe Domestication of the horse Kurgan Kurgan culture Steppe cultures Bug–Dniester Sredny Stog Dnieper–Donets Samara Khvalynsk Yamnaya Mikhaylovka culture Novotitorovka culture Caucasus Maykop East Asia Afanasievo Eastern Europe Usatovo Cernavodă Cucuteni Northern Europe Corded ware Baden Middle Dnieper Bronze Age Pontic Steppe Chariot Yamnaya Catacomb Multi-cordoned ware Poltavka Srubna Northern/Eastern Steppe Abashevo culture Andronovo Sintashta Europe Globular Amphora Corded ware Beaker Unetice Trzciniec Nordic Bronze Age Terramare Tumulus Urnfield Lusatian South Asia BMAC Yaz Gandhara grave Iron Age Steppe Chernoles Europe Thraco-Cimmerian Hallstatt Jastorf Caucasus Colchian India Painted Grey Ware Northern Black Polished Ware
show Peoples and societies Bronze Age Anatolian peoples (Hittites) Armenians Mycenaean Greeks Indo-Iranians Iron Age Indo-Aryans Indo-Aryans Iranians Iranians Persians Medes Parthians Scythians Saka Sarmatians Massagetae Alans East Asia Wusun Yuezhi Europe Celts Gauls Celtiberians Insular Celts Cimmerians Hellenic peoples Italic peoples Germanic peoples Paleo-Balkan/Anatolia Thracians Dacians Illyrians Paeonians Phrygians Middle Ages East Asia Tocharians Europe Albanians Balts Slavs Norsemen/Medieval Scandinavians Medieval Europe Indo-Aryan Medieval India Iranian Greater Persia
show Religion and mythology Reconstructed Proto-Indo-European mythology Proto-Indo-Iranian religion Ancient Iranian religion Historical Hittite Indo-Aryan Vedic Hinduism Buddhism Jainism Sikhism Iranian Persian Zoroastrianism Kurdish Yazidism Yarsanism Scythian Ossetian Others Armenian Europe Paleo-Balkan (Albanian · Illyrian · Thracian · Dacian) Greek Roman Celtic Irish Scottish Breton Welsh Cornish Germanic Anglo-Saxon Continental Norse Baltic Latvian Lithuanian Slavic Practices Fire sacrifice Horse sacrifice Sati Winter solstice/Yule
show Indo-European studies Scholars Marija Gimbutas J. P. Mallory Institutes Copenhagen Studies in Indo-European Publications Encyclopedia of Indo-European Culture The Horse, the Wheel and Language Journal of Indo-European Studies Indogermanisches etymologisches Wörterbuch Indo-European Etymological Dictionary
v t