Tocharian languages

Tocharian
	Tocharian B manuscript, c. 7th century AD
Native to	Agni, Kucha, Turfan and Krorän
Region	Tarim Basin of China, and Central Asia
Ethnicity	Tocharians
Extinct	9th century AD
Language family	Indo-European Tocharian;
Dialects	Agnean (Tocharian A) Kuchean (Tocharian B) Kroränian (Tocharian C)
Writing system	Brahmi script (Tocharian alphabet); Manichaean script
Language codes
ISO 639-3	Either:; xto – Tocharian A; txb – Tocharian B
Linguist list	xto Tocharian A
	txb Tocharian B
Glottolog	tokh1241
	This article contains IPA phonetic symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Unicode characters.

Tocharian, also spelled Tokharian (/təˈkɛəriən/ or /təˈkɑːriən/；Chinese: 吐火羅語), is an extinct branch of the Indo-European language family, known from manuscripts dating from the 6th to the 8th century AD found in oasis cities on the northern edge of the Tarim Basin (now part of Xinjiang in northwest China). The discovery of these languages in the early 20th century contradicted the formerly prevalent idea of an east–west division of Indo-European language family on the centum–satem isogloss, and contributed to re-invigorated study of the family. The term "Tocharian", based on an identification with the Bactrian Tokharoi of classical sources, is now generally considered a misnomer, but has become customary.

The documents record two closely related languages, called Tocharian A ("East Tocharian", Agnean or Turfanian) and Tocharian B ("West Tocharian" or Kuchean). The subject matter of the texts suggests that Tocharian A was more archaic and used as a Buddhist liturgical language, while Tocharian B was more actively spoken in the entire area from Turfan in the east to Tumshuq in the west. A body of loanwords and names found in Prakrit documents have been dubbed Tocharian C (Kroränian). These languages became extinct after Turkic Uyghur tribes expanded into the Tarim Basin in the 9th century AD during the Islamicisation and Turkicisation of Xinjiang.

Discovery and significance

Centum and Satem languages: Centum languages (green, blue, violet, yellow, brown) and Satem languages (red, orange)

"Tocharian donors", 6th-century AD fresco, Qizil, Tarim Basin. These frescoes are associated with annotations in Tocharian and Sanskrit made by their painters.

The existence of the Tocharian languages and alphabet was not even suspected until archaeological exploration of the Tarim basin by Aurel Stein in the early 20th century brought to light fragments of manuscripts in an unknown language, dating from the 6th to 8th centuries AD.^[4]

It soon became clear that these fragments were actually written in two distinct but related languages belonging to a hitherto unknown branch of Indo-European, now known as Tocharian:

Tocharian A (Agnean or East Tocharian; natively ārśi) of Qarašähär (ancient Agni, Chinese Yanqi) and Turpan (ancient Turfan and Xočo); and
Tocharian B (Kuchean or West Tocharian) of Kucha and Tocharian A sites.

Prakrit documents from 3rd-century Krorän on the southeast edge of the Tarim Basin contain loanwords and names that appear to come from another variety of Tocharian, dubbed Tocharian C.^[1]

The discovery of Tocharian upset some theories about the relations of Indo-European languages and revitalized their study. In the 19th century, it was thought that the division between Centum and Satem languages was a simple west–east division, with Centum languages in the west.

This theory was undermined in the early 20th century by the discovery of Hittite, a Centum language in a relatively eastern location, and Tocharian, the easternmost branch but nonetheless of the Centum type. The result was a new hypothesis, following the "wave" theory of Johannes Schmidt, suggesting that the Satem isogloss represents a linguistic innovation in the central part of the Proto-Indo-European home range, while the Centum languages along the eastern and western peripheries did not undergo that change.^[5]

Tocharian probably died out after 840 when the Uyghurs, expelled from Mongolia by the Kyrgyz, moved into the Tarim Basin.^[1] This theory is supported by the discovery of translations of Tocharian texts into Uyghur. During Uyghur rule, the peoples mixed with the Uyghurs to produce much of the modern population of what is now Xinjiang.

Names

A colophon to a Buddhist manuscript in Old Turkish from 800 AD states that it was translated from Sanskrit via a twγry language. In 1907, Emil Sieg and Friedrich W. K. Müller guessed that this referred to the newly discovered language of the Turpan area.^[6] In 1938, Walter Henning found the term "four twγry" used in early 9th-century manuscripts in Sogdian, Middle Iranian and Uighur. He argued that it referred to the region on the northeast edge of the Tarim, including Agni and Karakhoja but not Kucha. He thus inferred that the colophon referred to the Agnean language.^[7]^[8]

Sieg and Müller, reading this name as toxrï, connected it with the ethnonym Tócharoi (Ancient Greek: Τόχαροι, Ptolemy VI, 11, 6, 2nd century AD), itself taken from Indo-Iranian (cf. Old Persian tuxāri-, Khotanese ttahvāra, and Sanskrit tukhāra), and proposed the name "Tocharian" (German Tocharisch). Ptolemy's Tócharoi are often associated by modern scholars with the Yuezhi of Chinese historical accounts, who founded the Kushan empire.^[9]^[10] It is now clear that these people actually spoke Bactrian, an Eastern Iranian language, rather than the language of the Tarim manuscripts, so the term "Tocharian" is considered a misnomer.^[11]^[12]^[13] Nevertheless, it remains the standard term for the language of the Tarim Basin manuscripts.^[2]^[14]

Although the term twγry or toxrï appears to be the Old Turkic name for the Tocharians, it is not found in Tocharian texts.^[2] The apparent self-designation ārśi appears in Tocharian A texts. Tocharian B texts use the adjective kuśiññe, derived from kuśi or kuči, a name also known from Chinese and Turkic documents.^[2] The historian Bernard Sergent compounded these names to coin an alternative term Arśi-Kuči for the family, recently revised to Agni-Kuči,^[15] but this name has not achieved widespread usage.

Writing system

Wooden tablet with an inscription showing Tocharian B in its Brahmic form. Kucha, China, 5th–8th century (Tokyo National Museum)

Tocharian is documented in manuscript fragments, mostly from the 8th century (with a few earlier ones) that were written on palm leaves, wooden tablets and Chinese paper, preserved by the extremely dry climate of the Tarim Basin. Samples of the language have been discovered at sites in Kucha and Karasahr, including many mural inscriptions.

Most of the script in Tocharian was a derivative of the Brahmi alphabetic syllabary (abugida) and is referred to as slanting Brahmi. However a smaller amount was written in the Manichaean script in which Manichaean texts were recorded.^[16]^[17] It soon became apparent that a large proportion of the manuscripts were translations of known Buddhist works in Sanskrit and some of them were even bilingual, facilitating decipherment of the new language. Besides the Buddhist and Manichaean religious texts, there were also monastery correspondence and accounts, commercial documents, caravan permits, medical and magical texts, and one love poem.

In 1998, Chinese linguist Ji Xianlin published a translation and analysis of fragments of a Tocharian Maitreyasamiti-Nataka discovered in 1974 in Yanqi.^[18]^[19]^[20]

Tocharian A and B

Cities of the ancient Tarim Basin

◈ Turfan

◈ Agni

◈ Kucha

◈ Krorän

Tocharian A and B are significantly different, although it is unclear whether they were mutually intelligible. A common Proto-Tocharian language must precede the attested languages by several centuries, probably dating to the 1st millennium BC.

Tocharian A is found only in the eastern part of the Tocharian-speaking area, and all extant texts are of a religious nature. Tocharian B, however, is found throughout the range and in both religious and secular texts. As a result, it has been suggested that Tocharian A was a liturgical language, no longer spoken natively, while Tocharian B was the spoken language of the entire area.^[1] (On the other hand, it is possible that the lack of a secular corpus in Tocharian A is simply an accident, due to the smaller distribution of the language and the fragmentary preservation of Tocharian texts in general.)

The hypothesized relationship of Tocharian A and B as liturgical and spoken forms, respectively, is sometimes compared with the relationship between Latin and the modern Romance languages, or Classical Chinese and Mandarin. However, in both of these latter cases the liturgical language is the linguistic ancestor of the spoken language, whereas no such relationship holds between Tocharian A and B. In fact, from a phonological perspective Tocharian B is significantly more conservative than Tocharian A, and serves as the primary source for reconstructing Proto-Tocharian. Only Tocharian B preserves the following Proto-Tocharian features: stress distinctions, final vowels, diphthongs, and o vs. e distinction. In turn, the loss of final vowels in Tocharian A has led to the loss of certain Proto-Tocharian categories still found in Tocharian B, e.g. the vocative case and some of the noun, verb and adjective declensional classes.

In terms of declensional and conjugational endings, the two languages have tended to innovate in divergent ways, with neither clearly simpler than the other. For example, both languages show significant innovations in the present active indicative endings but in radically different ways, so that only the second-person singular ending is directly cognate between the two languages, and in most cases, neither variant is directly cognate with the corresponding Proto-Indo-European (PIE) form. The agglutinative secondary case endings in the two languages likewise stem from different sources, showing parallel development of the secondary case system after the Proto-Tocharian period. Likewise, some of the verb classes show independent origins, e.g. the class II preterite, which uses reduplication in Tocharian A (possibly from the reduplicated aorist) but long PIE ē in Tocharian B (possibly from the long-vowel perfect found in Latin lēgī, fēcī, etc.).^[2]

Phonology

Phonetically, Tocharian is a "centum" Indo-European language, meaning that it merges the palatovelar consonants (*ḱ, *ǵ, *ǵʰ) of Proto Indo-European with the plain velars (*k, *g, *gʰ). Centum languages are mostly found in western and southern Europe (Greek, Italic, Celtic, Germanic). In that sense, Tocharian (to some extent like the Greek and the Anatolian languages) seems to have been an isolate in the "satem" (i.e. palatovelar to sibilant) phonetic regions of Indo-European-speaking populations. The discovery of Tocharian contributed to doubts that Proto-Indo-European had originally split into western and eastern branches.^[21]^[22]

Vowels

	Front	Central	Back
Close	i /i/	ä /ɨ/	u /u/
Mid	e /e/	a /ə/	o /o/
Open		ā /a/

Note that, although both Tocharian A and Tocharian B have the same set of vowels, they often do not correspond to each other. For example, the sound a did not occur in Proto-Tocharian. Tocharian B a is derived from former stressed ä or unstressed ā (reflected unchanged in Tocharian A), while Tocharian A a stems from Proto-Tocharian /ɛ/ or /ɔ/ (reflected as /e/ and /o/ in Tocharian B), and Tocharian A e and o stem largely from monophthongization of former diphthongs (still present in Tocharian B).

Diphthongs

	Closer component is front	Closer component is back
Opener component is unrounded	ai /əi/	au /əu/ āu /au/
Opener component is rounded	oy /oi/

Diphthongs occur in Tocharian B only.

Consonants

The following table lists the reconstructed phonemes in Tocharian along with their standard transcription. Because Tocharian is written in an alphabet used originally for Sanskrit and its descendants, the transcription of the sounds is directly based on the transcription of the corresponding Sanskrit sounds. The Tocharian alphabet also has letters representing all of the remaining Sanskrit sounds, but these appear only in Sanskrit loanwords and are not thought to have had distinct pronunciations in Tocharian. There is some uncertainty as to actual pronunciation of some of the letters, particularly those representing palatalized obstruents (see below).

	Bilabial	Alveolar	Alveolo-palatal	Palato-alveolar?	Palatal	Velar	Labialized velar
Plosive	p /p/	t /t/	c /tɕ/?²			k /k/
Affricate		ts /ts/
Fricative		s /s/	ś /ɕ/	ṣ /ʃ/?³
Nasal	m /m/	n ṃ /n/¹			ñ /ɲ/	ṅ /ŋ/⁴
Trill		r /r/
Approximant					y /j/		w /w/
Lateral approximant		l /l/			ly /ʎ/

/n/ is transcribed by two different letters in the Tocharian alphabet depending on position. Based on the corresponding letters in Sanskrit, these are transcribed ṃ (word-finally, including before certain clitics) and n (elsewhere); but it should be noted that ṃ represents /n/, not /m/.
The sound written c is thought to correspond to a palatal stop /c/ in Sanskrit. The Tocharian pronunciation /tɕ/ is suggested by the common occurrence of the cluster śc, but the exact pronunciation cannot be determined with certainty.
The sound written ṣ corresponds to retroflex sibilant /ʂ/ in Sanskrit, but it seems more likely to have been a palato-alveolar sibilant /ʃ/ (as in English "ship"), because it derives from a palatalized /s/.^[23]
The sound ṅ /ŋ/ occurs only before k, or in some clusters where a k has been deleted between consonants. It is clearly phonemic because sequences nk and ñk also exist (from syncope of a former ä between them).

Proto-Tocharian

Vowels

Proto-Tocharian shows radical changes in its vowels from Proto-Indo-European (PIE). Length distinctions eventually disappeared, but prior to that all pairs of long and short vowels had become distinct in quality, and thus have different outcomes. Many pairs of PIE vowels are distinguished in Tocharian only by the occurrence or non-occurrence of palatalization. For example, PIE o and ē both evolved into Proto-Tocharian ë (possibly /ɛ/), but PIE ē palatalized the preceding consonant, and left a y when no consonant preceded, while neither of these occurs with PIE o.

Reconstructing the changes between PIE and Proto-Tocharian vowels is fraught with difficulty, and as a result there are a large number of disagreements among different researchers. The basic problems are:

Tocharian A and B are relatively sparsely attested.
The extensive mergers of PIE stop consonants lead to many ambiguities in the potential etymologies underlying particular Tocharian words.
The radical restructuring of the vowel system means that few potential sound changes can be rejected as unreasonable, no matter how unlikely they may look on the surface.
A large amount of analogical change has occurred in both the nominal and verbal systems, making it difficult to identify which changes are regular.

Historically, the evolution of the Tocharian vowels was the last part of the diachronic phonology to be understood. In 1938, George S. Lane remarked of Tocharian that "the vocalism so far has defied almost every attempt that has been made to bring it to order",^[24] and as late as 1945 still asserted: "That the subject [of palatalization] is a confused and difficult one is generally recognized—but so are most of the problems of Tocharian phonology."^[25] However, rapid progress towards understanding the evolution of the vocalic system, and with it the phonology as a whole, occurred during the period of approximately 1948–1960, beginning with Sieg and Siegling (1949).^[26] By 1960, the system was well-enough understood that Krause and Thomas's seminal work of that year^[27] is still considered one of the most important Tocharian grammatical handbooks.^[28]

Despite the apparent equivalence between the Tocharian A and B vowel systems, in fact a number of vowels are not cognate between the two varieties, and Proto-Tocharian had a different vowel system from either. For example, Tocharian A a reflects a merger of two Proto-Tocharian vowels that are distinguished in Tocharian B as e and o, while Tocharian B a reflects a stress-based variant of either Proto-Tocharian ā or ä, while Tocharian A preserves original ā and ä regardless of the position of stress.

As a general rule, Tocharian B reflects the Proto-Tocharian vowel system more faithfully than Tocharian A, which includes a number of changes not found in Tocharian B, e.g. monophthongization of diphthongs, loss of all absolutely final vowels, loss of ä in open syllables, and epenthesis of ä to break up difficult clusters (esp. word-finally) that resulted from vowel losses.

The following table describes a typical minimal reconstruction of Late Proto-Tocharian, which includes all vowels that are generally accepted by Tocharian scholars:^[2]

	Front	Central	Back
High	i /i/	ä /ɨ/	u /u/
Mid	e /e/?		ọ /o/
Low		ā /a/	o /ɔ/

The following table describes a "maximal" reconstruction of Proto-Tocharian, following Ringe (1996):^[23]

	Front	Central	Back
High	(i = ǝy)	ǝ /ɨ/	(u = ǝw)
Mid	ẹ /e/	ë /ǝ/	ọ /o/
Low	e /ɛ/	ā /a/	o /ɔ/

Some of the differences between the "minimal" and "maximal" systems are primarily notational: Ringe's *ǝ = standard *ä, and Ringe (along with many other researchers) reconstruct Proto-Tocharian surface *[i] and *[u] as underlying *äy, *äw (*ǝy, *ǝw in Ringe's notation). However, Ringe reconstructs three vowels *ë, *e, *ẹ in place of the single vowel *e in the minimal system. The primary distinction is between *ë < PIE *o, which is assumed to be a central rather than a front vowel because it does not trigger palatalization, and *e < PIE *ē, which does trigger palatalization. Other than palatalization effects, both vowels are reflected identically in both Tocharian A and B, and hence a number of researchers project the merger back to Proto-Tocharian. However, some umlaut processes are thought to have operated differently on the two vowels, and as a result Ringe (as well as Adams^[29] and some other scholars) prefer to distinguish the two in Proto-Tocharian. Ringe's *ẹ vowel, a higher vowel than *e, is fairly rare and appears as i in Tocharian B but e in Tocharian A. This vowel does trigger palatalization, and is thought by Ringe to stem primarily from PIE *oy and from loanwords. In general, Ringe's Proto-Tocharian reconstruction reflects an earlier stage than the one described by many researchers.

Some scholars use a different notation from what is given above: e.g. æ or ë in place of the e of the minimal system, å or ɔ in place of the o of the minimal system, and ǝ in place of ä.

The following table shows the changes from Proto-Indo-European (PIE) to Proto-Tocharian (PToch) and on to Tocharian B (TB) and Tocharian A (TA), using the notation of the "minimal" system above:^[2]

Late PIE	Early PToch	Late PToch	Tocharian B	Tocharian A	Example
a, Ḥ	ā	ā	ā́; a	ā	PIE h₂ént-o- "in front" > ánt-o- > PToch ā́ntë > TB ānte, TA ānt* PIE pḥ₂tḗr "father" > PToch pācer > TB pācer, TA pācar
e	yä	+ä	+á; +ä, +—	+ä, +—	PIE (h₁)eḱwo- "horse" > PToch yä́kwë* > TB yakwe PIE gʷén-eh₂ "woman" > gʷén-ā > PToch śä́nɔ > TB śana, TA śäṃ PIE kʷetwór-es "four" > PToch śätwë́rä > TB śtwer, TA śtwar
i	ä, yä	(+)ä	(+)á; (+)ä, (+)—	(+)ä, (+)—	PIE wiso-^{[citation needed]} "poison" > PToch wä́së > TB wase, TA wäs
o	ë	e	e	a	Late PIE *okʷs "eye" > PToch ëk > TB ek, TA ak
u	u	ä	á; ä, —	ä, —	PIE h₁rudʰ-ró- "red" > PToch rä́trë* > TB ratre, TA rtär (< rtr) PIE dhugh₂-tēr "daughter" > PToch täkā́cër > TB tkācer, TA ckācar*^[30]
ā	o	o	o	a	Late PIE bhrā́tēr "brother" > PToch prɔcër > TB procer, TA pracar
ē	ye	+e	+e	+a	PIE h₂wéh₁nt-o- "wind" > wēnt-o- > PToch w'entë > TB yente, TA want*
ō	ā	ā	ā́; a	ā	PIE ṇ-ǵneh₃-tih₂ "ignorant (Latin* ignōtus) > ṇ-ǵnō-tyḤ > PToch* āknā́tsā > TB aknātsa
ih₁	yä? ī?	+ä? i?	+á; +ä, +—? i?	+ä, +—? i?
ih₂, ih₃	yā	+ā	+ā́; +a	+ā
uh₁	ū?	u?	u?	u?
uh₂, uh₃	wā? ū?	wā? u?	wā́;wa? u?	wā? u?
ai	āi	āi	ai	e
ei	yäi	+äi	+i	+i
oi	ëi	ei	ai	e
au	āu	āu	au	o
eu	yäu	+äu	+u	+u
ou	ëu	eu	e_u > au	o
Ṛ, ṚH	äR	äR	áR; äR, R	äR, R	PIE wḷkʷo- "wolf" > PToch wä́lkʷë > TB walkwe PIE pṛh₂wo- "anterior" (cf. Sanskrit pū́rva-) > PToch pä́rwë > TB parwe, TA pärwa-t
	ā…ë́	ɔ…ɔ́	o…o	a…a
	ä(…o/ọ/u)	u	u	u
	äu(…ọ)	ọ	o	o
	ë́(…o/ọ/u)	ọ	o	o	PIE h₁óḱtōu "eight" > Early PToch ë́ktu > óktu > PToch óktä > TB okt, TA okät
	u(…o/ọ/u)	ọ	o	o
	ë́(…ā)	ā	ā́; a	ā

Notes:

The last six lines indicate umlaut processes that operated during the Proto-Tocharian period (see below).
A + sign indicates palatalization. When following a consonant, that consonant is palatalized; otherwise, a y appears. Before PIE i, consonants other than dentals are not necessarily palatalized (researchers differ on what exactly happened); this is indicated as (+).
A — sign indicates no vowel; this results from deletion of ä in open syllables.
Tocharian B reflects Proto-Tocharian central vowels (ä, ā) differently depending on whether they bore stress or not. This is indicated in the table above: stressed ä ā > a ā while unstressed ä ā > ä a (i.e. Tocharian B a reflects either stressed ä or unstressed ā). An additional complication is that unstressed ä is deleted in open syllables. See below.

Proto-Tocharian had phonemic stress, although its position varies depending on the researcher. Many researchers project the Tocharian B stress that is recoverable from ā~a and a~ä alternations back to Proto-Tocharian. For the most part, this stress does not reflect PIE stress. Rather, most bisyllabic words have initial stress, and trisyllabic and longer words usually have stress on the second syllable. A number of multisyllabic words in Tocharian B appear to indicate that more than one syllable was stressed; it is thought that these reflect clitics or affixes that still behaved phonologically as separate words in Proto-Tocharian. Ringe,^[23] however, prefers to project the PIE stress unchanged into Proto-Tocharian, and assumes that the radically different system seen in Tocharian B evolved within the separate history of that language.

The outcome of the PIE sequences *iH and *uH when not followed by a vowel is disputed. It is generally agreed that *ih₂ became Proto-Tocharian *yā; a similar change occurred in Ancient Greek. It is also usually accepted that *ih₃ likewise became Proto-Tocharian *yā, although it is unclear whether this reflects a direct change *ih₃ > *yā /ya/ or a change *ih₃ > *yō /yo:/ > *yā /ya/ (echoing a similar change in Ancient Greek), since PIE *ō is generally thought to have become Proto-Tocharian *ā (which was not a long vowel). The outcomes of all other sequences are much less clear. A number of etymologies appear to indicate a parallel change *uh₂ > *wā, but some also appear to indicate a change *uh₂ > *ū > *u. Ringe^[23] demonstrates that all the occurrences of *wā can potentially be explained as due to analogy, and prefers to postulate a general sound change *uH > *ū > *u following the normal outcome of *uH in other languages, but a number of other researchers (e.g. Krause and Slocum^[2]) prefer to see *uh₂ > *wā as the regular sound change. The outcome of *ih₁ is likewise disputed, with Ringe similarly preferring a regular change *ih₁ > *ī > i while others postulate a regular change *ih₁ > *ye > *yä. As elsewhere, the main difficulty is that, relative to other Indo-European languages, Tocharian is sparsely attested and was subject to a particularly large number of analogical changes.

A number of umlaut processes occurred in the Proto-Tocharian period, which tended to increase the number of rounded vowels. Vowel rounding also resulted from the influence of nearby labiovelars, although this occurred after the Proto-Tocharian period, with differing results in Tocharian A and B, generally with more rounding in Tocharian A (e.g. PIE *gʷṃ- "come" > PToch *kʷäm- > Tocharian A kum- but Tocharian B käm-).

Vowel deletion and insertion

Tocharian A deletes all Proto-Tocharian final vowels, as well as all instances of Proto-Tocharian ä in open syllables (which appears to include vowels followed by Cr and Cl sequences). When this produces impossible consonant sequences, these are rectified by vocalizing w and y into u and i, if possible; otherwise, an epenthetic ä is inserted. Note that most consonant sequences are tolerated word-initially, including unexpected cases like rt-, ys- and lks-. Example: PIE h₁rudhros (Greek erythros) > PToch rä́tre > Toch A *rtr > rtär.

Tocharian B deletes only unstressed ä in open syllables, and leaves all other vowels alone. Hence PIE h₁rudhros > PToch rä́tre > Toch B ratre. If necessary, impossible consonant sequences are rectified as in Tocharian A.

Consonants

The following are the main changes between PIE and Proto-Tocharian:

Centum change: PIE "palatals" merge with PIE "plain velars".
Loss of PIE d (but not other dentals) in a number of words when not followed by a PIE front vocalic.
Loss of contrastive voicing and aspiration, resulting in (e.g.) the merger of PIE k, g, gh.
Palatalization of all consonants before PIE e, ē, y and sometimes i, producing a number of new phonemes (c, ś, ṣ, ts, ñ, ly).
Loss of final consonants other than r, including total loss of certain final clusters (e.g. nts).

The extant Tocharian languages appear to reflect essentially the same consonant system as in Proto-Tocharian, except in a couple of cases:

A new phoneme ṅ /ŋ/ eventually developed. This was originally an allophone of n before k, but became phonemic when vowel losses resulted in instances of nk and ñk contrasting with ṅk, and occasional loss of k between consonants resulted in instances of ṅ not before k.
The Proto-Tocharian labiovelar *kʷ eventually merged with k in both Tocharian A and B. However, this clearly post-dated Proto-Tocharian, because the former *kʷ often rounded adjacent vowels prior to its loss, in a way that differed between A and B (e.g. Proto-Tocharian *kʷäm- "come" > A kum- but B käm-). In addition, according to Ringe,^[31] Proto-Tocharian *kʷ is sometimes retained in Tocharian B when directly preceding a voiceless consonant, particularly in western dialect texts.

Unlike in most centum languages, Proto-Tocharian maintained separate outcomes of PIE *kʷ and *ḱw.^[31] The latter is still reflected as kw in Tocharian B, e.g. yakwe "horse" < PIE *eḱwos.

Palatalization

Palatalization was a very important process operating in Proto-Tocharian. Palatalization appears to have operated very early, prior to almost all of the vowel changes that took place between PIE and Proto-Tocharian. Palatalization occurred before PIE e, ē, y and sometimes i; specifically, PIE i triggered palatalization of dentals but generally not of velars or labials. (According to Ringe,^[23] lack of palatalization before PIE i was actually due to early change of i > wǝ after certain sounds.) Palatalization, or lack thereof, is the only way to distinguish PIE e and i in Tocharian, and the primary way of distinguishing certain other pairs of PIE vowels, e.g. e vs. u and ē vs. o. Palatalization appeared to have operated in two stages, an earlier one that affected only the sequences ty and dhy, and a later more general one — or at least, the result of palatalization of t and dh before y is different from palatalization before e, ē and i, while other consonants do not show such a dual outcome. (A similar situation occurred in the history of Proto-Greek and Proto-Romance.) Certain sound changes occurred prior to palatalization:^[2]

Some changes involving PIE dentals; see below.
The development of PIE iH into y + vowel. (Note that iH develops to ī in most cases in all other Indo-European languages.)

The following chart shows the outcome of palatalization:^[2]

Result of palatalization of PIE consonants
PIE	Proto-Tocharian	Tocharian B	Tocharian A	Example
p, b, bh	p'	py; pyä > pi	p	PIE penkʷe "five" > PToch p'äñśä* (?) > Toch B piś, Toch A päñ
k, g, gh; kʷ, gʷ, gʷh	ś			PIE gʷén-eh₂ "woman" > gʷén-ā > PToch śä́nɔ > TB śana, TA śäṃ PIE kʷetwór-es "four" > PToch śätwë́rä > TB śtwer, TA śtwar
t, dh	c			PIE dhugh₂-tēr "daughter" > PToch täkā́cër > Toch B tkācer (Toch A ckācar < *tkācar by assimilation)
ty, dhy	ts			PIE wleh₂-nt-ih₂ "queen" > PToch wlāntyā > wlāntsā > Toch B lāntsa*
d > dz > ts	ś			PIE dekṃ(t) "ten" > PToch dzäkä(n) > śä́kä > Toch B śak, Toch A śäk*
s	ṣ
m	m'	my; myä > mi	m	PIE medhu "honey" > PToch m'ätu > m'ätä > Toch B mit*
n	ñ /ɲ/
l	ly /ʎ/			PIE léuk-os "light" > PToch l'äukë > Toch B lyuke
r	r'	r	r
w	w'	y	w	PIE h₂weh₁-nt-o- "wind" > PToch w'ëntë* > Toch B yente, Toch A want

Dental consonants

The outcomes of the PIE dentals in Tocharian, and in particular PIE *d, are complex and difficult to explain. Palatalization sometimes produces c, sometimes ts, sometimes ś, and in some words when a front vocalic does not follow, PIE *d (but not other dentals) is lost entirely, e.g. e.g. Toch AB or "wood" < PIE *doru and Toch B ime "thought" < PToch *w'äimë < PIE *weid-mo-. Many occurrences of c and ts can be explained by the differing effects of a following y vs. a front vowel (see above), but a number of difficult cases remain.

Most researchers agree that some of the PIE dentals are reflected differently from others — contrary to the situation with all other PIE stops. This in turn suggests that some sound changes must have operated on particular dentals, but not others, prior to the general loss of contrastive voicing and aspiration. There is a large amount of disagreement over what exactly the relevant sound changes were, due to the relatively small number of extant forms involved, the operation of analogy, and disagreement over particular etymologies, including both the PIE roots and ablaut forms involved. Ringe suggests the following changes, in approximate order:^[23]

Grassmann's Law, which triggers the change dh > d when another aspirated consonant occurs later in a word (and which also operated in Greek and Indo-Iranian).
The change d > dz, which occurred after Grassmann's Law if it existed.
Loss of contrastive voicing and aspiration (which may have occurred after palatalization).
Palatalization. The new sound ts palatalizes to ś; this explains cases like Toch B śak, Toch A śäk "ten" < PIE *dekṃ(t).
Loss of ts before consonants.

Even with this explanation, a lot of words don't have the expected outcomes and require appeal to analogy. For example, the assumption of Grassmann's Law helps to explain only two words, both verbs, in which PIE *dh shows up as ts; and in both of these words, palatalization to ś might have been expected, because the present-tense forms both begin with PIE *dhe-. Ringe needs to appeal to an analogical depalatalization, based on other forms of the verb with different ablaut patterns in which palatalization was not triggered. This assumption is reasonable, because a lot of other verbs also show analogical depalatalization; but nonetheless, it is rather slender evidence, and it is not surprising that other researchers have proposed different assumptions (e.g. that PToch *tsä- is the expected outcome of PIE *dhe-, with no operation of Grassmann's Law).

Likewise, the loss of PIE *d in Toch AB or "wood" < PIE *doru is not explainable by these rules, because it is lost before a vowel rather than a consonant. Ringe again assumes analogy: In this case, the PIE conjugation was nominative *doru, genitive *dreus, and Ringe assumes that the normal loss of *d in the genitive before *r was carried over into the nominative. Again, not all researchers accept this. For example, Krause and Slocum, while accepting the remainder of Ringe's sound changes involving PIE *d, suggest instead that loss eventually occurred before Proto-Tocharian rounded vowels and before Proto-Tocharian ë (from PIE o), as well as before nasals and possibly other consonants.^[2]

Morphology

Nouns

Tocharian has completely re-worked the nominal declension system of Proto-Indo-European.^[32] The only cases inherited from the proto-language are nominative, genitive, accusative, and (in Tocharian B only) vocative; in Tocharian the old accusative is known as the oblique case. In addition to these primary cases, however, each Tocharian language has six cases formed by the addition of an invariant suffix to the oblique case — although the same six cases are not the same in each language, and the suffixes are largely non-cognate. For example, the Tocharian word yakwe (Toch B), yuk (Toch A) "horse" < PIE *eḱwos is declined as follows:^[2]

Case	Tocharian B			Tocharian A
Case	Suffix	Singular	Plural	Suffix	Singular	Plural
Nominative	—	yakwe	yakwi	—	yuk	yukañ
Vocative	—	yakwa	—	—	—	—
Genitive	—	yäkwente	yäkweṃtsi	—	yukes	yukāśśi
Oblique	—	yakwe	yakweṃ	—	yuk	yukas
Instrumental	—	—	—	-yo	yukyo	yukasyo
Perlative	-sa	yakwesa	yakwentsa	-ā	yukā	yukasā
Comitative	-mpa	yakwempa	yakweṃmpa	-aśśäl	yukaśśäl	yukasaśśäl
Allative	-ś(c)	yakweś(c)	yakweṃś(c)	-ac	yukac	yukasac
Ablative	-meṃ	yakwemeṃ	yakweṃmeṃ	-äṣ	yukäṣ	yukasäṣ
Locative	-ne	yakwene	yakweṃne	-aṃ	yukaṃ	yukasaṃ
Causative	-ñ	yakweñ	yakweṃñ	—	—	—

The Tocharian A instrumental case rarely occurs with humans.

When referring to humans, the oblique singular of most adjectives and of some nouns is marked in both varieties by an ending -(a)ṃ, which also appears in the secondary cases. An example is eṅkwe (Toch B), oṅk (Toch A) "man", which belongs to the same declension as above, but has oblique singular eṅkweṃ (Toch B), oṅkaṃ (Toch A), and corresponding oblique stems eṅkweṃ- (Toch B), oṅkn- (Toch A) for the secondary cases. This is thought to stem from the generalization of n-stem adjectives as an indication of determinative semantics, seen most prominently in the weak adjective declension in the Germanic languages (where it cooccurs with definite articles and determiners), but also in Latin and Greek n-stem nouns (especially proper names) formed from adjectives, e.g. Latin Catō (genitive Catōnis) literally "the sly one"^{[citation needed]} < catus "sly", Greek Plátōn literally "the broad-shouldered one" < platús "broad".^[2]

Verbs

In contrast, the verb verbal conjugation system is quite conservative.^[33] The majority of Proto-Indo-European verbal classes and categories are represented in some manner in Tocharian, although not necessarily with the same function.^[34] Some examples: athematic and thematic present tenses, including null-, -y-, -sḱ-, -s-, -n- and -nH- suffixes as well as n-infixes and various laryngeal-ending stems; o-grade and possibly lengthened-grade perfects (although lacking reduplication or augment); sigmatic, reduplicated, thematic and possibly lengthened-grade aorists; optatives; imperatives; and possibly PIE subjunctives.

In addition, most PIE sets of endings are found in some form in Tocharian (although with significant innovations), including thematic and athematic endings, primary (non-past) and secondary (past) endings, active and mediopassive endings, and perfect endings. Dual endings are still found, although they are rarely attested and generally restricted to the third person. The mediopassive still reflects the distinction between primary -r and secondary -i, effaced in most Indo-European languages. Both root and suffix ablaut is still well-represented, although again with significant innovations.

Classes

A given verb belongs to one of a large number of classes, according to its conjugation. As in Sanskrit, Ancient Greek and (to a lesser extent) Latin, there are independent sets of classes in the indicative present, subjunctive, perfect, imperative, and to a limited extent optative and imperfect, and there is no general correspondence among the different sets of classes, meaning that each verb must be specified using a number of principal parts.

Present indicative

The most complex system is the present indicative, consisting of 12 classes, 8 thematic and 4 athematic, with distinct sets of thematic and athematic endings. The following classes occur in Tocharian B (some are missing in Tocharian A):

I: Athematic without suffix < PIE root athematic.
II: Thematic without suffix < PIE root thematic.
III: Thematic with PToch suffix *-ë-. Mediopassive only. Apparently reflecting consistent PIE o theme rather than the normal alternating o/e theme.
IV: Thematic with PToch suffix *-ɔ-. Mediopassive only. Same PIE origin as previous class, but diverging within Proto-Tocharian.
V: Athematic with PToch suffix *-ā-, likely from either PIE verbs ending in a syllabic laryngeal or PIE derived verbs in *-eh₂- (but extended to other verbs).
VI: Athematic with PToch suffix *-nā-, from PIE verbs in *-nH-.
VII: Athematic with infixed nasal, from PIE infixed nasal verbs.
VIII: Thematic with suffix -s-, possibly from PIE -sḱ-?
IX: Thematic with suffix -sk- < PIE -sḱ-.
X: Thematic with PToch suffix *-näsk/nāsk- (evidently a combination of classes VI and IX).
XI: Thematic in PToch suffix *-säsk- (evidently a combination of classes VIII and IX).
XII: Thematic with PToch suffix *-(ä)ññ- < either PIE *-n-y- (denominative to n-stem nouns) or PIE *-nH-y- (deverbative from PIE *-nH- verbs).

Palatalization of the final root consonant occurs in the 2nd singular, 3rd singular, 3rd dual and 2nd plural in thematic classes II and VIII-XII as a result of the original PIE thematic vowel e.

Subjunctive

The subjunctive likewise has 12 classes, denoted i through xii. Most are conjugated identically to the corresponding indicative classes; indicative and subjunctive are distinguished by the fact that a verb in a given indicative class will usually belong to a different subjunctive class.

In addition, four subjunctive classes differ from the corresponding indicative classes, two "special subjunctive" classes with differing suffixes and two "varying subjunctive" classes with root ablaut reflecting the PIE perfect.

Special subjunctives:

iv: Thematic with suffix i < PIE -y-, with consistent palatalization of final root consonant. Tocharian B only, rare.
vii: Thematic (not athematic, as in indicative class VII) with suffix ñ < PIE -n- (palatalized by thematic e, with palatalized variant generalized).

Varying subjunctives:

i: Athematic without suffix, with root ablaut reflecting PIE o-grade in active singular, zero-grade elsewhere. Derived from PIE perfect.
v: Identical to class i but with PToch suffix *-ā-, originally reflecting laryngeal-final roots but generalized.

Preterite

The preterite has 6 classes:

I: The most common class, with a suffix ā < PIE Ḥ (i.e. roots ending in a laryngeal, although widely extended to other roots). This class shows root ablaut, with original e-grade (and palatalization of the initial root consonant) in the active singular, contrasting with zero-grade (and no palatalization) elsewhere.
II: This class has reduplication in Tocharian A (possibly reflecting the PIE reduplicated aorist). However, Tocharian B has a vowel reflecting long PIE ē, along with palatalization of the initial root consonant. There is no ablaut in this class.
III: This class has a suffix s in the 3rd singular active and throughout the mediopassive, evidently reflecting the PIE sigmatic aorist. Root ablaut occurs between active and mediopassive. A few verbs have palatalization in the active along with s in the 3rd singular, but no palatalization and no s in the mediopassive, along with no root ablaut (the vowel reflects PToch ë). This suggests that, for these verbs in particular, the active originates in the PIE sigmatic aorist (with s suffix and ē vocalism) while the mediopassive stems from the PIE perfect (with o vocalism).
IV: This class has suffix ṣṣā, with no ablaut. Most verbs in this class are causatives.
V: This class has suffix ñ(ñ)ā, with no ablaut. Only a few verbs belong to this class.
VI: This class, which has only two verbs, is derived from the PIE thematic aorist. As in Greek, this class has different endings from all the others, which partly reflect the PIE secondary endings (as expected for the thematic aorist).

All except preterite class VI have a common set of endings that stem from the PIE perfect endings, although with significant innovations.

Imperative

The imperative likewise shows 6 classes, with a unique set of endings, found only in the second person, and a prefix beginning with p-. This prefix usually reflects Proto-Tocharian *pä- but unexpected connecting vowels occasionally occur, and the prefix combines with vowel-initial and glide-initial roots in unexpected ways. The prefix is often compared with the Slavic perfective prefix po-, although the phonology is difficult to explain.

Classes i through v tend to co-occur with preterite classes I through V, although there are many exceptions. Class vi is not so much a coherent class as an "irregular" class with all verbs not fitting in other categories. The imperative classes tend to share the same suffix as the corresponding preterite (if any), but to have root vocalism that matches the vocalism of a verb's subjunctive. This includes the root ablaut of subjunctive classes i and v, which tend to co-occur with imperative class i.

Optative and imperfect

The optative and imperfect have related formations. The optative is generally built by adding i onto the subjunctive stem. Tocharian B likewise forms the imperfect by adding i onto the present indicative stem, while Tocharian A has 4 separate imperfect formations: usually ā is added to the subjunctive stem, but occasionally to the indicative stem, and sometimes either ā or s is added directly onto the root. The endings differ between the two languages: Tocharian A uses present endings for the optative and preterite endings for the imperfect, while Tocharian B uses the same endings for both, which are a combination of preterite and unique endings (the latter used in the singular active).

Endings

As suggested by the above discussion, there are a large number of sets of endings. The present-tense endings come in both thematic and athematic variants, although they are related, with the thematic endings generally reflecting a theme vowel (PIE e or o) plus the athematic endings. There are different sets for the preterite classes I through V; preterite class VI; the imperative; and in Tocharian B, in the singular active of the optative and imperfect. Furthermore, each set of endings comes with both active and mediopassive forms. The mediopassive forms are quite conservative, directly reflecting the PIE variation between -r in the present and -i in the past. (Most other languages with the mediopassive have generalized one of the two.)

The present-tense endings are almost completely divergent between Tocharian A and B. The following shows the thematic endings, with their origin:

Thematic present active indicative endings
	Original PIE	Tocharian B		Tocharian A		Notes
	Original PIE	PIE source	Actual form	PIE source	Actual form	Notes
1st sing	*-o-h₂	*-o-h₂ + PToch -u	-āu	*-o-mi	-am	*-mi < PIE athematic present
2nd sing	*-e-si	*-e-th₂e?	-'t	*-e-th₂e	-'t	-th₂e < PIE perfect; previous consonant palatalized; Tocharian B form should be -'ta*
3rd sing	*-e-ti	*-e-nu	-'(ä)ṃ	*-e-se	-'ṣ	-nu < PIE nu "now"; previous consonant palatalized
1st pl	*-o-mos?	*-o-mō?	-em(o)	*-o-mes + V	-amäs
2nd pl	*-e-te	*-e-tē-r + V	-'cer	*-e-te	-'c	*-r < PIE mediopassive?; previous consonant palatalized
3rd pl	*-o-nti	*-o-nt	-eṃ	*-o-nti	-eñc < *-añc	*-o-nt < PIE secondary ending

Comparison to other Indo-European languages

English	Tocharian A	Tocharian B	Ancient Greek	Sanskrit	Latin	Gothic	Old Irish	Proto-Indo-European
Tocharian vocabulary (sample)
one	sas	ṣe	heĩs, hen	sa(kṛ́t)	semel	simle	samail	PToch sems ← sḗm
two	wu	wi	duo	dvā́	duo	twái	dá	*duoh₁
three	tre	trai	treis	tráyas	trēs	þreis	trí	*tréi̯es
four	śtwar	śtwer	téssares	catvā́ras, catúras	quattuor	fidwōr	cethair	*kʷetu̯óres
five	päñ	piś	pénte	páñca	quīnque	fimf	cóic	*pénkʷe
six	ṣäk	ṣkas	héx	ṣáṣ	sex	saihs	sé	*su̯éḱs
seven	ṣpät	ṣukt	heptá	saptá	septem	sibun	secht	*septḿ̥
eight	okät	okt	oktṓ	aṣṭáu, aṣṭá	octō	ahtau	ocht	*h₃eḱtéh₃(u)
nine	ñu	ñu	ennéa	náva	novem	niun	noí	*h₁néun̥
ten	śäk	śak	déka	dáśa	decem	taihun	deich	*déḱm̥t
hundred	känt	kante	hekatón	śatām	centum	hund	cét	*ḱm̥tóm
father	pācar	pācer	patḗr	pitṛ	pater	fadar	athair	*ph₂tḗr
mother	mācar	mācer	mḗtēr	mātṛ	mater	mōdar	máthair	*méh₂tēr
brother	pracar	procer	phrā́tēr^{[* 1]}	bhrātṛ	frāter	brōþar	bráthair	*bʰréh₂tēr
sister	ṣar	ṣer	héor^{[* 1]}	svásṛ	soror	swistar	siur	*swésōr
horse	yuk	yakwe	híppos	áśva-	equus	aiƕs	ech	*h₁éḱwos
cow	ko	keu	boũs	gaúṣ	bōs^{[* 2]}	(OE cū)	bó	gʷeh₃us ~ gʷh₃eum̥
voice	vak	vek	épos^{[* 1]}	vāk	vōx	(Du gewag)^{[* 1]}	foccul^{[* 1]}	*u̯ṓkʷs
name	ñom	ñem	ónoma	nāman-	nōmen	namō	ainmm	*h₃néh₃-m̥n
to milk	mālkā	mālkant	amélgein	—	mulgēre	miluks	bligid (MIr)	*h₂melǵ-ei̯e

↑ ^1.0 ^1.1 ^1.2 ^1.3 ^1.4 Cognate, with shifted meaning
↑ Borrowed cognate, not native.

In traditional Indo-European studies, no hypothesis of a closer genealogical relationship of the Tocharian languages has been widely accepted by linguists. However, lexicostatistical and glottochronological approaches suggest the Anatolian languages, including Hittite, may be the closest relatives of Tocharian.^[35]^[36]^[37] As an example, the same Proto-Indo-European root *h₂wrg(h)- (but not a common suffixed formation) can be reconstructed to underlie the words for 'wheel': Tocharian A wärkänt B yerkwanto and Hittite ḫūrkis.

References

↑ ^1.0 ^1.1 ^1.2 ^1.3 Lua error in package.lua at line 80: module 'strict' not found.
↑ ^2.00 ^2.01 ^2.02 ^2.03 ^2.04 ^2.05 ^2.06 ^2.07 ^2.08 ^2.09 ^2.10 ^2.11 ^2.12 ^2.13 Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Deuel, Leo. 1970. Testaments of Time, ch. XXI, pp. 425–455. Baltimore, Pelican Books. Orig. publ. Knopf, NY, 1965.
↑ Renfrew (1990), pp. 107–108.
↑ Mallory & Mair (2000), pp. 280–281.
↑ Henning (1938), pp. 559–561.
↑ Hansen (2012), pp. 71–72.
↑ Mallory & Mair (2000), pp. 281.
↑ Beckwith (2009), pp. 380–383.
↑ Adams (1988), pp. 1–4.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑
- Henning (1949), p. 161: "At the same time we can now finally dispose of the name "Tokharian". This misnomer has been supported by three reasons, all of them now discredited."
- Henning (1938), p. 564: "...a misnomer, also, is 'tocharisch', which, in any case, should be replaced by 'tuγrisch'."
- Henning (1938), pp. 547–548: "On the other hand, it must be examined if there is any possibility of the reverse development: *tχuār from older *tuχār. Such a change could have taken place either in the original language of the Tokharians which is completely unknown, or in the indigenous language of Bactria, undoubtedly Iranian, probably adopted by the immigrants, which, however, is equally unknown (cf. Bailey, Ttaugara, pp. 892 sq.). As we have to deal with unknown languages nothing, in fact, can be proved or disproved."
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Daniels (1996), p. 531.
↑ Campbell (2000), p. 1666.
↑ "Fragments of the Tocharian", Andrew Leonard, How the World Works, Salon.com, January 29, 2008
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Renfrew (1990), p. 107.
↑ Baldi, Philip The Foundations of Latin (1999), pg 39
↑ ^23.0 ^23.1 ^23.2 ^23.3 ^23.4 ^23.5 Ringe, Donald A. (1996). On the Chronology of Sound Changes in Tocharian: Volume I: From Proto-Indo-European to Proto-Tocharian. New Haven, CT: American Oriental Society.
↑ Lane, George S. "Problems of Tocharian Phonology". Language, Vol. 14, No. 1 (Jan. – Mar., 1938), pp. 20–38.
↑ Lane, George S. "The Tocharian Palatalization (I)". Language, Vol. 21, No. 1 (Jan. – Mar., 1945), pp. 18–26.
↑ Sieg, Emil and Wilhelm Siegling (1949), Tocharische Sprachreste, Sprache B. Heft 1. Gottingen: Vandenhoeck & Ruprecht.
↑ Krause & Thomas (1960).
↑ Ringe, Donald A, Jr. (1990). Review of Tocharian Historical Phonology and Morphology by Douglas Q. Adams. Language, Vol. 66, No. 2 (Jun., 1990).
↑ Douglas Adams, "Ablaut and Umlaut in the Tocharian Vowel System", Journal of the American Oriental Society, Vol. 98, No. 4 (Oct. – Dec., 1978), pp. 446- 450.
↑ *ckāc- < *tkāc- by assimilation.
↑ ^31.0 ^31.1 Donald A. Ringe Jr, "Review of 'Tocharian Historical Phonology and Morphology' by Douglas Q. Adams", Language, Vol. 66, No. 2 (Jun., 1990), pp. 400–408.
↑ Beekes (1995), p. 92.
↑ Beekes (1995), p. 20.
↑ Douglas Q. Adams, "On the Development of the Tocharian Verbal System", Journal of the American Oriental Society, Vol. 98, No. 3 (Jul. – Sep., 1978), pp. 277- 288.
↑ Holm, Hans J. (2008). "The Distribution of Data in Word Lists and its Impact on the Subgrouping of Languages", In: Christine Preisach, Hans Burkhardt, Lars Schmidt-Thieme, Reinhold Decker (Editors): Data Analysis, Machine Learning, and Applications. Proc. of the 31st Annual Conference of the German Classification Society (GfKl), University of Freiburg, March 7–9, 2007. Springer-Verlag, Heidelberg-Berlin.
↑ Václav Blažek (2007), "From August Schleicher to Sergej Starostin; On the development of the tree-diagram models of the Indo-European languages". Journal of Indo-European Studies 35 (1&2): 82–109.
↑ Remco Bouckaert, Philippe Lemey, Michael Dunn, Simon J. Greenhill, Alexander V. Alekseyenko, Alexei J. Drummond, Russell D. Gray, Marc A. Suchard, Quentin D. Atkinson (2012). "Mapping the Origins and Expansion of the Indo-European Language Family". Science 337 (6097): 957–960.

Sources

Lua error in package.lua at line 80: module 'strict' not found.
Lua error in package.lua at line 80: module 'strict' not found.
Lua error in package.lua at line 80: module 'strict' not found.
Lua error in package.lua at line 80: module 'strict' not found.
Carling, Gerd (2009). Dictionary and Thesaurus of Tocharian A. Volume 1: a-j. (in collaboration with Georges-Jean Pinault and Werner Winter), Wiesbaden, Harrassowitz Verlag, ISBN 978-3-447-05814-8.
Lua error in package.lua at line 80: module 'strict' not found.
Lua error in package.lua at line 80: module 'strict' not found.
Lua error in package.lua at line 80: module 'strict' not found.
Lua error in package.lua at line 80: module 'strict' not found.
Lua error in package.lua at line 80: module 'strict' not found.
Levi, Sylvain (1913). "Tokharian Pratimoksa Fragment". The Journal of the Royal Asiatic Society of Great Britain and Ireland, pp. 109–120.
Lua error in package.lua at line 80: module 'strict' not found.
Malzahn, Melanie (ed.) (2007). Instrumenta Tocharica. Heidelberg: Carl Winter Universitätsverlag, ISBN 978-3-8253-5299-8.
Pinault, Georges-Jean (2008). Chrestomathie tokharienne. Textes et grammaire, Leuven-Paris, Peeters (Collection linguistique publiée par la Société de Linguistique de Paris, t. XCV), ISBN 978-90-429-2168-9.
Lua error in package.lua at line 80: module 'strict' not found.
Ringe, Donald A. (1996). On the Chronology of Sound Changes in Tocharian: Volume I: From Proto-Indo-European to Proto-Tocharian. New Haven, CT: American Oriental Society.
Schmalsteig, William R. (1974). "Tokharian and Baltic." Lituanus. v. 20, no. 3.
Winter, Werner (1998). "Tocharian." In Ramat, Giacalone Anna and Paolo Ramat (eds). The Indo-European languages, 154–168. London: Routledge, ISBN 978-0-415-06449-1.

External links

Conjugation tables for Tocharian A and B
Tocharian alphabet (from Omniglot)
TITUS: Tocharian alphabet, Tocharian A manuscripts from the Berlin Turfan Collection
Mark Dickens, 'Everything you always wanted to know about Tocharian'
Tocharian-to-English dictionary with nearly 200 words
Tocharian Online from the University of Texas at Austin
Online dictionary of Tocharian B with over 4000 entries.^{[dead link]}
Online dictionary of Tocharian B, based upon D. Q. Adams's A Dictionary of Tocharian B (1999)
Tocharian B Swadesh list (From Wiktionary)
Comprehensive Edition of Tocharian Manuscripts, Universität Wien, with images, transcriptions and (in many cases) translations and other information.
Lua error in package.lua at line 80: module 'strict' not found. Transcriptions of Tocharian A manuscripts.
Lua error in package.lua at line 80: module 'strict' not found.

[cognate-35] 1.0 ^1.1 ^1.2 ^1.3 ^1.4 Cognate, with shifted meaning

[borrowed-36] Borrowed cognate, not native.

[mallory-expedition-1] 1.0 ^1.1 ^1.2 ^1.3 Lua error in package.lua at line 80: module 'strict' not found.

[Tocharian_Online-2] 2.00 ^2.01 ^2.02 ^2.03 ^2.04 ^2.05 ^2.06 ^2.07 ^2.08 ^2.09 ^2.10 ^2.11 ^2.12 ^2.13 Lua error in package.lua at line 80: module 'strict' not found.

[Glottolog-3] Lua error in package.lua at line 80: module 'strict' not found.

[4] Deuel, Leo. 1970. Testaments of Time, ch. XXI, pp. 425–455. Baltimore, Pelican Books. Orig. publ. Knopf, NY, 1965.

[FOOTNOTERenfrew1990107.E2.80.93108-5] Renfrew (1990), pp. 107–108.

[FOOTNOTEMalloryMair2000280.E2.80.93281-6] Mallory & Mair (2000), pp. 280–281.

[FOOTNOTEHenning1938559.E2.80.93561-7] Henning (1938), pp. 559–561.

[FOOTNOTEHansen201271.E2.80.9372-8] Hansen (2012), pp. 71–72.

[FOOTNOTEMalloryMair2000281-9] Mallory & Mair (2000), pp. 281.

[FOOTNOTEBeckwith2009380.E2.80.93383-10] Beckwith (2009), pp. 380–383.

[FOOTNOTEAdams19881.E2.80.934-11] Adams (1988), pp. 1–4.

[12] Lua error in package.lua at line 80: module 'strict' not found.

[13] 

Henning (1949), p. 161: "At the same time we can now finally dispose of the name "Tokharian". This misnomer has been supported by three reasons, all of them now discredited."

Henning (1938), p. 564: "...a misnomer, also, is 'tocharisch', which, in any case, should be replaced by 'tuγrisch'."

Henning (1938), pp. 547–548: "On the other hand, it must be examined if there is any possibility of the reverse development: *tχuār from older *tuχār. Such a change could have taken place either in the original language of the Tokharians which is completely unknown, or in the indigenous language of Bactria, undoubtedly Iranian, probably adopted by the immigrants, which, however, is equally unknown (cf. Bailey, Ttaugara, pp. 892 sq.). As we have to deal with unknown languages nothing, in fact, can be proved or disproved."

[16] Henning (1949), p. 161: "At the same time we can now finally dispose of the name "Tokharian". This misnomer has been supported by three reasons, all of them now discredited."

[17] Henning (1938), p. 564: "...a misnomer, also, is 'tocharisch', which, in any case, should be replaced by 'tuγrisch'."

[18] Henning (1938), pp. 547–548: "On the other hand, it must be examined if there is any possibility of the reverse development: *tχuār from older *tuχār. Such a change could have taken place either in the original language of the Tokharians which is completely unknown, or in the indigenous language of Bactria, undoubtedly Iranian, probably adopted by the immigrants, which, however, is equally unknown (cf. Bailey, Ttaugara, pp. 892 sq.). As we have to deal with unknown languages nothing, in fact, can be proved or disproved."

[14] Lua error in package.lua at line 80: module 'strict' not found.

[15] Lua error in package.lua at line 80: module 'strict' not found.

[FOOTNOTEDaniels1996531-16] Daniels (1996), p. 531.

[FOOTNOTECampbell20001666-17] Campbell (2000), p. 1666.

[18] "Fragments of the Tocharian", Andrew Leonard, How the World Works, Salon.com, January 29, 2008

[19] Lua error in package.lua at line 80: module 'strict' not found.

[20] Lua error in package.lua at line 80: module 'strict' not found.

[FOOTNOTERenfrew1990107-21] Renfrew (1990), p. 107.

[baldi-22] Baldi, Philip The Foundations of Latin (1999), pg 39

[ringe-proto-tocharian-23] 23.0 ^23.1 ^23.2 ^23.3 ^23.4 ^23.5 Ringe, Donald A. (1996). On the Chronology of Sound Changes in Tocharian: Volume I: From Proto-Indo-European to Proto-Tocharian. New Haven, CT: American Oriental Society.

[24] Lane, George S. "Problems of Tocharian Phonology". Language, Vol. 14, No. 1 (Jan. – Mar., 1938), pp. 20–38.

[25] Lane, George S. "The Tocharian Palatalization (I)". Language, Vol. 21, No. 1 (Jan. – Mar., 1945), pp. 18–26.

[26] Sieg, Emil and Wilhelm Siegling (1949), Tocharische Sprachreste, Sprache B. Heft 1. Gottingen: Vandenhoeck & Ruprecht.

[FOOTNOTEKrauseThomas1960-27] Krause & Thomas (1960).

[28] Ringe, Donald A, Jr. (1990). Review of Tocharian Historical Phonology and Morphology by Douglas Q. Adams. Language, Vol. 66, No. 2 (Jun., 1990).

[29] Douglas Adams, "Ablaut and Umlaut in the Tocharian Vowel System", Journal of the American Oriental Society, Vol. 98, No. 4 (Oct. – Dec., 1978), pp. 446- 450.

[30] *ckāc- < *tkāc- by assimilation.

[ringe-adams-31] 31.0 ^31.1 Donald A. Ringe Jr, "Review of 'Tocharian Historical Phonology and Morphology' by Douglas Q. Adams", Language, Vol. 66, No. 2 (Jun., 1990), pp. 400–408.

[FOOTNOTEBeekes199592-32] Beekes (1995), p. 92.

[FOOTNOTEBeekes199520-33] Beekes (1995), p. 20.

[34] Douglas Q. Adams, "On the Development of the Tocharian Verbal System", Journal of the American Oriental Society, Vol. 98, No. 3 (Jul. – Sep., 1978), pp. 277- 288.

[37] Holm, Hans J. (2008). "The Distribution of Data in Word Lists and its Impact on the Subgrouping of Languages", In: Christine Preisach, Hans Burkhardt, Lars Schmidt-Thieme, Reinhold Decker (Editors): Data Analysis, Machine Learning, and Applications. Proc. of the 31st Annual Conference of the German Classification Society (GfKl), University of Freiburg, March 7–9, 2007. Springer-Verlag, Heidelberg-Berlin.

[38] Václav Blažek (2007), "From August Schleicher to Sergej Starostin; On the development of the tree-diagram models of the Indo-European languages". Journal of Indo-European Studies 35 (1&2): 82–109.

[39] Remco Bouckaert, Philippe Lemey, Michael Dunn, Simon J. Greenhill, Alexander V. Alekseyenko, Alexei J. Drummond, Russell D. Gray, Marc A. Suchard, Quentin D. Atkinson (2012). "Mapping the Origins and Expansion of the Indo-European Language Family". Science 337 (6097): 957–960.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[* 1]

[* 2]

[35]

[36]

[37]

Tocharian languages

Contents

Discovery and significance

Names

Writing system

Tocharian A and B

Phonology

Vowels

Diphthongs

Consonants

Proto-Tocharian

Vowels

Vowel deletion and insertion

Consonants

Palatalization

Dental consonants

Morphology

Nouns

Verbs

Categories

Classes

Present indicative

Subjunctive

Preterite

Imperative

Optative and imperfect

Endings

Comparison to other Indo-European languages

See also

References

Sources

External links

Navigation menu

Search