Peoples and Languages of Nuristān

The following ethnolinguistic overview is a revision of a paper given at the 3rd Himalayan Languages Symposium, University of California at Santa Barbara, on 18 July 1997. The symposium was supported with funds from the Wenner-Gren Foundation.

Return to: Peoples of Nuristān Map | Top | Topics

Last modified 10 June. 1998

Copyright © 1997, 1998 by Richard F. Strand


An Overview of The Nuristâni Languages

by

Richard F. Strand


The five languages of Nuristân are spoken by about 70,000 people in the Hindu-Kush mountains of northeastern Afghanistan. Nuristânis inhabit the drainages of three main rivers: the Alingar in the west, the Pech-Pârun in the center, and the LanDai Sin in the east. The latter two flow into the Kunar-Chitral River. I list the languages here in a north-to-south order.

Transcriptional notes:

Vowels: a represents the unmarked vowel in each language. In groups 1, 2, and 3 it is a high central vowel, while in groups 4 and 5 it is a low central vowel. a contrasts with â, which is articulated with the jaw more open. Nasalized vowels are indicated by a following ~. Stress is indicated by ' before a vowel in groups 1 and 2, where it is phonemic. In the remaining groups (including Tregâmi?) stress falls automatically on the last syllable of a stem.

Consonants: Retroflex (apico-postalveolar) consonants are indicated by CAPITAL letters: T, D, C, J, S, Z, R, N, and ñ. R represents a retroflex approximant, as opposed to a tapped r. N is a retroflex nasal stop in groups 1 and 2, but a retroflex nasal flap in the other groups (again, Tregâmi is unclear). ñ is a nasalized R. Palatal (lamino-alveolar) consonants include the affricates c and j and the spirant $. ç is a dental affricate; z is a dental affricate in groups 1 and 2, and a lamino-dental spirant in the other groups.

  1. vâs'i vari ('Vâsi Language') is spoken in upper, middle, and lower dialects by the vâs'i people of the Pârun Valley.
  2. kâmk'ata mumkSt'a viri ('Kom, Kâta, Mumo, and Kshto Language') is spoken by the kât'a, mum'o, kSto, and kom peoples who range across the breadth of Nuristân. The Kâta speak two main dialects, Western and Eastern kât'aviri. Western Kâtaviri is spoken in Râmg'al, as they call the upper Alingar Basin, in the kt'ivi Valley off the upper Pech, and in the p'aRuk Valley off the upper LanDai Sin. Each of these regions has its own subdialect. Eastern Kâtaviri is spoken in the upper LanDai Sin and in a few villages across the border in Pakistan's Chitral District. The Kom and the Kshto speak another dialect, kâmv'iri or kSt'aviri, depending on which ethnic group one addresses; the Kom inhabit the lower LanDai Sin and upper Kunar basins, with some Kom spilling over into southern Chitral. Three client peoples, the jâ$'i, the bini'o, and the jâmc'o, live within Kom territory and speak Kâmviri. The Kshto inhabit one village surrounded by Kom. The Mumo speak a transitional dialect, mumv'iri, between Kâmviri and Eastern Kâtaviri in the middle LanDai Sin Valley.
  3. âSkuNu vîri is spoken by the âSku~ people in separate dialects in the bâzâigal, kolâtâ~, and titin Valleys off the middle Alingar, and by the saNu and gRâmsaNâ peoples of the middle Pech Valley, each with their own dialect.
  4. kalaSa alâ is spoken by the kalaSa people of the Vaygal Valley off the lower Pech and to the east in the veligal and d'u~gul Valleys off the middle Kunar. A major dialect division separates the varjan people of the upper Vaygal Valley from the cima-ni$ey people of the lower valley. Minor dialect divisions separate each village, especially among the Varjan, where the chief subdialect is that of the vay people of the village of vaygal.
  5. tregâmi is spoken in three villages of the Tregâm Valley off the lower Pech.

On the basis of shared features, Groups 1 and 2 form a northern cluster, opposed to a southern cluster consisting of Groups 3, 4 and 5. However, there is much overlapping of dialect traits.

The phylogenetic position of these languages within Indo-European was disputed by earlier scholars, until Georg Morgenstierne of Oslo University established that they formed a third branch of Indo-Iranian, in between Indo-Aryan and Iranian. I can add little to his view, except to flesh out a best-guess scenario for the history of these languages, based on native traditions, recent archeological syntheses of scholars like Marija Gimbutas, and my own observations of phonetics in the field.

Around the middle of the fourth millennium B.C., some 800 years after the first Indo-European peoples expanded out of their Volga Basin homeland into Europe, new waves of horse-mounted tribesmen who called themselves Aryas expanded south and east around the Caspian Sea from the Volga Basin, driving other Indo-European speaking peoples before them. Those Aryas who spread south into the region between the Caspian and Black Seas bumped up against the Caucasus Range, which for some fifteen hundred years served as their southern border. These southern Aryas were the precursors of the Indo-Aryan speakers of the Indian Subcontinent. By the beginning of the third millennium the remaining Aryas to the north had expanded to the west around the Black Sea, into Europe's Danube basin, and to the east around the Caspian Sea, into the basin of the Amu Darya. These were the precursors of today's Iranian speakers. Ahead of the eastern Iranian expansion were driven the precursors of the Nuristānis.

The earliest Aryas distinguished themselves expressively through the processes of harshening, apical suppression, and prognathizing. Harshening lowers the spectral frequency of a fricative by apicalizing and retroflexing it. Apical suppression holds the tongue's apex firmly fixed to the back of the lower teeth as the default articulation; only during retroflex, dental, and perhaps labiovelar sounds, does the apex leave its suppressed position. This process produces the palatalization of velars next to front vowels. Prognathizing is the process of jutting out the jaw to present a belligerent demeanor. Combined with apical suppression, it moves the blade of the tongue into contact with the upper teeth, so that palatal affricates become dental and apicodental sounds become laminodental. This process is also responsible for the change of postconsonantal v to a stop in may languages of the region. These three processes have acted periodically in the history of the Aryan languages up to today, as any phonetician who has observed the harshening and belligerent prognathizing of Kunar Afghans can attest.

One way to account for the early differentiation of proto-Nuristāni from the rest of the Aryan group is to assume that the Aryas first initiated a harshening of s after u. The proto-Nuristānis did not participate in that process, because they were beyond the influence of Aryan at that time. Thus, the word for 'mouse' is mus'a in Kātaviri but muS'a in a neighboring Indo-Aryan language like Gāhwar bāti.


Summary of Nuristani Developments:
1st Palatalization
'dog'
*kuon-'e-
*cuon'e-
*cuan'e-
*ēuan'e-
*ēuan'a-
*ēun'a-
*ē'auna-
*ē'aun-
*ē'un-
 ē'u~
'ten'
*d'ekm
*d'eka
*d'eca
*d'eēa
*d'aēa
*d'aē
 d'uē
'knee'
*g'enu-
*j'enu-
*j'źnu-
*z'źnu-
*z'ānu-
*z'ān-
*z'ā~-
 z'o~
'snow'
*ghim'o-
*jhim'o-
*jhim'a-
*jim'a-
*zim'a-
*z'ima-
 zim

Later, when the Proto-Nuristānis were within the Aryan sphere, both groups underwent apical suppression, which caused the plain velar stops to become palatal affricates, so that, for example, PIE *kuon- 'dog' became *cuon-, *d'ekm 'ten' became *d'eca, *genu- 'knee' became *j'źnu-, and *ghi-m'o- 'snow' became *jhim'o-. This process also caused the laminalization of s after i, as in *ni$anna- 'seated', and it may also have been involved in the change of Indo-European schwa (vocalic H) to i.

Other assimilations and further harshening changed s to S after r and k throughout the region. Later all harshened s's, that is, those after u, i, r, and k, merged to become $ in Iranian and S in Indo-Aryan. In proto-Nuristāni $ after i (as in vi$ 'poison'; cf. Sanskrit viS'a-) and S after r and k remain apart, while s after u remains unharshened.

Throughout the Aryan sphere there was, after the first palatalization, a loss of lip-rounding accompanying the labiovelar stops and the vowel o; the labiovelars kw, gw, and ghw became k, g, and gh, and o merged with a.

At this point the pronunciation of the palatal affricates diverged. The Indo-Aryans dropped the affrication of the voiceless palatal, producing a palatal spirant, so that *cuna- became Sanskrit $una- 'dog' and *d'eca became d'a$a 'ten', while the voiced palatals remained (*j'źnu- 'knee', later j'ānu-; *jhim'o- 'snow', later him'a-). The Iranians prognathized to produce dental affricates, and the proto-Nuristānis, being under Iranian influence, followed suit, so that *cuna- became KalaSa alā ēu~ 'dog' (through strengthened *c'āuna-), *d'eca became Kāmviri duē 'ten', *j'źnu- 'knee' became Kāmviri zo~, and *jhim'o- 'snow' became Kāmviri zim.

2nd Palatalization
'migrate'
*kwel'e-
*kel'e-
*cel'e-
*cal'a-
 cāl'a-
'five'
*p'enkwe
*p'enke
*p'ence
*p'anca
*p'anc
*p'a~c
*p'u~c
 p'uc
'alive'
*gwiHu'o-
*gwi:v'o-
*gi:v'o-
*ji:v'a-
*jiv'a-
*jü'a-
 j'üa-
'smite'
*ghw'ene-
*gh'ene-
*jh'ene-
*jh'ana-
*jan'a-
*jaN'a-
*jāN'a-
*jāń'a-
 j'āńa-

Throughout the Aryan-speaking region the default apical suppression continued, producing a second palatalization of the remaining velars (from labiovelars) before palatal sounds (i and e). In Indo-Aryan the resulting voiced affricates j and jh, as in the development of PIE *gwiH-u'o- 'alive' > *giHu'o- > Skt. ji:v'a- and PIE *ghwene- 'smite' > *ghene- > *jhene- > Skt. h'ana-, were identical to those of the previous palatalization, while the new voiceless affricate c, as in *kwele- 'migrate away' > *cele > Skt. c'ala-, contrasted with the spirant $ from the earlier palatalization. In both Iranian and proto-Nuristāni, the older dental affricates ē and z remained fully distinct from the new palatal affricates c and j, as in Kāmviri cāl'a- 'migrate', jüa- 'be alive', and jāń'a- 'kill'.

A final innovation occurred throughout the Aryan-speaking region: the loss of palatality of the vowel e, merging it with a.

In the Iranian region, a further innovation, the loss of voiced aspiration, also spread to the proto-Nuristānis; but they escaped subsequent Iranian laxing innovations, such as the changes of s to h, of stops to spirants before other stops, of voiceless aspirates to spirants, and the deaffrication of the dental affricates.

By the beginning of the second millennium B.C. we find the Indo-Aryans spreading south over the Caucasus, perhaps under pressure from their Iranian cousins to the north. By the middle of that millennium they had mostly left their formative region. They mingled with Caucasian Hurrians to form the kingdom of Mitanni in the west, and they spread east across the northern Iranian Plateau, through Khorāsān and the Helmand basin and up into the Kabul River basin. At that time they began to collide with Iranians coming through Khorāsān from the north, and by the beginning of the first millennium B.C. the Iranians had prevailed and obliterated the Indo-Aryans from Khorāsān westward. The proto-Nuristānis escaped further Iranization by moving east from Khorāsān into Indo-Aryan territory.

At this point we begin to get traditional accounts of their origins. According to Kom tradition, their route took them through the Helmand Valley to Kandahār, and then up to Kābul, Kāpisā, and finally down the Kābul River to Kāmā, at the confluence of the Kābul and Kunar Rivers. They probably followed that route first to escape advancing Iranians, and later to escape advancing Muslim armies that began their conquest of the region at the start of the 8th century A.D. They remained in Indo-Aryan-speaking territory at least from early Middle Indo-Aryan times, and participated in many of the major linguistic developments of the northwest Middle-Indo-Aryan region. A large portion of the Nuristāni vocabulary may be traced back to Indo-Aryan borrowings.

At the beginning of the 11th century Afghan expansion forced the Nuristānis into their present region. Islamic Turkish and Afghan holy warriors from Ghazni swept through the Kābul Valley, including Kāmā, on their way to India. They tried to convert the Hindu "infidels" of the region to Islam; those who refused fled south to the Safed Koh and north to the Hindu Kush for refuge. The Nuristānis were driven part way up the Kunar and then north over the Kunar-Pech watershed, leapfrogging the Indo-Aryan speaking Degans who were settled in the lowlands of the Kunar basin. The Nuristāni refugees probably bypassed the Indo-Aryans of the lower Pech and Vaygal Valleys, to settle in more sparsely inhabited areas. The Kāta, KSto, and Mumo ended up in the region where the Ktivi and Pārun Rivers converge to form the Pech. One Kāta account claims that they displaced an indigenous people, the jā$'i, about whom more later. The Vāsi, being less powerful, ended up in the high and less desirable Pārun Valley, while the ĀSkuNu speakers established themselves above the Pech's bend and over into the Alingar basin. The Vay expelled the inhabitants of what is now the village of Vaygal at the head of the Vaygal Valley. Those new refugees went east to the upper Kunar, where they live today, speaking Gahwār bāti. The Kom apparently split up; one group went as far as lower Rāmgal, while another group went to highlands further up the Kunar.

The invasion of the Ghaznavids was the first of a series of violent encroachments by Afghans into the lowlands of the Kunar, Pech, and Alingar valleys. This period of violent encroachment lasted into the 16th century, and displaced both Indo-Aryan and Nuristāni speakers. Encroachment by Afghans into the region continues in diminished intensity even until today; but with increasing intensity, contact with Afghans is transforming the lexicons of the Nuristāni languages through the introduction of the Perso-Arabic vocabulary of Islam.

During the millennium that the Nuristānis have lived in their remote valleys of the Hindu Kush, they have had chaotic relations with one another, and there has been much shifting and mixing of populations. One area of refuge was the middle and upper LanDai Sin Valley: an early split among the Kāta sent the speakers of Eastern Kātaviri there, and the KSto, Binio, Mumo, and Jāmco also sought refuge there. The Kom left Rāmgal and settled between the people of SaNu and Ktivi. There they became troublemakers and were driven out. They too fled to the middle LanDai Sin, where they encountered the Jā$i and the Binio. They expropriated the Jā$i lands in the lower LanDai Sin, down to the Kunar, where they apparently met up with their distant kinsmen living in the highlands across the Kunar. They also took most of the land that the KSto had settled.

Another refuge area was Tregām. A group of GRāmsaNā people invaded the lower Vaygal Valley to become the Cima-Ni$ey; they expelled the PrejvRe~ inhabitants into Tregām. Another portion of the population of Tregām were exiles from Ktivi.

By the time I started my fieldwork in Nuristan in the late sixties, the internecine fighting of earlier centuries had mostly disappeared, and any hostilities were aimed at Afghan and Gujar interlopers. When that largest of interlopers, the Soviet Union, set up a communist regime in Afghanistan, the Nuristānis were the first to revolt. For almost twenty years now, the societies of Nuristān have been scarred by war, and I fear that the old internecine chaos may be on the rise. The languages, for the time being, remain vital; and their vocabularies have acquired a new Persian layer that allows them to interface well with the modern, external world of war and technology.

Let me turn now to a brief typology of these languages, which will be based on their cognitive geometry. I note at the outset that Vāsi vari is highly aberrant vis-ą-vis the other languages, both in its phonological and morphological development.

Like most languages of ancient Eurasia, the Nuristāni languages first depict the geometric relationships of the objects of discourse, and then they depict a change in the objects. In "traditional" terms, they are SOV languages.

All objects in the cognitive field have a location, which is specified by components of distance and direction. Distance is measured by dichotomous divisions of space into internal vs. external spheres. Such divisions generate a three-way contrast of distance between archetypical, perceptual, and conceptual.

Nouns at archetypical distance represent internal archetypical objects, traditionally labelled as indefinite, and thus do not appear with deictics. An example is Kāmviri

         g'o gaē. 'Give me a cow' (any cow),

with just the basic form of the noun.

Nouns at the remaining distances represent external objects, traditionally labelled definite, and thus require deictics to locate them in space, unless they are nouns with implied location, like nurist'on.

Nouns at perceptual distance represent objects internal to the speaker's current perceptual sphere. Such objects are again specified as internal or proximal vs. external or distal, as in Kāmviri

         ina g'oa gaē. 'Give me this cow.'

vs.

         i·a g'oa gaē. 'Give me that cow.'


Nouns at conceptual distance represent non-perceived objects, and are therefore external to the speaker's perceptual sphere; thus Kāmviri

          āska g'oa gaē. 'Give me that (unseen) cow.'

With plural nouns conceptual distance is foreshortened, so that plurals appear only with perceptual deictics, whether unseen or not.

In addition to distance specification, ĀshkuNu Viri and Vāsi Vari allow directional pointers to be prefixed to deictics.

At least one object in the cognitive field stands in the foreground as the subject. Other non-archetypical objects may be backgrounded, as indicated by an oblique case suffix on the noun. Several postpositions and one or two prepositions may stand with an oblique-case noun to indicate regions or direction relative to the noun's object referent.

The overview of Kāmviri verbs that I presented earlier today [see abstract] is representative of the systems for the other languages, with some exceptions in Vāsi Vari. Finite verb forms are built on the ancient present active participle, the past passive participle, or, less ubiquitously, on the present stem, followed by a subject pointer. A participial base may be followed by a closely compounded form of *āsa- 'is' with subject pointer, or by an enclitic form of *āsa- with subject pointer. The forms of *āsa- serve to place the change further away on a distance scale analogous to the one for nouns. Here such distance is temporal, allowing discriminations between present vs. imperfect, or vivid vs. resultant past, or imaginative vs. resultative.

For transitive verbs built on the present stem (including the present participle), the agent stands in the foreground as subject, and an external patient is backgrounded by an oblique case marker. For transitive verbs built on the past passive participle, the situation is reversed; the focus is on the end-point of the verbal change, so that the patient stands in the foreground and the agent is backgrounded. This latter configuration implies that the speaker is looking back into time, with the final state of the verbal change being closer than the initial state. This typical "split-ergative" system is not found, however, in Vāsi Vari. There past forms are built on a participle that has an Iranian look, ending in -k, and definite patients are always backgrounded.

At least three modes are represented in verb stems, as were noted earlier today for Kāmviri; these include a real vs. an imperative vs. an imaginative mode.

Numerous non-finite forms exist besides the participles on which finite forms are built. There are conjunctive participles, which depict an agent-focus change prior to the change of the finite verb. There are infinitives, which depict a change as a temporal goal. Of interest are the forms that occur with verbs of motion. One of these depicts a change as a goal of motion, as in Kāmviri

          vāll'oań gu·sa. 'He went to call on someone'.

The other depicts a change concurrent with motion; an example is Kāmviri

          kān'am 'oaso. 'He came laughing'.


A salient feature of these languages is the depiction of imbedded images, which are represented by subordinate finite clauses. They depict the internal states or compulsions of an oblique-case experiencer, as in Kāmviri

          'i~a 'oata bo. 'I'm hungry'

or

          'i~a i'e~ sta_āsa. 'I have to go'.

With postverbal particles, they depict past or hypothetical scenes, as in

         'o~ē g'um_to, t'ü ā·k'i n'ā_āsa$.
          'When I went, you weren't there'

or

        'o~ē g'um bo, t'ü di ielo$. 'If I go, then you should go, too'.

With the postposed conjunctive participle (e.g., Kāmviri kti) of the verb 'make', in the sense of 'said', they depict quotations and causes, as in

          'i~a i'e~ sta_āsa kti gija_kāRo. 'He said that he has to go'

or

        gātr'a bi·sam_kti n'ā go. 'He didn't go because he was tired'.


Finally, I should mention these languages' unique system of directional pointers, which I will discuss in detail tomorrow morning [see abstract].

There is still much to be discovered about the languages of Nuristān. The pioneering research of Morgenstierne on these languages has been followed up with more concentrated field studies by only a handful of other scholars, including Georg Buddruss of Mainz University on Vāsi Vari, Aleksandr Griunberg of Leningrad and Jan Mohammad, a native speaker at the University of Arizona, on Western Kātaviri, and myself and my mentor Qāzi Ghulāmullāh on Kāmviri. Unfortunately, the prospects for further research in the homelands of these languages are eclipsed by the hostilities still current in Afghanistan.


Return to: Peoples of Nuristān Map | Top | Topics