Nuristâni Etymological Lexicon

Presented here are approximately 1,800 etymologies of words and grammatical formatives from the five Nuristâni languages. Included in the data are words that I recorded from Nuristâni speakers of Kâmk′ata-vari (dialects of Kâmv′iri and Kât′a-vari of Kt′ivi), Âṣkuňu-Saňu-vi:ri (Saňu-vi:ri dialect of Wâmâ), Kalaṣa-alâ (Nišei-alâ dialect), and a few from Vâs′i-vari (dialects of Ṣup′u and Uṣ′üt). I have also included the lexicons recorded by Georg Morgenstierne (1949) and Georg Buddruss (2005) from Vâsi-vari (dialects of Uṣ′üt and Seć) and the short Tregâmi lexicon (dialect of Gambir) recorded by Georg Morgenstierne (1952).

The majority of etymologies given here were first proposed by Morgenstierne in his various publications, later collected in Turner (1966). A few are due to Buddruss, appearing in Turner (1966) and Buddruss (2005). New etymologies proposed by Strand are indicated by a plus sign (+) in the comments following the Nuristâni forms.

Entries in the lexicon may be found using the Phonemic Index to the right. Click on a phoneme's symbol in the table. (If the Semantic Index appears to the right, click on "Alphabetic Index" at the top of the Index pane to show the Phonemic Index. You may have to click on "Nûristânî Etymological Lexicon" again at the bottom of the Lexicon pane to synchronize the selected choices.)

Entries under each phoneme are alphabetized according to the phoneme's position of articulation and subsequently by its degree of vocal-tract stricture. Vowels precede oral consonants, which precede pharyngeal and laryngeal phonemes. The alphabetic order appears at the top of each phoneme's page.

An etymological entry contains the following parts:

  1. The head entry, consisting of
    1. An abbreviation of the name of the earliest source language. Source languages fall into ten stages in the development of the Nuristâni languages, as shown in the following table:
      StageLanguages
      1EurAs = Eurasiatic
      2PIE = Proto-Indo-European
      3PAr = Proto-Aryan (Proto-Indo-Iranian)
      4OIA = Old Indo-Aryan
      5 MIA = Middle Indo-Aryan
      Ir = Iranian (unspecified language)
      Middle Prs = Middle Persian
      Turk = Turkic (unspecified language)
      Mongol = Mongolian
      6PNur = Proto- (or Common) Nuristani
      7Prs = Persian
      8Psht = Pashto
      Kal-mun = Kalaṣa-mun
      Khowàr = Khowàr
      Dåmiâ = Dåmiâ-båṣa
      9Arb = Arabic
      10Eng = English
    2. The entry word or root-form from the source language.
    3. The English gloss of the source-language entry
    4. A reference to the source of the etymology, if any. The reference contains the entry forms and citation or page numbers from the lexica of the following sources:
  2. Any intermediate-stage forms in the development of the modern Nuristâni forms. Modern Nuristâni forms derive from PIE sources via the following temporal trajectories: In the lexicon intermediate-stage forms are distinguished by indentation on subsequent lines.
  3. The Nuristâni forms that correspond to (descend from) the head word. The entry for each form contains:
    1. A Nuristâni dialect code followed by a source code. The following table lists the Nuristâni dialect codes found in this lexicon:
      Vâsʹi vari Kâmkʹata vari Âṣkuňu-Saňu vi:ri Kalaṣa-alâ Tregâmi
      Vâsi.u. = dialect of Uṣüt
      Vâsi.z. = dialect of Zumu
      Vâsi.üć. = dialect of Üćü
      Vâsi.s. = dialect of Sʹeć
      Vâsi.ṣu. = dialect of Ṣupʹu
      Kmkt.ktv = Kâtʹa vari of Ktʹivi
      Kmkt.km. = Kâmvʹiri
      Aṣk.s. = dialect of Saňu Kal.n = dialect of Nišeigrâm
      Kal.v = dialect of Vägal
      Kal.a = dialect of Amešdeš
      Treg.g = dialect of Gambir
      Source codes indicate the field researcher who recorded the Nuristâni form: S = Richard F. Strand, M = Georg Morgenstierne, B = Georg Buddruss, L=Wolfgang Lentz [via Morgenstierne (1949], LSI = Linguistic Survey of India [via Morgenstierne (1949]
    2. The Nuristâni form itself
    3. The part-of-speech code for the form
    4. The form's English gloss
    5. Comments on the etymology. A plus sign (+) indicates that the Nuristâni form should be added to those listed under the indicated entry in Turner (1966).

Language codes and entry forms are colored to match the colors of the language and dialect names found in the table "Phylogenetic Classification of the Region's Languages" on this website.


Notes on the Developmental Stages:

  1. Eurasiatic. Greenberg (2000), building on the work of Russian researchers, posited a Eurasiatic linguistic superphylum that includes the major linguistic families of northern Eurasia. The time of such linguistic unity must have preceded the last glaciation, some 30,000 years ago.
    The Eurasian entries in this lexicon are single-phoneme items that include t, r, l, n, m, i, k, and s. At this stage in linguistic development these sounds were apparently produced in conjunction with conventional hand and mouth gestures that modeled the spatial relationships of the objects of discourse. Even today many such gestures accompany speech in the region of Nuristân. Based on such currently used gestures, I have posited hypothetical gestures as the "translations" of the Eurasiatic forms given here.
    The various proposed PIE descendants of the Eurasiatic forms are listed under the Eurasiatic head entry. Many PIE descendant forms that have been treated separately in standard references can be united gesturally. For example, the "sideways-wagging hand with pointing index finger" represented by n may have indicated dispersed or indefinite location, active objects, or "erasure" to indicate negation, just as it does today.
  2. Proto-Indo-European. The transcription of reconstructed PIE head forms mostly follows Pokorny's, with the following exceptions: The transcription here maintains the traditionally reconstructed distinction between three types of PIE dorsal stops: "palatal" (dorso-palatals: , , and ĝʱ), "plain" (dorso-velars: k, g, and ), and "labialized" (labialized dorso-velars: , , and gʷʱ).
  3. Proto-Aryan. Reconstructed Proto-Aryan forms represent a stage after these developmental steps: At this stage there is a contrast between dorso-velar (k, g, and ), dorso-palatal (, , and ĝʱ), and lamino-alveolar stops (č, ǰ, and ǰʱ). This stage precedes two subsequent steps in the development of the individual branches of the Aryan languages:
  4. Old Indo Aryan. OIA forms are those of Turner (1966), with the addition of hyphens to indicate morpheme boundaries. Long vowels are transcribed as vowel + : to indicate their PIE origin as vowel + laryngeal, except where the OIA reflex â represents a PIE ablaut-grade vowel ē, ō, or ā. I have substituted ɦ and ʱ (after consonants) for the traditionally written h of OIA, to emphasize their posterior voicing. The traditional OIA or Sanskrit transcription of the lamino-alveolar spirant as ś is replaced by the more contemporary symbol š.
    Note that all OIA forms listed in the lexicon are loanwords into ancient Nuristâni and do not represent direct lineal descendants of PIE or PAr forms in Nuristâni. If an OIA form is the same as a reconstructed PAr form, the PAr form rather than the OIA form is assumed to be the precursor of the modern Nuristâni forms.
  5. Proto-Nuristâni. Proto-Nuristâni forms represent a common Nuristâni stage before the subsequent raising and rounding of accented vowels, before the assimilation of n before stops, and before the subsequent development of r and in the modern Nuristâni languages.