CLDF dataset derived from Blum et al.'s "A phylolinguistic classification of the Quechua language family" from 2023
If you use these data please cite
- the original source
Blum, Frederic, Carlos Barrientos, Adriano Ingunza & Zoe Poirier. 2023. A phylolinguistic classification of the Quechua language family. INDIANA - Anthropological Studies on Latin America and the Caribbean 40(1). 29–-54. DOI: https://doi.org/10.18441/IND.V40I1.29-54.
- the derived dataset using the DOI of the particular released version you were using
This dataset is licensed under a CC-BY-4.0 license
Conceptlists in Concepticon:
This dataset features a 150-items concept list (concepticon link here) for languages of three andean families, Quechua, Aymara and Uru-Chipaya. In the current version, the dataset features 42 Quechua varieties, three of those from colonial times, five Aymara languages and three languages of the Uru-Chipaya family. The data is collected from a variety of published sources as well as a couple of individual contributions. All forms have been converted to IPA and segmentated to enable further processing for both traditional and computational research. Even so, the phonological data is to be taken with caution, as no own fieldwork was involved and the phonological information varies hugely from source to source. We greatly appreciate any improvement on gaps in our dataset as well as additional phonological information.
Furthermore, this dataset is part of an ongoing research project which searches to re-evaluate the internal classification of Quechua and was first presented at a talk during the RE(E)LA conference in September 2021. A journal article on this topic is accepted for publication in Indiana mid-2023.
- Varieties: 50 (linked to 39 different Glottocodes)
- Concepts: 150 (linked to 150 different Concepticon concept sets)
- Lexemes: 7,518
- Sources: 28
- Synonymy: 1.05
- Cognacy: 7,518 cognates in 1,036 cognate sets (478 singletons)
- Cognate Diversity: 0.12
- Invalid lexemes: 0
- Tokens: 33,692
- Segments: 92 (0 BIPA errors, 0 CLTS sound class errors, 92 CLTS modified)
- Inventory size (avg): 32.80
Name | GitHub user | Description | Role |
---|---|---|---|
Frederic Blum | @FredericBlum | maintainer | Author, Editor |
Carlos B. Ugarte | @MuffinLinwist | maintainer | Author, Editor |
Adriano Ingunza | @BadBatched | maintainer | Author |
Zoe Poirier | @zrpm | maintainer | Author |
Mattis List | @LinguList | maintainer | Editor |
The following CLDF datasets are available in cldf:
- CLDF Wordlist at cldf/cldf-metadata.json