lingtypology
: Download typological databases’ datalingtypology
provides an ability to download data from these typological databases
All database function names have identical structure: database_name.feature. All functions have as first argument feature
. All functions create dataframe with column language
that can be used in map.feature()
function. It should be noted that all functions cut out the data that can’t be maped, so if you want to prevent functions from this behaviour set argument na.rm
to FALSE
.
The names of the WALS features can be typed in a lower case. This function preserves coordinates from WALS, so you can map coordinates from the WALS or use coordinates from lingtypology
.
<- wals.feature(c("1a", "20a")) df
Don't forget to cite a source (modify in case of using individual chapters):
Dryer, Matthew S. & Haspelmath, Martin (eds.) 2013. The World Atlas of Language Structures Online. Leipzig: Max Planck Institute for Evolutionary Anthropology.
(Available online at https://wals.info/, Accessed on 2022-02-12.)
@book{wals,
address = {Leipzig},
editor = {Matthew S. Dryer and Martin Haspelmath},
publisher = {Max Planck Institute for Evolutionary Anthropology},
title = {WALS Online},
url = {https://wals.info/},
year = {2013}
}
head(df)
map.feature(df$language,
features = df$`1a`,
latitude = df$latitude,
longitude = df$longitude,
label = df$language,
title = "Consonant Inventories")
The AUTOTYP features are listed on the GitHub page. You can use more human way with spaces. You can also use a module names in order to download all variables in a module (e. g. Gender
):
<- autotyp.feature(c('Has Gender', 'Has Numeral Classifiers')) df
Don't forget to cite a source:
Bickel, Balthasar, Nichols, Johanna, Zakharko, Taras, Witzlack-Makarevich, Alena, Hildebrandt, Kristine, Rießler, Michael, Bierkandt, Lennart, Zúñiga, Fernando & Lowe, John B. 2022. The AUTOTYP database (v1.0.0). https://doi.org/10.5281/zenodo.5931509
@misc{AUTOTYP,
author = {
Bickel, Balthasar and
Nichols, Johanna and
Zakharko, Taras and
Witzlack-Makarevich, Alena and
Hildebrandt, Kristine and
Rießler, Michael and
Bierkandt, Lennart and
Zúñiga, Fernando and
Lowe, John B
},
doi = {10.5281/zenodo.5931509},
title = {The AUTOTYP database (v1.0.0)},
url = {https://doi.org/10.5281/zenodo.5931509},
year = {2022}
}
head(df)
map.feature(df$language,
features = df$HasNumeralClassifiers,
label = df$language_for_lingtypology,
title = "Presence of Numeral Classifiers")
Warning: There is no coordinates for languages Nubian, Kanuri-Kanembu, Mixe,
Uradhic, Berber, West Futuna, Mari
I used only four features from PHOIBLE: the number of phonemes, the number of consonants, the number of tones and the number of vowels. If you need only a set of them, just specify it in the features
argument. Since there is a lot of doubling information in the PHOIBLE database, there is an argument source
.
<- phoible.feature() df
Don't forget to cite a source:
Moran, Steven & McCloy, Daniel & Wright, Richard (eds.) 2014. PHOIBLE Online. Leipzig: Max Planck Institute for Evolutionary Anthropology. (Available online at https://phoible.org/, Accessed on ...)
A BibTeX entry for LaTeX users is
@book{phoible,
address = {Leipzig},
editor = {Steven Moran and Daniel McCloy and Richard Wright},
publisher = {Max Planck Institute for Evolutionary Anthropology},
title = {PHOIBLE Online},
url = {https://phoible.org/},
year = {2014}
}
head(df)
The AfBo database has a lot of features that distinguish affix functions, but again you can use a bare function without any arguments to download the whole database. There will be no difference in time, since this function downloads the whole database to your PC. The main destinction is that this database provides recipient and donor languages, so other column names should be used.
<- afbo.feature(c("adjectivizer", "adverbializer")) df
Don't forget to cite a source:
Seifart, Frank. 2013. AfBo: A world-wide survey of affix borrowing. Leipzig: Max Planck Institute for Evolutionary Anthropology. (Available online at https://afbo.info/, Accessed on ...)
A BibTeX entry for LaTeX users is
@book{afbo,
address = {Leipzig},
editor = {Frank Seifart},
publisher = {Max Planck Institute for Evolutionary Anthropology},
title = {AfBo: A world-wide survey of affix borrowing},
url = {https://afbo.info/},
year = {2013}
}
head(df)
map.feature(df$Recipient.name,
features = df$adjectivizer,
label = df$Recipient.name,
title = "Borrowed adjectivizer affixes")
The SAILS database provide a lot of features, so the function work with their ids:
<- sails.feature(features = "ics10") df
Don't forget to cite a source (modify in case of using individual chapters):
Muysken, Pieter, Harald HammarstrÖm, Olga Krasnoukhova, Neele MÜller, Joshua Birchall, Simon van de Kerke, Loretta O'Connor, Swintha Danielsen, Rik van Gijn & George Saad. 2016. South American Indigenous Language Structures (SAILS) Online. Leipzig: Online Publication of the Max Planck Institute for Evolutionary Anthropology. (Available at https://sails.clld.org/)
head(df)
map.feature(df$language,
features = df$ics10_description,
longitude = df$longitude,
latitude = df$latitude,
label = df$language,
title = "Are there numeral classifiers?")
The ABVD database is a lexical database, so it is different from clld databases. First of all, ABVD has its own language classification ids. The information about the same language from different sources can be received from these database different ids. So I select several languages and map them coloring by word with the meaning ‘hand’.
<- abvd.feature(c(292, 7)) df
Don't forget to cite a source:
Greenhill, S.J., Blust. R, & Gray, R.D. (2008). The Austronesian Basic Vocabulary Database: From Bioinformatics to Lexomics. Evolutionary Bioinformatics, 4:271-283.
head(df)
<- df[df$word == "hand",]
new_df map.feature(new_df$language,
features = new_df$item,
label = new_df$language)
uralex.feature
downloads data from UraLex basic vocabulary dataset. Original language names are stored in the language
variable. Converted language names for map.feature
are stored in the language2
variable.
<- uralex.feature() df
Don't forget to cite a source:
Kaj Syrjänen, Jyri Lehtinen, Outi Vesakoski, Mervi de Heer, Toni Suutari, Michael Dunn, Urho Määttä & Unni-Päivä Leino (2018). UraLex basic vocabulary dataset.
<- df[df$uralex_mng == "crush",]
df
map.feature(df$language2,
label = df$item,
title = "crush")
Warning: Language Inari Sami is absent in our version of the Glottolog database.
Did you mean Inari Saami?
Warning: Language Kildin Sami is absent in our version of the Glottolog
database. Did you mean Kildin Saami?
Warning: Language Northern Sami is absent in our version of the Glottolog
database. Did you mean Northern Sani?
Warning: Language Northern Sami is absent in our version of the Glottolog
database. Did you mean Northern Sani?
Warning: Language Pite Sami is absent in our version of the Glottolog database.
Did you mean Pite Saami?
Warning: Language Skolt Sami is absent in our version of the Glottolog database.
Did you mean Skolt Saami?
Warning: Language Southern Sami is absent in our version of the Glottolog
database. Did you mean Southern Sama, Southern Samo, Southern Sani?
Warning: Language Southern Sami is absent in our version of the Glottolog
database. Did you mean Southern Sama, Southern Samo, Southern Sani?
Warning: Language Ume Sami is absent in our version of the Glottolog database.
Did you mean Ume Saami?
Warning: Language Voro (Uralic) is absent in our version of the Glottolog
database. Did you mean Karo (Brazil), Kamas (Uralic)?
Warning: Language Voro (Uralic) is absent in our version of the Glottolog
database. Did you mean Karo (Brazil), Kamas (Uralic)?
Warning: Language Inari Sami is absent in our version of the Glottolog database.
Did you mean Inari Saami?
Warning: Language Kildin Sami is absent in our version of the Glottolog
database. Did you mean Kildin Saami?
Warning: Language Northern Sami is absent in our version of the Glottolog
database. Did you mean Northern Sani?
Warning: Language Northern Sami is absent in our version of the Glottolog
database. Did you mean Northern Sani?
Warning: Language Pite Sami is absent in our version of the Glottolog database.
Did you mean Pite Saami?
Warning: Language Skolt Sami is absent in our version of the Glottolog database.
Did you mean Skolt Saami?
Warning: Language Southern Sami is absent in our version of the Glottolog
database. Did you mean Southern Sama, Southern Samo, Southern Sani?
Warning: Language Southern Sami is absent in our version of the Glottolog
database. Did you mean Southern Sama, Southern Samo, Southern Sani?
Warning: Language Ume Sami is absent in our version of the Glottolog database.
Did you mean Ume Saami?
Warning: Language Voro (Uralic) is absent in our version of the Glottolog
database. Did you mean Karo (Brazil), Kamas (Uralic)?
Warning: Language Voro (Uralic) is absent in our version of the Glottolog
database. Did you mean Karo (Brazil), Kamas (Uralic)?
Warning: Language Inari Sami is absent in our version of the Glottolog database.
Did you mean Inari Saami?
Warning: Language Kildin Sami is absent in our version of the Glottolog
database. Did you mean Kildin Saami?
Warning: Language Northern Sami is absent in our version of the Glottolog
database. Did you mean Northern Sani?
Warning: Language Northern Sami is absent in our version of the Glottolog
database. Did you mean Northern Sani?
Warning: Language Pite Sami is absent in our version of the Glottolog database.
Did you mean Pite Saami?
Warning: Language Skolt Sami is absent in our version of the Glottolog database.
Did you mean Skolt Saami?
Warning: Language Southern Sami is absent in our version of the Glottolog
database. Did you mean Southern Sama, Southern Samo, Southern Sani?
Warning: Language Southern Sami is absent in our version of the Glottolog
database. Did you mean Southern Sama, Southern Samo, Southern Sani?
Warning: Language Ume Sami is absent in our version of the Glottolog database.
Did you mean Ume Saami?
Warning: Language Voro (Uralic) is absent in our version of the Glottolog
database. Did you mean Karo (Brazil), Kamas (Uralic)?
Warning: Language Voro (Uralic) is absent in our version of the Glottolog
database. Did you mean Karo (Brazil), Kamas (Uralic)?
Warning: There is no coordinates for languages Inari Sami, Kildin Sami, Northern
Sami, Pite Sami, Skolt Sami, Southern Sami, Ume Sami, Voro (Uralic)