Skip to content

For most SIL dialect intelligibility surveys, wordlists are collected on paper (the batteries last forever!), using the International Phonetic Alphabet. For analysis, these are then often entered in text files (in Unicode, to preserve the IPA symbols). If the surveyor wants to get fancy, the wordlists might be stored in a spreadsheet. In order t…

License

Notifications You must be signed in to change notification settings

euangeleo/wordlist-preprocess

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

wordlist-preprocess

For most SIL dialect intelligibility surveys, wordlists are collected on paper (the batteries last forever!) in IPA, and are then stored in text files. If the surveyor wants to get fancy, they might be stored in a spreadsheet. In order to feed them into a particular tool for analysis (in this case, the version of RuG/L04 string edit distance calculator current in 2008), they need to be in an ASCII-only phonetic alphabet (like XSAMPA), and they need to be in the right format. This Perl script takes wordlists in Unicode IPA and converts them to XSAMPA, in a format that can be used with RuG/L04.

About

For most SIL dialect intelligibility surveys, wordlists are collected on paper (the batteries last forever!), using the International Phonetic Alphabet. For analysis, these are then often entered in text files (in Unicode, to preserve the IPA symbols). If the surveyor wants to get fancy, the wordlists might be stored in a spreadsheet. In order t…

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published