For most SIL dialect intelligibility surveys, wordlists are collected on paper (the batteries last forever!) in IPA, and are then stored in text files. If the surveyor wants to get fancy, they might be stored in a spreadsheet. In order to feed them into a particular tool for analysis (in this case, the version of RuG/L04 string edit distance calculator current in 2008), they need to be in an ASCII-only phonetic alphabet (like XSAMPA), and they need to be in the right format. This Perl script takes wordlists in Unicode IPA and converts them to XSAMPA, in a format that can be used with RuG/L04.
-
Notifications
You must be signed in to change notification settings - Fork 0
For most SIL dialect intelligibility surveys, wordlists are collected on paper (the batteries last forever!), using the International Phonetic Alphabet. For analysis, these are then often entered in text files (in Unicode, to preserve the IPA symbols). If the surveyor wants to get fancy, the wordlists might be stored in a spreadsheet. In order t…
License
euangeleo/wordlist-preprocess
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
For most SIL dialect intelligibility surveys, wordlists are collected on paper (the batteries last forever!), using the International Phonetic Alphabet. For analysis, these are then often entered in text files (in Unicode, to preserve the IPA symbols). If the surveyor wants to get fancy, the wordlists might be stored in a spreadsheet. In order t…
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published