Skip to content

smithnlp/wichi-parse

Repository files navigation

Wichí Parser

Instructions for how to update, compile, and use the foma-based morphological parser for Wichí.

1. Make changes to wichi.lexc and/or wichi.foma

2. Recompile the transducer and save it to an executable

$ foma
foma[0]: source wichi.foma
foma[1]: save stack wichi.bin

3. Process text files of Wichí, saving them to corresponding output files.

For batch processing:

$ for each in manuscripts/*.txt; do python3 morphoprocess.py ${each}; done

For one file:

$ python3 morphoprocess.py manuscripts/inputfile.txt

TODO

  • create repo with all current work
  • adjust the output of morphoprocess.py (pandas dataframe, pickled)
  • write foma rules tuned to laureano's data

LINKS

About

A simple morphological parser for Wichí

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages