-
Notifications
You must be signed in to change notification settings - Fork 19
Meeting record for 15th Dec 2021
Anubhab Chakraborty edited this page Dec 15, 2021
·
1 revision
- Dictionary
- Corpus
- writable:
- JATS metadata (front) - searchable
- search results
- extracted objects (images)
- writable:
- unsupervised
- phrase extraction (YAKE),
- PKE - multiple unsupervised tools
- supervised
- dictionary-based
- string matching - ?library routines
- fuzzy
- lexemes / lemmatise
- capitalization
- stemming
- workflow (commandline)
- entity in context / w3c annotation The key target is to get unsupervised and dictionary-based searching for our new interns.
- pypi / installation
- sectioning and sentences
- searching
probably need to add a stateful `AmiImage to manage conversions TextBox now displays boxes on text . We'll need user testing AmiArrow has been started.
- pygetpapers
- ami-search and corpus manipulation (section, glob, delete, filter) (docanalysis + pyami) -> "results" maybe Excel, Pandas, CSV, etc.
- downstream tools (python, display, analysis and ML) -> Pandas