Simple project in IR
The focus of this project is the Non-factoid Question-Answering task. Within such a setting, a user may post questions in natural language and answers should be informative enough to satisfy the users information need
usage: java -jar yourJarName.jar [[option1] [option2]]
-i,--index Reindex documents
-l,--lambda Create appropriate lambda for language model,
depends on median size of document in corpus
-q,--create-questions Create questions file from corpus json file
-i,--index Reindex documents
Without this option index will not be created
-l,--lambda Create appropriate lambda for language model,
depends on median size of document in corpus
Lambda parameter calculated by this formula:
-q,--create-questions Create questions file from corpus json file
This option needed for lambda parameter calculation, in case that there is no corpus of questions available. This requires regular .json file with corpus collection.
Can be found here https://louiscyphre.github.io/lucene-oxygen/
Additionally, it can be found here (same files) https://github.com/louiscyphre/lucene-oxygen/tree/master/docs