Introduction

This repository provides code that converts the unannotated zipped XML datasets for SemEval 2016 and 2017 English Task 3, Subtask A into a single dataset in JSON.

Usage

Use the program as follows:

$ pip install -r requirements.txt
$ python __main__.py
 79% [........................................................               ] 17186816 / 21555267

The resulting dataset will reside in the result.json file.

References

You should use the following citation in your publications whenever using this resource:

@InProceedings{nakov-EtAl:2016:SemEval,
  author    = {Nakov, Preslav  and  M\`{a}rquez, Llu\'{i}s  and  Magdy, Walid  and  Moschitti, Alessandro  and  Glass, Jim  and  Randeree, Bilal},
  title     = {{SemEval}-2016 Task 3: Community Question Answering},
  booktitle = {Proceedings of the 10th International Workshop on Semantic Evaluation},
  series    = {SemEval '16},
  month     = {June},
  year      = {2016},
  address   = {San Diego, California},
  publisher = {Association for Computational Linguistics},
}

@InProceedings{SemEval-2017:task3,
   author    = {Nakov, Preslav and Hoogeveen, Doris and M\`{a}rquez, Llu\'{i}s and Moschitti, Alessandro and Mubarak, Hamdy and Baldwin, Timothy and Verspoor, Karin},
   title     = {{SemEval}-2017 Task 3: Community Question Answering},
   booktitle = {Proceedings of the 11th International Workshop on Semantic Evaluation},
   series    = {SemEval '17},
   month     = {August},
   year      = {2017},
   address   = {Vancouver, Canada},
   publisher = {Association for Computational Linguistics},
 }

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md
__main__.py		__main__.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
xmlfiles.py		xmlfiles.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Usage

References

About

Releases 1

Packages

Languages

License

Witiko/semeval-2016_2017-task3-subtaskA-unannotated-english

Folders and files

Latest commit

History

Repository files navigation

Introduction

Usage

References

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages