GitHub - ymcui/cmrc2019: A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)

This repository contains the data for The Third Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2019). We will present our paper at COLING 2020,

Title: A Sentence Cloze Dataset for Chinese Machine Reading Comprehension
Authors: Yiming Cui, Ting Liu, Ziqing Yang, Zhipeng Chen, Wentao Ma, Wanxiang Che, Shijin Wang, Guoping Hu
Link: https://arxiv.org/abs/2004.03116
Venue: COLING 2020

Open Challenge Leaderboard (New!)

Keep track of the latest state-of-the-art systems on CMRC 2019 dataset. https://ymcui.github.io/cmrc2019/

Submission Guidelines

If you would like to test your model on the hidden test and challenge set, please follow the instructions on how to submit your model via CodaLab worksheet. https://worksheets.codalab.org/worksheets/0xe856b40d21de45bf898cd1d3c5135afe

Directory Guide

baseline: a Chinese BERT-based simple baseline system
eval: contains official evaluation script
data: contains offical evaluation data
sample_submission: sample submission for codalab competition platform (trial_rand_submission.zip is a randomly generated prediction file, trial_submission.zip is the BERT baseline prediction file)

Baseline System

We provide a BERT-based baseline system for participants (check baseline directory for more info).

Results on other sets will be annouced later.

QAC: Question-Level Accuracy

PAC: Passage-Level Accuracy

Data	Passage #	Query #	QAC	PAC	Fake Candidates	Availability
Trial Data	139	1,504	71.941%	28.776%	No	Public
Train Data	9,638	100,009	N/A	N/A	No	Public
Development Data	300	3,053	70.586%	13.333%	Yes	Public
Qualifying Data	500	5,081	70.01%	8.20%	Yes	Semi-Hidden
Test Data	-	-	-	-	Yes	Hidden

International Standard Language Resource Number (ISLRN)

ISLRN: 813-010-842-493-2

http://www.islrn.org/resources/resources_info/8624/

Reference

If you wish to use our data in your research, please cite our paper:

@inproceeding={cui-etal-2020-cmrc2019,
  title={A Sentence Cloze Dataset for Chinese Machine Reading Comprehension},
  author={Cui, Yiming and Liu, Ting and Yang, Ziqing and Chen, Zhipeng and Ma, Wentao and Che, Wanxiang and Wang, Shijin and Hu, Guoping},
  booktitle = 	"Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020)",
  year={2020}
}

Organization Committee

Host: Chinese Information Processing Society of China (CIPS)
Organizer: Joint Laboratory of HIT and iFLYTEK Research (HFL)
Sponsor: iFLYTEK Co., Ltd. and iFLYTEK Research (Hebei)

Evaluation Co-Chairs

Ting Liu, Harbin Institute of Technology
Yiming Cui, Joint Laboratory of HIT and iFLYTEK Research

Official HFL WeChat Account

Follow Joint Laboratory of HIT and iFLYTEK Research (HFL) on WeChat.

Contact us

Any problems? Feel free to concat us.
Email: cmrc2019 [aT] 126 [DoT] com
Forum: CodaLab Competition Forum
CMRC 2019 Official Website (中文)：https://cmrc2019.hfl-rc.com/
CMRC 2019 Official Website (English)：https://cmrc2019.hfl-rc.com/english/

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
baseline		baseline
data		data
eval		eval
sample_submission		sample_submission
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
banner.png		banner.png
qqgroup.png		qqgroup.png
qrcode.jpg		qrcode.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Open Challenge Leaderboard (New!)

Submission Guidelines

Directory Guide

Baseline System

International Standard Language Resource Number (ISLRN)

Reference

Organization Committee

Evaluation Co-Chairs

Official HFL WeChat Account

Contact us

About

Releases

Packages

Languages

License

ymcui/cmrc2019

Folders and files

Latest commit

History

Repository files navigation

Open Challenge Leaderboard (New!)

Submission Guidelines

Directory Guide

Baseline System

International Standard Language Resource Number (ISLRN)

Reference

Organization Committee

Evaluation Co-Chairs

Official HFL WeChat Account

Contact us

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages