Memoire

This is the git corresponding to Jeremie Bogaert's master thesis. Some of the appendix, judged as useful but too big to be included in the submission are included here. You can find:

The LDA html file that helps to find the different topics
The database used during the human evaluation in the OriginalText and Generated/ files
The url used to collect the news in urlCategory3.txt and urlCategory4.txt
The database used during the automated evaluation. The original text are in the databaseNorig.txt, while the generated ones are in the databaseNgen.jsonl. N is the category number, between 1 and 4.
The code used to crawl the data via commoncrawl

If the participants to the human experiment allow me to share their answers here, it will also be done in the future.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Generated1		Generated1
Generated2		Generated2
Generated3		Generated3
Generated4		Generated4
OriginalText		OriginalText
Bogaert_52871500_2021_appendix1.html		Bogaert_52871500_2021_appendix1.html
README.md		README.md
crawler.py		crawler.py
database1gen.jsonl		database1gen.jsonl
database1ori.txt		database1ori.txt
database2gen.jsonl		database2gen.jsonl
database2ori.txt		database2ori.txt
database3gen.jsonl		database3gen.jsonl
database3ori.txt		database3ori.txt
database4gen.jsonl		database4gen.jsonl
database4ori.txt		database4ori.txt
urlCategory3.txt		urlCategory3.txt
urlCategory4.txt		urlCategory4.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memoire

About

Releases

Packages

Languages

jebogaert/Memoire

Folders and files

Latest commit

History

Repository files navigation

Memoire

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages