Skip to content

Commit

Permalink
Change img embedding links in README
Browse files Browse the repository at this point in the history
  • Loading branch information
ben-tinc committed Feb 1, 2022
1 parent f4933cb commit 4e02959
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ Perform Named Entity Recognition (NER) on french novels from the roman18 corpus

The roman18-corpus of about 100 eighteenth-century French novels in full text is analysed via SpaCy’s named entity recognition. Named entity recognition (NER) is a popular information retrieval technique “to identify and segment named entities and classify or categorize them under various predefined classes” (Sarkar, 2019).

![Named entity recognition](https://raw.githubusercontent.com/MiMoText/NER_novels/main/img/ner_diderot.PNG?token=GHSAT0AAAAAABQ73GKISHNBWIT3UUYUDTJYYPSUEWA)
![Named entity recognition](https://github.com/MiMoText/NER_novels/blob/main/img/ner_diderot.PNG?raw=true)

Within the French language package of SpaCy one can distinguish the following types of named entities: LOC, PER, MISC and ORG entities. The five most common “LOC” (location) entities within each novel and their numerical occurences per text are extracted.

Expand All @@ -16,7 +16,8 @@ Within the French language package of SpaCy one can distinguish the following ty

The script which performs the NER and its technical documentation is under `scripts/`. The latest results of this script are stored under `raw_results/`, although this is configurable. These results still need manual corrections and supplements to be suitable for import into the [MiMoText Wikibase](https://github.com/MiMoText/roman18). The latest version of this is under `edited_for_import/`. In this folder there is also a file named `edits_openrefine.txt`. This has been generated by [OpenRefine](https://docs.openrefine.org/) which is used for semi-automatic reconciliation and includes all the editing steps in this tool.

![OpenRefine](https://raw.githubusercontent.com/MiMoText/NER_novels/main/img/OpenRefine.PNG?token=GHSAT0AAAAAABQ73GKJQBWOLMWEW7PIZ3FMYPSUEJA)
![OpenRefine](https://github.com/MiMoText/NER_novels/blob/main/img/OpenRefine.PNG?raw=true)


## Licence

Expand Down

0 comments on commit 4e02959

Please sign in to comment.