Skip to content

Commit

Permalink
Added changelog
Browse files Browse the repository at this point in the history
  • Loading branch information
Christopher Bryant committed Aug 12, 2018
1 parent e9425f8 commit 1ba32f9
Show file tree
Hide file tree
Showing 2 changed files with 22 additions and 4 deletions.
22 changes: 22 additions & 0 deletions changelog.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
# Changelog

This document contains descriptions of all the significant changes made to ERRANT since its release.

## 10-08-18

Added support for multiple annotators in `parallel_to_m2.py`.
Before: `python3 parallel_to_m2.py -orig <orig_file> -cor <cor_file> -out <out_file>`
After: `python3 parallel_to_m2.py -orig <orig_file> -cor <cor_file1> [<cor_file2> ...] -out <out_file>`
This is helpful if you have multiple annotations for the same orig file.

## 17-12-17

In November, spaCy changed significantly when it became version 2.0.0. Although we have not tested ERRANT with this new version, the main change seemed to be a slight increase in performance (pos tagging and parsing etc.) at a significant cost to speed. Consequently, we still recommend spaCy 1.9.0 for use with ERRANT.

## 22-11-17

ERRANT would sometimes run into memory problems if sentences were long and very different. We hence changed the default alignment from breadth-first to depth-first. This bypassed the memory problems, made ERRANT faster and barely affected results.

## 10-05-17

ERRANT v1.0 released.
4 changes: 0 additions & 4 deletions readme.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,6 @@ Currently, we only support Python 3. It is safest to install everything in a cle

spaCy is a natural language processing (NLP) toolkit available here: https://spacy.io/.

UPDATE 17/12/17: In early November, spaCy underwent significant changes when it became version 2.0.0. Although we have not tested ERRANT with this new version of spaCy, the main difference seems to be a slight increase in performance at a significant cost to speed. As such, we currently recommend the slightly older spaCy v1.9.0 for use with ERRANT.

It can be installed for Python 3 as follows:
```
pip3 install -U spacy==1.9.0
Expand Down Expand Up @@ -89,8 +87,6 @@ All these scripts also have additional advanced command line options which can b

In terms of speed, automatic edit extraction is the bottleneck. As a guideline, it takes roughly 10 seconds (including loading times) to extract and classify the edits in 100 sentences on an Intel Core i5-6600 @ 3.30GHz machine. In contrast, it takes just 0.2 seconds to classify the edits in the same 100 sentences if the edit boundaries are already known. Bear in mind that these figures are only a rough estimate and runtime actually depends on how different the original and corrected sentences are and how many edits they contain.

UPDATE 22/11/17: When sentences were long and very different, ERRANT would sometimes run into memory problems. We fixed this by changing the default alignment behaviour from breadth-first to depth-first. Experiments showed this barely affects the results and we even saw improvements. It should also make ERRANT faster.

# Edit Extraction

For more information about the edit extraction phase of annotation, we refer the reader to the following paper:
Expand Down

0 comments on commit 1ba32f9

Please sign in to comment.