Bump version to 2.2.2
Change-Id: I081b0ae75df07a9baa837e1a6cb6046e5e1109d4
diff --git a/CHANGELOG.md b/CHANGELOG.md
index f802a85..4a78e7c 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -1,5 +1,11 @@
# Changelog
+## 2.2.2
+
+* Bug fix: a single quotation mark at the beginning of a word
+ is no longer interpreted as a beginning of an omission, but as quotation mark token.
+* dependencies updated
+
## 2.2.1
* "du." is no longer treated as an abbreviation.
diff --git a/Readme.md b/Readme.md
index d78badd..4cf429b 100644
--- a/Readme.md
+++ b/Readme.md
@@ -35,7 +35,7 @@
#### Split English text into tokens
```
-$ echo "It's working." | java -jar target/KorAP-Tokenizer-2.2.0.9000-standalone.jar -l en
+$ echo "It's working." | java -jar target/KorAP-Tokenizer-2.2.2-standalone.jar -l en
It
's
working
@@ -44,7 +44,7 @@
#### Split French text into tokens and sentences
```
$ echo "C'est une phrase. Ici, il s'agit d'une deuxième phrase." \
- | java -jar target/KorAP-Tokenizer-2.2.0.9000-standalone.jar -s -l fr
+ | java -jar target/KorAP-Tokenizer-2.2.2-standalone.jar -s -l fr
C'
est
une
@@ -69,7 +69,7 @@
In order to end a text, flush the output and reset the character position, an EOT character (0x04) can be used.
```
$ echo -n -e 'This is a text.\x0a\x04\x0aAnd this is another text.\n\x04\n' |\
- java -jar target/KorAP-Tokenizer-2.2.0.9000-standalone.jar --positions
+ java -jar target/KorAP-Tokenizer-2.2.2-standalone.jar --positions
This
is
a
@@ -87,7 +87,7 @@
#### Print token and sentence offset
```
echo -n -e ' This ist a start of a text. And this is a sentence!!! But what the hack????\x0a\x04\x0aAnd this is another text.' |\
- java -jar target/KorAP-Tokenizer-2.2.0.9000-standalone.jar --no-tokens --positions --sentence-boundaries
+ java -jar target/KorAP-Tokenizer-2.2.2-standalone.jar --no-tokens --positions --sentence-boundaries
1 5 6 9 10 11 12 17 18 20 21 22 23 27 27 28 29 32 33 37 38 40 41 42 43 51 51 54 55 58 59 63 64 67 68 72 72 76
1 28 29 54 55 76
0 3 4 8 9 11 12 19 20 24 24 25
diff --git a/pom.xml b/pom.xml
index 96b19f1..47f4d01 100644
--- a/pom.xml
+++ b/pom.xml
@@ -6,7 +6,7 @@
<groupId>groupId</groupId>
<artifactId>KorAP-Tokenizer</artifactId>
- <version>2.2.1</version>
+ <version>2.2.2</version>
<properties>
<project.build.sourceEncoding>UTF-8</project.build.sourceEncoding>