Add some more information to the Readme Change-Id: Ia47ff941de2748587d5f3df5422e6f61c36ddc31

commit: e4f36137e03974c9dfeb2f496b7e8e558ea468a1 [log] [tgz]
author: Akron <nils@diewald-online.de> Mon Mar 07 11:48:41 2022 +0100
committer: Akron <nils@diewald-online.de> Mon Mar 07 11:48:41 2022 +0100
tree: 76044a889375077669e888fc074b0498c9fbd017
parent: b040897b226b57cc84a628e884807f2cfb437f51 [diff]
diff --git a/Readme.md b/Readme.md
index 97b868e..8bb4e0d 100644
--- a/Readme.md
+++ b/Readme.md

@@ -25,9 +25,66 @@
 $ docker run --rm -it \
   -v ${PWD}/benchmarks:/euralex/benchmarks \
   -v ${PWD}/corpus:/euralex/corpus \
-  korap/euralex22 benchmarks/benchmark.pl
+  korap/euralex22 benchmarks/[BENCHMARK-SCRIPT]
 ```
 
+The supported benchmark scripts are:
+
+## `benchmark.pl`
+
+Performance measurements of the tools. See the tools section for some
+remarks to take into account.
+
+
+## `empirist.pl`
+
+To run the empirist test suite, you need to download the empirist
+gold standard corpus and tooling first and extract it into
+the corpus directory.
+
+```shell
+$ wget https://sites.google.com/site/empirist2015/home/shared-task-data/empirist_gold_cmc.zip
+$ unzip empirist_gold_cmc.zip -d corpus
+
+$ wget https://sites.google.com/site/empirist2015/home/shared-task-data/empirist_gold_web.zip
+$ unzip empirist_gold_web.zip -d corpus
+```
+
+Quality measurements based on EmpiriST 2015.
+
+
+# Tools
+
+## Waste
+- Tokenization
+
+## OpenNLP
+- Tokenization
+
+## TreeTagger
+- Tokenization
+
+## JTok
+- Tokenization
+
+## SynTok
+- Tokenization
+
+## SoMaJo
+- Tokenization
+
+## Stanford CoreNLP
+- Tokenization
+
+All tools are run using [pipelining](https://stanfordnlp.github.io/CoreNLP/pipeline.html),
+which obviously introduces some overhead, that needs to be taken into account.
+
+## KorAP-Tokenizer
+- Tokenization + Sentence Splitting
+
+## Datok
+- Tokenization + Sentence Splitting
+
 
 # Licenses
 
@@ -46,3 +103,5 @@
 ```shell
 docker run --privileged -v
 ```
+
+# Literature
commit	e4f36137e03974c9dfeb2f496b7e8e558ea468a1	[log] [tgz]
author	Akron <nils@diewald-online.de>	Mon Mar 07 11:48:41 2022 +0100
committer	Akron <nils@diewald-online.de>	Mon Mar 07 11:48:41 2022 +0100
tree	76044a889375077669e888fc074b0498c9fbd017
parent	b040897b226b57cc84a628e884807f2cfb437f51 [diff]