diff --git a/Readme.md b/Readme.md
index 850409e..3527ca1 100644
--- a/Readme.md
+++ b/Readme.md
@@ -1,6 +1,6 @@
 # Language model training data version of the [20th Century in Basic Terms](https://www.ids-mannheim.de/lexik/pb1/woerter-medien-und-gesellschaft/politisch-soziale-grundbegriffe-grosser-reichweite-und-dauer/) project corpus.
 
-## Corpus definition
+## Corpus Definition
 
 The corpus is a virtual sub-corpus of the German Reference Corpus DeReKo (DeReKo-2022-I) (IDS 2022, Kupietz et al. 2010, 2018), containing the following parts:
 
@@ -29,7 +29,7 @@
 T91/MAI 1991.05 Beide Male fällt auch auf , daß niemand festgenommen wurde .
 ```
 
-### Construct KorAP corpus queries based on the data
+### Construct KorAP Corpus Queries Based on a Data Line
 
 For the example above:
 
@@ -38,18 +38,17 @@
 image.png
 
 
-## Software used
+## Used Software
 
-The archive `20CBT.tsv.bz2` was generated using the script `extract-shuffled-sentences.sh` provided here which uses the [korapxm2conllu](https://github.com/KorAP/KorAP-XML-CoNLL-U) tool for generating the one sentence per line format tsv-format with metadata, using the following command:
+The file `20CBT.tsv.bz2` was generated using the script `extract-shuffled-sentences.sh` provided [here](https://korap.ids-mannheim.de/gerrit/plugins/gitiles/IDS-Mannheim/20CBT/+/refs/heads/master/extract-shuffled-sentences.sh) which uses the [korapxm2conllu](https://github.com/KorAP/KorAP-XML-CoNLL-U) tool for generating the one sentence per line format tsv-format with metadata, using the following command:
 
 ```bash
 korapxml2conllu -m '<textSigle>([^<.]+)' -m '<creatDate>([^<]{4,7})' --word2vec $corpus > $dest
 ```
 
-## License of the data
+## License of the Data
 
-The corpus contains copyrighted and licensed material. Therefore, although the sentences are shuffled in random order, the corpus may only be shared among members of the project [Das 20. Jahrhundert in Grundbegriffen](https://www.zfl-berlin.org/projekt/das-20-jahrhundert-in-grundbegriffen.html), funded by the Leibniz Association 2022-2024, for text and data mining purposes in accordance with the TDM exception of the German Copyright Act (§ 60d UrhG.) and must be deleted upon completion of the project.
-
+The corpus contains copyrighted and licensed material. Therefore, although the sentences are shuffled in random order, the corpus may only be shared among members of the project [Das 20. Jahrhundert in Grundbegriffen](https://www.zfl-berlin.org/projekt/das-20-jahrhundert-in-grundbegriffen.html), funded by the Leibniz Association 2022-2024, for text and data mining purposes in accordance with the TDM exception of the German Copyright Act (§ 60d UrhG) and must be deleted upon completion of the project.
 
 ## References
 
