| commit | 8759751218f33f35c7f588c51b38cd5ddfbb102e | [log] [tgz] |
|---|---|---|
| author | Marc Kupietz <kupietz@ids-mannheim.de> | Mon Apr 15 18:31:34 2024 +0200 |
| committer | Marc Kupietz <kupietz@ids-mannheim.de> | Mon Apr 15 18:31:34 2024 +0200 |
| tree | 656a137e821943bb4959d7c4eaa3d6f66a8912cc | |
| parent | 958df03b91fc1a7411583de604ba485fbaf4618b [diff] |
Improve korapxml2krill performance
make -j $(nproc) target/dnb18.i5.xml YY=18
Prerequisite: KorAP-XML-CoNLL-U
make -j $(nproc) target/dnb23.zip YY=23
Install prerequisite korap/conllu2treetagger and korap/conllu2spacy docker images if not present:
docker image inspect korap/conllu2treetagger:latest || curl -Ls 'https://gitlab.ids-mannheim.de/KorAP/CoNLL-U-Treetagger/-/jobs/artifacts/master/raw/conllu2treetagger.xz?job=build-docker-image' | docker load docker image inspect korap/conllu2spacy:latest || curl -Ls https://corpora.ids-mannheim.de/tools/conllu2spacy.tar.xz | docker load
Make annotations:
make -j $(nproc) target/dnb20.marmot-malt.zip target/dnb20.spacy.zip target/dnb20.tree_tagger.zip YY=20
Build KorAP all, up to the deployable index:
make -j $(nproc) all YY=23
2024-04-10
make YY=22 to select 20222024-03-24
2024-03-18
make deploy to install new index and restart local KorAP@DNB instance (also available as ci target)show-server-logs and show-server-status make targets to monitor the local KorAP@DNB instance2024-03-17
make all to build all targets, including the index2024-03-16
2024-03-15: DNB test data added
2024-03-08: example EPub and I5 added from DeReKo KJL corpus: Christiane F. ; Kai Hermann ; Horst Rieck: Wir Kinder vom Bahnhof Zoo in the folder test/resources/ – do not distribute (copyrighted data)