commit | 9dc1000ef896e2613583a327aee178bb7ca198d6 | [log] [tgz] |
---|---|---|
author | bansp <bansp@o2.pl> | Tue May 17 22:33:34 2022 +0200 |
committer | bansp <bansp@o2.pl> | Tue May 17 22:33:34 2022 +0200 |
tree | 63485725fd40c45877b3929fe8642e25d6c40d98 | |
parent | d1bf1db691dc95bad6580596298430b68c1195ad [diff] |
begin the switch from text.xml to ann_segmentation.xml; for now, data.xml is properly created (whitespace and tokenization alternatives). A lot of code cleanup has not yet happened. Change-Id: Ib8ea509971adff46946fc803e053f6389ec49f2d
Tools for converting NKJP-XML format to KorAP-XML
The test suite is based on xspec. To install xspec, please follow the Installation Guide. Ensure either xspec.bat
or xspec.sh
is available on the command line.
To run the test suite, execute
$ xspec.sh test/nkjp2korap.xspec
The created report is available in test/xspec/nkjp2korap-result.html
afterwards.
Copyright (c) 2021, Leibniz Institute for the German Language, Mannheim, Germany
This package is developed as part of the KorAP Corpus Analysis Platform at the Leibniz Institute for German Language (IDS).
It is published under the BSD-2 License.
Contributions are very welcome!
Your contributions should ideally be committed via our Gerrit server to facilitate reviewing (see Gerrit Code Review - A Quick Introduction if you are not familiar with Gerrit). However, we are also happy to accept comments and pull requests via GitHub.
Please note that unless you explicitly state otherwise any contribution intentionally submitted for inclusion into this software shall – as this software itself – be under the BSD-2 License.