blob: cc16d48b2318dc654045789b0a46447145e28af0 [file] [log] [blame]
2.3.3 2022-03-28
- Load KorAP-Tokenizer only on request.
2.3.2 2022-03-23
- Do not reference metadata.xml
- Remove schema references from header files.
- Improve test suite for unability to use
KorAP-Tokenizer.
2.3.1 2022-01-14 Release
- Improve script handling of broken data
- Improve handling of unknown header types
- Check for valid sigles to avoid broken directories
- Introduce exclusivity for inline tokens handling.
- Use single dash for STDIN.
- Update KorAP-Tokenizer to v2.2.2 (single quote, "du." bug fixes)
2.2.0 2021-08-26 Release
- Remove unnecessary branch in recursive call
- Support inline-structures parameter
- Introduce --base-foundry, --data-file, and --header-file parameters
- Introduce --tokens-file parameter
- Introduce --skip-inline-tokens parameter
- Minor cleanups and improvements
- Introduce --skip-inline-tags parameter
- Introduce KorAP::XML::TEI::Inline class
- Introduce --skip-inline-token-annotations parameter
- Deprecate KORAPXMLTEI_INLINE environment variable
in favor of --skip-inline-token-annotations
1.0.0 2021-02-18 Release
- -s option added that uses sentence boundaries
provided by the KorAP tokenizer (-tk)
- Tokenizer invocation comments removed from KorAP XML output
- Indentation of </span> tags fixed
- Character entities used in DeReKo are automatically
replaced by their corresponding characters
- Resources defined in Makefile
- Fixed possible IO deadlock with KorAP tokenizer
- Simplified debugging by combining with X::C::T line numbers
- Support inline-tokens parameter
- Move verbose code documentation to trailing
script section
0.03 2021-01-12
- Update KorAP-Tokenizer to released 2.0 version
- Improve test suite for recent version
of Mojolicious.
0.02 2020-11-27
- Update KorAP-Tokenizer to v2.0.0.
- Switch input encoding based on XML
processing instruction.
- Fix handling of UTF-8 in sigles.
0.01 2020-09-28
- Initial release to GitHub.