blob: d3252c4498ad92c09cbed02ebcabee4c112bee42 [file] [log] [blame]
- -s option added that uses sentence boundaries provided by the KorAP tokenizer (-tk)
- tokenizer invocation comments removed from KorAP XML output
- indentation of </span> tags fixed
- character entities that used in DeReKo are automatically replaced by their corresponding characters
0.03 2021-01-12
- Update KorAP-Tokenizer to released 2.0 version
- Improve test suite for recent version
of Mojolicious.
0.02 2020-11-27
- Update KorAP-Tokenizer to v2.0.0.
- Switch input encoding based on XML
processing instruction.
- Fix handling of UTF-8 in sigles.
0.01 2020-09-28
- Initial release to GitHub.