blob: d3252c4498ad92c09cbed02ebcabee4c112bee42 [file] [log] [blame]
Marc Kupietz985da0c2021-02-15 19:29:50 +01001 - -s option added that uses sentence boundaries provided by the KorAP tokenizer (-tk)
Marc Kupietzed0505f2021-02-16 16:40:12 +01002 - tokenizer invocation comments removed from KorAP XML output
Marc Kupietz400044c2021-02-16 16:44:21 +01003 - indentation of </span> tags fixed
Marc Kupietz8a954e52021-02-16 22:03:07 +01004 - character entities that used in DeReKo are automatically replaced by their corresponding characters
Akronf7084c42021-01-07 10:25:22 +010050.03 2021-01-12
Marc Kupietzb505d442021-01-06 16:40:29 +01006 - Update KorAP-Tokenizer to released 2.0 version
Akronf7084c42021-01-07 10:25:22 +01007 - Improve test suite for recent version
8 of Mojolicious.
9
Marc Kupietz44b1f252020-11-26 16:31:40 +0100100.02 2020-11-27
Akronf7084c42021-01-07 10:25:22 +010011 - Update KorAP-Tokenizer to v2.0.0.
Akroneaa96232020-10-15 17:06:15 +020012 - Switch input encoding based on XML
13 processing instruction.
Marc Kupietz44b1f252020-11-26 16:31:40 +010014 - Fix handling of UTF-8 in sigles.
Akroneaa96232020-10-15 17:06:15 +020015
Akron0c41ab32020-09-29 07:33:33 +0200160.01 2020-09-28
17 - Initial release to GitHub.