Marc Kupietz | 985da0c | 2021-02-15 19:29:50 +0100 | [diff] [blame] | 1 | - -s option added that uses sentence boundaries provided by the KorAP tokenizer (-tk) |
Marc Kupietz | ed0505f | 2021-02-16 16:40:12 +0100 | [diff] [blame^] | 2 | - tokenizer invocation comments removed from KorAP XML output |
Akron | f7084c4 | 2021-01-07 10:25:22 +0100 | [diff] [blame] | 3 | 0.03 2021-01-12 |
Marc Kupietz | b505d44 | 2021-01-06 16:40:29 +0100 | [diff] [blame] | 4 | - Update KorAP-Tokenizer to released 2.0 version |
Akron | f7084c4 | 2021-01-07 10:25:22 +0100 | [diff] [blame] | 5 | - Improve test suite for recent version |
| 6 | of Mojolicious. |
| 7 | |
Marc Kupietz | 44b1f25 | 2020-11-26 16:31:40 +0100 | [diff] [blame] | 8 | 0.02 2020-11-27 |
Akron | f7084c4 | 2021-01-07 10:25:22 +0100 | [diff] [blame] | 9 | - Update KorAP-Tokenizer to v2.0.0. |
Akron | eaa9623 | 2020-10-15 17:06:15 +0200 | [diff] [blame] | 10 | - Switch input encoding based on XML |
| 11 | processing instruction. |
Marc Kupietz | 44b1f25 | 2020-11-26 16:31:40 +0100 | [diff] [blame] | 12 | - Fix handling of UTF-8 in sigles. |
Akron | eaa9623 | 2020-10-15 17:06:15 +0200 | [diff] [blame] | 13 | |
Akron | 0c41ab3 | 2020-09-29 07:33:33 +0200 | [diff] [blame] | 14 | 0.01 2020-09-28 |
| 15 | - Initial release to GitHub. |