Marc Kupietz | 985da0c | 2021-02-15 19:29:50 +0100 | [diff] [blame] | 1 | - -s option added that uses sentence boundaries provided by the KorAP tokenizer (-tk) |
Marc Kupietz | ed0505f | 2021-02-16 16:40:12 +0100 | [diff] [blame] | 2 | - tokenizer invocation comments removed from KorAP XML output |
Marc Kupietz | 400044c | 2021-02-16 16:44:21 +0100 | [diff] [blame] | 3 | - indentation of </span> tags fixed |
Marc Kupietz | e955ecc | 2021-02-17 17:42:01 +0100 | [diff] [blame^] | 4 | - character entities used in DeReKo are automatically replaced by their corresponding characters |
| 5 | - resources defined in Makefile |
Akron | f7084c4 | 2021-01-07 10:25:22 +0100 | [diff] [blame] | 6 | 0.03 2021-01-12 |
Marc Kupietz | b505d44 | 2021-01-06 16:40:29 +0100 | [diff] [blame] | 7 | - Update KorAP-Tokenizer to released 2.0 version |
Akron | f7084c4 | 2021-01-07 10:25:22 +0100 | [diff] [blame] | 8 | - Improve test suite for recent version |
| 9 | of Mojolicious. |
| 10 | |
Marc Kupietz | 44b1f25 | 2020-11-26 16:31:40 +0100 | [diff] [blame] | 11 | 0.02 2020-11-27 |
Akron | f7084c4 | 2021-01-07 10:25:22 +0100 | [diff] [blame] | 12 | - Update KorAP-Tokenizer to v2.0.0. |
Akron | eaa9623 | 2020-10-15 17:06:15 +0200 | [diff] [blame] | 13 | - Switch input encoding based on XML |
| 14 | processing instruction. |
Marc Kupietz | 44b1f25 | 2020-11-26 16:31:40 +0100 | [diff] [blame] | 15 | - Fix handling of UTF-8 in sigles. |
Akron | eaa9623 | 2020-10-15 17:06:15 +0200 | [diff] [blame] | 16 | |
Akron | 0c41ab3 | 2020-09-29 07:33:33 +0200 | [diff] [blame] | 17 | 0.01 2020-09-28 |
| 18 | - Initial release to GitHub. |