- eed4cb1 Fix possible IO deadlocks with KorAP tokenizer by Marc Kupietz · 3 years, 10 months ago
- 8a954e5 Automatically replace entities with their corresponding characters by Marc Kupietz · 3 years, 10 months ago
- 44b1f25 Fix handling of utf-characters in sigles by Marc Kupietz · 4 years ago
- eaa9623 Switch input encoding based on XML processing instruction by Akron · 4 years, 2 months ago
- 54e363c Improve i5 template testing by Akron · 4 years, 4 months ago
- 1c5ce15 change utf8_encode and utf8_decode by Peter Harders · 4 years, 4 months ago
- 854a115 bugfixing Conservative.pm by Peter Harders · 4 years, 4 months ago
- 71f072b Bugfix: intern tokenization by Peter Harders · 4 years, 5 months ago
- eac374d Separate dummy tokenization from main script with minimal changes by Akron · 4 years, 5 months ago
- e913908 added tagged version of test-file goe_sample by Peter Harders · 4 years, 5 months ago
- 3281234 added sample file for testing by Peter Harders · 4 years, 10 months ago