1. 78f6714 Split tokenizer rules into language-specific and language-dependent by Akron · 2 years, 8 months ago
  2. 61948ef Restructure XFST sources by Akron · 2 years, 8 months ago
  3. 7aa1cbe Improve sentence endings further by Akron · 2 years, 8 months ago
  4. b98e4cf Improve Emoticons by Akron · 2 years, 8 months ago v0.1.5
  5. f94b9ce check parantheses at the end of sentences by Akron · 2 years, 8 months ago
  6. b428755 Support punctuation after quotes by Akron · 2 years, 8 months ago v0.1.4
  7. 4222ac8 Improve handling of ellipsis by Akron · 2 years, 9 months ago
  8. ece3f01 Support quote combinations at the end of sentences by Akron · 2 years, 9 months ago
  9. e200841 Further improve speech rule for eos with more quotation marks by Akron · 2 years, 9 months ago
  10. e96895f Improve handling of sentence splits including speech by Akron · 2 years, 9 months ago
  11. b02ad07 Improve handling of apostrophes by Akron · 2 years, 10 months ago
  12. 54ed7e7 Fix handling of "z.B." by Akron · 2 years, 11 months ago
  13. 936c0f5 Support Plusampersand words in compounds by Akron · 3 years ago
  14. e62e8eb Introducing Plusampersand-Compounds by Akron · 3 years ago
  15. e87906b Minor improvements by Akron · 3 years ago
  16. fac8abc Reorder longest match operator and update models by Akron · 3 years ago
  17. 65c0f21 Simplify tokenizer whitespace handling by Akron · 3 years ago
  18. c840636 Separate xml rule from main script by Akron · 3 years ago
  19. 7198645 Speed up build by Akron · 3 years ago
  20. 6742b96 Add XML entities by Akron · 3 years ago
  21. 11a05d9 Extend tokenizer fileending by Akron · 3 years, 1 month ago
  22. f1106ec Add single character abbreviations by Akron · 3 years, 1 month ago
  23. 17984c8 Improving time parsing by Akron · 3 years, 1 month ago
  24. 78dba06 Add time format to transducer by Akron · 3 years, 1 month ago
  25. 066d99c Fix XML empty element handling by Akron · 3 years, 1 month ago
  26. f6bdfdb Add trimming at the beginning of a text by Akron · 3 years, 1 month ago
  27. a854faa Introduce EOT (end-of-transmission) marker by Akron · 3 years, 1 month ago
  28. 4c2a1ad Introduce XML tests by Akron · 3 years, 3 months ago
  29. 3de361e Improved newline and abbreviation handling by Akron · 3 years, 3 months ago
  30. 1e10d00 Remove dir/Dir from abbreviation file by Akron · 3 years, 3 months ago
  31. 57d0161 Add known terms with special characters by Akron · 3 years, 3 months ago
  32. e8837b5 Add file scheme by Akron · 3 years, 3 months ago
  33. fd92d7e Update abbreviations according to KorAP-Tokenizer by Akron · 3 years, 3 months ago
  34. a0bded5 Add ordinals by Akron · 3 years, 3 months ago
  35. 4af79f1 Added support for streetnames by Akron · 3 years, 3 months ago
  36. 310905f Add foma sources by Akron · 3 years, 3 months ago