1. a2f952f Support Wikipedia templates by Akron · 4 weeks ago
  2. d8d8895 Introduce hyphenated abreviations in german tokenizer by Akron · 4 weeks ago
  3. 72a6422 Introduce english clitics by Akron · 2 years, 10 months ago
  4. d0dfea8 Added context rule for I by Akron · 2 years, 10 months ago
  5. be3d366 Introduce english tokenizer by Akron · 2 years, 10 months ago
  6. b15acb9 Rename token_symbol to token_bound by Akron · 3 years, 11 months ago
  7. d47c67e Add minor rules for XML support by Akron · 3 years, 11 months ago
  8. 6dcb6ce Add arrows by Akron · 3 years, 11 months ago
  9. 78f6714 Split tokenizer rules into language-specific and language-dependent by Akron · 3 years, 11 months ago
  10. 61948ef Restructure XFST sources by Akron · 4 years ago
  11. 7aa1cbe Improve sentence endings further by Akron · 4 years ago
  12. b98e4cf Improve Emoticons by Akron · 4 years ago v0.1.5
  13. f94b9ce check parantheses at the end of sentences by Akron · 4 years ago
  14. b428755 Support punctuation after quotes by Akron · 4 years ago v0.1.4
  15. 4222ac8 Improve handling of ellipsis by Akron · 4 years ago
  16. ece3f01 Support quote combinations at the end of sentences by Akron · 4 years ago
  17. e200841 Further improve speech rule for eos with more quotation marks by Akron · 4 years ago
  18. e96895f Improve handling of sentence splits including speech by Akron · 4 years ago
  19. b02ad07 Improve handling of apostrophes by Akron · 4 years, 1 month ago
  20. 54ed7e7 Fix handling of "z.B." by Akron · 4 years, 2 months ago
  21. 936c0f5 Support Plusampersand words in compounds by Akron · 4 years, 3 months ago
  22. e62e8eb Introducing Plusampersand-Compounds by Akron · 4 years, 3 months ago
  23. e87906b Minor improvements by Akron · 4 years, 3 months ago
  24. fac8abc Reorder longest match operator and update models by Akron · 4 years, 4 months ago
  25. 65c0f21 Simplify tokenizer whitespace handling by Akron · 4 years, 4 months ago
  26. c840636 Separate xml rule from main script by Akron · 4 years, 4 months ago
  27. 7198645 Speed up build by Akron · 4 years, 4 months ago
  28. 6742b96 Add XML entities by Akron · 4 years, 4 months ago
  29. 11a05d9 Extend tokenizer fileending by Akron · 4 years, 4 months ago
  30. f1106ec Add single character abbreviations by Akron · 4 years, 4 months ago
  31. 17984c8 Improving time parsing by Akron · 4 years, 4 months ago
  32. 78dba06 Add time format to transducer by Akron · 4 years, 4 months ago
  33. 066d99c Fix XML empty element handling by Akron · 4 years, 4 months ago
  34. f6bdfdb Add trimming at the beginning of a text by Akron · 4 years, 4 months ago
  35. a854faa Introduce EOT (end-of-transmission) marker by Akron · 4 years, 4 months ago
  36. 4c2a1ad Introduce XML tests by Akron · 4 years, 6 months ago
  37. 3de361e Improved newline and abbreviation handling by Akron · 4 years, 7 months ago
  38. 1e10d00 Remove dir/Dir from abbreviation file by Akron · 4 years, 7 months ago
  39. 57d0161 Add known terms with special characters by Akron · 4 years, 7 months ago
  40. e8837b5 Add file scheme by Akron · 4 years, 7 months ago
  41. fd92d7e Update abbreviations according to KorAP-Tokenizer by Akron · 4 years, 7 months ago
  42. a0bded5 Add ordinals by Akron · 4 years, 7 months ago
  43. 4af79f1 Added support for streetnames by Akron · 4 years, 7 months ago
  44. 310905f Add foma sources by Akron · 4 years, 7 months ago