1. be3d366 Introduce english tokenizer by Akron · 1 year, 8 months ago
  2. b15acb9 Rename token_symbol to token_bound by Akron · 2 years, 8 months ago
  3. d47c67e Add minor rules for XML support by Akron · 2 years, 9 months ago
  4. 6dcb6ce Add arrows by Akron · 2 years, 9 months ago
  5. 78f6714 Split tokenizer rules into language-specific and language-dependent by Akron · 2 years, 9 months ago
  6. 61948ef Restructure XFST sources by Akron · 2 years, 9 months ago
  7. 7aa1cbe Improve sentence endings further by Akron · 2 years, 9 months ago
  8. b98e4cf Improve Emoticons by Akron · 2 years, 9 months ago v0.1.5
  9. f94b9ce check parantheses at the end of sentences by Akron · 2 years, 9 months ago
  10. b428755 Support punctuation after quotes by Akron · 2 years, 9 months ago v0.1.4
  11. 4222ac8 Improve handling of ellipsis by Akron · 2 years, 10 months ago
  12. ece3f01 Support quote combinations at the end of sentences by Akron · 2 years, 10 months ago
  13. e200841 Further improve speech rule for eos with more quotation marks by Akron · 2 years, 10 months ago
  14. e96895f Improve handling of sentence splits including speech by Akron · 2 years, 10 months ago
  15. b02ad07 Improve handling of apostrophes by Akron · 3 years ago
  16. 54ed7e7 Fix handling of "z.B." by Akron · 3 years ago
  17. 936c0f5 Support Plusampersand words in compounds by Akron · 3 years, 1 month ago
  18. e62e8eb Introducing Plusampersand-Compounds by Akron · 3 years, 1 month ago
  19. e87906b Minor improvements by Akron · 3 years, 1 month ago
  20. fac8abc Reorder longest match operator and update models by Akron · 3 years, 2 months ago
  21. 65c0f21 Simplify tokenizer whitespace handling by Akron · 3 years, 2 months ago
  22. c840636 Separate xml rule from main script by Akron · 3 years, 2 months ago
  23. 7198645 Speed up build by Akron · 3 years, 2 months ago
  24. 6742b96 Add XML entities by Akron · 3 years, 2 months ago
  25. 11a05d9 Extend tokenizer fileending by Akron · 3 years, 2 months ago
  26. f1106ec Add single character abbreviations by Akron · 3 years, 2 months ago
  27. 17984c8 Improving time parsing by Akron · 3 years, 2 months ago
  28. 78dba06 Add time format to transducer by Akron · 3 years, 2 months ago
  29. 066d99c Fix XML empty element handling by Akron · 3 years, 2 months ago
  30. f6bdfdb Add trimming at the beginning of a text by Akron · 3 years, 2 months ago
  31. a854faa Introduce EOT (end-of-transmission) marker by Akron · 3 years, 2 months ago
  32. 4c2a1ad Introduce XML tests by Akron · 3 years, 4 months ago
  33. 3de361e Improved newline and abbreviation handling by Akron · 3 years, 4 months ago
  34. 1e10d00 Remove dir/Dir from abbreviation file by Akron · 3 years, 5 months ago
  35. 57d0161 Add known terms with special characters by Akron · 3 years, 5 months ago
  36. e8837b5 Add file scheme by Akron · 3 years, 5 months ago
  37. fd92d7e Update abbreviations according to KorAP-Tokenizer by Akron · 3 years, 5 months ago
  38. a0bded5 Add ordinals by Akron · 3 years, 5 months ago
  39. 4af79f1 Added support for streetnames by Akron · 3 years, 5 months ago
  40. 310905f Add foma sources by Akron · 3 years, 5 months ago