1. d0dfea8 Added context rule for I by Akron · 1 year, 7 months ago
  2. be3d366 Introduce english tokenizer by Akron · 1 year, 7 months ago
  3. b15acb9 Rename token_symbol to token_bound by Akron · 2 years, 7 months ago
  4. d47c67e Add minor rules for XML support by Akron · 2 years, 8 months ago
  5. 6dcb6ce Add arrows by Akron · 2 years, 8 months ago
  6. 78f6714 Split tokenizer rules into language-specific and language-dependent by Akron · 2 years, 8 months ago
  7. 61948ef Restructure XFST sources by Akron · 2 years, 8 months ago
  8. 7aa1cbe Improve sentence endings further by Akron · 2 years, 8 months ago
  9. b98e4cf Improve Emoticons by Akron · 2 years, 8 months ago v0.1.5
  10. f94b9ce check parantheses at the end of sentences by Akron · 2 years, 8 months ago
  11. b428755 Support punctuation after quotes by Akron · 2 years, 8 months ago v0.1.4
  12. 4222ac8 Improve handling of ellipsis by Akron · 2 years, 9 months ago
  13. ece3f01 Support quote combinations at the end of sentences by Akron · 2 years, 9 months ago
  14. e200841 Further improve speech rule for eos with more quotation marks by Akron · 2 years, 9 months ago
  15. e96895f Improve handling of sentence splits including speech by Akron · 2 years, 9 months ago
  16. b02ad07 Improve handling of apostrophes by Akron · 2 years, 10 months ago
  17. 54ed7e7 Fix handling of "z.B." by Akron · 2 years, 11 months ago
  18. 936c0f5 Support Plusampersand words in compounds by Akron · 3 years ago
  19. e62e8eb Introducing Plusampersand-Compounds by Akron · 3 years ago
  20. e87906b Minor improvements by Akron · 3 years ago
  21. fac8abc Reorder longest match operator and update models by Akron · 3 years ago
  22. 65c0f21 Simplify tokenizer whitespace handling by Akron · 3 years ago
  23. c840636 Separate xml rule from main script by Akron · 3 years ago
  24. 7198645 Speed up build by Akron · 3 years ago
  25. 6742b96 Add XML entities by Akron · 3 years ago
  26. 11a05d9 Extend tokenizer fileending by Akron · 3 years, 1 month ago
  27. f1106ec Add single character abbreviations by Akron · 3 years, 1 month ago
  28. 17984c8 Improving time parsing by Akron · 3 years, 1 month ago
  29. 78dba06 Add time format to transducer by Akron · 3 years, 1 month ago
  30. 066d99c Fix XML empty element handling by Akron · 3 years, 1 month ago
  31. f6bdfdb Add trimming at the beginning of a text by Akron · 3 years, 1 month ago
  32. a854faa Introduce EOT (end-of-transmission) marker by Akron · 3 years, 1 month ago
  33. 4c2a1ad Introduce XML tests by Akron · 3 years, 3 months ago
  34. 3de361e Improved newline and abbreviation handling by Akron · 3 years, 3 months ago
  35. 1e10d00 Remove dir/Dir from abbreviation file by Akron · 3 years, 3 months ago
  36. 57d0161 Add known terms with special characters by Akron · 3 years, 3 months ago
  37. e8837b5 Add file scheme by Akron · 3 years, 3 months ago
  38. fd92d7e Update abbreviations according to KorAP-Tokenizer by Akron · 3 years, 3 months ago
  39. a0bded5 Add ordinals by Akron · 3 years, 3 months ago
  40. 4af79f1 Added support for streetnames by Akron · 3 years, 3 months ago
  41. 310905f Add foma sources by Akron · 3 years, 3 months ago