- d0dfea8 Added context rule for I by Akron · 1 year, 7 months ago
- be3d366 Introduce english tokenizer by Akron · 1 year, 7 months ago
- b15acb9 Rename token_symbol to token_bound by Akron · 2 years, 7 months ago
- d47c67e Add minor rules for XML support by Akron · 2 years, 8 months ago
- 6dcb6ce Add arrows by Akron · 2 years, 8 months ago
- 78f6714 Split tokenizer rules into language-specific and language-dependent by Akron · 2 years, 8 months ago
- 61948ef Restructure XFST sources by Akron · 2 years, 8 months ago
- 7aa1cbe Improve sentence endings further by Akron · 2 years, 8 months ago
- b98e4cf Improve Emoticons by Akron · 2 years, 8 months ago v0.1.5
- f94b9ce check parantheses at the end of sentences by Akron · 2 years, 8 months ago
- b428755 Support punctuation after quotes by Akron · 2 years, 8 months ago v0.1.4
- 4222ac8 Improve handling of ellipsis by Akron · 2 years, 9 months ago
- ece3f01 Support quote combinations at the end of sentences by Akron · 2 years, 9 months ago
- e200841 Further improve speech rule for eos with more quotation marks by Akron · 2 years, 9 months ago
- e96895f Improve handling of sentence splits including speech by Akron · 2 years, 9 months ago
- b02ad07 Improve handling of apostrophes by Akron · 2 years, 10 months ago
- 54ed7e7 Fix handling of "z.B." by Akron · 2 years, 11 months ago
- 936c0f5 Support Plusampersand words in compounds by Akron · 3 years ago
- e62e8eb Introducing Plusampersand-Compounds by Akron · 3 years ago
- e87906b Minor improvements by Akron · 3 years ago
- fac8abc Reorder longest match operator and update models by Akron · 3 years ago
- 65c0f21 Simplify tokenizer whitespace handling by Akron · 3 years ago
- c840636 Separate xml rule from main script by Akron · 3 years ago
- 7198645 Speed up build by Akron · 3 years ago
- 6742b96 Add XML entities by Akron · 3 years ago
- 11a05d9 Extend tokenizer fileending by Akron · 3 years, 1 month ago
- f1106ec Add single character abbreviations by Akron · 3 years, 1 month ago
- 17984c8 Improving time parsing by Akron · 3 years, 1 month ago
- 78dba06 Add time format to transducer by Akron · 3 years, 1 month ago
- 066d99c Fix XML empty element handling by Akron · 3 years, 1 month ago
- f6bdfdb Add trimming at the beginning of a text by Akron · 3 years, 1 month ago
- a854faa Introduce EOT (end-of-transmission) marker by Akron · 3 years, 1 month ago
- 4c2a1ad Introduce XML tests by Akron · 3 years, 3 months ago
- 3de361e Improved newline and abbreviation handling by Akron · 3 years, 3 months ago
- 1e10d00 Remove dir/Dir from abbreviation file by Akron · 3 years, 3 months ago
- 57d0161 Add known terms with special characters by Akron · 3 years, 3 months ago
- e8837b5 Add file scheme by Akron · 3 years, 3 months ago
- fd92d7e Update abbreviations according to KorAP-Tokenizer by Akron · 3 years, 3 months ago
- a0bded5 Add ordinals by Akron · 3 years, 3 months ago
- 4af79f1 Added support for streetnames by Akron · 3 years, 3 months ago
- 310905f Add foma sources by Akron · 3 years, 3 months ago