- b84d469 Fix Godoc by Akron · 3 months ago master
- cd58d08 Bump github.com/alecthomas/kong from 0.8.1 to 0.9.0 (closes #10) by dependabot[bot] · 9 months ago
- b71b51f Add comment to tokenwriter by Akron · 9 months ago
- 0b9aa66 Bump github.com/stretchr/testify from 1.8.4 to 1.9.0 (closes #9) by dependabot[bot] · 9 months ago
- 90b3f9f Merge "Update dependencies" by Nils Diewald · 1 year, 1 month ago
- 1071ccd Update dependencies by Akron · 1 year, 1 month ago
- 4370f35 Merge "Bump github.com/alecthomas/kong from 0.8.0 to 0.8.1 (closes #8)" by Nils Diewald · 1 year, 1 month ago
- f5c2819 Fix DOI in README (2) by Akron · 1 year, 1 month ago
- 353e323 Bump github.com/alecthomas/kong from 0.8.0 to 0.8.1 (closes #8) by dependabot[bot] · 1 year, 1 month ago
- 7bbc30b Fix DOI in Readme by Akron · 1 year, 1 month ago
- f66dc14 Fix end of text behaviour in case of sentence positions by Akron · 1 year, 3 months ago v0.2.2
- 78d270d Add library usage explanation by Akron · 1 year, 3 months ago
- 8e80393 Minor performance improvements by Akron · 1 year, 7 months ago v0.2.1
- 5d68ae4 Update changes by Akron · 1 year, 3 months ago
- 7b61a0b Improve performance description by Akron · 1 year, 3 months ago v0.2.0
- 7efa7e5 Bump github.com/stretchr/testify from 1.8.2 to 1.8.4 by dependabot[bot] · 1 year, 6 months ago
- 3d637df Add reference to DeReKo by Akron · 1 year, 3 months ago
- dfe1bbc Added benchmark table regarding performance by Akron · 1 year, 3 months ago
- 9c4b1f7 Added new english intro video by Akron · 1 year, 3 months ago
- 0139bc5 Introduce the english model as being on the same level as german by Akron · 1 year, 3 months ago
- 72a6422 Introduce english clitics by Akron · 1 year, 7 months ago
- cab40cf Bump github.com/alecthomas/kong from 0.7.1 to 0.8.0 (closes #7) by dependabot[bot] · 1 year, 5 months ago
- cae3911 Fix buffer bug in token writer by Akron · 1 year, 7 months ago
- d0dfea8 Added context rule for I by Akron · 1 year, 7 months ago
- be3d366 Introduce english tokenizer by Akron · 1 year, 7 months ago
- 8a5596a Merge "Added update command" by Nils Diewald · 1 year, 9 months ago
- 96c6548 Added update command by Akron · 1 year, 9 months ago
- 8413f84 Bump github.com/stretchr/testify from 1.7.0 to 1.8.2 (fixes #4) by dependabot[bot] · 1 year, 9 months ago
- e995f76 Bump github.com/alecthomas/kong from 0.5.0 to 0.7.1 (closes #3) by dependabot[bot] · 1 year, 9 months ago
- 6c92763 New build by Akron · 1 year, 9 months ago
- 0597b27 Introduce dependabot support by Akron · 1 year, 9 months ago
- a25a7d5 Update pages for EURALEX publication by Akron · 2 years, 4 months ago
- 49ebd91 Add paper link (2) by Akron · 2 years, 5 months ago
- 656934d Add paper link by Akron · 2 years, 5 months ago
- fd120d3 Update dependencies by Akron · 2 years, 6 months ago
- 7e4b780 Move references to the end of the readme by Akron · 2 years, 6 months ago
- 79ec995 Add references by Akron · 2 years, 6 months ago
- a44944d Merge "Add notification regarding load factor" by Nils Diewald · 2 years, 7 months ago
- 6a4ce18 Add notification regarding load factor by Akron · 2 years, 7 months ago
- b15acb9 Rename token_symbol to token_bound by Akron · 2 years, 7 months ago
- d47c67e Add minor rules for XML support by Akron · 2 years, 8 months ago
- 6dcb6ce Add arrows by Akron · 2 years, 8 months ago
- 3b6c7fb Add Zenodo DOI Badge by Akron · 2 years, 8 months ago
- 78f6714 Split tokenizer rules into language-specific and language-dependent by Akron · 2 years, 8 months ago
- 61948ef Restructure XFST sources by Akron · 2 years, 8 months ago
- 7aa1cbe Improve sentence endings further by Akron · 2 years, 8 months ago
- b98e4cf Improve Emoticons by Akron · 2 years, 8 months ago v0.1.5
- f94b9ce check parantheses at the end of sentences by Akron · 2 years, 8 months ago
- b428755 Support punctuation after quotes by Akron · 2 years, 8 months ago v0.1.4
- df27581 Make tokenizer robust and never failing by Akron · 2 years, 8 months ago
- 4222ac8 Improve handling of ellipsis by Akron · 2 years, 9 months ago
- ece3f01 Support quote combinations at the end of sentences by Akron · 2 years, 9 months ago
- e200841 Further improve speech rule for eos with more quotation marks by Akron · 2 years, 9 months ago
- e96895f Improve handling of sentence splits including speech by Akron · 2 years, 9 months ago
- b02ad07 Improve handling of apostrophes by Akron · 2 years, 10 months ago
- 9a59471 Test single quote handling by Akron · 2 years, 10 months ago
- 54ed7e7 Fix handling of "z.B." by Akron · 2 years, 11 months ago
- d0c6e10 Fix datok tests to be more robust regarding tokenizer changes by Akron · 3 years ago
- 936c0f5 Support Plusampersand words in compounds by Akron · 3 years ago
- 00cecd1 Initialize identity for sigma < 256 by Akron · 3 years ago
- 4880fb6 Improve rune2symbol conversion by Akron · 3 years ago
- e62e8eb Introducing Plusampersand-Compounds by Akron · 3 years ago
- 22c565a Fix out of range bug by reverting buffer rewind improvement by Akron · 3 years ago v0.1.1
- 4ec8cec Prepare first official release by Akron · 3 years ago v0.1.0
- e87906b Minor improvements by Akron · 3 years ago
- 90aa45b Minor code simplifications by Akron · 3 years ago
- fac8abc Reorder longest match operator and update models by Akron · 3 years ago
- 3976804 Add benchmark rule to Makefile by Akron · 3 years ago
- 65c0f21 Simplify tokenizer whitespace handling by Akron · 3 years ago
- c840636 Separate xml rule from main script by Akron · 3 years ago
- 289414f Update benchmarks by Akron · 3 years ago
- 7198645 Speed up build by Akron · 3 years ago
- 6742b96 Add XML entities by Akron · 3 years ago
- 7e75ef0 Add makefile by Akron · 3 years, 1 month ago
- 11a05d9 Extend tokenizer fileending by Akron · 3 years, 1 month ago
- 9135b20 Test IPv4 handling by Akron · 3 years, 1 month ago
- f1106ec Add single character abbreviations by Akron · 3 years, 1 month ago
- 4a6e0ff Fix newline after eot behaiour by Akron · 3 years, 1 month ago
- 274600e Fix buffer flushing to work with tei2korapxml by Akron · 3 years, 1 month ago
- 9c3bf7f Change fmt to log for easier writing to STDOUT by Akron · 3 years, 1 month ago
- 3d31453 Add introduction video to readme by Akron · 3 years, 1 month ago
- 6792bd2 Improve Readme example by Akron · 3 years, 1 month ago
- 15bb13d Introduce dash flag for STDIN and input file handling for tokenization by Akron · 3 years, 1 month ago
- 17984c8 Improving time parsing by Akron · 3 years, 1 month ago
- 78dba06 Add time format to transducer by Akron · 3 years, 1 month ago
- 066d99c Fix XML empty element handling by Akron · 3 years, 1 month ago
- 04335c6 Update tests by Akron · 3 years, 1 month ago
- 9fb63af Optimize tests by avoiding reload of tokenizers by Akron · 3 years, 1 month ago
- 7035d2e Fix sentence_pos handling by Akron · 3 years, 1 month ago
- 96fdc9b Fix TokenWriter regarding sentence boundaries and remove simple TokenWriter by Akron · 3 years, 1 month ago
- 2612f99 Improve command help page by Akron · 3 years, 1 month ago
- 685861a Improve Readme by Akron · 3 years, 1 month ago
- 0f087ea Parse command line options as bit flags by Akron · 3 years, 1 month ago
- fceddb6 Add sentence flags (for printing and offsets) by Akron · 3 years, 1 month ago
- a9e0c42 Introduce --[no]-tokens flag by Akron · 3 years, 1 month ago
- e9431ec Ignore newline after EOT with a flag by Akron · 3 years, 1 month ago
- 8cc2dd9 Fix buffer rewind at end of transmission by Akron · 3 years, 1 month ago
- 4f6b28c Support token offsets in token writer by Akron · 3 years, 1 month ago
- 32416ce Support offsets in token writer by Akron · 3 years, 1 month ago
- 98fbfef Improve offset handling in buffers by Akron · 3 years, 1 month ago