- f5d4d75 Fix empty cardinal tokens and lemmas in FilterKeys by Marc Kupietz · 1 year, 10 months ago
- 545bb91 Extend isPunctuation unit test by Marc Kupietz · 2 years ago
- a4f6cf9 Add exclude punctuation option by Marc Kupietz · 2 years ago
- 24416b4 Add FilterKeys script and Readme by Marc Kupietz · 2 years, 1 month ago
- 47bd743 Drop failing stderr test by Marc Kupietz · 2 years, 1 month ago
- 869bfb9 Incorporate pseudonymization scripts into maven project by Marc Kupietz · 2 years, 1 month ago
- 0e72537 Add missing test resources by Marc Kupietz · 2 years, 1 month ago
- 37197a8 Fix bug in simple test by Marc Kupietz · 3 years, 2 months ago
- be29959 Test simple text also without padding by Marc Kupietz · 3 years, 2 months ago
- e4adb69 Make sure that start and end tags for empty texts are counted by Marc Kupietz · 3 years, 2 months ago
- a691041 Add --pad option to optionally add padding symbols at text edges by Marc Kupietz · 3 years, 2 months ago
- ead2a6f Improve null handling in tests by Marc Kupietz · 3 years, 2 months ago
- 1b717be Auto detect xz compression for input and output by Marc Kupietz · 3 years, 2 months ago
- 53623e0 Add --downcase/-d option to convert all token characters to lower case by Marc Kupietz · 3 years, 8 months ago
- c78b5a5 totalNGrams: unescape all XML entities (&, <, >, ") by Marc Kupietz · 4 years, 2 months ago
- aaf46f1 totalngrams: fix main class name by Marc Kupietz · 4 years, 4 months ago
- 3db37c5 totalngrams: add unit test for almost the whole pipeline by Marc Kupietz · 4 years, 4 months ago
- 2ea60bd Use cryptogrphic Blake2b hash as determisitic fold random source by Marc Kupietz · 4 years, 4 months ago
- 6638bb2 totalngrams: start adding unit tests by Marc Kupietz · 4 years, 4 months ago