1. 104c94b Add option -Z to exclude empty texts by Marc Kupietz · 1 year, 10 months ago
  2. f5d4d75 Fix empty cardinal tokens and lemmas in FilterKeys by Marc Kupietz · 1 year, 10 months ago
  3. 545bb91 Extend isPunctuation unit test by Marc Kupietz · 2 years ago
  4. a4f6cf9 Add exclude punctuation option by Marc Kupietz · 2 years ago
  5. 24416b4 Add FilterKeys script and Readme by Marc Kupietz · 2 years, 1 month ago
  6. 47bd743 Drop failing stderr test by Marc Kupietz · 2 years, 1 month ago
  7. 869bfb9 Incorporate pseudonymization scripts into maven project by Marc Kupietz · 2 years, 1 month ago
  8. 0e72537 Add missing test resources by Marc Kupietz · 2 years, 1 month ago
  9. 37197a8 Fix bug in simple test by Marc Kupietz · 3 years, 2 months ago
  10. be29959 Test simple text also without padding by Marc Kupietz · 3 years, 2 months ago
  11. e4adb69 Make sure that start and end tags for empty texts are counted by Marc Kupietz · 3 years, 2 months ago
  12. a691041 Add --pad option to optionally add padding symbols at text edges by Marc Kupietz · 3 years, 2 months ago
  13. ead2a6f Improve null handling in tests by Marc Kupietz · 3 years, 2 months ago
  14. 1b717be Auto detect xz compression for input and output by Marc Kupietz · 3 years, 2 months ago
  15. 53623e0 Add --downcase/-d option to convert all token characters to lower case by Marc Kupietz · 3 years, 8 months ago
  16. c78b5a5 totalNGrams: unescape all XML entities (&, <, >, ") by Marc Kupietz · 4 years, 2 months ago
  17. aaf46f1 totalngrams: fix main class name by Marc Kupietz · 4 years, 4 months ago
  18. 3db37c5 totalngrams: add unit test for almost the whole pipeline by Marc Kupietz · 4 years, 4 months ago
  19. 2ea60bd Use cryptogrphic Blake2b hash as determisitic fold random source by Marc Kupietz · 4 years, 4 months ago
  20. 6638bb2 totalngrams: start adding unit tests by Marc Kupietz · 4 years, 4 months ago