1. 104c94b Add option -Z to exclude empty texts by Marc Kupietz · 1 year, 10 months ago
  2. b028de4 Bump version to 2.2.3 by Marc Kupietz · 1 year, 10 months ago
  3. 426dbff Bump version to v2.2.2 by Marc Kupietz · 1 year, 10 months ago v2.2.2
  4. f5d4d75 Fix empty cardinal tokens and lemmas in FilterKeys by Marc Kupietz · 1 year, 10 months ago
  5. 95e3f4f Add DOI badge for v2.2.1 by Marc Kupietz · 1 year, 11 months ago v2.2.1
  6. 8044382 Bump version to 2.2.1 by Marc Kupietz · 1 year, 11 months ago
  7. af80952 Fix missing scripts in bin distribution by Marc Kupietz · 1 year, 11 months ago
  8. 4ab08ab Update Readme and Changelog by Marc Kupietz · 1 year, 11 months ago
  9. 5c28c9d Provide binary and source distribution artifacts in ci workflow by Marc Kupietz · 1 year, 11 months ago
  10. d235350 Name standalone jar target properly by Marc Kupietz · 1 year, 11 months ago
  11. 207a1e5 Generate source distribution by Marc Kupietz · 1 year, 11 months ago
  12. e3a72f7 Add Apache-2.0 license by Marc Kupietz · 1 year, 11 months ago
  13. 15df2de Bump org.jetbrains:annotations to 23.1.0 by Marc Kupietz · 1 year, 11 months ago
  14. 0c778db Drop javadoc building by Marc Kupietz · 1 year, 11 months ago
  15. 73397d8 Provide a proper binary distribution by Marc Kupietz · 1 year, 11 months ago
  16. 5b1d372 Bump version to 2.2.0 by Marc Kupietz · 1 year, 11 months ago
  17. 545bb91 Extend isPunctuation unit test by Marc Kupietz · 2 years ago
  18. 575ab1c Add IML to facilitate groovy integration in IntelliJ IDEA by Marc Kupietz · 2 years ago
  19. a4f6cf9 Add exclude punctuation option by Marc Kupietz · 2 years ago
  20. 464b1c8 Bump version to v2.1.0 by Marc Kupietz · 2 years ago
  21. 1e09a23 Cosmetic code changes by Marc Kupietz · 2 years ago
  22. a5c191f Add BLAKE2b reference to Readme by Marc Kupietz · 2 years ago
  23. b6f601f Update dependencies by Marc Kupietz · 2 years ago
  24. ed14736 clean up ci pipeline by Marc Kupietz · 2 years ago
  25. 31574d9 Include shell scripts in artifacts by Marc Kupietz · 2 years, 1 month ago
  26. 9b6b845 Extend LinkedList<String> correctly by Marc Kupietz · 2 years, 1 month ago
  27. 097f35a Make groovy scripts available as executables by Marc Kupietz · 2 years, 1 month ago
  28. 48070d9 Update .gitlab-ci.yml file by Marc Kupietz · 2 years, 1 month ago
  29. 3253079 Update changelog by Marc Kupietz · 2 years, 1 month ago
  30. ab91cf0 Update pom by Marc Kupietz · 2 years, 1 month ago
  31. 24416b4 Add FilterKeys script and Readme by Marc Kupietz · 2 years, 1 month ago
  32. 8250c4b Make surefire-plugin fork and give it more heap by Marc Kupietz · 2 years, 1 month ago
  33. 47bd743 Drop failing stderr test by Marc Kupietz · 2 years, 1 month ago
  34. 869bfb9 Incorporate pseudonymization scripts into maven project by Marc Kupietz · 2 years, 1 month ago
  35. e582d9c Add groovy script for pseudonymizing ngram lists by Marc Kupietz · 2 years, 1 month ago
  36. 42e37e1 Add groovy script for generation pseudonym keys by Marc Kupietz · 2 years, 1 month ago
  37. 098c820 Handle tab-separated ints in numeric secondary sort by Marc Kupietz · 2 years, 1 month ago
  38. 7722b47 Simplify unsorted output by Marc Kupietz · 2 years, 1 month ago
  39. fa273a5 Bump version to 2.1.0 by Marc Kupietz · 2 years, 1 month ago
  40. 6a151fb Add artifacts to ci script by Marc Kupietz · 2 years, 1 month ago
  41. 0e72537 Add missing test resources by Marc Kupietz · 2 years, 1 month ago
  42. 118b7a5 Add CI tests and dependency scans by Marc Kupietz · 2 years, 1 month ago
  43. b6f4fab Upgrade dependencies by Marc Kupietz · 2 years, 1 month ago
  44. 014cfc5 Bump version to 2.0.1 by Marc Kupietz · 2 years, 1 month ago
  45. 983cfc1 Bump version to 2.0 by Marc Kupietz · 3 years, 2 months ago
  46. 4a64b9c Rename jar from nGrammFoldCount to totalngrams by Marc Kupietz · 3 years, 2 months ago
  47. c73be80 Let number of folds (-F) default to 1 by Marc Kupietz · 3 years, 2 months ago
  48. 04dd984 Unify name to totalngrams by Marc Kupietz · 3 years, 2 months ago
  49. 37197a8 Fix bug in simple test by Marc Kupietz · 3 years, 2 months ago
  50. be29959 Test simple text also without padding by Marc Kupietz · 3 years, 2 months ago
  51. e4adb69 Make sure that start and end tags for empty texts are counted by Marc Kupietz · 3 years, 2 months ago
  52. 5d56653 Add text end padding only if in target fold by Marc Kupietz · 3 years, 2 months ago
  53. a691041 Add --pad option to optionally add padding symbols at text edges by Marc Kupietz · 3 years, 2 months ago
  54. ead2a6f Improve null handling in tests by Marc Kupietz · 3 years, 2 months ago
  55. 03f9dc0 Add option -N to sort keys with same freq numerically by Marc Kupietz · 3 years, 2 months ago
  56. 1b717be Auto detect xz compression for input and output by Marc Kupietz · 3 years, 2 months ago
  57. f14c096 Bump version to 1.9.900 by Marc Kupietz · 3 years, 8 months ago
  58. 53623e0 Add --downcase/-d option to convert all token characters to lower case by Marc Kupietz · 3 years, 8 months ago
  59. bc8bd3d totalngrams: for .(freq|tsv)(.gz)? input files automatically cumulate frequencies by Marc Kupietz · 4 years ago
  60. 2a65aee totalngrams: use stringbuilder for building output lines by Marc Kupietz · 4 years ago
  61. c3c172d totalngrams add log for unsorted ouput by Marc Kupietz · 4 years, 2 months ago
  62. 97f00c4 totalngrams: make sure to assume unsigned ints in frequency prints by Marc Kupietz · 4 years, 2 months ago
  63. 85a17b0 totalngrams: add -S to turn off output sorting by Marc Kupietz · 4 years, 2 months ago
  64. 775c666 totalngrams: dev version by Marc Kupietz · 4 years, 2 months ago
  65. ca29d55 totalNGrams: Bump version to 1.7 (automatic XML unescaping) by Marc Kupietz · 4 years, 2 months ago
  66. c78b5a5 totalNGrams: unescape all XML entities (&, <, >, ") by Marc Kupietz · 4 years, 2 months ago
  67. aca0a8d n-grams: Assert proper tab separated CoNLL-U Format with 10 columns input by Marc Kupietz · 4 years, 2 months ago
  68. aaf46f1 totalngrams: fix main class name by Marc Kupietz · 4 years, 4 months ago
  69. cf6ec7d totalngrams: fix junit5 tests with old maven versions by Marc Kupietz · 4 years, 4 months ago
  70. 3db37c5 totalngrams: add unit test for almost the whole pipeline by Marc Kupietz · 4 years, 4 months ago
  71. b66b832 totalngrams: create call script by Marc Kupietz · 4 years, 4 months ago
  72. 8b1bfab totalngrams: default to cores - 2 threads by Marc Kupietz · 4 years, 4 months ago
  73. b5f4868 totalngrams: remove redundant returns and imports by Marc Kupietz · 4 years, 4 months ago
  74. 2ea60bd Use cryptogrphic Blake2b hash as determisitic fold random source by Marc Kupietz · 4 years, 4 months ago
  75. 8780be5 totalngrams: add console logger for > WARNING levels by Marc Kupietz · 4 years, 4 months ago
  76. 30c9a81 totalngrams: fix missing imports by Marc Kupietz · 4 years, 4 months ago
  77. d49801a totalngrams: sort input files by size by Marc Kupietz · 4 years, 4 months ago
  78. 3e8a38d totalngrams: add reading .conllu and conllu.gz files directly by Marc Kupietz · 4 years, 4 months ago
  79. 0337db6 totalngrams: add -l option to count <token>\t<lemma>\t<pos> by Marc Kupietz · 4 years, 4 months ago
  80. 6638bb2 totalngrams: start adding unit tests by Marc Kupietz · 4 years, 4 months ago
  81. 9c63b57 totalngrams: Calculate only one fold at a time by Marc Kupietz · 4 years, 4 months ago
  82. a363746 totalngrams: use tab as separator also between n-gram tokens by Marc Kupietz · 4 years, 4 months ago
  83. fcee606 Revert "totalngrams: use chronicle map for larger maps" by Marc Kupietz · 4 years, 4 months ago
  84. 12c0411 Revert "Try Ehcache" by Marc Kupietz · 4 years, 4 months ago
  85. d245822 Try Ehcache by Marc Kupietz · 4 years, 4 months ago
  86. 7f45175 totalngrams: use chronicle map for larger maps by Marc Kupietz · 4 years, 4 months ago
  87. ac44043 totalngrams: factor out file creation by Marc Kupietz · 4 years, 4 months ago
  88. e23b6cb totalngrams: update parallelzip to 1.0.5 by Marc Kupietz · 4 years, 4 months ago
  89. b9ff157 totalngrams: fix calculations with unsigned integers > 2^31 by Marc Kupietz · 4 years, 4 months ago
  90. 34bbca5 totalngrams: use buffered output streams by Marc Kupietz · 4 years, 4 months ago
  91. 251bbac totalngrams: bump version to 1.1 by Marc Kupietz · 4 years, 4 months ago
  92. 5dc9fa5 totalngrams: improve logging by Marc Kupietz · 4 years, 4 months ago
  93. ba7c757 totalngrams: support gzipped output by Marc Kupietz · 4 years, 4 months ago
  94. 0b95783 totalngrams: sanitize worker node pool by Marc Kupietz · 4 years, 4 months ago
  95. 010903e totalngrams: reuse slidingWindowQueue by Marc Kupietz · 4 years, 4 months ago
  96. e31c4e1 totalngrams: improve progressbar looks by Marc Kupietz · 4 years, 4 months ago
  97. 9d64cde Use unsigned integers instead of longs for counts by Marc Kupietz · 4 years, 4 months ago
  98. 68dc73d totalngrams: parallelize and fix alphabetical secondary sorting by Marc Kupietz · 4 years, 4 months ago
  99. bd62aa0 totalngrams: log warnings and retry on errors by Marc Kupietz · 4 years, 4 months ago
  100. cc77d67 totalngrams: sanitize progressbar by Marc Kupietz · 4 years, 4 months ago