1. 97f00c4 totalngrams: make sure to assume unsigned ints in frequency prints by Marc Kupietz · 4 years, 2 months ago
  2. 85a17b0 totalngrams: add -S to turn off output sorting by Marc Kupietz · 4 years, 2 months ago
  3. c78b5a5 totalNGrams: unescape all XML entities (&, <, >, ") by Marc Kupietz · 4 years, 2 months ago
  4. aca0a8d n-grams: Assert proper tab separated CoNLL-U Format with 10 columns input by Marc Kupietz · 4 years, 2 months ago
  5. aaf46f1 totalngrams: fix main class name by Marc Kupietz · 4 years, 4 months ago
  6. cf6ec7d totalngrams: fix junit5 tests with old maven versions by Marc Kupietz · 4 years, 4 months ago
  7. 3db37c5 totalngrams: add unit test for almost the whole pipeline by Marc Kupietz · 4 years, 4 months ago
  8. 8b1bfab totalngrams: default to cores - 2 threads by Marc Kupietz · 4 years, 4 months ago
  9. b5f4868 totalngrams: remove redundant returns and imports by Marc Kupietz · 4 years, 4 months ago
  10. 2ea60bd Use cryptogrphic Blake2b hash as determisitic fold random source by Marc Kupietz · 4 years, 4 months ago
  11. 8780be5 totalngrams: add console logger for > WARNING levels by Marc Kupietz · 4 years, 4 months ago
  12. 30c9a81 totalngrams: fix missing imports by Marc Kupietz · 4 years, 4 months ago
  13. d49801a totalngrams: sort input files by size by Marc Kupietz · 4 years, 4 months ago
  14. 3e8a38d totalngrams: add reading .conllu and conllu.gz files directly by Marc Kupietz · 4 years, 4 months ago
  15. 0337db6 totalngrams: add -l option to count <token>\t<lemma>\t<pos> by Marc Kupietz · 4 years, 4 months ago
  16. 6638bb2 totalngrams: start adding unit tests by Marc Kupietz · 4 years, 4 months ago
  17. 9c63b57 totalngrams: Calculate only one fold at a time by Marc Kupietz · 4 years, 4 months ago
  18. a363746 totalngrams: use tab as separator also between n-gram tokens by Marc Kupietz · 4 years, 4 months ago
  19. fcee606 Revert "totalngrams: use chronicle map for larger maps" by Marc Kupietz · 4 years, 4 months ago
  20. 12c0411 Revert "Try Ehcache" by Marc Kupietz · 4 years, 4 months ago
  21. d245822 Try Ehcache by Marc Kupietz · 4 years, 4 months ago
  22. 7f45175 totalngrams: use chronicle map for larger maps by Marc Kupietz · 4 years, 4 months ago
  23. ac44043 totalngrams: factor out file creation by Marc Kupietz · 4 years, 4 months ago
  24. b9ff157 totalngrams: fix calculations with unsigned integers > 2^31 by Marc Kupietz · 4 years, 4 months ago
  25. 34bbca5 totalngrams: use buffered output streams by Marc Kupietz · 4 years, 4 months ago
  26. 5dc9fa5 totalngrams: improve logging by Marc Kupietz · 4 years, 4 months ago
  27. ba7c757 totalngrams: support gzipped output by Marc Kupietz · 4 years, 4 months ago
  28. 0b95783 totalngrams: sanitize worker node pool by Marc Kupietz · 4 years, 4 months ago
  29. 010903e totalngrams: reuse slidingWindowQueue by Marc Kupietz · 4 years, 4 months ago
  30. e31c4e1 totalngrams: improve progressbar looks by Marc Kupietz · 4 years, 4 months ago
  31. 9d64cde Use unsigned integers instead of longs for counts by Marc Kupietz · 4 years, 4 months ago
  32. 68dc73d totalngrams: parallelize and fix alphabetical secondary sorting by Marc Kupietz · 4 years, 4 months ago
  33. bd62aa0 totalngrams: log warnings and retry on errors by Marc Kupietz · 4 years, 4 months ago
  34. cc77d67 totalngrams: sanitize progressbar by Marc Kupietz · 4 years, 4 months ago
  35. 4cd3389 totalngrams: allow using a worker node pool by Marc Kupietz · 4 years, 4 months ago
  36. 97713f3 totalngrams: detect early if output file can be written by Marc Kupietz · 4 years, 4 months ago
  37. 8fae582 totalngram: Use our own ETA printer by Marc Kupietz · 4 years, 4 months ago
  38. 1c06250 Revert "totalngram: compute marginals in parallel" by Marc Kupietz · 4 years, 4 months ago
  39. 88d50e2 totalngram: compute marginals in parallel by Marc Kupietz · 4 years, 4 months ago
  40. 962a978 Revert "totalngram: use CircularFifoQueue as sliding window base" by Marc Kupietz · 4 years, 4 months ago
  41. 0184f9c totalngram: use CircularFifoQueue as sliding window base by Marc Kupietz · 4 years, 4 months ago
  42. 81e85e2 totalngram: Support > 1-grams by Marc Kupietz · 4 years, 4 months ago
  43. a37cc9f Efficient folded ngram frequency adder by Marc Kupietz · 4 years, 4 months ago