- 098c820 Handle tab-separated ints in numeric secondary sort by Marc Kupietz · 3 years ago
 - 7722b47 Simplify unsorted output by Marc Kupietz · 3 years ago
 - fa273a5 Bump version to 2.1.0 by Marc Kupietz · 3 years ago
 - 6a151fb Add artifacts to ci script by Marc Kupietz · 3 years, 1 month ago
 - 0e72537 Add missing test resources by Marc Kupietz · 3 years, 1 month ago
 - 118b7a5 Add CI tests and dependency scans by Marc Kupietz · 3 years, 1 month ago
 - b6f4fab Upgrade dependencies by Marc Kupietz · 3 years, 1 month ago
 - 014cfc5 Bump version to 2.0.1 by Marc Kupietz · 3 years, 1 month ago
 - 983cfc1 Bump version to 2.0 by Marc Kupietz · 4 years, 1 month ago
 - 4a64b9c Rename jar from nGrammFoldCount to totalngrams by Marc Kupietz · 4 years, 1 month ago
 - c73be80 Let number of folds (-F) default to 1 by Marc Kupietz · 4 years, 1 month ago
 - 04dd984 Unify name to totalngrams by Marc Kupietz · 4 years, 1 month ago
 - 37197a8 Fix bug in simple test by Marc Kupietz · 4 years, 1 month ago
 - be29959 Test simple text also without padding by Marc Kupietz · 4 years, 1 month ago
 - e4adb69 Make sure that start and end tags for empty texts are counted by Marc Kupietz · 4 years, 1 month ago
 - 5d56653 Add text end padding only if in target fold by Marc Kupietz · 4 years, 1 month ago
 - a691041 Add --pad option to optionally add padding symbols at text edges by Marc Kupietz · 4 years, 1 month ago
 - ead2a6f Improve null handling in tests by Marc Kupietz · 4 years, 1 month ago
 - 03f9dc0 Add option -N to sort keys with same freq numerically by Marc Kupietz · 4 years, 2 months ago
 - 1b717be Auto detect xz compression for input and output by Marc Kupietz · 4 years, 2 months ago
 - f14c096 Bump version to 1.9.900 by Marc Kupietz · 4 years, 7 months ago
 - 53623e0 Add --downcase/-d option to convert all token characters to lower case by Marc Kupietz · 4 years, 7 months ago
 - bc8bd3d totalngrams: for .(freq|tsv)(.gz)? input files automatically cumulate frequencies by Marc Kupietz · 5 years ago
 - 2a65aee totalngrams: use stringbuilder for building output lines by Marc Kupietz · 5 years ago
 - c3c172d totalngrams add log for unsorted ouput by Marc Kupietz · 5 years ago
 - 97f00c4 totalngrams: make sure to assume unsigned ints in frequency prints by Marc Kupietz · 5 years ago
 - 85a17b0 totalngrams: add -S to turn off output sorting by Marc Kupietz · 5 years ago
 - 775c666 totalngrams: dev version by Marc Kupietz · 5 years ago
 - ca29d55 totalNGrams: Bump version to 1.7 (automatic XML unescaping) by Marc Kupietz · 5 years ago
 - c78b5a5 totalNGrams: unescape all XML entities (&, <, >, ") by Marc Kupietz · 5 years ago
 - aca0a8d n-grams: Assert proper tab separated CoNLL-U Format with 10 columns input by Marc Kupietz · 5 years ago
 - aaf46f1 totalngrams: fix main class name by Marc Kupietz · 5 years ago
 - cf6ec7d totalngrams: fix junit5 tests with old maven versions by Marc Kupietz · 5 years ago
 - 3db37c5 totalngrams: add unit test for almost the whole pipeline by Marc Kupietz · 5 years ago
 - b66b832 totalngrams: create call script by Marc Kupietz · 5 years ago
 - 8b1bfab totalngrams: default to cores - 2 threads by Marc Kupietz · 5 years ago
 - b5f4868 totalngrams: remove redundant returns and imports by Marc Kupietz · 5 years ago
 - 2ea60bd Use cryptogrphic Blake2b hash as determisitic fold random source by Marc Kupietz · 5 years ago
 - 8780be5 totalngrams: add console logger for > WARNING levels by Marc Kupietz · 5 years ago
 - 30c9a81 totalngrams: fix missing imports by Marc Kupietz · 5 years ago
 - d49801a totalngrams: sort input files by size by Marc Kupietz · 5 years ago
 - 3e8a38d totalngrams: add reading .conllu and conllu.gz files directly by Marc Kupietz · 5 years ago
 - 0337db6 totalngrams: add -l option to count <token>\t<lemma>\t<pos> by Marc Kupietz · 5 years ago
 - 6638bb2 totalngrams: start adding unit tests by Marc Kupietz · 5 years ago
 - 9c63b57 totalngrams: Calculate only one fold at a time by Marc Kupietz · 5 years ago
 - a363746 totalngrams: use tab as separator also between n-gram tokens by Marc Kupietz · 5 years ago
 - fcee606 Revert "totalngrams: use chronicle map for larger maps" by Marc Kupietz · 5 years ago
 - 12c0411 Revert "Try Ehcache" by Marc Kupietz · 5 years ago
 - d245822 Try Ehcache by Marc Kupietz · 5 years ago
 - 7f45175 totalngrams: use chronicle map for larger maps by Marc Kupietz · 5 years ago
 - ac44043 totalngrams: factor out file creation by Marc Kupietz · 5 years ago
 - e23b6cb totalngrams: update parallelzip to 1.0.5 by Marc Kupietz · 5 years ago
 - b9ff157 totalngrams: fix calculations with unsigned integers > 2^31 by Marc Kupietz · 5 years ago
 - 34bbca5 totalngrams: use buffered output streams by Marc Kupietz · 5 years ago
 - 251bbac totalngrams: bump version to 1.1 by Marc Kupietz · 5 years ago
 - 5dc9fa5 totalngrams: improve logging by Marc Kupietz · 5 years ago
 - ba7c757 totalngrams: support gzipped output by Marc Kupietz · 5 years ago
 - 0b95783 totalngrams: sanitize worker node pool by Marc Kupietz · 5 years ago
 - 010903e totalngrams: reuse slidingWindowQueue by Marc Kupietz · 5 years ago
 - e31c4e1 totalngrams: improve progressbar looks by Marc Kupietz · 5 years ago
 - 9d64cde Use unsigned integers instead of longs for counts by Marc Kupietz · 5 years ago
 - 68dc73d totalngrams: parallelize and fix alphabetical secondary sorting by Marc Kupietz · 5 years ago
 - bd62aa0 totalngrams: log warnings and retry on errors by Marc Kupietz · 5 years ago
 - cc77d67 totalngrams: sanitize progressbar by Marc Kupietz · 5 years ago
 - 4cd3389 totalngrams: allow using a worker node pool by Marc Kupietz · 5 years ago
 - 97713f3 totalngrams: detect early if output file can be written by Marc Kupietz · 5 years ago
 - 8fae582 totalngram: Use our own ETA printer by Marc Kupietz · 5 years ago
 - 1c06250 Revert "totalngram: compute marginals in parallel" by Marc Kupietz · 5 years ago
 - 88d50e2 totalngram: compute marginals in parallel by Marc Kupietz · 5 years ago
 - 962a978 Revert "totalngram: use CircularFifoQueue as sliding window base" by Marc Kupietz · 5 years ago
 - 0184f9c totalngram: use CircularFifoQueue as sliding window base by Marc Kupietz · 5 years ago
 - 81e85e2 totalngram: Support > 1-grams by Marc Kupietz · 5 years ago
 - a37cc9f Efficient folded ngram frequency adder by Marc Kupietz · 5 years ago