Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
IDS-Mannheim
/
totalngrams
/
30c9a81abe1df6a1fe8bd94e4e83a4a3da629168
30c9a81
totalngrams: fix missing imports
by Marc Kupietz
· 5 years ago
d49801a
totalngrams: sort input files by size
by Marc Kupietz
· 5 years ago
3e8a38d
totalngrams: add reading .conllu and conllu.gz files directly
by Marc Kupietz
· 5 years ago
0337db6
totalngrams: add -l option to count <token>\t<lemma>\t<pos>
by Marc Kupietz
· 5 years ago
6638bb2
totalngrams: start adding unit tests
by Marc Kupietz
· 5 years ago
9c63b57
totalngrams: Calculate only one fold at a time
by Marc Kupietz
· 5 years ago
a363746
totalngrams: use tab as separator also between n-gram tokens
by Marc Kupietz
· 5 years ago
fcee606
Revert "totalngrams: use chronicle map for larger maps"
by Marc Kupietz
· 5 years ago
12c0411
Revert "Try Ehcache"
by Marc Kupietz
· 5 years ago
d245822
Try Ehcache
by Marc Kupietz
· 5 years ago
7f45175
totalngrams: use chronicle map for larger maps
by Marc Kupietz
· 5 years ago
ac44043
totalngrams: factor out file creation
by Marc Kupietz
· 5 years ago
e23b6cb
totalngrams: update parallelzip to 1.0.5
by Marc Kupietz
· 5 years ago
b9ff157
totalngrams: fix calculations with unsigned integers > 2^31
by Marc Kupietz
· 5 years ago
34bbca5
totalngrams: use buffered output streams
by Marc Kupietz
· 5 years ago
251bbac
totalngrams: bump version to 1.1
by Marc Kupietz
· 5 years ago
5dc9fa5
totalngrams: improve logging
by Marc Kupietz
· 5 years ago
ba7c757
totalngrams: support gzipped output
by Marc Kupietz
· 5 years ago
0b95783
totalngrams: sanitize worker node pool
by Marc Kupietz
· 5 years ago
010903e
totalngrams: reuse slidingWindowQueue
by Marc Kupietz
· 5 years ago
e31c4e1
totalngrams: improve progressbar looks
by Marc Kupietz
· 5 years ago
9d64cde
Use unsigned integers instead of longs for counts
by Marc Kupietz
· 5 years ago
68dc73d
totalngrams: parallelize and fix alphabetical secondary sorting
by Marc Kupietz
· 5 years ago
bd62aa0
totalngrams: log warnings and retry on errors
by Marc Kupietz
· 5 years ago
cc77d67
totalngrams: sanitize progressbar
by Marc Kupietz
· 5 years ago
4cd3389
totalngrams: allow using a worker node pool
by Marc Kupietz
· 5 years ago
97713f3
totalngrams: detect early if output file can be written
by Marc Kupietz
· 5 years ago
8fae582
totalngram: Use our own ETA printer
by Marc Kupietz
· 5 years ago
1c06250
Revert "totalngram: compute marginals in parallel"
by Marc Kupietz
· 5 years ago
88d50e2
totalngram: compute marginals in parallel
by Marc Kupietz
· 5 years ago
962a978
Revert "totalngram: use CircularFifoQueue as sliding window base"
by Marc Kupietz
· 5 years ago
0184f9c
totalngram: use CircularFifoQueue as sliding window base
by Marc Kupietz
· 5 years ago
81e85e2
totalngram: Support > 1-grams
by Marc Kupietz
· 5 years ago
a37cc9f
Efficient folded ngram frequency adder
by Marc Kupietz
· 5 years ago