Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
korapxmltool
/
4c4470a2482e5977911d20a2399c0b4f69fb0891
4c4470a
Don't wait for text that are already emitted to krill.tar
by Marc Kupietz
· 4 days ago
803394d
Calculate defaults for heap and threads adaptively
by Marc Kupietz
· 4 days ago
bc7e301
Fix filename comment in korapxml2conllu
by Marc Kupietz
· 4 days ago
81bd722
Fix threadId deprecation warning
by Marc Kupietz
· 5 days ago
2a3223d
Enable parallel test execution
by Marc Kupietz
· 5 days ago
9a52367
Split tests
by Marc Kupietz
· 5 days ago
a2c3398
Add conllu2korapxml part
by Marc Kupietz
· 5 days ago
cec4101
Add korapxml2conllu shortcut
by Marc Kupietz
· 5 days ago
481643b
Add korapxml2krill shortcut
by Marc Kupietz
· 5 days ago
e5ff4ea
Check if input zips exist before doing any processing
by Marc Kupietz
· 5 days ago
afd2c2e
Update Readme.md to reflect now command line options
by Marc Kupietz
· 5 days ago
954d40d
Overhaul and standardize command line options
by Marc Kupietz
· 5 days ago
7b3a16e
Process data, structure and constituency with DOM
by Marc Kupietz
· 6 days ago
d552017
Switch to StAX XML parser
by Marc Kupietz
· 6 days ago
8749d1a
Fix ZIP scanning
by Marc Kupietz
· 6 days ago
9708a40
Fix ConcurrentModificationException
by Marc Kupietz
· 6 days ago
5b34af1
Switch back to standard gzip output
by Marc Kupietz
· 6 days ago
d335fe9
Use thread pool for gzipping krill output
by Marc Kupietz
· 6 days ago
af39ebe
Disable expensive XML security features
by Marc Kupietz
· 6 days ago
5b16f65
Use thread local document builders in krill output
by Marc Kupietz
· 6 days ago
d4c6bd5
Add --lz4 option for krill output
by Marc Kupietz
· 7 days ago
acb21e3
Switch to GZIP compression level 1
by Marc Kupietz
· 7 days ago
ec64582
Extract KrillJsonGenerator
by Marc Kupietz
· 7 days ago
ed5e6d1
Extract KorAP XML output to own module
by Marc Kupietz
· 7 days ago
7841ec1
Extract CoNNL-U formatter to own module
by Marc Kupietz
· 7 days ago
4cad9dc
Extract some formatters
by Marc Kupietz
· 7 days ago
1774c7d
Add defaults for tagger and parser models
by Marc Kupietz
· 7 days ago
05d3bbb
Make executable executable for everybody
by Marc Kupietz
· 7 days ago
bab5d7e
Default KORAPXMLTOOL_MODELS_PATH to ${SCRIPT_DIR}/../lib/models
by Marc Kupietz
· 7 days ago
afcc766
Improve examples in Readme and help
by Marc Kupietz
· 7 days ago
1e405bf
Fix tests with working -D for zip output
by Marc Kupietz
· 7 days ago
634e9b3
Fix malt examples
by Marc Kupietz
· 7 days ago
e19a016
Fix -D and overwrite with zip output
by Marc Kupietz
· 7 days ago
df14956
CI: Downgrade compile JDK to 21
by Marc Kupietz
· 7 days ago
b4b5768
Introduce env variable KORAPXMLTOOL_MODELS_PATH
by Marc Kupietz
· 7 days ago
570d0e0
KORAPXMLTOOL_XMX_MB -> KORAPXMLTOOL_XMX
by Marc Kupietz
· 7 days ago
a995ba8
Update Readme.md and help examples
by Marc Kupietz
· 7 days ago
1869f50
Put executable into build/bin
by Marc Kupietz
· 8 days ago
17126e2
CI: export also executable
by Marc Kupietz
· 8 days ago
d8549e4
Fix XML APIs relocation warnings (CoreNLP needs an old version)
by Marc Kupietz
· 8 days ago
f0f4d43
Krill output: introduce month-aware sorting
by Marc Kupietz
· 8 days ago
c076b59
Remove deprecated ZipFile constructors
by Marc Kupietz
· 8 days ago
02cd8bf
Improve shebang script
by Marc Kupietz
· 8 days ago
b00b73f
Bump version to v2.99
by Marc Kupietz
· 8 days ago
fbfcd04
Auto create shebang executable
by Marc Kupietz
· 8 days ago
92187a1
Add rough constituency parse also as conllu output comment
by Marc Kupietz
· 8 days ago
38d3ca3
Add corenlp and krill output support to Readme.md
by Marc Kupietz
· 8 days ago
f846bcb
Add constituency parsing test
by Marc Kupietz
· 8 days ago
6cb3f27
Fix constituency parsing
by Marc Kupietz
· 9 days ago
319f3d5
Only add parser foundry if it's different from tagger foundry
by Marc Kupietz
· 9 days ago
a4600e2
Respect target dir also with zip output
by Marc Kupietz
· 9 days ago
ddea0c0
Add support for CoreNLP SR parser and fast tagger
by Marc Kupietz
· 9 days ago
8fab8fc
Make foundry names more general
by Marc Kupietz
· 9 days ago
72b3fee
Test that annotations are processed ordered
by Marc Kupietz
· 9 days ago
0b0bde8
Speed up Krill output tests by only processing once
by Marc Kupietz
· 9 days ago
09d45b7
Fix mixed foundry detection
by Marc Kupietz
· 10 days ago
4d704a3
Switch to XML parsing
by Marc Kupietz
· 11 days ago
9d2c64e
Neutralize identifiers for constituency and sentence annotations
by Marc Kupietz
· 11 days ago
bf622e9
Add constituencies and non base sentences to krill output
by Marc Kupietz
· 11 days ago
f1d1e7f
Fix escaping of $ and #
by Marc Kupietz
· 11 days ago
33ca8f1
Update .gitignore
by Marc Kupietz
· 11 days ago
16b4ccb
Add --non-word-token option and change default
by Marc Kupietz
· 11 days ago
ccce9d6
Fix progress bar to show output filename instead of generic label
by Marc Kupietz
· 12 days ago
82b4b64
Re-enable conditional incremental output for multi-text corpora
by Marc Kupietz
· 13 days ago
f5e0d2d
Clean up debug logging after confirming dependency fix
by Marc Kupietz
· 13 days ago
fb0862b
Fix scheduler termination and temporarily disable incremental output
by Marc Kupietz
· 13 days ago
0df533a
Fix missing base/s
by Marc Kupietz
· 13 days ago
ac82228
Escape hashes
by Marc Kupietz
· 13 days ago
782b3fe
Free memory after tar push
by Marc Kupietz
· 13 days ago
e1594dc
Log file contents more precisely
by Marc Kupietz
· 13 days ago
89adf73
Redirect logs earlier
by Marc Kupietz
· 13 days ago
ea048ff
Make incremental krill the only operation mode
by Marc Kupietz
· 13 days ago
899f233
Krill: log to file
by Marc Kupietz
· 13 days ago
1dd505f
Fix NPE
by Marc Kupietz
· 13 days ago
32f15f1
Make sure alle entries are processed in sorted order
by Marc Kupietz
· 13 days ago
5c14247
Switch to priority based scheduling
by Marc Kupietz
· 13 days ago
e706b8c
Improve logging
by Marc Kupietz
· 14 days ago
f7e06c2
Do not close tar stream before we are finished
by Marc Kupietz
· 14 days ago
7799272
Process ZIPs in sorted order
by Marc Kupietz
· 14 days ago
b447a8b
Read ZIP contents in parallel
by Marc Kupietz
· 14 days ago
7397838
Add ZIP scanning progressbar
by Marc Kupietz
· 14 days ago
e9af1c5
Use inventory instead of watermark
by Marc Kupietz
· 14 days ago
327dc6b
Krill output: Introduce watermark based output approach
by Marc Kupietz
· 2 weeks ago
7beb4af
Introduce incremental-krill
by Marc Kupietz
· 2 weeks ago
9131003
Krill output: Improvements for many texts
by Marc Kupietz
· 2 weeks ago
32d6d6f
Escape $ character in krill output
by Marc Kupietz
· 2 weeks ago
a723cc0
Fix some metadata
by Marc Kupietz
· 2 weeks ago
4fd32ef
Krill output: escape URLs in target refs
by Marc Kupietz
· 2 weeks ago
6d89917
Default destination folder always to .
by Marc Kupietz
· 2 weeks ago
f30fd4f
Fix more metadata types
by Marc Kupietz
· 2 weeks ago
86b055a
Add krill output format
by Marc Kupietz
· 2 weeks ago
c6b51e7
Add D/outputDir option
by Marc Kupietz
· 2 weeks ago
95b0fa6
Bump version to v2.20
by Marc Kupietz
· 3 weeks ago
0aebe74
Switch all to ROOT locale
by Marc Kupietz
· 3 weeks ago
8e281d9
Fix sentence spans in dependency zip output
by Marc Kupietz
· 3 weeks ago
44f2f89
Print target zip name with progressbar
by Marc Kupietz
· 3 weeks ago
5a410be
When zip output log to .log files
by Marc Kupietz
· 3 weeks ago
cae0970
Fix foundry folder with -f zip
by Marc Kupietz
· 3 weeks ago
4f7101d
Update CI and Readme
by Marc Kupietz
· 3 weeks ago
4c92944
Fix ConcurrentModificationException with CoNLL-U output
by Marc Kupietz
· 3 weeks ago
Next »