Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-Krill
/
b05b842e8537543e2211410cc7974ef6946b0665
/
lib
/
KorAP
/
XML
b05b842
Improve logging
by Akron
· 5 years ago
2029455
Added external link for AGD data in I5 meta
by Akron
· 5 years ago
0d68a4b
Added 'distributor' field to I5 metadata
by Akron
· 5 years ago
7d5e638
Added support for Talismane
by Akron
· 5 years ago
57510c1
Added DGD support
by Akron
· 6 years ago
9b04f60
Update version
by Akron
· 6 years ago
f021ad6
Improve error handling
by Akron
· 6 years ago
eaffe93
Fail hard on tokenization problems now
by Akron
· 6 years ago
955b75b
Remove extract_text and extract_doc in favor of extract_sigle
by Akron
· 6 years ago
31a08cb
Add extract_sigle method to archive
by Akron
· 6 years ago
6bf3cc9
Added links for wikipedia resources
by Akron
· 6 years ago
263274c
Support koral versioning
by Akron
· 6 years ago
c526e75
Include field serialization in versioned json output
by Akron
· 6 years ago
5eb3aa0
Set field types and serialize as koral:fields
by Akron
· 6 years ago
ed9baf0
Support non-word-tokens (fixes #5)
by Akron
· 6 years ago
6eff23b
Updated minimum perl
by Akron
· 6 years ago
dd1c0f1
Updated version
by Akron
· 6 years ago
c893ac3
Added tests and minor metadata parsing adjustments for HNC
by Akron
· 6 years ago
28dc17f
Fix certainty values in TreeTagger output
by Akron
· 7 years ago
0426176
Remove certainty value on lemmata in Treetagger
by Akron
· 7 years ago
4c67919
Support for LWC dependency annotations
by Akron
· 7 years ago
3c56f50
Support file extensions in base tokenization file
by Akron
· 7 years ago
9b67b93
Fix attribute generation for DeReKo
by Akron
· 7 years ago
9a062ce
Fix tarring to include only filenames
by Akron
· 7 years ago
0a6cce1
Remove non-core fc
by Akron
· 7 years ago
3abc03e
Fixed exit codes in script
by Akron
· 7 years ago
0f9b93a
Fixed minor issue in I5 meta parsing
by Akron
· 7 years ago
403934d
Fixed CMC for empty features
by Akron
· 7 years ago
36d4627
Fixed feature treatment in CMC morpho
by Akron
· 7 years ago
ce125b6
Improved documentation on new features
by Akron
· 7 years ago
e599379
Added treatment of CMC data
by Akron
· 7 years ago
918ce42
Fixed primary data handling for data with white space at the beginning and at the end
by Akron
· 7 years ago
a308c71
Start testing with DCK
by Akron
· 7 years ago
da3097e
Finished tar flag
by Akron
· 8 years ago
9ec8887
Introduced sequential extraction flag to circumvent troubles with parallel extraction
by Akron
· 8 years ago
3a486f8
Another unzip flag update (-uo)
by Akron
· 8 years ago
86db52e
Improved unzip overwriting mechanism
by Akron
· 8 years ago
0278ca2
Test zip overwriting
by Akron
· 8 years ago
8150010
Introduced temporary extraction
by Akron
· 8 years ago
636aa11
Added configuration to script
by Akron
· 8 years ago
821db3d
Add wildcard support for inputs
by Akron
· 8 years ago
55778f0
Added preliminary support for diacritic insensitivity support
by Akron
· 8 years ago
5809fea
Fixed casefolding for case insensitivity
by Akron
· 8 years ago
c11f798
Add auto-core-calculation
by Akron
· 8 years ago
3bd942f
Added marmot-support
by Akron
· 8 years ago
60a8caa
Treat prefixes correct for text sigles
by Akron
· 8 years ago
08d5445
Changed meta name for pages
by Akron
· 8 years ago
41ac10b
Added pagebreak annotations (with '~'-prefix)
by Akron
· 8 years ago
0465de5
Improved handling of weird metadata stuff
by Akron
· 8 years ago
3887301
More relaxed handling of document siglen
by Akron
· 8 years ago
a7d0e9f
Improved DRuKoLa meta data handling
by Akron
· 8 years ago
578af4b
Support translator meta data type
by Akron
· 8 years ago
4fa37c3
Added DRuKoLa support to korapxml2krill script
by Akron
· 8 years ago
c388150
Added more drukola tests
by Akron
· 8 years ago
3139917
Fixed DRuKoLa annotations
by Akron
· 8 years ago
ace612e
Added DRuKoLa annotations
by Akron
· 8 years ago
7e2eb88
Fixed analytic+monogr behaviour for metadata
by Akron
· 8 years ago
3ec0a1c
Updated to Mojolicious 7.20
by Akron
· 8 years ago
b7f130c
Added DRuKoLa meta data skeleton
by Akron
· 8 years ago
3741f8b
Added base-sentences and base-paragraphs options
by Akron
· 8 years ago
89df4fa
Fixed bug in tokenizer to recognize non-word-tokenizations
by Akron
· 8 years ago
6f9fef5
Ignore recursion in CoreNLP
by Akron
· 8 years ago
13d5662
Improved 'already processed' message
by Akron
· 8 years ago
2812ba2
Fixed archive handling and support multiple jobs for extraction
by Akron
· 8 years ago
2fd402b
Added support for wildcards in document siglen
by Akron
· 8 years ago
2080758
Added extraction method for documents in archives
by Akron
· 8 years ago
af670ae
Fixed root prefix in meta parser
by Akron
· 8 years ago
ad4cb01
Fixed conflict
by Akron
· 8 years ago
087d5db
Fixed rootdir in meta parser
by Akron
· 8 years ago
0e48977
Fixed windows support
by Nils Diewald
· 8 years ago
b3e9ccd
Fixed windows support
by Nils Diewald
· 8 years ago
4c0cf31
Fixed treatment of temporary files
by Akron
· 8 years ago
bdb6465
New version number
by Akron
· 8 years ago
7d4cdd8
Added archive test script
by Akron
· 8 years ago
5f51d42
Fixed annotation bug in script
by Akron
· 8 years ago
afb81ad
Fixed Mojolicious 7 support
by Akron
· 8 years ago
af0ae3f
Check sentence mapping in base/sentences
by Akron
· 8 years ago
fbf6638
Added support for direct I5 support
by Akron
· 8 years ago
e1dbc38
Added test for script calls
by Akron
· 8 years ago
cdf0e00
Added batch processing class for documents
by Akron
· 8 years ago
405f0c5
Test file processing for batch processing
by Akron
· 8 years ago
8b99052
Start splitting script file for better testing
by Akron
· 8 years ago
05ba547
Preliminary support for MDParser annotations
by Akron
· 8 years ago
a5920b1
Improved test suite for caching and rei
by Akron
· 8 years ago
0c3e375
Test multiple archives
by Akron
· 8 years ago
f3f0c94
Added malt dependency resource
by Akron
· 8 years ago
2cfe809
Added pefix negation to multiple archive support
by Akron
· 8 years ago
08385f6
First step to multi-archive support
by Akron
· 9 years ago
1924bbe
Added REI to test suite
by Akron
· 8 years ago
e8adfcc
Optimize performance of text listing
by Akron
· 9 years ago
1cd5b87
Use slashes as separators in siglen
by Akron
· 9 years ago
11c8030
Add metadata caching
by Akron
· 9 years ago
6396c30
Cleanup metadata files
by Akron
· 9 years ago
35db6e3
Simplified and modularized metadata processing
by Akron
· 9 years ago
151676d
Rename path of index and annotation
by Akron
· 9 years ago
8675b89
Fixing the sort order (2)
by Akron
· 9 years ago
7bc9d90
Fixed relying on p_end in relation tokens
by Akron
· 9 years ago
75ba57d
TUIs are now optional if not set
by Akron
· 9 years ago
fc10ea8
Partial fix of dependencies for sorting
by Akron
· 9 years ago
88000f9
Partial fix of dependencies without nodes
by Akron
· 9 years ago
Next »