Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-Krill
/
57510c1b269b5127618bda0141079b8d560dbbfa
57510c1
Added DGD support
by Akron
· 6 years ago
9b04f60
Update version
by Akron
· 6 years ago
f021ad6
Improve error handling
by Akron
· 6 years ago
eaffe93
Fail hard on tokenization problems now
by Akron
· 6 years ago
94262ce
Renamed Institute for the German Language to Leibniz Institute for the German Language
by Akron
· 6 years ago
955b75b
Remove extract_text and extract_doc in favor of extract_sigle
by Akron
· 6 years ago
31a08cb
Add extract_sigle method to archive
by Akron
· 6 years ago
63d03ee
Ignore temporary-extraction on directory archiving
by Akron
· 6 years ago
6bf3cc9
Added links for wikipedia resources
by Akron
· 6 years ago
4e1712c
Add english wikipedia example
by Akron
· 6 years ago
263274c
Support koral versioning
by Akron
· 6 years ago
c526e75
Include field serialization in versioned json output
by Akron
· 6 years ago
5eb3aa0
Set field types and serialize as koral:fields
by Akron
· 6 years ago
ea9c364
Ignore DGD parser tests
by Akron
· 6 years ago
ed9baf0
Support non-word-tokens (fixes #5)
by Akron
· 6 years ago
6eff23b
Updated minimum perl
by Akron
· 6 years ago
ea1aed5
Activate HNC by default
by Akron
· 6 years ago
5fdc7e1
Fixed last change info in --version
by Akron
· 6 years ago
dd1c0f1
Updated version
by Akron
· 6 years ago
c893ac3
Added tests and minor metadata parsing adjustments for HNC
by Akron
· 6 years ago
f73ffb6
Fixed readme by mentioning preference regarding configuration parameters
by Akron
· 7 years ago
28dc17f
Fix certainty values in TreeTagger output
by Akron
· 7 years ago
0426176
Remove certainty value on lemmata in Treetagger
by Akron
· 7 years ago
6727b21
Fixed lwc tests
by Akron
· 7 years ago
4c67919
Support for LWC dependency annotations
by Akron
· 7 years ago
56dfb31
Added test regarding offset bug in KorAP
by Akron
· 7 years ago
d19e275
Recheck dependency tests
by Akron
· 7 years ago
3c56f50
Support file extensions in base tokenization file
by Akron
· 7 years ago
28c4e54
Fix missing command issue
by Akron
· 7 years ago
d5643ad
Warn on missing output parameter in extract
by Akron
· 7 years ago
9b67b93
Fix attribute generation for DeReKo
by Akron
· 7 years ago
9a062ce
Fix tarring to include only filenames
by Akron
· 7 years ago
0a6cce1
Remove non-core fc
by Akron
· 7 years ago
3abc03e
Fixed exit codes in script
by Akron
· 7 years ago
0f9b93a
Fixed minor issue in I5 meta parsing
by Akron
· 7 years ago
403934d
Fixed CMC for empty features
by Akron
· 8 years ago
36d4627
Fixed feature treatment in CMC morpho
by Akron
· 8 years ago
aaea246
One more missing permission problem in the test suite fixed
by Akron
· 8 years ago
5fd2d8e
Fixed more permission and dependency issues
by Akron
· 8 years ago
ce125b6
Improved documentation on new features
by Akron
· 8 years ago
d5bb434
Fixed permissions in test suite
by Akron
· 8 years ago
e599379
Added treatment of CMC data
by Akron
· 8 years ago
918ce42
Fixed primary data handling for data with white space at the beginning and at the end
by Akron
· 8 years ago
a308c71
Start testing with DCK
by Akron
· 8 years ago
da3097e
Finished tar flag
by Akron
· 8 years ago
486f9ab
Improved tar support
by Akron
· 8 years ago
081639e
Added preliminary tar support
by Akron
· 8 years ago
9ec8887
Introduced sequential extraction flag to circumvent troubles with parallel extraction
by Akron
· 8 years ago
3a486f8
Another unzip flag update (-uo)
by Akron
· 8 years ago
86db52e
Improved unzip overwriting mechanism
by Akron
· 8 years ago
0278ca2
Test zip overwriting
by Akron
· 8 years ago
bd3adda
Fixing behaviour for existing output directories
by Akron
· 8 years ago
442c4e9
Updated readme
by Akron
· 8 years ago
63f20d4
Support serial conversion and input-base
by Akron
· 8 years ago
8150010
Introduced temporary extraction
by Akron
· 8 years ago
636aa11
Added configuration to script
by Akron
· 8 years ago
821db3d
Add wildcard support for inputs
by Akron
· 8 years ago
55778f0
Added preliminary support for diacritic insensitivity support
by Akron
· 8 years ago
5809fea
Fixed casefolding for case insensitivity
by Akron
· 8 years ago
b2f1ab8
Improve test suite for MarMoT
by Akron
· 8 years ago
c11f798
Add auto-core-calculation
by Akron
· 8 years ago
3bd942f
Added marmot-support
by Akron
· 8 years ago
f624084
Added test for quotes in archives for archiving
by Akron
· 8 years ago
60a8caa
Treat prefixes correct for text sigles
by Akron
· 8 years ago
08d5445
Changed meta name for pages
by Akron
· 8 years ago
d35d2d3
Fixed pagebreak test
by Akron
· 8 years ago
3c11964
Added comment regarding missing pagebreaks in the data
by Akron
· 8 years ago
636bd9c
Fixed pagebreak treatment in script
by Akron
· 8 years ago
41ac10b
Added pagebreak annotations (with '~'-prefix)
by Akron
· 8 years ago
0465de5
Improved handling of weird metadata stuff
by Akron
· 8 years ago
3887301
More relaxed handling of document siglen
by Akron
· 8 years ago
a7d0e9f
Improved DRuKoLa meta data handling
by Akron
· 8 years ago
ce41be8
Updated announced dependency to Mojolicious
by Akron
· 8 years ago
578af4b
Support translator meta data type
by Akron
· 8 years ago
4fa37c3
Added DRuKoLa support to korapxml2krill script
by Akron
· 8 years ago
c388150
Added more drukola tests
by Akron
· 8 years ago
3139917
Fixed DRuKoLa annotations
by Akron
· 8 years ago
ace612e
Added DRuKoLa annotations
by Akron
· 8 years ago
7e2eb88
Fixed analytic+monogr behaviour for metadata
by Akron
· 8 years ago
3ec0a1c
Updated to Mojolicious 7.20
by Akron
· 8 years ago
b7f130c
Added DRuKoLa meta data skeleton
by Akron
· 8 years ago
3741f8b
Added base-sentences and base-paragraphs options
by Akron
· 8 years ago
53167fd
Added new test data without base annotations
by Akron
· 8 years ago
89df4fa
Fixed bug in tokenizer to recognize non-word-tokenizations
by Akron
· 8 years ago
6f9fef5
Ignore recursion in CoreNLP
by Akron
· 8 years ago
f1a1de9
Improved readme
by Akron
· 8 years ago
5c71a85
Fixed readme
by Akron
· 8 years ago
13d5662
Improved 'already processed' message
by Akron
· 8 years ago
2812ba2
Fixed archive handling and support multiple jobs for extraction
by Akron
· 8 years ago
2fd402b
Added support for wildcards in document siglen
by Akron
· 8 years ago
a76d835
Improved documentation (thx @margaretha)
by Akron
· 8 years ago
b4bbec7
Fixed naming scheme for folder archives
by Akron
· 8 years ago
2080758
Added extraction method for documents in archives
by Akron
· 8 years ago
7606afa
Improved documentation to be more precise regarding non-argument calls (thx @margaretha)
by Akron
· 8 years ago
a93d51b
Improved Readme
by Akron
· 8 years ago
28ae63a
Merge "Fixed root prefix in meta parser"
by Akron
· 8 years ago
3315021
Merge changes Ifdcfb816,I51994658
by Akron
· 8 years ago
345faf3
Merge changes I00796d75,I1c20452e
by Nils Diewald
· 8 years ago
af670ae
Fixed root prefix in meta parser
by Akron
· 8 years ago
ad4cb01
Fixed conflict
by Akron
· 8 years ago
Next »