Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-Krill
/
cbf098a3f61c2261327d25da5f15189038ca2fe1
/
t
cbf098a
Support scrambling of header files in scramble_korapxml tool
by Akron
· 4 years, 7 months ago
59a0e4b
Support rule files for scramble_korapxml tool
by Akron
· 4 years, 7 months ago
c403644
Improve RWK morphology parser to support multiple morphological key:value pairs
by Akron
· 4 years, 7 months ago
85eb5aa
Improve RWK structure parser for *-milestone elements
by Akron
· 4 years, 7 months ago
28299f4
Introduce special RWK structure parser
by Akron
· 4 years, 7 months ago
8ff5879
Introduce special RWK morphology parser
by Akron
· 4 years, 7 months ago
dec4312
Fixed gap behind last token and <base/s:t> length
by Akron
· 4 years, 9 months ago
b62d92a
Fixed span position offset bug and fixed milestones behind last token bug
by Akron
· 4 years, 9 months ago
57799fc
Fix editionStmt metadata parsing
by Akron
· 4 years, 10 months ago
f1849aa
Support non-verbal annotations
by Akron
· 5 years ago
c29b8e1
Added support for DGD pseudo-sentences based on anchor milestones
by Akron
· 5 years ago
2029455
Added external link for AGD data in I5 meta
by Akron
· 5 years ago
0d68a4b
Added 'distributor' field to I5 metadata
by Akron
· 5 years ago
7d5e638
Added support for Talismane
by Akron
· 5 years ago
57510c1
Added DGD support
by Akron
· 6 years ago
f021ad6
Improve error handling
by Akron
· 6 years ago
eaffe93
Fail hard on tokenization problems now
by Akron
· 6 years ago
955b75b
Remove extract_text and extract_doc in favor of extract_sigle
by Akron
· 6 years ago
31a08cb
Add extract_sigle method to archive
by Akron
· 6 years ago
63d03ee
Ignore temporary-extraction on directory archiving
by Akron
· 6 years ago
6bf3cc9
Added links for wikipedia resources
by Akron
· 6 years ago
4e1712c
Add english wikipedia example
by Akron
· 6 years ago
263274c
Support koral versioning
by Akron
· 6 years ago
c526e75
Include field serialization in versioned json output
by Akron
· 6 years ago
5eb3aa0
Set field types and serialize as koral:fields
by Akron
· 6 years ago
ed9baf0
Support non-word-tokens (fixes #5)
by Akron
· 6 years ago
c893ac3
Added tests and minor metadata parsing adjustments for HNC
by Akron
· 6 years ago
28dc17f
Fix certainty values in TreeTagger output
by Akron
· 7 years ago
0426176
Remove certainty value on lemmata in Treetagger
by Akron
· 7 years ago
6727b21
Fixed lwc tests
by Akron
· 7 years ago
4c67919
Support for LWC dependency annotations
by Akron
· 7 years ago
56dfb31
Added test regarding offset bug in KorAP
by Akron
· 7 years ago
d19e275
Recheck dependency tests
by Akron
· 7 years ago
3c56f50
Support file extensions in base tokenization file
by Akron
· 7 years ago
9a062ce
Fix tarring to include only filenames
by Akron
· 7 years ago
aaea246
One more missing permission problem in the test suite fixed
by Akron
· 7 years ago
5fd2d8e
Fixed more permission and dependency issues
by Akron
· 7 years ago
d5bb434
Fixed permissions in test suite
by Akron
· 7 years ago
918ce42
Fixed primary data handling for data with white space at the beginning and at the end
by Akron
· 7 years ago
da3097e
Finished tar flag
by Akron
· 8 years ago
486f9ab
Improved tar support
by Akron
· 8 years ago
9ec8887
Introduced sequential extraction flag to circumvent troubles with parallel extraction
by Akron
· 8 years ago
bd3adda
Fixing behaviour for existing output directories
by Akron
· 8 years ago
63f20d4
Support serial conversion and input-base
by Akron
· 8 years ago
636aa11
Added configuration to script
by Akron
· 8 years ago
821db3d
Add wildcard support for inputs
by Akron
· 8 years ago
55778f0
Added preliminary support for diacritic insensitivity support
by Akron
· 8 years ago
b2f1ab8
Improve test suite for MarMoT
by Akron
· 8 years ago
3bd942f
Added marmot-support
by Akron
· 8 years ago
f624084
Added test for quotes in archives for archiving
by Akron
· 8 years ago
60a8caa
Treat prefixes correct for text sigles
by Akron
· 8 years ago
08d5445
Changed meta name for pages
by Akron
· 8 years ago
d35d2d3
Fixed pagebreak test
by Akron
· 8 years ago
3c11964
Added comment regarding missing pagebreaks in the data
by Akron
· 8 years ago
636bd9c
Fixed pagebreak treatment in script
by Akron
· 8 years ago
41ac10b
Added pagebreak annotations (with '~'-prefix)
by Akron
· 8 years ago
0465de5
Improved handling of weird metadata stuff
by Akron
· 8 years ago
3887301
More relaxed handling of document siglen
by Akron
· 8 years ago
a7d0e9f
Improved DRuKoLa meta data handling
by Akron
· 8 years ago
578af4b
Support translator meta data type
by Akron
· 8 years ago
c388150
Added more drukola tests
by Akron
· 8 years ago
3139917
Fixed DRuKoLa annotations
by Akron
· 8 years ago
ace612e
Added DRuKoLa annotations
by Akron
· 8 years ago
7e2eb88
Fixed analytic+monogr behaviour for metadata
by Akron
· 8 years ago
3ec0a1c
Updated to Mojolicious 7.20
by Akron
· 8 years ago
3741f8b
Added base-sentences and base-paragraphs options
by Akron
· 8 years ago
53167fd
Added new test data without base annotations
by Akron
· 8 years ago
89df4fa
Fixed bug in tokenizer to recognize non-word-tokenizations
by Akron
· 8 years ago
6f9fef5
Ignore recursion in CoreNLP
by Akron
· 8 years ago
13d5662
Improved 'already processed' message
by Akron
· 8 years ago
2812ba2
Fixed archive handling and support multiple jobs for extraction
by Akron
· 8 years ago
2fd402b
Added support for wildcards in document siglen
by Akron
· 8 years ago
a76d835
Improved documentation (thx @margaretha)
by Akron
· 8 years ago
2080758
Added extraction method for documents in archives
by Akron
· 8 years ago
7606afa
Improved documentation to be more precise regarding non-argument calls (thx @margaretha)
by Akron
· 8 years ago
b3e9ccd
Fixed windows support
by Nils Diewald
· 8 years ago
3ec4897
Added archive test for directories and parallel processing
by Akron
· 8 years ago
7d4cdd8
Added archive test script
by Akron
· 8 years ago
651cb8d
Fix extraction of multiple archives
by Akron
· 8 years ago
03b24db
Added test for sigles support in extract
by Akron
· 8 years ago
f98b669
Test meta switch in script
by Akron
· 8 years ago
e2b902d
Fixed output of version and help screens
by Akron
· 8 years ago
5f51d42
Fixed annotation bug in script
by Akron
· 8 years ago
92ad95b
Added test for script execution
by Akron
· 8 years ago
afb81ad
Fixed Mojolicious 7 support
by Akron
· 8 years ago
af0ae3f
Check sentence mapping in base/sentences
by Akron
· 8 years ago
fbf6638
Added support for direct I5 support
by Akron
· 8 years ago
e1dbc38
Added test for script calls
by Akron
· 8 years ago
cdf0e00
Added batch processing class for documents
by Akron
· 8 years ago
405f0c5
Test file processing for batch processing
by Akron
· 8 years ago
a86d94a
Fixed MDParser data and test suite
by Akron
· 8 years ago
05ba547
Preliminary support for MDParser annotations
by Akron
· 8 years ago
a5920b1
Improved test suite for caching and rei
by Akron
· 8 years ago
b0c88db
Added caching test
by Akron
· 8 years ago
0c3e375
Test multiple archives
by Akron
· 8 years ago
f3f0c94
Added malt dependency resource
by Akron
· 8 years ago
08385f6
First step to multi-archive support
by Akron
· 9 years ago
1924bbe
Added REI to test suite
by Akron
· 8 years ago
e8adfcc
Optimize performance of text listing
by Akron
· 9 years ago
1cd5b87
Use slashes as separators in siglen
by Akron
· 9 years ago
Next »