Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-Krill
/
0a187b9da6c5cbd1775d983bd622a1ef9c57ac34
/
Changes
dec4312
Fixed gap behind last token and <base/s:t> length
by Akron
· 4 years, 9 months ago
b62d92a
Fixed span position offset bug and fixed milestones behind last token bug
by Akron
· 4 years, 9 months ago
a0d5af3
Fixed legacy XIP parser
by Akron
· 4 years, 9 months ago
6e886f7
Added benchmark mechanism
by Akron
· 4 years, 9 months ago
d4c5c10
Added documentation for supported I5 metadata fields
by Akron
· 4 years, 10 months ago
8f69d63
Added brief explanation of the format
by Akron
· 4 years, 10 months ago
c29b8e1
Added support for DGD pseudo-sentences based on anchor milestones
by Akron
· 5 years ago
b05b842
Improve logging
by Akron
· 5 years ago
2029455
Added external link for AGD data in I5 meta
by Akron
· 5 years ago
0d68a4b
Added 'distributor' field to I5 metadata
by Akron
· 5 years ago
7d5e638
Added support for Talismane
by Akron
· 5 years ago
57510c1
Added DGD support
by Akron
· 6 years ago
9b04f60
Update version
by Akron
· 6 years ago
eaffe93
Fail hard on tokenization problems now
by Akron
· 6 years ago
955b75b
Remove extract_text and extract_doc in favor of extract_sigle
by Akron
· 6 years ago
63d03ee
Ignore temporary-extraction on directory archiving
by Akron
· 6 years ago
6bf3cc9
Added links for wikipedia resources
by Akron
· 6 years ago
4e1712c
Add english wikipedia example
by Akron
· 6 years ago
263274c
Support koral versioning
by Akron
· 6 years ago
ed9baf0
Support non-word-tokens (fixes #5)
by Akron
· 6 years ago
6eff23b
Updated minimum perl
by Akron
· 6 years ago
dd1c0f1
Updated version
by Akron
· 6 years ago
28dc17f
Fix certainty values in TreeTagger output
by Akron
· 7 years ago
4c67919
Support for LWC dependency annotations
by Akron
· 7 years ago
3c56f50
Support file extensions in base tokenization file
by Akron
· 7 years ago
9a062ce
Fix tarring to include only filenames
by Akron
· 7 years ago
0a6cce1
Remove non-core fc
by Akron
· 7 years ago
3abc03e
Fixed exit codes in script
by Akron
· 7 years ago
ce125b6
Improved documentation on new features
by Akron
· 7 years ago
d5bb434
Fixed permissions in test suite
by Akron
· 7 years ago
da3097e
Finished tar flag
by Akron
· 8 years ago
9ec8887
Introduced sequential extraction flag to circumvent troubles with parallel extraction
by Akron
· 8 years ago
86db52e
Improved unzip overwriting mechanism
by Akron
· 8 years ago
63f20d4
Support serial conversion and input-base
by Akron
· 8 years ago
8150010
Introduced temporary extraction
by Akron
· 8 years ago
636aa11
Added configuration to script
by Akron
· 8 years ago
55778f0
Added preliminary support for diacritic insensitivity support
by Akron
· 8 years ago
5809fea
Fixed casefolding for case insensitivity
by Akron
· 8 years ago
3bd942f
Added marmot-support
by Akron
· 8 years ago
60a8caa
Treat prefixes correct for text sigles
by Akron
· 8 years ago
08d5445
Changed meta name for pages
by Akron
· 8 years ago
41ac10b
Added pagebreak annotations (with '~'-prefix)
by Akron
· 8 years ago
3887301
More relaxed handling of document siglen
by Akron
· 8 years ago
4fa37c3
Added DRuKoLa support to korapxml2krill script
by Akron
· 8 years ago
7e2eb88
Fixed analytic+monogr behaviour for metadata
by Akron
· 8 years ago
3ec0a1c
Updated to Mojolicious 7.20
by Akron
· 8 years ago
3741f8b
Added base-sentences and base-paragraphs options
by Akron
· 8 years ago
6f9fef5
Ignore recursion in CoreNLP
by Akron
· 8 years ago
13d5662
Improved 'already processed' message
by Akron
· 8 years ago
2812ba2
Fixed archive handling and support multiple jobs for extraction
by Akron
· 8 years ago
2fd402b
Added support for wildcards in document siglen
by Akron
· 8 years ago
b4bbec7
Fixed naming scheme for folder archives
by Akron
· 8 years ago
2080758
Added extraction method for documents in archives
by Akron
· 8 years ago
b3e9ccd
Fixed windows support
by Nils Diewald
· 8 years ago
4c0cf31
Fixed treatment of temporary files
by Akron
· 8 years ago
bdb6465
New version number
by Akron
· 8 years ago
7d4cdd8
Added archive test script
by Akron
· 8 years ago
651cb8d
Fix extraction of multiple archives
by Akron
· 8 years ago
03b24db
Added test for sigles support in extract
by Akron
· 8 years ago
e2b902d
Fixed output of version and help screens
by Akron
· 8 years ago
5f51d42
Fixed annotation bug in script
by Akron
· 8 years ago
92ad95b
Added test for script execution
by Akron
· 8 years ago
afb81ad
Fixed Mojolicious 7 support
by Akron
· 8 years ago
fbf6638
Added support for direct I5 support
by Akron
· 8 years ago
cdf0e00
Added batch processing class for documents
by Akron
· 8 years ago
a86d94a
Fixed MDParser data and test suite
by Akron
· 8 years ago
a5920b1
Improved test suite for caching and rei
by Akron
· 8 years ago
f3f0c94
Added malt dependency resource
by Akron
· 8 years ago
2cfe809
Added pefix negation to multiple archive support
by Akron
· 8 years ago
1924bbe
Added REI to test suite
by Akron
· 8 years ago
e8adfcc
Optimize performance of text listing
by Akron
· 9 years ago
1cd5b87
Use slashes as separators in siglen
by Akron
· 9 years ago
11c8030
Add metadata caching
by Akron
· 9 years ago
35db6e3
Simplified and modularized metadata processing
by Akron
· 9 years ago
c13a170
Removed BRZ and added Readme
by Akron
· 9 years ago
151676d
Rename path of index and annotation
by Akron
· 9 years ago
5b25431
Make a note that the current implementation is extremely slow
by Akron
· 9 years ago
a6ea30a
Ignore xip/dependency in tests
by Akron
· 9 years ago
44feb4e
Removed korapxml2krill_dir
by Akron
· 9 years ago
dc898d8
Fixed sentence bug in base
by Akron
· 9 years ago
e10ad32
Added 'extract' method support
by Akron
· 9 years ago
941c1a6
Merged executables
by Akron
· 9 years ago
96165ad
Added experimental support für parallel processing
by Akron
· 9 years ago
c1babed
Fixed tempdir issue in script
by Akron
· 9 years ago
150b29e
Added archive support to korapxml2krill_dir
by Akron
· 9 years ago
8c84aa5
Added meta tests for IDS
by Akron
· 9 years ago
49a4765
Updated version number
by Akron
· 9 years ago
93d620e
Update scripts and sgbr test suite
by Akron
· 9 years ago
226006a
Update to Changes
by Akron
· 9 years ago
e4c2e41
New structure is KorAP::XML::Krill
by Akron
· 9 years ago
9c0488f
Finished test suite
by Akron
· 9 years ago
69a4a2f
Added support for pagebreaks (i.e. empty elements)
by Akron
· 9 years ago
7867467
Minor fix for offset failures and updated scheme
by Nils Diewald
· 10 years ago
98d11c8
Missed some XIP data (3)
by Nils Diewald
· 10 years ago
f03c680
Sentence annotations for all providing foundries and a beginning subtokenization based on cschnobers code
by Nils Diewald
· 10 years ago
7b84722
Added text marker, added sentences from multiple foundries, changed paragraphs to base/para some tests, some bugfixes
by Nils Diewald
· 11 years ago