Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-Krill
/
b62d92a0bb9d5b7c0c421df18061c7aa41ad6331
b62d92a
Fixed span position offset bug and fixed milestones behind last token bug
by Akron
· 4 years, 9 months ago
a0d5af3
Fixed legacy XIP parser
by Akron
· 4 years, 9 months ago
9711ed3
Fixed benchmark mechanism
by Akron
· 4 years, 9 months ago
6e886f7
Added benchmark mechanism
by Akron
· 4 years, 9 months ago
42f48c1
Rename KorapXML to KorAP-XML coherently
by Akron
· 4 years, 10 months ago
72bc522
Improve KorAP-XML documentation
by Akron
· 4 years, 10 months ago
d4c5c10
Added documentation for supported I5 metadata fields
by Akron
· 4 years, 10 months ago
57799fc
Fix editionStmt metadata parsing
by Akron
· 4 years, 10 months ago
8f69d63
Added brief explanation of the format
by Akron
· 4 years, 11 months ago
f1849aa
Support non-verbal annotations
by Akron
· 5 years ago
c29b8e1
Added support for DGD pseudo-sentences based on anchor milestones
by Akron
· 5 years ago
67b6eda
Support 'FOLK' as corpus sigle for DGD associated corpora
by Akron
· 5 years ago
b05b842
Improve logging
by Akron
· 5 years ago
2029455
Added external link for AGD data in I5 meta
by Akron
· 5 years ago
0d68a4b
Added 'distributor' field to I5 metadata
by Akron
· 5 years ago
7d5e638
Added support for Talismane
by Akron
· 5 years ago
c93a080
Document --to-tar option
by Akron
· 5 years ago
57510c1
Added DGD support
by Akron
· 6 years ago
9b04f60
Update version
by Akron
· 6 years ago
f021ad6
Improve error handling
by Akron
· 6 years ago
eaffe93
Fail hard on tokenization problems now
by Akron
· 6 years ago
94262ce
Renamed Institute for the German Language to Leibniz Institute for the German Language
by Akron
· 6 years ago
955b75b
Remove extract_text and extract_doc in favor of extract_sigle
by Akron
· 6 years ago
31a08cb
Add extract_sigle method to archive
by Akron
· 6 years ago
63d03ee
Ignore temporary-extraction on directory archiving
by Akron
· 6 years ago
6bf3cc9
Added links for wikipedia resources
by Akron
· 6 years ago
4e1712c
Add english wikipedia example
by Akron
· 6 years ago
263274c
Support koral versioning
by Akron
· 6 years ago
c526e75
Include field serialization in versioned json output
by Akron
· 6 years ago
5eb3aa0
Set field types and serialize as koral:fields
by Akron
· 6 years ago
ea9c364
Ignore DGD parser tests
by Akron
· 6 years ago
ed9baf0
Support non-word-tokens (fixes #5)
by Akron
· 6 years ago
6eff23b
Updated minimum perl
by Akron
· 6 years ago
ea1aed5
Activate HNC by default
by Akron
· 6 years ago
5fdc7e1
Fixed last change info in --version
by Akron
· 6 years ago
dd1c0f1
Updated version
by Akron
· 6 years ago
c893ac3
Added tests and minor metadata parsing adjustments for HNC
by Akron
· 6 years ago
f73ffb6
Fixed readme by mentioning preference regarding configuration parameters
by Akron
· 6 years ago
28dc17f
Fix certainty values in TreeTagger output
by Akron
· 7 years ago
0426176
Remove certainty value on lemmata in Treetagger
by Akron
· 7 years ago
6727b21
Fixed lwc tests
by Akron
· 7 years ago
4c67919
Support for LWC dependency annotations
by Akron
· 7 years ago
56dfb31
Added test regarding offset bug in KorAP
by Akron
· 7 years ago
d19e275
Recheck dependency tests
by Akron
· 7 years ago
3c56f50
Support file extensions in base tokenization file
by Akron
· 7 years ago
28c4e54
Fix missing command issue
by Akron
· 7 years ago
d5643ad
Warn on missing output parameter in extract
by Akron
· 7 years ago
9b67b93
Fix attribute generation for DeReKo
by Akron
· 7 years ago
9a062ce
Fix tarring to include only filenames
by Akron
· 7 years ago
0a6cce1
Remove non-core fc
by Akron
· 7 years ago
3abc03e
Fixed exit codes in script
by Akron
· 7 years ago
0f9b93a
Fixed minor issue in I5 meta parsing
by Akron
· 7 years ago
403934d
Fixed CMC for empty features
by Akron
· 7 years ago
36d4627
Fixed feature treatment in CMC morpho
by Akron
· 7 years ago
aaea246
One more missing permission problem in the test suite fixed
by Akron
· 7 years ago
5fd2d8e
Fixed more permission and dependency issues
by Akron
· 7 years ago
ce125b6
Improved documentation on new features
by Akron
· 7 years ago
d5bb434
Fixed permissions in test suite
by Akron
· 7 years ago
e599379
Added treatment of CMC data
by Akron
· 7 years ago
918ce42
Fixed primary data handling for data with white space at the beginning and at the end
by Akron
· 7 years ago
a308c71
Start testing with DCK
by Akron
· 7 years ago
da3097e
Finished tar flag
by Akron
· 8 years ago
486f9ab
Improved tar support
by Akron
· 8 years ago
081639e
Added preliminary tar support
by Akron
· 8 years ago
9ec8887
Introduced sequential extraction flag to circumvent troubles with parallel extraction
by Akron
· 8 years ago
3a486f8
Another unzip flag update (-uo)
by Akron
· 8 years ago
86db52e
Improved unzip overwriting mechanism
by Akron
· 8 years ago
0278ca2
Test zip overwriting
by Akron
· 8 years ago
bd3adda
Fixing behaviour for existing output directories
by Akron
· 8 years ago
442c4e9
Updated readme
by Akron
· 8 years ago
63f20d4
Support serial conversion and input-base
by Akron
· 8 years ago
8150010
Introduced temporary extraction
by Akron
· 8 years ago
636aa11
Added configuration to script
by Akron
· 8 years ago
821db3d
Add wildcard support for inputs
by Akron
· 8 years ago
55778f0
Added preliminary support for diacritic insensitivity support
by Akron
· 8 years ago
5809fea
Fixed casefolding for case insensitivity
by Akron
· 8 years ago
b2f1ab8
Improve test suite for MarMoT
by Akron
· 8 years ago
c11f798
Add auto-core-calculation
by Akron
· 8 years ago
3bd942f
Added marmot-support
by Akron
· 8 years ago
f624084
Added test for quotes in archives for archiving
by Akron
· 8 years ago
60a8caa
Treat prefixes correct for text sigles
by Akron
· 8 years ago
08d5445
Changed meta name for pages
by Akron
· 8 years ago
d35d2d3
Fixed pagebreak test
by Akron
· 8 years ago
3c11964
Added comment regarding missing pagebreaks in the data
by Akron
· 8 years ago
636bd9c
Fixed pagebreak treatment in script
by Akron
· 8 years ago
41ac10b
Added pagebreak annotations (with '~'-prefix)
by Akron
· 8 years ago
0465de5
Improved handling of weird metadata stuff
by Akron
· 8 years ago
3887301
More relaxed handling of document siglen
by Akron
· 8 years ago
a7d0e9f
Improved DRuKoLa meta data handling
by Akron
· 8 years ago
ce41be8
Updated announced dependency to Mojolicious
by Akron
· 8 years ago
578af4b
Support translator meta data type
by Akron
· 8 years ago
4fa37c3
Added DRuKoLa support to korapxml2krill script
by Akron
· 8 years ago
c388150
Added more drukola tests
by Akron
· 8 years ago
3139917
Fixed DRuKoLa annotations
by Akron
· 8 years ago
ace612e
Added DRuKoLa annotations
by Akron
· 8 years ago
7e2eb88
Fixed analytic+monogr behaviour for metadata
by Akron
· 8 years ago
3ec0a1c
Updated to Mojolicious 7.20
by Akron
· 8 years ago
b7f130c
Added DRuKoLa meta data skeleton
by Akron
· 8 years ago
3741f8b
Added base-sentences and base-paragraphs options
by Akron
· 8 years ago
53167fd
Added new test data without base annotations
by Akron
· 8 years ago
Next »