Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-Krill
/
34926b49b3ff570b8f644a6d5e9a8f6d072897ea
/
lib
/
KorAP
/
Tokenizer
31ec128
Consistent offset checks for spans and tokens
by Nils Diewald
· 12 years ago
27e9965
Minor fix for offset failures and updated scheme
by Nils Diewald
· 12 years ago
c95607a
Updated new metadata scheme
by Nils Diewald
· 12 years ago
ebdc52d
Added glemm workaround, removed author array, implemented but skipped punctuation support
by Nils Diewald
· 12 years ago
2c7e5b4
Lifted recall for span wraps 2
by Nils Diewald
· 12 years ago
79a355c
Fix script for new index (including new foundries)
by Nils Diewald
· 12 years ago
ff6d078
Solr export
by Nils Diewald
· 12 years ago
47c3ef3
Found some bugs in XIP/Constituency ... and introduced some new ones - yay
by Nils Diewald
· 12 years ago
21a3e1a
Bugfixes in dependency converter, improved test suite
by Nils Diewald
· 12 years ago
7b84722
Added text marker, added sentences from multiple foundries, changed paragraphs to base/para some tests, some bugfixes
by Nils Diewald
· 12 years ago
3cf08c7
Fixed primary data problems, speedup using moar C and now provide layer info
by Nils Diewald
· 12 years ago
38b3b5a
Made indexer a bit more robust
by Nils Diewald
· 12 years ago
3ece630
Fixed tiny offset issue for documents ending with non-tokens
by Nils Diewald
· 12 years ago
aba4710
Made the indexer more robust and ignore s**t my parser says
by Nils Diewald
· 12 years ago
092178e
Fix dealing with no-span layers|Improve error messages for bughunting
by Nils Diewald
· 12 years ago
37478a8
Bugfixed right offset of spans
by Nils Diewald
· 13 years ago
ded8e83
Small bugfix regarding single span documents
by Nils Diewald
· 13 years ago
7364d1f
Indexation script finished
by Nils Diewald
· 13 years ago
2db9ad0
Lucene field indexer written in perl
by Nils Diewald
· 13 years ago