Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
Datok
/
3976804df50d7334c44a4a4971b394d25aff3a63
/
testdata
/
tokenizer.fst
17984c8
Improving time parsing
by Akron
· 3 years, 1 month ago
f6bdfdb
Add trimming at the beginning of a text
by Akron
· 3 years, 1 month ago
a854faa
Introduce EOT (end-of-transmission) marker
by Akron
· 3 years, 1 month ago
4c2a1ad
Introduce XML tests
by Akron
· 3 years, 3 months ago
235ea12
Update generated tokenizers
by Akron
· 3 years, 3 months ago
e184a91
Add new generated automata
by Akron
· 3 years, 3 months ago
03c92fe
Support for tokenend MCS symbol
by Akron
· 3 years, 4 months ago
b4bbb47
Added sentence splitter capabilities
by Akron
· 3 years, 4 months ago
c9d84a6
Sort alphabet prior to xCheck
by Akron
· 3 years, 4 months ago