Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-Tokenizer
/
de949deb083c43f4e0fed3713617aed768c000aa
/
src
/
main
783e2a2
Ignore quoted email names like "John Doe"@xx.com
by Marc Kupietz
· 4 years, 3 months ago
571c194
Empty text (<EOT><EOT>) -> empty output line
by Marc Kupietz
· 4 years, 3 months ago
6afd121
Use standard EOT/EOF character x04 instead of magic escape \n\x03\n
by Marc Kupietz
· 4 years, 3 months ago
b2666fc
Implement sentence splitter
by Marc Kupietz
· 4 years, 3 months ago
8192509
Use original Span class and implement Tokenizer interface from OpenNLP
by Marc Kupietz
· 4 years, 3 months ago
478632e
Clean up code
by Marc Kupietz
· 4 years, 3 months ago
45dc0fe
Add Apache copyright NOTICE
by Marc Kupietz
· 4 years, 3 months ago
3367773
Initial import from private/Ingestion
by Marc Kupietz
· 4 years, 3 months ago