Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-Tokenizer
/
a5804ff22cb9c1fe17992c317222cf6b20b8acbb
/
src
/
test
/
resources
a5804ff
Fix unwanted split at :innen + lc letter
by Marc Kupietz
· 2 weeks ago
2173013
Add lookahead to noun gender endings to prevent false matches
by Marc Kupietz
· 3 weeks ago
9ef5dec
Support German gender-sensitive DET, ADJ, PRON endings
by Marc Kupietz
· 3 weeks ago
6d28ed1
Separate apostrophe marked contractions and clitics for en and fr
by Marc Kupietz
· 4 years, 8 months ago
96bd87c
Improve systematicity of options -p, -s, --[no-]tokens
by Marc Kupietz
· 5 years ago
f5a7e04
Add French tokenizer (-l fr)
by Marc Kupietz
· 5 years ago
74141b3
Add -l command line option to choose language
by Marc Kupietz
· 5 years ago
8e197f3
Allow setting input encoding explicitely whith -e <encoding>
by Marc Kupietz
· 5 years ago
571c194
Empty text (<EOT><EOT>) -> empty output line
by Marc Kupietz
· 5 years ago
793f85d
Add first tests for IPC invocation scenario
by Marc Kupietz
· 5 years ago
3367773
Initial import from private/Ingestion
by Marc Kupietz
· 5 years ago