Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-Tokenizer
/
8c7488bcf02d080f32c8d7b7c24a3b2be90ebf37
/
src
/
main
/
jpc
6d28ed1
Separate apostrophe marked contractions and clitics for en and fr
by Marc Kupietz
· 4 years, 10 months ago
96bd87c
Improve systematicity of options -p, -s, --[no-]tokens
by Marc Kupietz
· 5 years ago
cf9b5f5
Add heuristcis for distinguishing I. as abbrevation vs PPER / CARD
by Marc Kupietz
· 6 years ago
2199f76
Simplify German abbreviations
by Marc Kupietz
· 6 years ago
e3282b0
Accept URLs starting with "www." without URI scheme
by Marc Kupietz
· 6 years ago
4fb896a
Amend English abbreviation macro
by Marc Kupietz
· 6 years ago
f5a7e04
Add French tokenizer (-l fr)
by Marc Kupietz
· 6 years ago
74141b3
Add -l command line option to choose language
by Marc Kupietz
· 6 years ago
ce48102
Recognize {LETTER}+str. as abbreviation for Straße in de-tokenizer
by Marc Kupietz
· 6 years ago
67eed1c
Build language specific tokenizers: de, en
by Marc Kupietz
· 6 years ago