Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-Tokenizer
/
37018068f9423b8f229257c9b0c30fe1e64e66d8
3701806
Do not use commit ids for naming standalone jars
by Marc Kupietz
· 4 years, 10 months ago
751868b
Make tokenizer implementation exchangeable
by Marc Kupietz
· 4 years, 10 months ago
b9f45e0
Rename tokenizer class to KorAPDFATokenizer
by Marc Kupietz
· 4 years, 11 months ago
c419d5b
Add new command line options using picocli and sanitize code
by Marc Kupietz
· 4 years, 11 months ago
de949de
Bump version to 1.3
by Marc Kupietz
· 4 years, 11 months ago
f4df712
Change jar target naming conventions
by Marc Kupietz
· 4 years, 11 months ago
783e2a2
Ignore quoted email names like "John Doe"@xx.com
by Marc Kupietz
· 5 years ago
571c194
Empty text (<EOT><EOT>) -> empty output line
by Marc Kupietz
· 5 years ago
6afd121
Use standard EOT/EOF character x04 instead of magic escape \n\x03\n
by Marc Kupietz
· 5 years ago
793f85d
Add first tests for IPC invocation scenario
by Marc Kupietz
· 5 years ago
b9fb196
Update MAVEN_OPTS in Readme.md
by Marc Kupietz
· 5 years ago
b920f85
Set java version in pom
by Marc Kupietz
· 5 years ago
b2666fc
Implement sentence splitter
by Marc Kupietz
· 5 years ago
07d9714
Move tests to proper location
by Marc Kupietz
· 5 years ago
c315c2a
Add .gitgnore
by Marc Kupietz
· 5 years ago
8192509
Use original Span class and implement Tokenizer interface from OpenNLP
by Marc Kupietz
· 5 years ago
478632e
Clean up code
by Marc Kupietz
· 5 years ago
fe84dd0
Add Readme.md
by Marc Kupietz
· 5 years ago
45dc0fe
Add Apache copyright NOTICE
by Marc Kupietz
· 5 years ago
656055b
Add Apache LICENSE file
by Marc Kupietz
· 5 years ago
3367773
Initial import from private/Ingestion
by Marc Kupietz
· 5 years ago