Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-TEI
/
5fb5e8d0fe8f3b16277a77a68b732dd42a80657b
/
xt
f9c5124
parametrize internal tokenization
by Peter Harders
· 4 years, 4 months ago
b122717
clean up intern tokenization
by Peter Harders
· 4 years, 4 months ago
95bc98a
Rename delHTMLcom to be in line with other naming conventions and make the function exportable
by Akron
· 4 years, 5 months ago
d962747
Establish tokenizer objects for aggressive and conservative base tokenization
by Akron
· 4 years, 5 months ago
510a88c
Minor speedup in tokenization by merging array pushes
by Akron
· 4 years, 5 months ago
2d547bc
Fix a bug in delHTMLcom where comments were left open
by Akron
· 4 years, 5 months ago
4f67cd4
Atomize and test comment stripping
by Akron
· 4 years, 5 months ago
aa229a2
Add simple benchmark script
by Akron
· 4 years, 9 months ago