Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-TEI
/
f57ed81463dceb07312a6a3800c013a16d16c2fa
/
xt
/
benchmark.pl
994aff7
faster processing of UTF8-chars
by Peter Harders
· 4 years, 4 months ago
f9c5124
parametrize internal tokenization
by Peter Harders
· 4 years, 4 months ago
b122717
clean up intern tokenization
by Peter Harders
· 4 years, 4 months ago
95bc98a
Rename delHTMLcom to be in line with other naming conventions and make the function exportable
by Akron
· 4 years, 5 months ago
d962747
Establish tokenizer objects for aggressive and conservative base tokenization
by Akron
· 4 years, 5 months ago
510a88c
Minor speedup in tokenization by merging array pushes
by Akron
· 4 years, 5 months ago
2d547bc
Fix a bug in delHTMLcom where comments were left open
by Akron
· 4 years, 5 months ago
4f67cd4
Atomize and test comment stripping
by Akron
· 4 years, 5 months ago
aa229a2
Add simple benchmark script
by Akron
· 4 years, 9 months ago