Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-TEI
/
c3dabd93655df4a8be990ef20fdaae362409f80e
c3dabd9
Merge "Remove the call for select_tokenization as it needlessly doubles the tokenizer check"
by Peter Harders
· 4 years, 11 months ago
997e940
Remove the call for select_tokenization as it needlessly doubles the tokenizer check
by Akron
· 4 years, 11 months ago
95bc98a
Rename delHTMLcom to be in line with other naming conventions and make the function exportable
by Akron
· 4 years, 11 months ago
8b511f9
Establish tokenizer object for external base tokenization
by Akron
· 5 years ago
d962747
Establish tokenizer objects for aggressive and conservative base tokenization
by Akron
· 5 years ago
95612c3
Merge "Improve manpage"
by Akron
· 5 years ago
8571751
Create Zip-Factory for simpler handling of Zip streams
by Akron
· 5 years ago
3479082
Simplify conservative tokenization code
by Akron
· 5 years ago
ee434b1
Improve manpage
by Akron
· 5 years ago
510a88c
Minor speedup in tokenization by merging array pushes
by Akron
· 5 years ago
eac374d
Separate dummy tokenization from main script with minimal changes
by Akron
· 5 years ago
7fab93b
Replace recursion and non-essential regexes with index/substr
by Akron
· 5 years ago
2d547bc
Fix a bug in delHTMLcom where comments were left open
by Akron
· 5 years ago
5ca6efc
Merge "Atomize and test comment stripping"
by Akron
· 5 years ago
4f67cd4
Atomize and test comment stripping
by Akron
· 5 years ago
e913908
added tagged version of test-file goe_sample
by Peter Harders
· 5 years ago
9015734
fixed: segfaulting of XML::LibXML::Reader
by Peter Harders
· 5 years ago
6f526a3
rework: formatting, variablenames, comments, ...
by Peter Harders
· 5 years ago
aa229a2
Add simple benchmark script
by Akron
· 5 years ago
7c2505d
Use Test::XML::Loy instead of Test::XML::Simple for performance reasons
by Akron
· 5 years ago
d949e18
Introduce POD documentation and add license file
by Akron
· 5 years ago
d89ef82
Use Test::XML::Loy instead of Test::XML::Simple for performance reasons
by Akron
· 5 years ago
6896608
Added processing tests for example corpus
by Akron
· 5 years ago
2a60c53
Added processing tests for example corpus
by Akron
· 5 years ago
9cb1394
Improve portability of shebang
by Akron
· 5 years ago
7025713
Fix in 1st call of 'IO::Compress::Zip': 'Append => 0'
by Peter Harders
· 5 years ago
3281234
added sample file for testing
by Peter Harders
· 5 years ago
797e807
Added initial script test
by Akron
· 5 years ago
dd3f47f
Add Makefile
by Akron
· 5 years ago
d892a58
init. vers. tei2korapxml (former: dereko2korapxml)
by Peter Harders
· 5 years ago
5ffc377
Initial commit
by Akron
· 5 years ago