Gitiles
Code Review
Sign In
korap.ids-mannheim.de
/
KorAP
/
KorAP-XML-TEI
/
09e0b2c7f4ce5f2f7e1c1b95ac12776f9ad48063
/
script
09e0b2c
Establish collection object for token annotations
by Akron
· 5 years ago
e68ec0c
Test and benchmark conversion of inline annotations
by Akron
· 5 years ago
0465e9e
Add exportable XML escape function
by Akron
· 5 years ago
1c5ce15
change utf8_encode and utf8_decode
by Peter Harders
· 5 years ago
6d07f0e
Merge "Fix and extend documentation"
by Akron
· 5 years ago
edee6e5
Make tokenization chainable and remove unnecessary tokenization switch
by Akron
· 5 years ago
e19aa3e
Replace wrong line counting with $.
by Akron
· 5 years ago
4d1899f
Merge "Establish header object for corpus, doc and text header parsing"
by Akron
· 5 years ago
f57ed81
Establish header object for corpus, doc and text header parsing
by Akron
· 5 years ago
42e18a6
allow to specify both tokenizations (extern and intern)
by Peter Harders
· 5 years ago
4e603a5
Fix and extend documentation
by Akron
· 5 years ago
f9c5124
parametrize internal tokenization
by Peter Harders
· 5 years ago
b122717
clean up intern tokenization
by Peter Harders
· 5 years ago
71f072b
Bugfix: intern tokenization
by Peter Harders
· 5 years ago
41c3562
changed comments, variable- and function-name(s)
by Peter Harders
· 5 years ago
c3dabd9
Merge "Remove the call for select_tokenization as it needlessly doubles the tokenizer check"
by Peter Harders
· 5 years ago
997e940
Remove the call for select_tokenization as it needlessly doubles the tokenizer check
by Akron
· 5 years ago
95bc98a
Rename delHTMLcom to be in line with other naming conventions and make the function exportable
by Akron
· 5 years ago
8b511f9
Establish tokenizer object for external base tokenization
by Akron
· 5 years ago
d962747
Establish tokenizer objects for aggressive and conservative base tokenization
by Akron
· 5 years ago
95612c3
Merge "Improve manpage"
by Akron
· 5 years ago
8571751
Create Zip-Factory for simpler handling of Zip streams
by Akron
· 5 years ago
ee434b1
Improve manpage
by Akron
· 5 years ago
510a88c
Minor speedup in tokenization by merging array pushes
by Akron
· 5 years ago
eac374d
Separate dummy tokenization from main script with minimal changes
by Akron
· 5 years ago
4f67cd4
Atomize and test comment stripping
by Akron
· 5 years ago
9015734
fixed: segfaulting of XML::LibXML::Reader
by Peter Harders
· 5 years ago
6f526a3
rework: formatting, variablenames, comments, ...
by Peter Harders
· 5 years ago
d949e18
Introduce POD documentation and add license file
by Akron
· 5 years ago
9cb1394
Improve portability of shebang
by Akron
· 5 years ago
7025713
Fix in 1st call of 'IO::Compress::Zip': 'Append => 0'
by Peter Harders
· 5 years ago
d892a58
init. vers. tei2korapxml (former: dereko2korapxml)
by Peter Harders
· 5 years ago