blob: 5c4f5410361fbc8c038804f0e49c105aba78db12 [file] [log] [blame]
Marc Kupietza671ae52022-12-22 16:28:14 +01001 - Conversion of standard TEI P5 should now work, at least
2 in some cases.
3 - Option --xmlid-to-textsigle <from-regex>@<to-c/to-d/to-t>
4 added to convert standard P5 text id attributes to I5
5 sigles with three parts.
6
Akron2520a342022-03-29 18:18:05 +020072.3.4 2022-11-09
Akron85269c02022-11-07 14:03:31 +01008 - Improve stability of XML entity replacement.
Akron2520a342022-03-29 18:18:05 +02009 - Check version for script and KorAP-Tokenizer
10 library when requested.
Akron85269c02022-11-07 14:03:31 +010011
Akron2520a342022-03-29 18:18:05 +0200122.3.3 2022-03-30
Akronbd4281e2022-03-28 08:31:40 +020013 - Load KorAP-Tokenizer only on request.
14
Akrond708a612022-03-21 16:00:01 +0100152.3.2 2022-03-23
Akron540fd622022-03-21 18:20:05 +010016 - Do not reference metadata.xml
Akrond708a612022-03-21 16:00:01 +010017 - Remove schema references from header files.
Akron4ee372a2022-02-24 17:54:24 +010018 - Improve test suite for unability to use
19 KorAP-Tokenizer.
Akron540fd622022-03-21 18:20:05 +010020
Marc Kupietz0bca4f12022-01-14 13:24:22 +0100212.3.1 2022-01-14 Release
Akrona3799ce2021-10-15 16:27:30 +020022 - Improve script handling of broken data
23 - Improve handling of unknown header types
24 - Check for valid sigles to avoid broken directories
25 - Introduce exclusivity for inline tokens handling.
Akrona2cb2812021-10-30 10:29:08 +020026 - Use single dash for STDIN.
Marc Kupietz0bca4f12022-01-14 13:24:22 +010027 - Update KorAP-Tokenizer to v2.2.2 (single quote, "du." bug fixes)
Akrona3799ce2021-10-15 16:27:30 +020028
292.2.0 2021-08-26 Release
Akrond658df72021-02-18 18:58:56 +010030 - Remove unnecessary branch in recursive call
Akrondd0be8f2021-02-18 19:29:41 +010031 - Support inline-structures parameter
Akron26a71522021-02-19 10:27:37 +010032 - Introduce --base-foundry, --data-file, and --header-file parameters
Akron91705d72021-02-19 10:59:45 +010033 - Introduce --tokens-file parameter
Akron75d63142021-02-23 18:40:56 +010034 - Introduce --skip-inline-tokens parameter
Akrond3e1d282021-02-24 14:51:27 +010035 - Minor cleanups and improvements
Akron54c3ff12021-02-25 11:33:37 +010036 - Introduce --skip-inline-tags parameter
Akroneb12e232021-02-25 13:49:50 +010037 - Introduce KorAP::XML::TEI::Inline class
Akron692d17d2021-03-05 13:21:03 +010038 - Introduce --skip-inline-token-annotations parameter
39 - Deprecate KORAPXMLTEI_INLINE environment variable
40 in favor of --skip-inline-token-annotations
Akrond658df72021-02-18 18:58:56 +010041
Akrona3799ce2021-10-15 16:27:30 +0200421.0.0 2021-02-18 Release
Akrond3e1d282021-02-24 14:51:27 +010043 - -s option added that uses sentence boundaries
44 provided by the KorAP tokenizer (-tk)
Marc Kupietza1421f02021-02-18 15:32:38 +010045 - Tokenizer invocation comments removed from KorAP XML output
46 - Indentation of </span> tags fixed
Akrond3e1d282021-02-24 14:51:27 +010047 - Character entities used in DeReKo are automatically
48 replaced by their corresponding characters
Marc Kupietza1421f02021-02-18 15:32:38 +010049 - Resources defined in Makefile
50 - Fixed possible IO deadlock with KorAP tokenizer
Akron4e3c7e32021-02-18 15:19:53 +010051 - Simplified debugging by combining with X::C::T line numbers
Akron1a5271a2021-02-18 13:18:15 +010052 - Support inline-tokens parameter
Akronf8088e62021-02-18 16:18:59 +010053 - Move verbose code documentation to trailing
54 script section
Marc Kupietzeed4cb12021-02-17 19:39:32 +010055
Akronf7084c42021-01-07 10:25:22 +0100560.03 2021-01-12
Marc Kupietzb505d442021-01-06 16:40:29 +010057 - Update KorAP-Tokenizer to released 2.0 version
Akronf7084c42021-01-07 10:25:22 +010058 - Improve test suite for recent version
59 of Mojolicious.
60
Marc Kupietz44b1f252020-11-26 16:31:40 +0100610.02 2020-11-27
Akronf7084c42021-01-07 10:25:22 +010062 - Update KorAP-Tokenizer to v2.0.0.
Akroneaa96232020-10-15 17:06:15 +020063 - Switch input encoding based on XML
64 processing instruction.
Marc Kupietz44b1f252020-11-26 16:31:40 +010065 - Fix handling of UTF-8 in sigles.
Akroneaa96232020-10-15 17:06:15 +020066
Akron0c41ab32020-09-29 07:33:33 +0200670.01 2020-09-28
68 - Initial release to GitHub.