Akron | a472a24 | 2023-02-13 13:46:30 +0100 | [diff] [blame^] | 1 | 0.50 2023-02-13 |
2 | - Fix 'temporary-extract' configuration | ||||
3 | information. | ||||
4 | |||||
Akron | 4a7ab01 | 2023-02-12 12:59:38 +0100 | [diff] [blame] | 5 | 0.49 2023-02-12 |
Marc Kupietz | 400590b | 2022-12-23 16:02:36 +0100 | [diff] [blame] | 6 | - Support for UDPipe POS, lemma and dependency |
7 | annotations (kupietz). | ||||
Akron | 4a7ab01 | 2023-02-12 12:59:38 +0100 | [diff] [blame] | 8 | - Remove last bit of Sys::Info dependency. |
9 | (fixes #9) | ||||
Marc Kupietz | 400590b | 2022-12-23 16:02:36 +0100 | [diff] [blame] | 10 | |
Akron | 2dd0e5d | 2022-11-15 09:44:43 +0100 | [diff] [blame] | 11 | 0.48 2022-11-15 |
Akron | aa166fa | 2022-11-10 14:15:14 +0100 | [diff] [blame] | 12 | - Improve support for text siglen including |
13 | underscore in corpus parts. | ||||
Akron | 2dd0e5d | 2022-11-15 09:44:43 +0100 | [diff] [blame] | 14 | - Split morphological features in NKJP. |
Akron | aa166fa | 2022-11-10 14:15:14 +0100 | [diff] [blame] | 15 | |
Akron | ddf3319 | 2022-08-08 16:44:39 +0200 | [diff] [blame] | 16 | 0.47 2022-08-08 |
Akron | 64f7fae | 2022-07-27 12:45:33 +0200 | [diff] [blame] | 17 | - Support for preferred language transformation. |
Akron | 1a2535d | 2022-07-28 16:31:43 +0200 | [diff] [blame] | 18 | - Support for NKJP taxonomies. |
Akron | ddf3319 | 2022-08-08 16:44:39 +0200 | [diff] [blame] | 19 | - Support for NKJP 'orig' values. |
Akron | 64f7fae | 2022-07-27 12:45:33 +0200 | [diff] [blame] | 20 | |
Akron | a65cd68 | 2022-07-21 15:40:40 +0200 | [diff] [blame] | 21 | 0.46 2022-07-21 |
22 | - Support NKJP Meta, Morpho and NamedEntities. | ||||
23 | |||||
Akron | 3c9b27c | 2022-03-04 13:08:13 +0100 | [diff] [blame] | 24 | 0.45 2022-03-04 |
Akron | eb370a0 | 2022-02-24 13:33:40 +0100 | [diff] [blame] | 25 | - Due to problems installing Archive::Tar::Builder |
26 | in certain environments, this is now optional, | ||||
27 | with a pure perl fallback archiver. | ||||
Akron | 3c9b27c | 2022-03-04 13:08:13 +0100 | [diff] [blame] | 28 | - Support externalLink and internalLink universally in |
29 | i5 meta data. | ||||
Akron | eb370a0 | 2022-02-24 13:33:40 +0100 | [diff] [blame] | 30 | |
Akron | f683310 | 2022-02-17 18:35:03 +0100 | [diff] [blame] | 31 | 0.44 2022-02-17 |
Akron | 8c85e9f | 2022-01-03 16:27:10 +0100 | [diff] [blame] | 32 | - Improve Gingko Metadata support. |
Akron | e1cde96 | 2022-02-07 20:00:29 +0100 | [diff] [blame] | 33 | - Fix data-URIs by always refering to UTF-8. |
Akron | f683310 | 2022-02-17 18:35:03 +0100 | [diff] [blame] | 34 | - Warn on wrong token order. |
Akron | c4ad747 | 2022-01-28 19:12:50 +0100 | [diff] [blame] | 35 | - Improve Gingko Metadata support. |
36 | - Updated all dependencies. | ||||
Akron | 8c85e9f | 2022-01-03 16:27:10 +0100 | [diff] [blame] | 37 | |
Akron | 9a2545e | 2022-01-16 15:15:50 +0100 | [diff] [blame] | 38 | 0.43 2022-01-17 |
39 | - Fix temporary extract handling when defined | ||||
40 | in a config file. | ||||
Akron | 303c4fd | 2022-01-16 15:14:46 +0100 | [diff] [blame] | 41 | - Improve handling of invalid certainty values |
42 | in TreeTagger. | ||||
Akron | 84b53ad | 2022-01-14 12:39:15 +0100 | [diff] [blame] | 43 | - Add log slimming function. |
Akron | 9a2545e | 2022-01-16 15:15:50 +0100 | [diff] [blame] | 44 | |
Akron | 8c85e9f | 2022-01-03 16:27:10 +0100 | [diff] [blame] | 45 | 0.42 2021-10-11 |
Akron | b9c3381 | 2020-10-21 16:19:35 +0200 | [diff] [blame] | 46 | - Replaced Log4perl with Log::Any. |
Akron | 0ffbd52 | 2021-02-16 12:01:19 +0100 | [diff] [blame] | 47 | - Ignore level < 0 structures in DeReKo, but support |
48 | them for base annotations. | ||||
Akron | 6882d7d | 2021-02-08 09:43:57 +0100 | [diff] [blame] | 49 | - Define resources in Makefile. |
Akron | 56d5f17 | 2021-03-16 18:37:39 +0100 | [diff] [blame] | 50 | - Add GitHub action for CI. |
Akron | fca010b | 2021-10-11 15:52:48 +0200 | [diff] [blame] | 51 | - Remove MANIFEST file from repo. |
Akron | abb3690 | 2021-10-11 15:51:06 +0200 | [diff] [blame] | 52 | - Introduce Gingko support. |
Akron | 8ad06c4 | 2022-01-11 17:07:49 +0100 | [diff] [blame] | 53 | - Fix data URIs to always encode percentage-wise. |
Akron | b9c3381 | 2020-10-21 16:19:35 +0200 | [diff] [blame] | 54 | |
55 | 0.41 2020-08-10 | ||||
Akron | 07e2477 | 2020-04-23 14:00:54 +0200 | [diff] [blame] | 56 | - Added support for RWK annotations. |
Akron | 1cdbc9d | 2020-05-07 15:28:54 +0200 | [diff] [blame] | 57 | - Improved DGD support. |
Akron | e3e0536 | 2020-06-16 17:19:09 +0200 | [diff] [blame] | 58 | - Fixed bug in RWK support that broke on |
59 | some KorAP-XML files. | ||||
Akron | 414ec95 | 2020-08-03 15:48:43 +0200 | [diff] [blame] | 60 | - Separate "real data" test suite from artificial |
61 | tests to prepare for CPAN release. | ||||
Akron | 39df7ce | 2020-08-04 15:55:26 +0200 | [diff] [blame] | 62 | - Optimizations and cleanup based on profiling. |
Akron | 129e441 | 2020-08-05 15:30:12 +0200 | [diff] [blame] | 63 | - Remove MultiTerm->add() in favor of |
64 | MultiTerm->add_by_term(). | ||||
Akron | 47426f0 | 2020-08-06 13:28:53 +0200 | [diff] [blame] | 65 | - Optimization by reducing calls to _offset(). |
Akron | 6a4cb16 | 2020-08-06 16:00:33 +0200 | [diff] [blame] | 66 | - Introduced add_span() method to MultiTermToken. |
Akron | 11daf96 | 2020-08-07 16:29:22 +0200 | [diff] [blame] | 67 | - Removed deprecated 'primary' flag. |
Akron | 6aed056 | 2020-08-07 16:46:00 +0200 | [diff] [blame] | 68 | - Removed deprecated 'pretty' flag. |
Akron | 56deacb | 2020-08-10 10:03:55 +0200 | [diff] [blame] | 69 | - Fix RWK paragraph handling. |
Akron | d2cd8e4 | 2020-10-30 16:37:19 +0100 | [diff] [blame] | 70 | - Updated 'Clone' dependency in Makefile. |
Akron | 0b04b31 | 2020-10-30 17:39:18 +0100 | [diff] [blame] | 71 | - Make Sys::Info optional. |
Akron | dcbee64 | 2020-10-30 18:01:43 +0100 | [diff] [blame] | 72 | - Fixes a bug in XIP::Dependency and added |
73 | dependency checks for all annotation libraries. | ||||
Akron | 07e2477 | 2020-04-23 14:00:54 +0200 | [diff] [blame] | 74 | |
Akron | dec4312 | 2020-03-03 11:22:25 +0100 | [diff] [blame] | 75 | 0.40 2020-03-03 |
Akron | a0d5af3 | 2020-03-01 12:46:30 +0100 | [diff] [blame] | 76 | - Fixed XIP parser. |
Akron | b62d92a | 2020-03-01 16:32:00 +0100 | [diff] [blame] | 77 | - Added example corpus of the |
78 | Redewiedergabe-Korpus. | ||||
79 | - Fixed span offset bug. | ||||
80 | - Fixed milestones behind the last | ||||
81 | token bug. | ||||
Akron | dec4312 | 2020-03-03 11:22:25 +0100 | [diff] [blame] | 82 | - Fixed gap behind last token bug. |
83 | - Fixed <base/s:t> length. | ||||
Akron | a0d5af3 | 2020-03-01 12:46:30 +0100 | [diff] [blame] | 84 | |
Akron | 6e886f7 | 2020-02-19 07:42:32 +0100 | [diff] [blame] | 85 | 0.39 2020-02-19 |
Akron | 7d5e638 | 2019-08-08 16:36:27 +0200 | [diff] [blame] | 86 | - Added Talismane support. |
Akron | 0d68a4b | 2019-11-13 15:42:11 +0100 | [diff] [blame] | 87 | - Added "distributor" field to I5 metadata. |
Akron | 2029455 | 2019-11-29 16:15:35 +0100 | [diff] [blame] | 88 | - Added DGD link field to I5 metadata. |
Akron | b05b842 | 2019-12-11 13:47:57 +0100 | [diff] [blame] | 89 | - Improve logging. |
Akron | c29b8e1 | 2019-12-16 14:28:09 +0100 | [diff] [blame] | 90 | - Added support for DGD pseudo-sentences |
91 | based on anchor milestones. | ||||
Akron | 8f69d63 | 2020-01-15 16:58:11 +0100 | [diff] [blame] | 92 | - Added brief explanation of the format. |
Akron | d4c5c10 | 2020-02-11 11:47:59 +0100 | [diff] [blame] | 93 | - Fixed parsing of editionStmt. |
94 | - Added documentation for supported I5 metadata | ||||
95 | fields. | ||||
Akron | 6e886f7 | 2020-02-19 07:42:32 +0100 | [diff] [blame] | 96 | - Added integrated benchmark mechanism. |
Akron | 7d5e638 | 2019-08-08 16:36:27 +0200 | [diff] [blame] | 97 | |
Akron | 57510c1 | 2019-01-04 14:58:53 +0100 | [diff] [blame] | 98 | 0.38 2019-05-22 |
Akron | 9b04f60 | 2019-03-08 18:45:35 +0100 | [diff] [blame] | 99 | - Stop file processing when base tokenization |
100 | is wrong. | ||||
Akron | 57510c1 | 2019-01-04 14:58:53 +0100 | [diff] [blame] | 101 | - Added DGD support. |
Akron | 9b04f60 | 2019-03-08 18:45:35 +0100 | [diff] [blame] | 102 | |
Akron | eaffe93 | 2019-03-07 17:14:42 +0100 | [diff] [blame] | 103 | 0.37 2019-03-06 |
Akron | 263274c | 2019-02-07 09:48:30 +0100 | [diff] [blame] | 104 | - Support for 'koral:field' array. |
105 | - Support for Koral versioning. | ||||
Akron | 4e1712c | 2019-02-04 22:29:37 +0100 | [diff] [blame] | 106 | - Added tests for english sources. |
Akron | 6bf3cc9 | 2019-02-07 12:11:20 +0100 | [diff] [blame] | 107 | - Added support for external links for |
108 | Wikipedia resources. | ||||
Akron | 63d03ee | 2019-02-13 18:49:38 +0100 | [diff] [blame] | 109 | - Ignore temporary extraction |
110 | on directory archiving. | ||||
Akron | 955b75b | 2019-02-21 14:28:41 +0100 | [diff] [blame] | 111 | - Remove extract_text and extract_doc in |
112 | favor of extract_sigle for archives. | ||||
Akron | 263274c | 2019-02-07 09:48:30 +0100 | [diff] [blame] | 113 | |
Akron | ed9baf0 | 2019-01-22 17:03:25 +0100 | [diff] [blame] | 114 | 0.36 2019-01-22 |
115 | - Support for non-word tokens (fixes #5). | ||||
116 | |||||
Akron | 6eff23b | 2018-09-24 10:31:20 +0200 | [diff] [blame] | 117 | 0.35 2018-09-24 |
118 | - Lift minimum version of Perl to 5.16 as for | ||||
119 | "fc"-feature. | ||||
120 | |||||
Akron | dd1c0f1 | 2018-07-19 06:45:28 +0200 | [diff] [blame] | 121 | 0.34 2018-07-19 |
122 | - Preliminary support for HNC. | ||||
123 | |||||
Akron | 28dc17f | 2018-02-01 15:31:41 +0100 | [diff] [blame] | 124 | 0.33 2018-02-01 |
Akron | 4c67919 | 2018-01-16 17:41:49 +0100 | [diff] [blame] | 125 | - Added LWC support. |
Akron | 28dc17f | 2018-02-01 15:31:41 +0100 | [diff] [blame] | 126 | - Fixed TreeTagger certainties. |
Akron | 4c67919 | 2018-01-16 17:41:49 +0100 | [diff] [blame] | 127 | |
Akron | 3c56f50 | 2017-10-24 15:37:27 +0200 | [diff] [blame] | 128 | 0.32 2017-10-24 |
Akron | 9a062ce | 2017-07-04 19:12:05 +0200 | [diff] [blame] | 129 | - Fixed tar building process in script. |
Akron | 3c56f50 | 2017-10-24 15:37:27 +0200 | [diff] [blame] | 130 | - Support file extensions in base tokenization parameter. |
Akron | 9a062ce | 2017-07-04 19:12:05 +0200 | [diff] [blame] | 131 | |
Akron | 0a6cce1 | 2017-06-30 23:03:21 +0200 | [diff] [blame] | 132 | 0.31 2017-06-30 |
Akron | 3abc03e | 2017-06-29 16:23:35 +0200 | [diff] [blame] | 133 | - Fixed exit codes in script. |
Akron | 0a6cce1 | 2017-06-30 23:03:21 +0200 | [diff] [blame] | 134 | - Use CORE::fc for case folding. |
Akron | 3abc03e | 2017-06-29 16:23:35 +0200 | [diff] [blame] | 135 | |
Akron | d5bb434 | 2017-06-19 11:50:49 +0200 | [diff] [blame] | 136 | 0.30 2017-06-19 |
137 | - Fixed permission handling in test suite. | ||||
Akron | ce125b6 | 2017-06-19 11:54:36 +0200 | [diff] [blame] | 138 | - Added preliminary CMC support. |
Akron | d5bb434 | 2017-06-19 11:50:49 +0200 | [diff] [blame] | 139 | |
Akron | da3097e | 2017-04-23 19:53:57 +0200 | [diff] [blame] | 140 | 0.29 2017-04-23 |
141 | - support --to-tar flag. | ||||
142 | |||||
Akron | 9ec8887 | 2017-04-12 16:29:06 +0200 | [diff] [blame] | 143 | 0.28 2017-04-12 |
Akron | 86db52e | 2017-04-11 20:36:43 +0200 | [diff] [blame] | 144 | - Improved overwriting behaviour for unzip. |
Akron | 9ec8887 | 2017-04-12 16:29:06 +0200 | [diff] [blame] | 145 | - Introduced --sequential-extraction flag. |
Akron | 86db52e | 2017-04-11 20:36:43 +0200 | [diff] [blame] | 146 | |
Akron | 63f20d4 | 2017-04-10 23:40:29 +0200 | [diff] [blame] | 147 | 0.27 2017-04-10 |
Akron | 636aa11 | 2017-04-07 18:48:56 +0200 | [diff] [blame] | 148 | - Support configuration files. |
Akron | 8150010 | 2017-04-07 20:45:44 +0200 | [diff] [blame] | 149 | - Support temporary extraction. |
Akron | 63f20d4 | 2017-04-10 23:40:29 +0200 | [diff] [blame] | 150 | - Support serial conversion. |
151 | - Support input-base. | ||||
Akron | 636aa11 | 2017-04-07 18:48:56 +0200 | [diff] [blame] | 152 | |
153 | 0.26 2017-04-06 | ||||
154 | - Support wildcards on input. | ||||
155 | |||||
Akron | 5809fea | 2017-03-14 20:02:26 +0100 | [diff] [blame] | 156 | 0.25 2017-03-14 |
Akron | 7e2eb88 | 2017-01-18 17:28:07 +0100 | [diff] [blame] | 157 | - Updated to Mojolicious 7.20 |
158 | - Fixed meta treatment in case analytic and monogr | ||||
159 | are available | ||||
Akron | 4fa37c3 | 2017-01-20 14:43:10 +0100 | [diff] [blame] | 160 | - Added DRuKoLa support to script |
Akron | 3887301 | 2017-02-06 20:27:37 +0100 | [diff] [blame] | 161 | - Liberated document and text sigle handling to be |
162 | compliant with CoRoLa. | ||||
Akron | 41ac10b | 2017-02-08 22:47:25 +0100 | [diff] [blame] | 163 | - Added support for pagebreak annotations. |
Akron | 08d5445 | 2017-02-16 23:19:49 +0100 | [diff] [blame] | 164 | - Renamed "pages" to "srcPages". |
Akron | 60a8caa | 2017-02-17 21:51:27 +0100 | [diff] [blame] | 165 | - Fixed handling of prefixes for text sigles. |
Akron | 3bd942f | 2017-02-20 20:09:14 +0100 | [diff] [blame] | 166 | - Support for MarMoT. |
Akron | 5809fea | 2017-03-14 20:02:26 +0100 | [diff] [blame] | 167 | - Fix case insensitivity. |
Akron | 55778f0 | 2017-03-14 20:47:26 +0100 | [diff] [blame] | 168 | - Added preliminary support for diacritic insensitivity. |
Akron | 3ec0a1c | 2017-01-18 14:41:55 +0100 | [diff] [blame] | 169 | |
Akron | 3741f8b | 2016-12-21 19:55:21 +0100 | [diff] [blame] | 170 | 0.24 2016-12-21 |
171 | - Added --base-sentences and --base-paragraphs options | ||||
172 | |||||
Akron | 6f9fef5 | 2016-11-03 17:06:40 +0100 | [diff] [blame] | 173 | 0.23 2016-11-03 |
Akron | 2fd402b | 2016-10-27 21:26:48 +0200 | [diff] [blame] | 174 | - Added wildcard support for document extraction |
Akron | 2812ba2 | 2016-10-28 21:55:59 +0200 | [diff] [blame] | 175 | - Fixed archive iteration to not duplicate the first archive |
176 | - Added parallel extraction for document sigles | ||||
Akron | 13d5662 | 2016-10-31 14:54:49 +0100 | [diff] [blame] | 177 | - Improved return value for existing files |
Akron | 3741f8b | 2016-12-21 19:55:21 +0100 | [diff] [blame] | 178 | - Don't warn on recursion in CoreNLP/Constituency |
Akron | 2fd402b | 2016-10-27 21:26:48 +0200 | [diff] [blame] | 179 | |
Akron | 2080758 | 2016-10-26 17:11:34 +0200 | [diff] [blame] | 180 | 0.22 2016-10-26 |
181 | - Added support for document extraction | ||||
Akron | b4bbec7 | 2016-10-26 20:21:02 +0200 | [diff] [blame] | 182 | - Fixed archive naming |
Akron | 2080758 | 2016-10-26 17:11:34 +0200 | [diff] [blame] | 183 | |
Akron | b4bbec7 | 2016-10-26 20:21:02 +0200 | [diff] [blame] | 184 | 0.21 2016-10-24 |
Nils Diewald | b3e9ccd | 2016-10-24 15:16:52 +0200 | [diff] [blame] | 185 | - Improved Windows support |
186 | |||||
Akron | 4c0cf31 | 2016-10-15 16:42:09 +0200 | [diff] [blame] | 187 | 0.20 2016-10-15 |
188 | - Fixed treatment of temporary folders in script | ||||
189 | |||||
Akron | bdb6465 | 2016-08-17 23:30:01 +0200 | [diff] [blame] | 190 | 0.19 2016-08-17 |
Akron | 92ad95b | 2016-08-15 23:38:56 +0200 | [diff] [blame] | 191 | - Added test for direct I5 support. |
192 | - Fixed support for Mojolicious 7. | ||||
193 | - Added script test. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 194 | - Fixed setting multiple annotations in |
195 | script. | ||||
Akron | e2b902d | 2016-08-16 16:50:11 +0200 | [diff] [blame] | 196 | - Fixed output of version and help messages. |
Akron | 7d4cdd8 | 2016-08-17 21:39:45 +0200 | [diff] [blame] | 197 | - Added script test for extraction. |
Akron | 651cb8d | 2016-08-16 21:44:49 +0200 | [diff] [blame] | 198 | - Fixed extraction with multiple archives and prefix |
199 | negation support. | ||||
Akron | 7d4cdd8 | 2016-08-17 21:39:45 +0200 | [diff] [blame] | 200 | - Added script test for archives. |
Akron | 1924bbe | 2016-06-22 16:05:41 +0200 | [diff] [blame] | 201 | |
Akron | bdb6465 | 2016-08-17 23:30:01 +0200 | [diff] [blame] | 202 | 0.18 2016-07-08 |
203 | - Added REI test. | ||||
204 | - Added multiple archive support to korapxml2krill. | ||||
205 | - Added support for prefix negation in korapxml2krill. | ||||
206 | - Added support for Malt#Dependency. | ||||
207 | - Improved test suite for caching and REI. | ||||
208 | - Added support for MDParser annotation. | ||||
209 | - Added batch processing class for documents. | ||||
210 | |||||
Akron | 1cd5b87 | 2016-03-22 00:23:46 +0100 | [diff] [blame] | 211 | 0.17 2016-03-22 |
212 | - Rewrite siglen to use slashes as separators. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 213 | - Zip listing optimized. Does no longer work with primary data |
214 | in text.xml files. | ||||
Akron | 1cd5b87 | 2016-03-22 00:23:46 +0100 | [diff] [blame] | 215 | |
Akron | 11c8030 | 2016-03-18 19:44:43 +0100 | [diff] [blame] | 216 | 0.16 2016-03-18 |
217 | - Added caching mechanism for | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 218 | metadata. |
Akron | 11c8030 | 2016-03-18 19:44:43 +0100 | [diff] [blame] | 219 | |
Akron | 35db6e3 | 2016-03-17 22:42:22 +0100 | [diff] [blame] | 220 | 0.15 2016-03-17 |
221 | - Modularized metadata handling. | ||||
222 | - Simplified metadata handling. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 223 | - Added --meta option to script. |
224 | - Removed deprecated --human option from script. | ||||
Akron | 35db6e3 | 2016-03-17 22:42:22 +0100 | [diff] [blame] | 225 | |
Akron | c13a170 | 2016-03-15 19:33:14 +0100 | [diff] [blame] | 226 | 0.14 2016-03-15 |
Akron | 151676d | 2016-03-14 20:12:14 +0100 | [diff] [blame] | 227 | - Renamed ::Index to ::Annotate and ::Field to ::Index. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 228 | - Renamed 'allow' to 'anno' as parameters of the script. |
229 | - Added readme. | ||||
Akron | 151676d | 2016-03-14 20:12:14 +0100 | [diff] [blame] | 230 | |
Akron | 5b25431 | 2016-03-10 00:29:56 +0100 | [diff] [blame] | 231 | 0.13 2016-03-10 |
Akron | 44feb4e | 2016-03-02 12:45:47 +0100 | [diff] [blame] | 232 | - Removed korapxml2krill_dir. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 233 | - Renamed dependency nodes. |
234 | - Made dependency relations more effective (trimmed down TUIs) | ||||
235 | ! This is currently very slow ! | ||||
Akron | 44feb4e | 2016-03-02 12:45:47 +0100 | [diff] [blame] | 236 | |
Akron | dc898d8 | 2016-02-28 23:49:19 +0100 | [diff] [blame] | 237 | 0.12 2016-02-28 |
Akron | e10ad32 | 2016-02-27 10:54:26 +0100 | [diff] [blame] | 238 | - Added extract method to korapxml2krill. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 239 | - Fixed Mate/Dependency. |
240 | - Fixed skip flag in korapxml2krill. | ||||
241 | - Ignore spans outside the token range | ||||
242 | (i.e. character offsets end before tokens have started). | ||||
Akron | e10ad32 | 2016-02-27 10:54:26 +0100 | [diff] [blame] | 243 | |
Akron | 941c1a6 | 2016-02-23 17:41:41 +0100 | [diff] [blame] | 244 | 0.11 2016-02-23 |
Akron | 44feb4e | 2016-03-02 12:45:47 +0100 | [diff] [blame] | 245 | - Merged korapxml2krill and korapxml2krill_dir. |
Akron | 941c1a6 | 2016-02-23 17:41:41 +0100 | [diff] [blame] | 246 | |
Akron | 96165ad | 2016-02-15 18:09:41 +0100 | [diff] [blame] | 247 | 0.10 2016-02-15 |
248 | - Added EXPERIMENTAL support for parallel jobs. | ||||
249 | |||||
Akron | c1babed | 2016-02-15 11:48:18 +0100 | [diff] [blame] | 250 | 0.09 2016-02-15 |
251 | - Fixed temporary directory handling in scripts. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 252 | - Improved skipping for archive handling in scripts. |
Akron | c1babed | 2016-02-15 11:48:18 +0100 | [diff] [blame] | 253 | |
Akron | 150b29e | 2016-02-14 23:06:48 +0100 | [diff] [blame] | 254 | 0.08 2016-02-14 |
255 | - Added support for archive streaming. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 256 | - Improved scripts. |
Akron | 150b29e | 2016-02-14 23:06:48 +0100 | [diff] [blame] | 257 | |
Akron | 8c84aa5 | 2016-02-13 21:26:54 +0100 | [diff] [blame] | 258 | 0.07 2016-02-13 |
259 | - Improved support for Schreibgebrauch meta data | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 260 | (IDS flavour). |
Akron | 8c84aa5 | 2016-02-13 21:26:54 +0100 | [diff] [blame] | 261 | |
262 | 0.06 2016-02-11 | ||||
Akron | 49a4765 | 2016-02-12 18:17:19 +0100 | [diff] [blame] | 263 | - Improved support for Schreibgebrauch meta data |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 264 | (Duden flavour). |
Akron | 49a4765 | 2016-02-12 18:17:19 +0100 | [diff] [blame] | 265 | |
Akron | 93d620e | 2016-02-05 19:40:05 +0100 | [diff] [blame] | 266 | 0.05 2016-02-04 |
Akron | e4c2e41 | 2016-01-28 15:10:50 +0100 | [diff] [blame] | 267 | - Changed KorAP::Document to KorAP::XML::Krill. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 268 | - Renamed "Schreibgebrauch" to "Sgbr". |
269 | - Preparation for GitHub release. | ||||
Akron | e4c2e41 | 2016-01-28 15:10:50 +0100 | [diff] [blame] | 270 | |
Akron | 9c0488f | 2016-01-28 14:17:15 +0100 | [diff] [blame] | 271 | 0.04 2016-01-28 |
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 272 | - Added PTI to all payloads. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 273 | - Added support for empty elements. |
274 | - Added support for element attributes in struct. | ||||
275 | - Added meta data support for Schreibgebrauch. | ||||
276 | - Fixed test suite for meta data. | ||||
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 277 | |
278 | 0.03 2014-11-03 | ||||
Nils Diewald | 7867467 | 2014-11-03 21:43:12 +0000 | [diff] [blame] | 279 | - Added new metadata scheme. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 280 | - Fixed a minor bug in the constituency tree building. |
281 | - Sorted terms in tokens a priori. | ||||
Nils Diewald | 7867467 | 2014-11-03 21:43:12 +0000 | [diff] [blame] | 282 | |
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 283 | 0.02 2014-07-21 |
Nils Diewald | f03c680 | 2014-07-21 16:39:44 +0000 | [diff] [blame] | 284 | - Sentence annotations for all providing foundries |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 285 | - Starting subtokenization |
Nils Diewald | f03c680 | 2014-07-21 16:39:44 +0000 | [diff] [blame] | 286 | |
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 287 | 0.01 2014-04-15 |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 288 | - [bugfix] for first token annotations |
Nils Diewald | 7b84722 | 2014-04-23 11:14:00 +0000 | [diff] [blame] | 289 | - Sentences are now available from all foundries that have it |
290 | - <>:p is now <>:base/para | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 291 | - Added <>:base/text |