Akron | a351837 | 2024-01-22 23:29:00 +0100 | [diff] [blame^] | 1 | 0.52 2023-01-23 |
2 | - Introduced 'quiet' flag. | ||||
3 | |||||
Akron | 2daf8fe | 2023-02-27 12:55:04 +0100 | [diff] [blame] | 4 | 0.51 2023-12-23 |
Akron | 2532f1b | 2023-05-15 13:41:24 +0200 | [diff] [blame] | 5 | - Support ICC meta. |
Akron | 01c6fb5 | 2023-08-25 12:22:33 +0200 | [diff] [blame] | 6 | - Fix date handling for years of length < 2. |
Akron | 2daf8fe | 2023-02-27 12:55:04 +0100 | [diff] [blame] | 7 | - Improve emoji detection (rebecca). |
Akron | 18ce3b3 | 2023-12-13 15:44:11 +0100 | [diff] [blame] | 8 | - Upgrade minimum perl version required. |
Akron | 2532f1b | 2023-05-15 13:41:24 +0200 | [diff] [blame] | 9 | |
Akron | a472a24 | 2023-02-13 13:46:30 +0100 | [diff] [blame] | 10 | 0.50 2023-02-13 |
11 | - Fix 'temporary-extract' configuration | ||||
12 | information. | ||||
13 | |||||
Akron | 4a7ab01 | 2023-02-12 12:59:38 +0100 | [diff] [blame] | 14 | 0.49 2023-02-12 |
Marc Kupietz | 400590b | 2022-12-23 16:02:36 +0100 | [diff] [blame] | 15 | - Support for UDPipe POS, lemma and dependency |
16 | annotations (kupietz). | ||||
Akron | 4a7ab01 | 2023-02-12 12:59:38 +0100 | [diff] [blame] | 17 | - Remove last bit of Sys::Info dependency. |
18 | (fixes #9) | ||||
Marc Kupietz | 400590b | 2022-12-23 16:02:36 +0100 | [diff] [blame] | 19 | |
Akron | 2dd0e5d | 2022-11-15 09:44:43 +0100 | [diff] [blame] | 20 | 0.48 2022-11-15 |
Akron | aa166fa | 2022-11-10 14:15:14 +0100 | [diff] [blame] | 21 | - Improve support for text siglen including |
22 | underscore in corpus parts. | ||||
Akron | 2dd0e5d | 2022-11-15 09:44:43 +0100 | [diff] [blame] | 23 | - Split morphological features in NKJP. |
Akron | aa166fa | 2022-11-10 14:15:14 +0100 | [diff] [blame] | 24 | |
Akron | ddf3319 | 2022-08-08 16:44:39 +0200 | [diff] [blame] | 25 | 0.47 2022-08-08 |
Akron | 64f7fae | 2022-07-27 12:45:33 +0200 | [diff] [blame] | 26 | - Support for preferred language transformation. |
Akron | 1a2535d | 2022-07-28 16:31:43 +0200 | [diff] [blame] | 27 | - Support for NKJP taxonomies. |
Akron | ddf3319 | 2022-08-08 16:44:39 +0200 | [diff] [blame] | 28 | - Support for NKJP 'orig' values. |
Akron | 64f7fae | 2022-07-27 12:45:33 +0200 | [diff] [blame] | 29 | |
Akron | a65cd68 | 2022-07-21 15:40:40 +0200 | [diff] [blame] | 30 | 0.46 2022-07-21 |
31 | - Support NKJP Meta, Morpho and NamedEntities. | ||||
32 | |||||
Akron | 3c9b27c | 2022-03-04 13:08:13 +0100 | [diff] [blame] | 33 | 0.45 2022-03-04 |
Akron | eb370a0 | 2022-02-24 13:33:40 +0100 | [diff] [blame] | 34 | - Due to problems installing Archive::Tar::Builder |
35 | in certain environments, this is now optional, | ||||
36 | with a pure perl fallback archiver. | ||||
Akron | 3c9b27c | 2022-03-04 13:08:13 +0100 | [diff] [blame] | 37 | - Support externalLink and internalLink universally in |
38 | i5 meta data. | ||||
Akron | eb370a0 | 2022-02-24 13:33:40 +0100 | [diff] [blame] | 39 | |
Akron | f683310 | 2022-02-17 18:35:03 +0100 | [diff] [blame] | 40 | 0.44 2022-02-17 |
Akron | 8c85e9f | 2022-01-03 16:27:10 +0100 | [diff] [blame] | 41 | - Improve Gingko Metadata support. |
Akron | e1cde96 | 2022-02-07 20:00:29 +0100 | [diff] [blame] | 42 | - Fix data-URIs by always refering to UTF-8. |
Akron | f683310 | 2022-02-17 18:35:03 +0100 | [diff] [blame] | 43 | - Warn on wrong token order. |
Akron | c4ad747 | 2022-01-28 19:12:50 +0100 | [diff] [blame] | 44 | - Improve Gingko Metadata support. |
45 | - Updated all dependencies. | ||||
Akron | 8c85e9f | 2022-01-03 16:27:10 +0100 | [diff] [blame] | 46 | |
Akron | 9a2545e | 2022-01-16 15:15:50 +0100 | [diff] [blame] | 47 | 0.43 2022-01-17 |
48 | - Fix temporary extract handling when defined | ||||
49 | in a config file. | ||||
Akron | 303c4fd | 2022-01-16 15:14:46 +0100 | [diff] [blame] | 50 | - Improve handling of invalid certainty values |
51 | in TreeTagger. | ||||
Akron | 84b53ad | 2022-01-14 12:39:15 +0100 | [diff] [blame] | 52 | - Add log slimming function. |
Akron | 9a2545e | 2022-01-16 15:15:50 +0100 | [diff] [blame] | 53 | |
Akron | 8c85e9f | 2022-01-03 16:27:10 +0100 | [diff] [blame] | 54 | 0.42 2021-10-11 |
Akron | b9c3381 | 2020-10-21 16:19:35 +0200 | [diff] [blame] | 55 | - Replaced Log4perl with Log::Any. |
Akron | 0ffbd52 | 2021-02-16 12:01:19 +0100 | [diff] [blame] | 56 | - Ignore level < 0 structures in DeReKo, but support |
57 | them for base annotations. | ||||
Akron | 6882d7d | 2021-02-08 09:43:57 +0100 | [diff] [blame] | 58 | - Define resources in Makefile. |
Akron | 56d5f17 | 2021-03-16 18:37:39 +0100 | [diff] [blame] | 59 | - Add GitHub action for CI. |
Akron | fca010b | 2021-10-11 15:52:48 +0200 | [diff] [blame] | 60 | - Remove MANIFEST file from repo. |
Akron | abb3690 | 2021-10-11 15:51:06 +0200 | [diff] [blame] | 61 | - Introduce Gingko support. |
Akron | 8ad06c4 | 2022-01-11 17:07:49 +0100 | [diff] [blame] | 62 | - Fix data URIs to always encode percentage-wise. |
Akron | b9c3381 | 2020-10-21 16:19:35 +0200 | [diff] [blame] | 63 | |
64 | 0.41 2020-08-10 | ||||
Akron | 07e2477 | 2020-04-23 14:00:54 +0200 | [diff] [blame] | 65 | - Added support for RWK annotations. |
Akron | 1cdbc9d | 2020-05-07 15:28:54 +0200 | [diff] [blame] | 66 | - Improved DGD support. |
Akron | e3e0536 | 2020-06-16 17:19:09 +0200 | [diff] [blame] | 67 | - Fixed bug in RWK support that broke on |
68 | some KorAP-XML files. | ||||
Akron | 414ec95 | 2020-08-03 15:48:43 +0200 | [diff] [blame] | 69 | - Separate "real data" test suite from artificial |
70 | tests to prepare for CPAN release. | ||||
Akron | 39df7ce | 2020-08-04 15:55:26 +0200 | [diff] [blame] | 71 | - Optimizations and cleanup based on profiling. |
Akron | 129e441 | 2020-08-05 15:30:12 +0200 | [diff] [blame] | 72 | - Remove MultiTerm->add() in favor of |
73 | MultiTerm->add_by_term(). | ||||
Akron | 47426f0 | 2020-08-06 13:28:53 +0200 | [diff] [blame] | 74 | - Optimization by reducing calls to _offset(). |
Akron | 6a4cb16 | 2020-08-06 16:00:33 +0200 | [diff] [blame] | 75 | - Introduced add_span() method to MultiTermToken. |
Akron | 11daf96 | 2020-08-07 16:29:22 +0200 | [diff] [blame] | 76 | - Removed deprecated 'primary' flag. |
Akron | 6aed056 | 2020-08-07 16:46:00 +0200 | [diff] [blame] | 77 | - Removed deprecated 'pretty' flag. |
Akron | 56deacb | 2020-08-10 10:03:55 +0200 | [diff] [blame] | 78 | - Fix RWK paragraph handling. |
Akron | d2cd8e4 | 2020-10-30 16:37:19 +0100 | [diff] [blame] | 79 | - Updated 'Clone' dependency in Makefile. |
Akron | 0b04b31 | 2020-10-30 17:39:18 +0100 | [diff] [blame] | 80 | - Make Sys::Info optional. |
Akron | dcbee64 | 2020-10-30 18:01:43 +0100 | [diff] [blame] | 81 | - Fixes a bug in XIP::Dependency and added |
82 | dependency checks for all annotation libraries. | ||||
Akron | 07e2477 | 2020-04-23 14:00:54 +0200 | [diff] [blame] | 83 | |
Akron | dec4312 | 2020-03-03 11:22:25 +0100 | [diff] [blame] | 84 | 0.40 2020-03-03 |
Akron | a0d5af3 | 2020-03-01 12:46:30 +0100 | [diff] [blame] | 85 | - Fixed XIP parser. |
Akron | b62d92a | 2020-03-01 16:32:00 +0100 | [diff] [blame] | 86 | - Added example corpus of the |
87 | Redewiedergabe-Korpus. | ||||
88 | - Fixed span offset bug. | ||||
89 | - Fixed milestones behind the last | ||||
90 | token bug. | ||||
Akron | dec4312 | 2020-03-03 11:22:25 +0100 | [diff] [blame] | 91 | - Fixed gap behind last token bug. |
92 | - Fixed <base/s:t> length. | ||||
Akron | a0d5af3 | 2020-03-01 12:46:30 +0100 | [diff] [blame] | 93 | |
Akron | 6e886f7 | 2020-02-19 07:42:32 +0100 | [diff] [blame] | 94 | 0.39 2020-02-19 |
Akron | 7d5e638 | 2019-08-08 16:36:27 +0200 | [diff] [blame] | 95 | - Added Talismane support. |
Akron | 0d68a4b | 2019-11-13 15:42:11 +0100 | [diff] [blame] | 96 | - Added "distributor" field to I5 metadata. |
Akron | 2029455 | 2019-11-29 16:15:35 +0100 | [diff] [blame] | 97 | - Added DGD link field to I5 metadata. |
Akron | b05b842 | 2019-12-11 13:47:57 +0100 | [diff] [blame] | 98 | - Improve logging. |
Akron | c29b8e1 | 2019-12-16 14:28:09 +0100 | [diff] [blame] | 99 | - Added support for DGD pseudo-sentences |
100 | based on anchor milestones. | ||||
Akron | 8f69d63 | 2020-01-15 16:58:11 +0100 | [diff] [blame] | 101 | - Added brief explanation of the format. |
Akron | d4c5c10 | 2020-02-11 11:47:59 +0100 | [diff] [blame] | 102 | - Fixed parsing of editionStmt. |
103 | - Added documentation for supported I5 metadata | ||||
104 | fields. | ||||
Akron | 6e886f7 | 2020-02-19 07:42:32 +0100 | [diff] [blame] | 105 | - Added integrated benchmark mechanism. |
Akron | 7d5e638 | 2019-08-08 16:36:27 +0200 | [diff] [blame] | 106 | |
Akron | 57510c1 | 2019-01-04 14:58:53 +0100 | [diff] [blame] | 107 | 0.38 2019-05-22 |
Akron | 9b04f60 | 2019-03-08 18:45:35 +0100 | [diff] [blame] | 108 | - Stop file processing when base tokenization |
109 | is wrong. | ||||
Akron | 57510c1 | 2019-01-04 14:58:53 +0100 | [diff] [blame] | 110 | - Added DGD support. |
Akron | 9b04f60 | 2019-03-08 18:45:35 +0100 | [diff] [blame] | 111 | |
Akron | eaffe93 | 2019-03-07 17:14:42 +0100 | [diff] [blame] | 112 | 0.37 2019-03-06 |
Akron | 263274c | 2019-02-07 09:48:30 +0100 | [diff] [blame] | 113 | - Support for 'koral:field' array. |
114 | - Support for Koral versioning. | ||||
Akron | 4e1712c | 2019-02-04 22:29:37 +0100 | [diff] [blame] | 115 | - Added tests for english sources. |
Akron | 6bf3cc9 | 2019-02-07 12:11:20 +0100 | [diff] [blame] | 116 | - Added support for external links for |
117 | Wikipedia resources. | ||||
Akron | 63d03ee | 2019-02-13 18:49:38 +0100 | [diff] [blame] | 118 | - Ignore temporary extraction |
119 | on directory archiving. | ||||
Akron | 955b75b | 2019-02-21 14:28:41 +0100 | [diff] [blame] | 120 | - Remove extract_text and extract_doc in |
121 | favor of extract_sigle for archives. | ||||
Akron | 263274c | 2019-02-07 09:48:30 +0100 | [diff] [blame] | 122 | |
Akron | ed9baf0 | 2019-01-22 17:03:25 +0100 | [diff] [blame] | 123 | 0.36 2019-01-22 |
124 | - Support for non-word tokens (fixes #5). | ||||
125 | |||||
Akron | 6eff23b | 2018-09-24 10:31:20 +0200 | [diff] [blame] | 126 | 0.35 2018-09-24 |
127 | - Lift minimum version of Perl to 5.16 as for | ||||
128 | "fc"-feature. | ||||
129 | |||||
Akron | dd1c0f1 | 2018-07-19 06:45:28 +0200 | [diff] [blame] | 130 | 0.34 2018-07-19 |
131 | - Preliminary support for HNC. | ||||
132 | |||||
Akron | 28dc17f | 2018-02-01 15:31:41 +0100 | [diff] [blame] | 133 | 0.33 2018-02-01 |
Akron | 4c67919 | 2018-01-16 17:41:49 +0100 | [diff] [blame] | 134 | - Added LWC support. |
Akron | 28dc17f | 2018-02-01 15:31:41 +0100 | [diff] [blame] | 135 | - Fixed TreeTagger certainties. |
Akron | 4c67919 | 2018-01-16 17:41:49 +0100 | [diff] [blame] | 136 | |
Akron | 3c56f50 | 2017-10-24 15:37:27 +0200 | [diff] [blame] | 137 | 0.32 2017-10-24 |
Akron | 9a062ce | 2017-07-04 19:12:05 +0200 | [diff] [blame] | 138 | - Fixed tar building process in script. |
Akron | 3c56f50 | 2017-10-24 15:37:27 +0200 | [diff] [blame] | 139 | - Support file extensions in base tokenization parameter. |
Akron | 9a062ce | 2017-07-04 19:12:05 +0200 | [diff] [blame] | 140 | |
Akron | 0a6cce1 | 2017-06-30 23:03:21 +0200 | [diff] [blame] | 141 | 0.31 2017-06-30 |
Akron | 3abc03e | 2017-06-29 16:23:35 +0200 | [diff] [blame] | 142 | - Fixed exit codes in script. |
Akron | 0a6cce1 | 2017-06-30 23:03:21 +0200 | [diff] [blame] | 143 | - Use CORE::fc for case folding. |
Akron | 3abc03e | 2017-06-29 16:23:35 +0200 | [diff] [blame] | 144 | |
Akron | d5bb434 | 2017-06-19 11:50:49 +0200 | [diff] [blame] | 145 | 0.30 2017-06-19 |
146 | - Fixed permission handling in test suite. | ||||
Akron | ce125b6 | 2017-06-19 11:54:36 +0200 | [diff] [blame] | 147 | - Added preliminary CMC support. |
Akron | d5bb434 | 2017-06-19 11:50:49 +0200 | [diff] [blame] | 148 | |
Akron | da3097e | 2017-04-23 19:53:57 +0200 | [diff] [blame] | 149 | 0.29 2017-04-23 |
150 | - support --to-tar flag. | ||||
151 | |||||
Akron | 9ec8887 | 2017-04-12 16:29:06 +0200 | [diff] [blame] | 152 | 0.28 2017-04-12 |
Akron | 86db52e | 2017-04-11 20:36:43 +0200 | [diff] [blame] | 153 | - Improved overwriting behaviour for unzip. |
Akron | 9ec8887 | 2017-04-12 16:29:06 +0200 | [diff] [blame] | 154 | - Introduced --sequential-extraction flag. |
Akron | 86db52e | 2017-04-11 20:36:43 +0200 | [diff] [blame] | 155 | |
Akron | 63f20d4 | 2017-04-10 23:40:29 +0200 | [diff] [blame] | 156 | 0.27 2017-04-10 |
Akron | 636aa11 | 2017-04-07 18:48:56 +0200 | [diff] [blame] | 157 | - Support configuration files. |
Akron | 8150010 | 2017-04-07 20:45:44 +0200 | [diff] [blame] | 158 | - Support temporary extraction. |
Akron | 63f20d4 | 2017-04-10 23:40:29 +0200 | [diff] [blame] | 159 | - Support serial conversion. |
160 | - Support input-base. | ||||
Akron | 636aa11 | 2017-04-07 18:48:56 +0200 | [diff] [blame] | 161 | |
162 | 0.26 2017-04-06 | ||||
163 | - Support wildcards on input. | ||||
164 | |||||
Akron | 5809fea | 2017-03-14 20:02:26 +0100 | [diff] [blame] | 165 | 0.25 2017-03-14 |
Akron | 7e2eb88 | 2017-01-18 17:28:07 +0100 | [diff] [blame] | 166 | - Updated to Mojolicious 7.20 |
167 | - Fixed meta treatment in case analytic and monogr | ||||
168 | are available | ||||
Akron | 4fa37c3 | 2017-01-20 14:43:10 +0100 | [diff] [blame] | 169 | - Added DRuKoLa support to script |
Akron | 3887301 | 2017-02-06 20:27:37 +0100 | [diff] [blame] | 170 | - Liberated document and text sigle handling to be |
171 | compliant with CoRoLa. | ||||
Akron | 41ac10b | 2017-02-08 22:47:25 +0100 | [diff] [blame] | 172 | - Added support for pagebreak annotations. |
Akron | 08d5445 | 2017-02-16 23:19:49 +0100 | [diff] [blame] | 173 | - Renamed "pages" to "srcPages". |
Akron | 60a8caa | 2017-02-17 21:51:27 +0100 | [diff] [blame] | 174 | - Fixed handling of prefixes for text sigles. |
Akron | 3bd942f | 2017-02-20 20:09:14 +0100 | [diff] [blame] | 175 | - Support for MarMoT. |
Akron | 5809fea | 2017-03-14 20:02:26 +0100 | [diff] [blame] | 176 | - Fix case insensitivity. |
Akron | 55778f0 | 2017-03-14 20:47:26 +0100 | [diff] [blame] | 177 | - Added preliminary support for diacritic insensitivity. |
Akron | 3ec0a1c | 2017-01-18 14:41:55 +0100 | [diff] [blame] | 178 | |
Akron | 3741f8b | 2016-12-21 19:55:21 +0100 | [diff] [blame] | 179 | 0.24 2016-12-21 |
180 | - Added --base-sentences and --base-paragraphs options | ||||
181 | |||||
Akron | 6f9fef5 | 2016-11-03 17:06:40 +0100 | [diff] [blame] | 182 | 0.23 2016-11-03 |
Akron | 2fd402b | 2016-10-27 21:26:48 +0200 | [diff] [blame] | 183 | - Added wildcard support for document extraction |
Akron | 2812ba2 | 2016-10-28 21:55:59 +0200 | [diff] [blame] | 184 | - Fixed archive iteration to not duplicate the first archive |
185 | - Added parallel extraction for document sigles | ||||
Akron | 13d5662 | 2016-10-31 14:54:49 +0100 | [diff] [blame] | 186 | - Improved return value for existing files |
Akron | 3741f8b | 2016-12-21 19:55:21 +0100 | [diff] [blame] | 187 | - Don't warn on recursion in CoreNLP/Constituency |
Akron | 2fd402b | 2016-10-27 21:26:48 +0200 | [diff] [blame] | 188 | |
Akron | 2080758 | 2016-10-26 17:11:34 +0200 | [diff] [blame] | 189 | 0.22 2016-10-26 |
190 | - Added support for document extraction | ||||
Akron | b4bbec7 | 2016-10-26 20:21:02 +0200 | [diff] [blame] | 191 | - Fixed archive naming |
Akron | 2080758 | 2016-10-26 17:11:34 +0200 | [diff] [blame] | 192 | |
Akron | b4bbec7 | 2016-10-26 20:21:02 +0200 | [diff] [blame] | 193 | 0.21 2016-10-24 |
Nils Diewald | b3e9ccd | 2016-10-24 15:16:52 +0200 | [diff] [blame] | 194 | - Improved Windows support |
195 | |||||
Akron | 4c0cf31 | 2016-10-15 16:42:09 +0200 | [diff] [blame] | 196 | 0.20 2016-10-15 |
197 | - Fixed treatment of temporary folders in script | ||||
198 | |||||
Akron | bdb6465 | 2016-08-17 23:30:01 +0200 | [diff] [blame] | 199 | 0.19 2016-08-17 |
Akron | 92ad95b | 2016-08-15 23:38:56 +0200 | [diff] [blame] | 200 | - Added test for direct I5 support. |
201 | - Fixed support for Mojolicious 7. | ||||
202 | - Added script test. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 203 | - Fixed setting multiple annotations in |
204 | script. | ||||
Akron | e2b902d | 2016-08-16 16:50:11 +0200 | [diff] [blame] | 205 | - Fixed output of version and help messages. |
Akron | 7d4cdd8 | 2016-08-17 21:39:45 +0200 | [diff] [blame] | 206 | - Added script test for extraction. |
Akron | 651cb8d | 2016-08-16 21:44:49 +0200 | [diff] [blame] | 207 | - Fixed extraction with multiple archives and prefix |
208 | negation support. | ||||
Akron | 7d4cdd8 | 2016-08-17 21:39:45 +0200 | [diff] [blame] | 209 | - Added script test for archives. |
Akron | 1924bbe | 2016-06-22 16:05:41 +0200 | [diff] [blame] | 210 | |
Akron | bdb6465 | 2016-08-17 23:30:01 +0200 | [diff] [blame] | 211 | 0.18 2016-07-08 |
212 | - Added REI test. | ||||
213 | - Added multiple archive support to korapxml2krill. | ||||
214 | - Added support for prefix negation in korapxml2krill. | ||||
215 | - Added support for Malt#Dependency. | ||||
216 | - Improved test suite for caching and REI. | ||||
217 | - Added support for MDParser annotation. | ||||
218 | - Added batch processing class for documents. | ||||
219 | |||||
Akron | 1cd5b87 | 2016-03-22 00:23:46 +0100 | [diff] [blame] | 220 | 0.17 2016-03-22 |
221 | - Rewrite siglen to use slashes as separators. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 222 | - Zip listing optimized. Does no longer work with primary data |
223 | in text.xml files. | ||||
Akron | 1cd5b87 | 2016-03-22 00:23:46 +0100 | [diff] [blame] | 224 | |
Akron | 11c8030 | 2016-03-18 19:44:43 +0100 | [diff] [blame] | 225 | 0.16 2016-03-18 |
226 | - Added caching mechanism for | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 227 | metadata. |
Akron | 11c8030 | 2016-03-18 19:44:43 +0100 | [diff] [blame] | 228 | |
Akron | 35db6e3 | 2016-03-17 22:42:22 +0100 | [diff] [blame] | 229 | 0.15 2016-03-17 |
230 | - Modularized metadata handling. | ||||
231 | - Simplified metadata handling. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 232 | - Added --meta option to script. |
233 | - Removed deprecated --human option from script. | ||||
Akron | 35db6e3 | 2016-03-17 22:42:22 +0100 | [diff] [blame] | 234 | |
Akron | c13a170 | 2016-03-15 19:33:14 +0100 | [diff] [blame] | 235 | 0.14 2016-03-15 |
Akron | 151676d | 2016-03-14 20:12:14 +0100 | [diff] [blame] | 236 | - Renamed ::Index to ::Annotate and ::Field to ::Index. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 237 | - Renamed 'allow' to 'anno' as parameters of the script. |
238 | - Added readme. | ||||
Akron | 151676d | 2016-03-14 20:12:14 +0100 | [diff] [blame] | 239 | |
Akron | 5b25431 | 2016-03-10 00:29:56 +0100 | [diff] [blame] | 240 | 0.13 2016-03-10 |
Akron | 44feb4e | 2016-03-02 12:45:47 +0100 | [diff] [blame] | 241 | - Removed korapxml2krill_dir. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 242 | - Renamed dependency nodes. |
243 | - Made dependency relations more effective (trimmed down TUIs) | ||||
244 | ! This is currently very slow ! | ||||
Akron | 44feb4e | 2016-03-02 12:45:47 +0100 | [diff] [blame] | 245 | |
Akron | dc898d8 | 2016-02-28 23:49:19 +0100 | [diff] [blame] | 246 | 0.12 2016-02-28 |
Akron | e10ad32 | 2016-02-27 10:54:26 +0100 | [diff] [blame] | 247 | - Added extract method to korapxml2krill. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 248 | - Fixed Mate/Dependency. |
249 | - Fixed skip flag in korapxml2krill. | ||||
250 | - Ignore spans outside the token range | ||||
251 | (i.e. character offsets end before tokens have started). | ||||
Akron | e10ad32 | 2016-02-27 10:54:26 +0100 | [diff] [blame] | 252 | |
Akron | 941c1a6 | 2016-02-23 17:41:41 +0100 | [diff] [blame] | 253 | 0.11 2016-02-23 |
Akron | 44feb4e | 2016-03-02 12:45:47 +0100 | [diff] [blame] | 254 | - Merged korapxml2krill and korapxml2krill_dir. |
Akron | 941c1a6 | 2016-02-23 17:41:41 +0100 | [diff] [blame] | 255 | |
Akron | 96165ad | 2016-02-15 18:09:41 +0100 | [diff] [blame] | 256 | 0.10 2016-02-15 |
257 | - Added EXPERIMENTAL support for parallel jobs. | ||||
258 | |||||
Akron | c1babed | 2016-02-15 11:48:18 +0100 | [diff] [blame] | 259 | 0.09 2016-02-15 |
260 | - Fixed temporary directory handling in scripts. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 261 | - Improved skipping for archive handling in scripts. |
Akron | c1babed | 2016-02-15 11:48:18 +0100 | [diff] [blame] | 262 | |
Akron | 150b29e | 2016-02-14 23:06:48 +0100 | [diff] [blame] | 263 | 0.08 2016-02-14 |
264 | - Added support for archive streaming. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 265 | - Improved scripts. |
Akron | 150b29e | 2016-02-14 23:06:48 +0100 | [diff] [blame] | 266 | |
Akron | 8c84aa5 | 2016-02-13 21:26:54 +0100 | [diff] [blame] | 267 | 0.07 2016-02-13 |
268 | - Improved support for Schreibgebrauch meta data | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 269 | (IDS flavour). |
Akron | 8c84aa5 | 2016-02-13 21:26:54 +0100 | [diff] [blame] | 270 | |
271 | 0.06 2016-02-11 | ||||
Akron | 49a4765 | 2016-02-12 18:17:19 +0100 | [diff] [blame] | 272 | - Improved support for Schreibgebrauch meta data |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 273 | (Duden flavour). |
Akron | 49a4765 | 2016-02-12 18:17:19 +0100 | [diff] [blame] | 274 | |
Akron | 93d620e | 2016-02-05 19:40:05 +0100 | [diff] [blame] | 275 | 0.05 2016-02-04 |
Akron | e4c2e41 | 2016-01-28 15:10:50 +0100 | [diff] [blame] | 276 | - Changed KorAP::Document to KorAP::XML::Krill. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 277 | - Renamed "Schreibgebrauch" to "Sgbr". |
278 | - Preparation for GitHub release. | ||||
Akron | e4c2e41 | 2016-01-28 15:10:50 +0100 | [diff] [blame] | 279 | |
Akron | 9c0488f | 2016-01-28 14:17:15 +0100 | [diff] [blame] | 280 | 0.04 2016-01-28 |
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 281 | - Added PTI to all payloads. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 282 | - Added support for empty elements. |
283 | - Added support for element attributes in struct. | ||||
284 | - Added meta data support for Schreibgebrauch. | ||||
285 | - Fixed test suite for meta data. | ||||
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 286 | |
287 | 0.03 2014-11-03 | ||||
Nils Diewald | 7867467 | 2014-11-03 21:43:12 +0000 | [diff] [blame] | 288 | - Added new metadata scheme. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 289 | - Fixed a minor bug in the constituency tree building. |
290 | - Sorted terms in tokens a priori. | ||||
Nils Diewald | 7867467 | 2014-11-03 21:43:12 +0000 | [diff] [blame] | 291 | |
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 292 | 0.02 2014-07-21 |
Nils Diewald | f03c680 | 2014-07-21 16:39:44 +0000 | [diff] [blame] | 293 | - Sentence annotations for all providing foundries |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 294 | - Starting subtokenization |
Nils Diewald | f03c680 | 2014-07-21 16:39:44 +0000 | [diff] [blame] | 295 | |
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 296 | 0.01 2014-04-15 |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 297 | - [bugfix] for first token annotations |
Nils Diewald | 7b84722 | 2014-04-23 11:14:00 +0000 | [diff] [blame] | 298 | - Sentences are now available from all foundries that have it |
299 | - <>:p is now <>:base/para | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 300 | - Added <>:base/text |