Akron | af18edb | 2024-07-26 12:11:48 +0200 | [diff] [blame] | 1 | 0.57 2024-07-26 |
Akron | 9d01c1e | 2024-07-25 18:29:10 +0200 | [diff] [blame] | 2 | - Support award notes in i5. |
Akron | 1f95122 | 2024-07-25 17:45:21 +0200 | [diff] [blame] | 3 | - Add support for idno (with @rend) in i5. |
4 | - Add support for ISBN in i5. | ||||
Akron | af18edb | 2024-07-26 12:11:48 +0200 | [diff] [blame] | 5 | - Translator is now indexed as Text in i5, when |
6 | K2K_TRANSLATOR_TEXT is set as an environment | ||||
7 | variable. | ||||
Akron | 9d01c1e | 2024-07-25 18:29:10 +0200 | [diff] [blame] | 8 | |
Akron | 5530a55 | 2022-02-17 17:53:15 +0100 | [diff] [blame] | 9 | 0.56 2024-06-05 |
10 | - Add support fΓΌr corpusexplorer. | ||||
11 | |||||
Akron | 24ad3c0 | 2024-06-03 12:38:20 +0200 | [diff] [blame] | 12 | 0.55 2024-06-04 |
13 | - Add support for xenodata to i5. | ||||
14 | |||||
Akron | 2cfe9bd | 2024-05-02 16:28:52 +0200 | [diff] [blame] | 15 | 0.54 2024-05-02 |
Akron | c0ac4ff | 2024-04-15 18:03:15 +0200 | [diff] [blame] | 16 | - Fix 'cache' parameter. (reported by kupietz) |
17 | - Fix cache deletion for certain scenarios. | ||||
Akron | ebbac2e | 2024-03-22 10:31:23 +0100 | [diff] [blame] | 18 | - Improve information on the number of jobs |
19 | running in parallel. | ||||
Akron | 2cfe9bd | 2024-05-02 16:28:52 +0200 | [diff] [blame] | 20 | - Add support for KoKoKom <u> attributes. |
Akron | c0ac4ff | 2024-04-15 18:03:15 +0200 | [diff] [blame] | 21 | |
Akron | 2cfe9bd | 2024-05-02 16:28:52 +0200 | [diff] [blame] | 22 | 0.53 2024-03-20 |
Marc Kupietz | b8c5382 | 2024-03-16 18:54:08 +0100 | [diff] [blame] | 23 | - Added Spacy support. (kupietz) |
Marc Kupietz | 7fe9cd9 | 2024-03-18 11:50:22 +0100 | [diff] [blame] | 24 | - Support 'pos' as an alternative to 'ctag' |
25 | in Treetagger. (kupietz) | ||||
26 | - Change default certainty value in TreeTagger | ||||
27 | to 1. | ||||
Marc Kupietz | b8c5382 | 2024-03-16 18:54:08 +0100 | [diff] [blame] | 28 | |
Akron | 2cfe9bd | 2024-05-02 16:28:52 +0200 | [diff] [blame] | 29 | 0.52 2024-01-23 |
Akron | a351837 | 2024-01-22 23:29:00 +0100 | [diff] [blame] | 30 | - Introduced 'quiet' flag. |
31 | |||||
Akron | 2daf8fe | 2023-02-27 12:55:04 +0100 | [diff] [blame] | 32 | 0.51 2023-12-23 |
Akron | 2532f1b | 2023-05-15 13:41:24 +0200 | [diff] [blame] | 33 | - Support ICC meta. |
Akron | 01c6fb5 | 2023-08-25 12:22:33 +0200 | [diff] [blame] | 34 | - Fix date handling for years of length < 2. |
Akron | 2daf8fe | 2023-02-27 12:55:04 +0100 | [diff] [blame] | 35 | - Improve emoji detection (rebecca). |
Akron | 18ce3b3 | 2023-12-13 15:44:11 +0100 | [diff] [blame] | 36 | - Upgrade minimum perl version required. |
Akron | 2532f1b | 2023-05-15 13:41:24 +0200 | [diff] [blame] | 37 | |
Akron | a472a24 | 2023-02-13 13:46:30 +0100 | [diff] [blame] | 38 | 0.50 2023-02-13 |
39 | - Fix 'temporary-extract' configuration | ||||
40 | information. | ||||
41 | |||||
Akron | 4a7ab01 | 2023-02-12 12:59:38 +0100 | [diff] [blame] | 42 | 0.49 2023-02-12 |
Marc Kupietz | 400590b | 2022-12-23 16:02:36 +0100 | [diff] [blame] | 43 | - Support for UDPipe POS, lemma and dependency |
44 | annotations (kupietz). | ||||
Akron | 4a7ab01 | 2023-02-12 12:59:38 +0100 | [diff] [blame] | 45 | - Remove last bit of Sys::Info dependency. |
46 | (fixes #9) | ||||
Marc Kupietz | 400590b | 2022-12-23 16:02:36 +0100 | [diff] [blame] | 47 | |
Akron | 2dd0e5d | 2022-11-15 09:44:43 +0100 | [diff] [blame] | 48 | 0.48 2022-11-15 |
Akron | aa166fa | 2022-11-10 14:15:14 +0100 | [diff] [blame] | 49 | - Improve support for text siglen including |
50 | underscore in corpus parts. | ||||
Akron | 2dd0e5d | 2022-11-15 09:44:43 +0100 | [diff] [blame] | 51 | - Split morphological features in NKJP. |
Akron | aa166fa | 2022-11-10 14:15:14 +0100 | [diff] [blame] | 52 | |
Akron | ddf3319 | 2022-08-08 16:44:39 +0200 | [diff] [blame] | 53 | 0.47 2022-08-08 |
Akron | 64f7fae | 2022-07-27 12:45:33 +0200 | [diff] [blame] | 54 | - Support for preferred language transformation. |
Akron | 1a2535d | 2022-07-28 16:31:43 +0200 | [diff] [blame] | 55 | - Support for NKJP taxonomies. |
Akron | ddf3319 | 2022-08-08 16:44:39 +0200 | [diff] [blame] | 56 | - Support for NKJP 'orig' values. |
Akron | 64f7fae | 2022-07-27 12:45:33 +0200 | [diff] [blame] | 57 | |
Akron | a65cd68 | 2022-07-21 15:40:40 +0200 | [diff] [blame] | 58 | 0.46 2022-07-21 |
59 | - Support NKJP Meta, Morpho and NamedEntities. | ||||
60 | |||||
Akron | 3c9b27c | 2022-03-04 13:08:13 +0100 | [diff] [blame] | 61 | 0.45 2022-03-04 |
Akron | eb370a0 | 2022-02-24 13:33:40 +0100 | [diff] [blame] | 62 | - Due to problems installing Archive::Tar::Builder |
63 | in certain environments, this is now optional, | ||||
64 | with a pure perl fallback archiver. | ||||
Akron | 3c9b27c | 2022-03-04 13:08:13 +0100 | [diff] [blame] | 65 | - Support externalLink and internalLink universally in |
66 | i5 meta data. | ||||
Akron | eb370a0 | 2022-02-24 13:33:40 +0100 | [diff] [blame] | 67 | |
Akron | f683310 | 2022-02-17 18:35:03 +0100 | [diff] [blame] | 68 | 0.44 2022-02-17 |
Akron | 8c85e9f | 2022-01-03 16:27:10 +0100 | [diff] [blame] | 69 | - Improve Gingko Metadata support. |
Akron | e1cde96 | 2022-02-07 20:00:29 +0100 | [diff] [blame] | 70 | - Fix data-URIs by always refering to UTF-8. |
Akron | f683310 | 2022-02-17 18:35:03 +0100 | [diff] [blame] | 71 | - Warn on wrong token order. |
Akron | c4ad747 | 2022-01-28 19:12:50 +0100 | [diff] [blame] | 72 | - Improve Gingko Metadata support. |
73 | - Updated all dependencies. | ||||
Akron | 8c85e9f | 2022-01-03 16:27:10 +0100 | [diff] [blame] | 74 | |
Akron | 9a2545e | 2022-01-16 15:15:50 +0100 | [diff] [blame] | 75 | 0.43 2022-01-17 |
76 | - Fix temporary extract handling when defined | ||||
77 | in a config file. | ||||
Akron | 303c4fd | 2022-01-16 15:14:46 +0100 | [diff] [blame] | 78 | - Improve handling of invalid certainty values |
79 | in TreeTagger. | ||||
Akron | 84b53ad | 2022-01-14 12:39:15 +0100 | [diff] [blame] | 80 | - Add log slimming function. |
Akron | 9a2545e | 2022-01-16 15:15:50 +0100 | [diff] [blame] | 81 | |
Akron | 8c85e9f | 2022-01-03 16:27:10 +0100 | [diff] [blame] | 82 | 0.42 2021-10-11 |
Akron | b9c3381 | 2020-10-21 16:19:35 +0200 | [diff] [blame] | 83 | - Replaced Log4perl with Log::Any. |
Akron | 0ffbd52 | 2021-02-16 12:01:19 +0100 | [diff] [blame] | 84 | - Ignore level < 0 structures in DeReKo, but support |
85 | them for base annotations. | ||||
Akron | 6882d7d | 2021-02-08 09:43:57 +0100 | [diff] [blame] | 86 | - Define resources in Makefile. |
Akron | 56d5f17 | 2021-03-16 18:37:39 +0100 | [diff] [blame] | 87 | - Add GitHub action for CI. |
Akron | fca010b | 2021-10-11 15:52:48 +0200 | [diff] [blame] | 88 | - Remove MANIFEST file from repo. |
Akron | abb3690 | 2021-10-11 15:51:06 +0200 | [diff] [blame] | 89 | - Introduce Gingko support. |
Akron | 8ad06c4 | 2022-01-11 17:07:49 +0100 | [diff] [blame] | 90 | - Fix data URIs to always encode percentage-wise. |
Akron | b9c3381 | 2020-10-21 16:19:35 +0200 | [diff] [blame] | 91 | |
92 | 0.41 2020-08-10 | ||||
Akron | 07e2477 | 2020-04-23 14:00:54 +0200 | [diff] [blame] | 93 | - Added support for RWK annotations. |
Akron | 1cdbc9d | 2020-05-07 15:28:54 +0200 | [diff] [blame] | 94 | - Improved DGD support. |
Akron | e3e0536 | 2020-06-16 17:19:09 +0200 | [diff] [blame] | 95 | - Fixed bug in RWK support that broke on |
96 | some KorAP-XML files. | ||||
Akron | 414ec95 | 2020-08-03 15:48:43 +0200 | [diff] [blame] | 97 | - Separate "real data" test suite from artificial |
98 | tests to prepare for CPAN release. | ||||
Akron | 39df7ce | 2020-08-04 15:55:26 +0200 | [diff] [blame] | 99 | - Optimizations and cleanup based on profiling. |
Akron | 129e441 | 2020-08-05 15:30:12 +0200 | [diff] [blame] | 100 | - Remove MultiTerm->add() in favor of |
101 | MultiTerm->add_by_term(). | ||||
Akron | 47426f0 | 2020-08-06 13:28:53 +0200 | [diff] [blame] | 102 | - Optimization by reducing calls to _offset(). |
Akron | 6a4cb16 | 2020-08-06 16:00:33 +0200 | [diff] [blame] | 103 | - Introduced add_span() method to MultiTermToken. |
Akron | 11daf96 | 2020-08-07 16:29:22 +0200 | [diff] [blame] | 104 | - Removed deprecated 'primary' flag. |
Akron | 6aed056 | 2020-08-07 16:46:00 +0200 | [diff] [blame] | 105 | - Removed deprecated 'pretty' flag. |
Akron | 56deacb | 2020-08-10 10:03:55 +0200 | [diff] [blame] | 106 | - Fix RWK paragraph handling. |
Akron | d2cd8e4 | 2020-10-30 16:37:19 +0100 | [diff] [blame] | 107 | - Updated 'Clone' dependency in Makefile. |
Akron | 0b04b31 | 2020-10-30 17:39:18 +0100 | [diff] [blame] | 108 | - Make Sys::Info optional. |
Akron | dcbee64 | 2020-10-30 18:01:43 +0100 | [diff] [blame] | 109 | - Fixes a bug in XIP::Dependency and added |
110 | dependency checks for all annotation libraries. | ||||
Akron | 07e2477 | 2020-04-23 14:00:54 +0200 | [diff] [blame] | 111 | |
Akron | dec4312 | 2020-03-03 11:22:25 +0100 | [diff] [blame] | 112 | 0.40 2020-03-03 |
Akron | a0d5af3 | 2020-03-01 12:46:30 +0100 | [diff] [blame] | 113 | - Fixed XIP parser. |
Akron | b62d92a | 2020-03-01 16:32:00 +0100 | [diff] [blame] | 114 | - Added example corpus of the |
115 | Redewiedergabe-Korpus. | ||||
116 | - Fixed span offset bug. | ||||
117 | - Fixed milestones behind the last | ||||
118 | token bug. | ||||
Akron | dec4312 | 2020-03-03 11:22:25 +0100 | [diff] [blame] | 119 | - Fixed gap behind last token bug. |
120 | - Fixed <base/s:t> length. | ||||
Akron | a0d5af3 | 2020-03-01 12:46:30 +0100 | [diff] [blame] | 121 | |
Akron | 6e886f7 | 2020-02-19 07:42:32 +0100 | [diff] [blame] | 122 | 0.39 2020-02-19 |
Akron | 7d5e638 | 2019-08-08 16:36:27 +0200 | [diff] [blame] | 123 | - Added Talismane support. |
Akron | 0d68a4b | 2019-11-13 15:42:11 +0100 | [diff] [blame] | 124 | - Added "distributor" field to I5 metadata. |
Akron | 2029455 | 2019-11-29 16:15:35 +0100 | [diff] [blame] | 125 | - Added DGD link field to I5 metadata. |
Akron | b05b842 | 2019-12-11 13:47:57 +0100 | [diff] [blame] | 126 | - Improve logging. |
Akron | c29b8e1 | 2019-12-16 14:28:09 +0100 | [diff] [blame] | 127 | - Added support for DGD pseudo-sentences |
128 | based on anchor milestones. | ||||
Akron | 8f69d63 | 2020-01-15 16:58:11 +0100 | [diff] [blame] | 129 | - Added brief explanation of the format. |
Akron | d4c5c10 | 2020-02-11 11:47:59 +0100 | [diff] [blame] | 130 | - Fixed parsing of editionStmt. |
131 | - Added documentation for supported I5 metadata | ||||
132 | fields. | ||||
Akron | 6e886f7 | 2020-02-19 07:42:32 +0100 | [diff] [blame] | 133 | - Added integrated benchmark mechanism. |
Akron | 7d5e638 | 2019-08-08 16:36:27 +0200 | [diff] [blame] | 134 | |
Akron | 57510c1 | 2019-01-04 14:58:53 +0100 | [diff] [blame] | 135 | 0.38 2019-05-22 |
Akron | 9b04f60 | 2019-03-08 18:45:35 +0100 | [diff] [blame] | 136 | - Stop file processing when base tokenization |
137 | is wrong. | ||||
Akron | 57510c1 | 2019-01-04 14:58:53 +0100 | [diff] [blame] | 138 | - Added DGD support. |
Akron | 9b04f60 | 2019-03-08 18:45:35 +0100 | [diff] [blame] | 139 | |
Akron | eaffe93 | 2019-03-07 17:14:42 +0100 | [diff] [blame] | 140 | 0.37 2019-03-06 |
Akron | 263274c | 2019-02-07 09:48:30 +0100 | [diff] [blame] | 141 | - Support for 'koral:field' array. |
142 | - Support for Koral versioning. | ||||
Akron | 4e1712c | 2019-02-04 22:29:37 +0100 | [diff] [blame] | 143 | - Added tests for english sources. |
Akron | 6bf3cc9 | 2019-02-07 12:11:20 +0100 | [diff] [blame] | 144 | - Added support for external links for |
145 | Wikipedia resources. | ||||
Akron | 63d03ee | 2019-02-13 18:49:38 +0100 | [diff] [blame] | 146 | - Ignore temporary extraction |
147 | on directory archiving. | ||||
Akron | 955b75b | 2019-02-21 14:28:41 +0100 | [diff] [blame] | 148 | - Remove extract_text and extract_doc in |
149 | favor of extract_sigle for archives. | ||||
Akron | 263274c | 2019-02-07 09:48:30 +0100 | [diff] [blame] | 150 | |
Akron | ed9baf0 | 2019-01-22 17:03:25 +0100 | [diff] [blame] | 151 | 0.36 2019-01-22 |
152 | - Support for non-word tokens (fixes #5). | ||||
153 | |||||
Akron | 6eff23b | 2018-09-24 10:31:20 +0200 | [diff] [blame] | 154 | 0.35 2018-09-24 |
155 | - Lift minimum version of Perl to 5.16 as for | ||||
156 | "fc"-feature. | ||||
157 | |||||
Akron | dd1c0f1 | 2018-07-19 06:45:28 +0200 | [diff] [blame] | 158 | 0.34 2018-07-19 |
159 | - Preliminary support for HNC. | ||||
160 | |||||
Akron | 28dc17f | 2018-02-01 15:31:41 +0100 | [diff] [blame] | 161 | 0.33 2018-02-01 |
Akron | 4c67919 | 2018-01-16 17:41:49 +0100 | [diff] [blame] | 162 | - Added LWC support. |
Akron | 28dc17f | 2018-02-01 15:31:41 +0100 | [diff] [blame] | 163 | - Fixed TreeTagger certainties. |
Akron | 4c67919 | 2018-01-16 17:41:49 +0100 | [diff] [blame] | 164 | |
Akron | 3c56f50 | 2017-10-24 15:37:27 +0200 | [diff] [blame] | 165 | 0.32 2017-10-24 |
Akron | 9a062ce | 2017-07-04 19:12:05 +0200 | [diff] [blame] | 166 | - Fixed tar building process in script. |
Akron | 3c56f50 | 2017-10-24 15:37:27 +0200 | [diff] [blame] | 167 | - Support file extensions in base tokenization parameter. |
Akron | 9a062ce | 2017-07-04 19:12:05 +0200 | [diff] [blame] | 168 | |
Akron | 0a6cce1 | 2017-06-30 23:03:21 +0200 | [diff] [blame] | 169 | 0.31 2017-06-30 |
Akron | 3abc03e | 2017-06-29 16:23:35 +0200 | [diff] [blame] | 170 | - Fixed exit codes in script. |
Akron | 0a6cce1 | 2017-06-30 23:03:21 +0200 | [diff] [blame] | 171 | - Use CORE::fc for case folding. |
Akron | 3abc03e | 2017-06-29 16:23:35 +0200 | [diff] [blame] | 172 | |
Akron | d5bb434 | 2017-06-19 11:50:49 +0200 | [diff] [blame] | 173 | 0.30 2017-06-19 |
174 | - Fixed permission handling in test suite. | ||||
Akron | ce125b6 | 2017-06-19 11:54:36 +0200 | [diff] [blame] | 175 | - Added preliminary CMC support. |
Akron | d5bb434 | 2017-06-19 11:50:49 +0200 | [diff] [blame] | 176 | |
Akron | da3097e | 2017-04-23 19:53:57 +0200 | [diff] [blame] | 177 | 0.29 2017-04-23 |
178 | - support --to-tar flag. | ||||
179 | |||||
Akron | 9ec8887 | 2017-04-12 16:29:06 +0200 | [diff] [blame] | 180 | 0.28 2017-04-12 |
Akron | 86db52e | 2017-04-11 20:36:43 +0200 | [diff] [blame] | 181 | - Improved overwriting behaviour for unzip. |
Akron | 9ec8887 | 2017-04-12 16:29:06 +0200 | [diff] [blame] | 182 | - Introduced --sequential-extraction flag. |
Akron | 86db52e | 2017-04-11 20:36:43 +0200 | [diff] [blame] | 183 | |
Akron | 63f20d4 | 2017-04-10 23:40:29 +0200 | [diff] [blame] | 184 | 0.27 2017-04-10 |
Akron | 636aa11 | 2017-04-07 18:48:56 +0200 | [diff] [blame] | 185 | - Support configuration files. |
Akron | 8150010 | 2017-04-07 20:45:44 +0200 | [diff] [blame] | 186 | - Support temporary extraction. |
Akron | 63f20d4 | 2017-04-10 23:40:29 +0200 | [diff] [blame] | 187 | - Support serial conversion. |
188 | - Support input-base. | ||||
Akron | 636aa11 | 2017-04-07 18:48:56 +0200 | [diff] [blame] | 189 | |
190 | 0.26 2017-04-06 | ||||
191 | - Support wildcards on input. | ||||
192 | |||||
Akron | 5809fea | 2017-03-14 20:02:26 +0100 | [diff] [blame] | 193 | 0.25 2017-03-14 |
Akron | 7e2eb88 | 2017-01-18 17:28:07 +0100 | [diff] [blame] | 194 | - Updated to Mojolicious 7.20 |
195 | - Fixed meta treatment in case analytic and monogr | ||||
196 | are available | ||||
Akron | 4fa37c3 | 2017-01-20 14:43:10 +0100 | [diff] [blame] | 197 | - Added DRuKoLa support to script |
Akron | 3887301 | 2017-02-06 20:27:37 +0100 | [diff] [blame] | 198 | - Liberated document and text sigle handling to be |
199 | compliant with CoRoLa. | ||||
Akron | 41ac10b | 2017-02-08 22:47:25 +0100 | [diff] [blame] | 200 | - Added support for pagebreak annotations. |
Akron | 08d5445 | 2017-02-16 23:19:49 +0100 | [diff] [blame] | 201 | - Renamed "pages" to "srcPages". |
Akron | 60a8caa | 2017-02-17 21:51:27 +0100 | [diff] [blame] | 202 | - Fixed handling of prefixes for text sigles. |
Akron | 3bd942f | 2017-02-20 20:09:14 +0100 | [diff] [blame] | 203 | - Support for MarMoT. |
Akron | 5809fea | 2017-03-14 20:02:26 +0100 | [diff] [blame] | 204 | - Fix case insensitivity. |
Akron | 55778f0 | 2017-03-14 20:47:26 +0100 | [diff] [blame] | 205 | - Added preliminary support for diacritic insensitivity. |
Akron | 3ec0a1c | 2017-01-18 14:41:55 +0100 | [diff] [blame] | 206 | |
Akron | 3741f8b | 2016-12-21 19:55:21 +0100 | [diff] [blame] | 207 | 0.24 2016-12-21 |
208 | - Added --base-sentences and --base-paragraphs options | ||||
209 | |||||
Akron | 6f9fef5 | 2016-11-03 17:06:40 +0100 | [diff] [blame] | 210 | 0.23 2016-11-03 |
Akron | 2fd402b | 2016-10-27 21:26:48 +0200 | [diff] [blame] | 211 | - Added wildcard support for document extraction |
Akron | 2812ba2 | 2016-10-28 21:55:59 +0200 | [diff] [blame] | 212 | - Fixed archive iteration to not duplicate the first archive |
213 | - Added parallel extraction for document sigles | ||||
Akron | 13d5662 | 2016-10-31 14:54:49 +0100 | [diff] [blame] | 214 | - Improved return value for existing files |
Akron | 3741f8b | 2016-12-21 19:55:21 +0100 | [diff] [blame] | 215 | - Don't warn on recursion in CoreNLP/Constituency |
Akron | 2fd402b | 2016-10-27 21:26:48 +0200 | [diff] [blame] | 216 | |
Akron | 2080758 | 2016-10-26 17:11:34 +0200 | [diff] [blame] | 217 | 0.22 2016-10-26 |
218 | - Added support for document extraction | ||||
Akron | b4bbec7 | 2016-10-26 20:21:02 +0200 | [diff] [blame] | 219 | - Fixed archive naming |
Akron | 2080758 | 2016-10-26 17:11:34 +0200 | [diff] [blame] | 220 | |
Akron | b4bbec7 | 2016-10-26 20:21:02 +0200 | [diff] [blame] | 221 | 0.21 2016-10-24 |
Nils Diewald | b3e9ccd | 2016-10-24 15:16:52 +0200 | [diff] [blame] | 222 | - Improved Windows support |
223 | |||||
Akron | 4c0cf31 | 2016-10-15 16:42:09 +0200 | [diff] [blame] | 224 | 0.20 2016-10-15 |
225 | - Fixed treatment of temporary folders in script | ||||
226 | |||||
Akron | bdb6465 | 2016-08-17 23:30:01 +0200 | [diff] [blame] | 227 | 0.19 2016-08-17 |
Akron | 92ad95b | 2016-08-15 23:38:56 +0200 | [diff] [blame] | 228 | - Added test for direct I5 support. |
229 | - Fixed support for Mojolicious 7. | ||||
230 | - Added script test. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 231 | - Fixed setting multiple annotations in |
232 | script. | ||||
Akron | e2b902d | 2016-08-16 16:50:11 +0200 | [diff] [blame] | 233 | - Fixed output of version and help messages. |
Akron | 7d4cdd8 | 2016-08-17 21:39:45 +0200 | [diff] [blame] | 234 | - Added script test for extraction. |
Akron | 651cb8d | 2016-08-16 21:44:49 +0200 | [diff] [blame] | 235 | - Fixed extraction with multiple archives and prefix |
236 | negation support. | ||||
Akron | 7d4cdd8 | 2016-08-17 21:39:45 +0200 | [diff] [blame] | 237 | - Added script test for archives. |
Akron | 1924bbe | 2016-06-22 16:05:41 +0200 | [diff] [blame] | 238 | |
Akron | bdb6465 | 2016-08-17 23:30:01 +0200 | [diff] [blame] | 239 | 0.18 2016-07-08 |
240 | - Added REI test. | ||||
241 | - Added multiple archive support to korapxml2krill. | ||||
242 | - Added support for prefix negation in korapxml2krill. | ||||
243 | - Added support for Malt#Dependency. | ||||
244 | - Improved test suite for caching and REI. | ||||
245 | - Added support for MDParser annotation. | ||||
246 | - Added batch processing class for documents. | ||||
247 | |||||
Akron | 1cd5b87 | 2016-03-22 00:23:46 +0100 | [diff] [blame] | 248 | 0.17 2016-03-22 |
249 | - Rewrite siglen to use slashes as separators. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 250 | - Zip listing optimized. Does no longer work with primary data |
251 | in text.xml files. | ||||
Akron | 1cd5b87 | 2016-03-22 00:23:46 +0100 | [diff] [blame] | 252 | |
Akron | 11c8030 | 2016-03-18 19:44:43 +0100 | [diff] [blame] | 253 | 0.16 2016-03-18 |
254 | - Added caching mechanism for | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 255 | metadata. |
Akron | 11c8030 | 2016-03-18 19:44:43 +0100 | [diff] [blame] | 256 | |
Akron | 35db6e3 | 2016-03-17 22:42:22 +0100 | [diff] [blame] | 257 | 0.15 2016-03-17 |
258 | - Modularized metadata handling. | ||||
259 | - Simplified metadata handling. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 260 | - Added --meta option to script. |
261 | - Removed deprecated --human option from script. | ||||
Akron | 35db6e3 | 2016-03-17 22:42:22 +0100 | [diff] [blame] | 262 | |
Akron | c13a170 | 2016-03-15 19:33:14 +0100 | [diff] [blame] | 263 | 0.14 2016-03-15 |
Akron | 151676d | 2016-03-14 20:12:14 +0100 | [diff] [blame] | 264 | - Renamed ::Index to ::Annotate and ::Field to ::Index. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 265 | - Renamed 'allow' to 'anno' as parameters of the script. |
266 | - Added readme. | ||||
Akron | 151676d | 2016-03-14 20:12:14 +0100 | [diff] [blame] | 267 | |
Akron | 5b25431 | 2016-03-10 00:29:56 +0100 | [diff] [blame] | 268 | 0.13 2016-03-10 |
Akron | 44feb4e | 2016-03-02 12:45:47 +0100 | [diff] [blame] | 269 | - Removed korapxml2krill_dir. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 270 | - Renamed dependency nodes. |
271 | - Made dependency relations more effective (trimmed down TUIs) | ||||
272 | ! This is currently very slow ! | ||||
Akron | 44feb4e | 2016-03-02 12:45:47 +0100 | [diff] [blame] | 273 | |
Akron | dc898d8 | 2016-02-28 23:49:19 +0100 | [diff] [blame] | 274 | 0.12 2016-02-28 |
Akron | e10ad32 | 2016-02-27 10:54:26 +0100 | [diff] [blame] | 275 | - Added extract method to korapxml2krill. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 276 | - Fixed Mate/Dependency. |
277 | - Fixed skip flag in korapxml2krill. | ||||
278 | - Ignore spans outside the token range | ||||
279 | (i.e. character offsets end before tokens have started). | ||||
Akron | e10ad32 | 2016-02-27 10:54:26 +0100 | [diff] [blame] | 280 | |
Akron | 941c1a6 | 2016-02-23 17:41:41 +0100 | [diff] [blame] | 281 | 0.11 2016-02-23 |
Akron | 44feb4e | 2016-03-02 12:45:47 +0100 | [diff] [blame] | 282 | - Merged korapxml2krill and korapxml2krill_dir. |
Akron | 941c1a6 | 2016-02-23 17:41:41 +0100 | [diff] [blame] | 283 | |
Akron | 96165ad | 2016-02-15 18:09:41 +0100 | [diff] [blame] | 284 | 0.10 2016-02-15 |
285 | - Added EXPERIMENTAL support for parallel jobs. | ||||
286 | |||||
Akron | c1babed | 2016-02-15 11:48:18 +0100 | [diff] [blame] | 287 | 0.09 2016-02-15 |
288 | - Fixed temporary directory handling in scripts. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 289 | - Improved skipping for archive handling in scripts. |
Akron | c1babed | 2016-02-15 11:48:18 +0100 | [diff] [blame] | 290 | |
Akron | 150b29e | 2016-02-14 23:06:48 +0100 | [diff] [blame] | 291 | 0.08 2016-02-14 |
292 | - Added support for archive streaming. | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 293 | - Improved scripts. |
Akron | 150b29e | 2016-02-14 23:06:48 +0100 | [diff] [blame] | 294 | |
Akron | 8c84aa5 | 2016-02-13 21:26:54 +0100 | [diff] [blame] | 295 | 0.07 2016-02-13 |
296 | - Improved support for Schreibgebrauch meta data | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 297 | (IDS flavour). |
Akron | 8c84aa5 | 2016-02-13 21:26:54 +0100 | [diff] [blame] | 298 | |
299 | 0.06 2016-02-11 | ||||
Akron | 49a4765 | 2016-02-12 18:17:19 +0100 | [diff] [blame] | 300 | - Improved support for Schreibgebrauch meta data |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 301 | (Duden flavour). |
Akron | 49a4765 | 2016-02-12 18:17:19 +0100 | [diff] [blame] | 302 | |
Akron | 93d620e | 2016-02-05 19:40:05 +0100 | [diff] [blame] | 303 | 0.05 2016-02-04 |
Akron | e4c2e41 | 2016-01-28 15:10:50 +0100 | [diff] [blame] | 304 | - Changed KorAP::Document to KorAP::XML::Krill. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 305 | - Renamed "Schreibgebrauch" to "Sgbr". |
306 | - Preparation for GitHub release. | ||||
Akron | e4c2e41 | 2016-01-28 15:10:50 +0100 | [diff] [blame] | 307 | |
Akron | 9c0488f | 2016-01-28 14:17:15 +0100 | [diff] [blame] | 308 | 0.04 2016-01-28 |
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 309 | - Added PTI to all payloads. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 310 | - Added support for empty elements. |
311 | - Added support for element attributes in struct. | ||||
312 | - Added meta data support for Schreibgebrauch. | ||||
313 | - Fixed test suite for meta data. | ||||
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 314 | |
315 | 0.03 2014-11-03 | ||||
Nils Diewald | 7867467 | 2014-11-03 21:43:12 +0000 | [diff] [blame] | 316 | - Added new metadata scheme. |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 317 | - Fixed a minor bug in the constituency tree building. |
318 | - Sorted terms in tokens a priori. | ||||
Nils Diewald | 7867467 | 2014-11-03 21:43:12 +0000 | [diff] [blame] | 319 | |
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 320 | 0.02 2014-07-21 |
Nils Diewald | f03c680 | 2014-07-21 16:39:44 +0000 | [diff] [blame] | 321 | - Sentence annotations for all providing foundries |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 322 | - Starting subtokenization |
Nils Diewald | f03c680 | 2014-07-21 16:39:44 +0000 | [diff] [blame] | 323 | |
Akron | 69a4a2f | 2016-01-17 12:55:50 +0100 | [diff] [blame] | 324 | 0.01 2014-04-15 |
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 325 | - [bugfix] for first token annotations |
Nils Diewald | 7b84722 | 2014-04-23 11:14:00 +0000 | [diff] [blame] | 326 | - Sentences are now available from all foundries that have it |
327 | - <>:p is now <>:base/para | ||||
Akron | 5f51d42 | 2016-08-16 16:26:43 +0200 | [diff] [blame] | 328 | - Added <>:base/text |