| Akron | 2f7f6f3 | 2026-02-11 15:12:48 +0100 | [diff] [blame] | 1 | 0.3.1 2026-02-11 |
| 2 | - Introduce hyphenated abbreviations in german tokenizer. | ||||
| 3 | - Support Wikipedia templates. | ||||
| 4 | - Introduced multiple gender forms for nouns | ||||
| 5 | in german tokenizer. | ||||
| 6 | (from KorAP-Tokenizer) | ||||
| 7 | - Added short forms for determiners, adjectives, pronouns | ||||
| 8 | "eine(n)", "gute:r", "ihm/r", "diese(r)", "ein(e)" | ||||
| Akron | d8d8895 | 2026-02-04 09:02:09 +0100 | [diff] [blame] | 9 | |
| Akron | f66dc14 | 2023-09-06 20:00:47 +0200 | [diff] [blame] | 10 | 0.2.2 2023-09-06 |
| Akron | 2f7f6f3 | 2026-02-11 15:12:48 +0100 | [diff] [blame] | 11 | - Fix behaviour for end of text character positions |
| 12 | when no end of sentence occured before. | ||||
| Akron | f66dc14 | 2023-09-06 20:00:47 +0200 | [diff] [blame] | 13 | |
| Akron | 8e80393 | 2023-04-18 10:19:19 +0200 | [diff] [blame] | 14 | 0.2.1 2023-09-05 |
| Akron | 2f7f6f3 | 2026-02-11 15:12:48 +0100 | [diff] [blame] | 15 | - Add english tokenizer. |
| 16 | - Fix buffer bug. | ||||
| 17 | - Improve Readme. | ||||
| 18 | - Minor performance improvements. | ||||
| Akron | cae3911 | 2023-04-26 19:43:16 +0200 | [diff] [blame] | 19 | |
| Akron | 96c6548 | 2023-02-28 09:08:48 +0100 | [diff] [blame] | 20 | 0.1.7 2023-02-28 |
| Akron | 2f7f6f3 | 2026-02-11 15:12:48 +0100 | [diff] [blame] | 21 | - Add dependabot checks. |
| 22 | - Add update command. | ||||
| Akron | 0597b27 | 2023-02-23 15:04:11 +0100 | [diff] [blame] | 23 | |
| Akron | b15acb9 | 2022-04-16 11:01:46 +0200 | [diff] [blame] | 24 | 0.1.6 2022-04-14 |
| Akron | 2f7f6f3 | 2026-02-11 15:12:48 +0100 | [diff] [blame] | 25 | - Rename TOKEN_SYMBOL to TOKEN_BOUND. |
| Akron | b15acb9 | 2022-04-16 11:01:46 +0200 | [diff] [blame] | 26 | |
| Akron | b98e4cf | 2022-03-27 23:56:49 +0200 | [diff] [blame] | 27 | 0.1.5 2022-03-28 |
| Akron | 2f7f6f3 | 2026-02-11 15:12:48 +0100 | [diff] [blame] | 28 | - Improve Emoticon-List. |
| Akron | b98e4cf | 2022-03-27 23:56:49 +0200 | [diff] [blame] | 29 | |
| Akron | b428755 | 2022-03-27 14:11:24 +0200 | [diff] [blame] | 30 | 0.1.4 2022-03-27 |
| Akron | 2f7f6f3 | 2026-02-11 15:12:48 +0100 | [diff] [blame] | 31 | - Improved handling of ellipsis. |
| 32 | - Make algorithm more robust to nevere fail. | ||||
| 33 | - Remove match option. | ||||
| Akron | 4222ac8 | 2022-03-11 01:06:21 +0100 | [diff] [blame] | 34 | |
| Akron | e96895f | 2022-03-08 19:58:37 +0100 | [diff] [blame] | 35 | 0.1.3 2022-03-08 |
| Akron | 2f7f6f3 | 2026-02-11 15:12:48 +0100 | [diff] [blame] | 36 | - Introduced refined handling of sentences including speech. |
| Akron | e96895f | 2022-03-08 19:58:37 +0100 | [diff] [blame] | 37 | |
| Akron | 936c0f5 | 2021-12-07 11:30:53 +0100 | [diff] [blame] | 38 | 0.1.2 2021-12-07 |
| Akron | 2f7f6f3 | 2026-02-11 15:12:48 +0100 | [diff] [blame] | 39 | - Improve performance of rune to symbol conversion in transduction |
| 40 | method. | ||||
| 41 | - Support Plusampersand word list in compounds. | ||||