blob: 235124b3970b1dc14eeace47fbd070e428433481 [file] [log] [blame]
Akron2f7f6f32026-02-11 15:12:48 +010010.3.1 2026-02-11
2 - Introduce hyphenated abbreviations in german tokenizer.
3 - Support Wikipedia templates.
4 - Introduced multiple gender forms for nouns
5 in german tokenizer.
6 (from KorAP-Tokenizer)
7 - Added short forms for determiners, adjectives, pronouns
8 "eine(n)", "gute:r", "ihm/r", "diese(r)", "ein(e)"
Akrond8d88952026-02-04 09:02:09 +01009
Akronf66dc142023-09-06 20:00:47 +0200100.2.2 2023-09-06
Akron2f7f6f32026-02-11 15:12:48 +010011 - Fix behaviour for end of text character positions
12 when no end of sentence occured before.
Akronf66dc142023-09-06 20:00:47 +020013
Akron8e803932023-04-18 10:19:19 +0200140.2.1 2023-09-05
Akron2f7f6f32026-02-11 15:12:48 +010015 - Add english tokenizer.
16 - Fix buffer bug.
17 - Improve Readme.
18 - Minor performance improvements.
Akroncae39112023-04-26 19:43:16 +020019
Akron96c65482023-02-28 09:08:48 +0100200.1.7 2023-02-28
Akron2f7f6f32026-02-11 15:12:48 +010021 - Add dependabot checks.
22 - Add update command.
Akron0597b272023-02-23 15:04:11 +010023
Akronb15acb92022-04-16 11:01:46 +0200240.1.6 2022-04-14
Akron2f7f6f32026-02-11 15:12:48 +010025 - Rename TOKEN_SYMBOL to TOKEN_BOUND.
Akronb15acb92022-04-16 11:01:46 +020026
Akronb98e4cf2022-03-27 23:56:49 +0200270.1.5 2022-03-28
Akron2f7f6f32026-02-11 15:12:48 +010028 - Improve Emoticon-List.
Akronb98e4cf2022-03-27 23:56:49 +020029
Akronb4287552022-03-27 14:11:24 +0200300.1.4 2022-03-27
Akron2f7f6f32026-02-11 15:12:48 +010031 - Improved handling of ellipsis.
32 - Make algorithm more robust to nevere fail.
33 - Remove match option.
Akron4222ac82022-03-11 01:06:21 +010034
Akrone96895f2022-03-08 19:58:37 +0100350.1.3 2022-03-08
Akron2f7f6f32026-02-11 15:12:48 +010036 - Introduced refined handling of sentences including speech.
Akrone96895f2022-03-08 19:58:37 +010037
Akron936c0f52021-12-07 11:30:53 +0100380.1.2 2021-12-07
Akron2f7f6f32026-02-11 15:12:48 +010039 - Improve performance of rune to symbol conversion in transduction
40method.
41 - Support Plusampersand word list in compounds.