blob: 02f5d40e0bee7cb2e74b9f901b8d4c55567a1832 [file] [log] [blame]
Akrona2f952f2026-02-04 09:51:51 +010010.3.1 2026-02-04
Akrond8d88952026-02-04 09:02:09 +01002 - Introduce hyphenated abbreviations in german tokenizer.
Akrona2f952f2026-02-04 09:51:51 +01003 - Support Wikipedia templates.
Akron3dd560e2026-02-04 11:23:08 +01004 - Introduced multiple gender forms for nouns
5 in german tokenizer.
Akrond8d88952026-02-04 09:02:09 +01006
Akronf66dc142023-09-06 20:00:47 +020070.2.2 2023-09-06
8 - Fix behaviour for end of text character positions
9 when no end of sentence occured before.
10
Akron8e803932023-04-18 10:19:19 +0200110.2.1 2023-09-05
Akroncae39112023-04-26 19:43:16 +020012 - Add english tokenizer.
13 - Fix buffer bug.
Akron5d68ae42023-09-05 12:14:53 +020014 - Improve Readme.
Akron8e803932023-04-18 10:19:19 +020015 - Minor performance improvements.
Akroncae39112023-04-26 19:43:16 +020016
Akron96c65482023-02-28 09:08:48 +0100170.1.7 2023-02-28
Akron0597b272023-02-23 15:04:11 +010018 - Add dependabot checks.
Akron96c65482023-02-28 09:08:48 +010019 - Add update command.
Akron0597b272023-02-23 15:04:11 +010020
Akronb15acb92022-04-16 11:01:46 +0200210.1.6 2022-04-14
22 - Rename TOKEN_SYMBOL to TOKEN_BOUND.
23
Akronb98e4cf2022-03-27 23:56:49 +0200240.1.5 2022-03-28
25 - Improve Emoticon-List.
26
Akronb4287552022-03-27 14:11:24 +0200270.1.4 2022-03-27
Akron4222ac82022-03-11 01:06:21 +010028 - Improved handling of ellipsis.
Akronb4287552022-03-27 14:11:24 +020029 - Make algorithm more robust to nevere fail.
30 - Remove match option.
Akron4222ac82022-03-11 01:06:21 +010031
Akrone96895f2022-03-08 19:58:37 +0100320.1.3 2022-03-08
33 - Introduced refined handling of sentences including speech.
34
Akron936c0f52021-12-07 11:30:53 +0100350.1.2 2021-12-07
Akron4880fb62021-12-05 12:03:05 +010036 - Improve performance of rune to symbol conversion in transduction
37 method.
Akron936c0f52021-12-07 11:30:53 +010038 - Support Plusampersand word list in compounds.