blob: 78c0829a76f88e4b0900d7ae29447d658161e45d [file] [log] [blame]
Marc Kupietzd0bf2772022-06-26 19:27:58 +02001 - --word2vec|lm-training-data option added to print word2vec input format
2 - --extract-metadata-regex added to extract some metadata values as context input for language model training
Marc Kupietz15c84fd2021-10-12 12:20:27 +02003 - by default sentence boundary information is now read from structure.xml files (use --s-bounds-from-morpho otherwise)
Marc Kupietzf1fdc192021-10-08 13:29:59 +02004 - korapxml2conllu: use morpho.xml if present when run on base zips
Marc Kupietzd7d5d6a2021-10-11 17:52:58 +02005 - korapxml2conllu: new option -c <columns>
Marc Kupietz97ba2ba2021-10-11 17:55:47 +02006 - conllu2korapxml: ignore _-lemmas
Marc Kupietzf1fdc192021-10-08 13:29:59 +02007
Marc Kupietza7d90c62021-07-31 23:48:13 +020080.4.1 2021-07-31
9 - korapxml2conllu: fix patterns not extracted for last texts in archive
10
Marc Kupietz6beca9d2021-07-29 18:26:09 +0200110.4 2021-07-29
Marc Kupietzeb7d06a2021-03-19 16:29:16 +010012 - korapxml2conllu option -e <regex> added to extract element/attributes to comments
Marc Kupietz0ab8a2c2021-03-19 16:21:00 +010013
Marc Kupietz22858f82021-02-15 14:22:05 +0100140.3 2021-02-15
Marc Kupietz79ba1e52021-02-12 17:26:54 +010015 - Provide conllu2korapxml to convert from ConLL-U to KorAP-XML zip
16
Marc Kupietzb96c3862021-02-12 08:33:44 +0100170.2 2021-02-12
Marc Kupietzd8455832021-02-11 17:30:29 +010018 - Convert also KorAP-XML base zips
19
Marc Kupietz396b4d62021-02-12 08:29:35 +0100200.1 2020-09-23
21 - Initial release to GitHub.