1. 07e2477 Include RWK annotations in script by Akron · 4 years, 8 months ago
  2. c403644 Improve RWK morphology parser to support multiple morphological key:value pairs by Akron · 4 years, 8 months ago
  3. 85eb5aa Improve RWK structure parser for *-milestone elements by Akron · 4 years, 8 months ago
  4. 28299f4 Introduce special RWK structure parser by Akron · 4 years, 8 months ago
  5. 8ff5879 Introduce special RWK morphology parser by Akron · 4 years, 8 months ago
  6. 158bd50 Catch if files are not writable to output by Akron · 4 years, 9 months ago
  7. 0a187b9 Fix I5 Meta documentation to not confuse dot-separated tag names in by Akron · 4 years, 9 months ago
  8. 9fdd08b Merge "Add tool to rewrite licenses for the sample corpus" by Akron · 4 years, 10 months ago
  9. 3eadf2e Add tool to rewrite licenses for the sample corpus by Akron · 4 years, 10 months ago
  10. dec4312 Fixed gap behind last token and <base/s:t> length by Akron · 4 years, 10 months ago
  11. b62d92a Fixed span position offset bug and fixed milestones behind last token bug by Akron · 4 years, 10 months ago
  12. a0d5af3 Fixed legacy XIP parser by Akron · 4 years, 10 months ago
  13. 9711ed3 Fixed benchmark mechanism by Akron · 4 years, 10 months ago
  14. 6e886f7 Added benchmark mechanism by Akron · 4 years, 10 months ago
  15. 42f48c1 Rename KorapXML to KorAP-XML coherently by Akron · 4 years, 11 months ago
  16. 72bc522 Improve KorAP-XML documentation by Akron · 4 years, 11 months ago
  17. d4c5c10 Added documentation for supported I5 metadata fields by Akron · 4 years, 11 months ago
  18. 57799fc Fix editionStmt metadata parsing by Akron · 4 years, 11 months ago
  19. 8f69d63 Added brief explanation of the format by Akron · 5 years ago
  20. f1849aa Support non-verbal annotations by Akron · 5 years ago
  21. c29b8e1 Added support for DGD pseudo-sentences based on anchor milestones by Akron · 5 years ago
  22. 67b6eda Support 'FOLK' as corpus sigle for DGD associated corpora by Akron · 5 years ago
  23. b05b842 Improve logging by Akron · 5 years ago
  24. 2029455 Added external link for AGD data in I5 meta by Akron · 5 years ago
  25. 0d68a4b Added 'distributor' field to I5 metadata by Akron · 5 years ago
  26. 7d5e638 Added support for Talismane by Akron · 5 years ago
  27. c93a080 Document --to-tar option by Akron · 5 years ago
  28. 57510c1 Added DGD support by Akron · 6 years ago
  29. 9b04f60 Update version by Akron · 6 years ago
  30. f021ad6 Improve error handling by Akron · 6 years ago
  31. eaffe93 Fail hard on tokenization problems now by Akron · 6 years ago
  32. 94262ce Renamed Institute for the German Language to Leibniz Institute for the German Language by Akron · 6 years ago
  33. 955b75b Remove extract_text and extract_doc in favor of extract_sigle by Akron · 6 years ago
  34. 31a08cb Add extract_sigle method to archive by Akron · 6 years ago
  35. 63d03ee Ignore temporary-extraction on directory archiving by Akron · 6 years ago
  36. 6bf3cc9 Added links for wikipedia resources by Akron · 6 years ago
  37. 4e1712c Add english wikipedia example by Akron · 6 years ago
  38. 263274c Support koral versioning by Akron · 6 years ago
  39. c526e75 Include field serialization in versioned json output by Akron · 6 years ago
  40. 5eb3aa0 Set field types and serialize as koral:fields by Akron · 6 years ago
  41. ea9c364 Ignore DGD parser tests by Akron · 6 years ago
  42. ed9baf0 Support non-word-tokens (fixes #5) by Akron · 6 years ago
  43. 6eff23b Updated minimum perl by Akron · 6 years ago
  44. ea1aed5 Activate HNC by default by Akron · 6 years ago
  45. 5fdc7e1 Fixed last change info in --version by Akron · 6 years ago
  46. dd1c0f1 Updated version by Akron · 6 years ago
  47. c893ac3 Added tests and minor metadata parsing adjustments for HNC by Akron · 6 years ago
  48. f73ffb6 Fixed readme by mentioning preference regarding configuration parameters by Akron · 6 years ago
  49. 28dc17f Fix certainty values in TreeTagger output by Akron · 7 years ago
  50. 0426176 Remove certainty value on lemmata in Treetagger by Akron · 7 years ago
  51. 6727b21 Fixed lwc tests by Akron · 7 years ago
  52. 4c67919 Support for LWC dependency annotations by Akron · 7 years ago
  53. 56dfb31 Added test regarding offset bug in KorAP by Akron · 7 years ago
  54. d19e275 Recheck dependency tests by Akron · 7 years ago
  55. 3c56f50 Support file extensions in base tokenization file by Akron · 7 years ago
  56. 28c4e54 Fix missing command issue by Akron · 7 years ago
  57. d5643ad Warn on missing output parameter in extract by Akron · 7 years ago
  58. 9b67b93 Fix attribute generation for DeReKo by Akron · 7 years ago
  59. 9a062ce Fix tarring to include only filenames by Akron · 7 years ago
  60. 0a6cce1 Remove non-core fc by Akron · 7 years ago
  61. 3abc03e Fixed exit codes in script by Akron · 7 years ago
  62. 0f9b93a Fixed minor issue in I5 meta parsing by Akron · 7 years ago
  63. 403934d Fixed CMC for empty features by Akron · 8 years ago
  64. 36d4627 Fixed feature treatment in CMC morpho by Akron · 8 years ago
  65. aaea246 One more missing permission problem in the test suite fixed by Akron · 8 years ago
  66. 5fd2d8e Fixed more permission and dependency issues by Akron · 8 years ago
  67. ce125b6 Improved documentation on new features by Akron · 8 years ago
  68. d5bb434 Fixed permissions in test suite by Akron · 8 years ago
  69. e599379 Added treatment of CMC data by Akron · 8 years ago
  70. 918ce42 Fixed primary data handling for data with white space at the beginning and at the end by Akron · 8 years ago
  71. a308c71 Start testing with DCK by Akron · 8 years ago
  72. da3097e Finished tar flag by Akron · 8 years ago
  73. 486f9ab Improved tar support by Akron · 8 years ago
  74. 081639e Added preliminary tar support by Akron · 8 years ago
  75. 9ec8887 Introduced sequential extraction flag to circumvent troubles with parallel extraction by Akron · 8 years ago
  76. 3a486f8 Another unzip flag update (-uo) by Akron · 8 years ago
  77. 86db52e Improved unzip overwriting mechanism by Akron · 8 years ago
  78. 0278ca2 Test zip overwriting by Akron · 8 years ago
  79. bd3adda Fixing behaviour for existing output directories by Akron · 8 years ago
  80. 442c4e9 Updated readme by Akron · 8 years ago
  81. 63f20d4 Support serial conversion and input-base by Akron · 8 years ago
  82. 8150010 Introduced temporary extraction by Akron · 8 years ago
  83. 636aa11 Added configuration to script by Akron · 8 years ago
  84. 821db3d Add wildcard support for inputs by Akron · 8 years ago
  85. 55778f0 Added preliminary support for diacritic insensitivity support by Akron · 8 years ago
  86. 5809fea Fixed casefolding for case insensitivity by Akron · 8 years ago
  87. b2f1ab8 Improve test suite for MarMoT by Akron · 8 years ago
  88. c11f798 Add auto-core-calculation by Akron · 8 years ago
  89. 3bd942f Added marmot-support by Akron · 8 years ago
  90. f624084 Added test for quotes in archives for archiving by Akron · 8 years ago
  91. 60a8caa Treat prefixes correct for text sigles by Akron · 8 years ago
  92. 08d5445 Changed meta name for pages by Akron · 8 years ago
  93. d35d2d3 Fixed pagebreak test by Akron · 8 years ago
  94. 3c11964 Added comment regarding missing pagebreaks in the data by Akron · 8 years ago
  95. 636bd9c Fixed pagebreak treatment in script by Akron · 8 years ago
  96. 41ac10b Added pagebreak annotations (with '~'-prefix) by Akron · 8 years ago
  97. 0465de5 Improved handling of weird metadata stuff by Akron · 8 years ago
  98. 3887301 More relaxed handling of document siglen by Akron · 8 years ago
  99. a7d0e9f Improved DRuKoLa meta data handling by Akron · 8 years ago
  100. ce41be8 Updated announced dependency to Mojolicious by Akron · 8 years ago