1. c403644 Improve RWK morphology parser to support multiple morphological key:value pairs by Akron · 4 years, 7 months ago
  2. 85eb5aa Improve RWK structure parser for *-milestone elements by Akron · 4 years, 7 months ago
  3. 28299f4 Introduce special RWK structure parser by Akron · 4 years, 7 months ago
  4. 8ff5879 Introduce special RWK morphology parser by Akron · 4 years, 7 months ago
  5. dec4312 Fixed gap behind last token and <base/s:t> length by Akron · 4 years, 9 months ago
  6. b62d92a Fixed span position offset bug and fixed milestones behind last token bug by Akron · 4 years, 9 months ago
  7. 57799fc Fix editionStmt metadata parsing by Akron · 4 years, 10 months ago
  8. f1849aa Support non-verbal annotations by Akron · 5 years ago
  9. c29b8e1 Added support for DGD pseudo-sentences based on anchor milestones by Akron · 5 years ago
  10. 2029455 Added external link for AGD data in I5 meta by Akron · 5 years ago
  11. 0d68a4b Added 'distributor' field to I5 metadata by Akron · 5 years ago
  12. 7d5e638 Added support for Talismane by Akron · 5 years ago
  13. 57510c1 Added DGD support by Akron · 6 years ago
  14. f021ad6 Improve error handling by Akron · 6 years ago
  15. eaffe93 Fail hard on tokenization problems now by Akron · 6 years ago
  16. 955b75b Remove extract_text and extract_doc in favor of extract_sigle by Akron · 6 years ago
  17. 31a08cb Add extract_sigle method to archive by Akron · 6 years ago
  18. 63d03ee Ignore temporary-extraction on directory archiving by Akron · 6 years ago
  19. 6bf3cc9 Added links for wikipedia resources by Akron · 6 years ago
  20. 4e1712c Add english wikipedia example by Akron · 6 years ago
  21. 263274c Support koral versioning by Akron · 6 years ago
  22. c526e75 Include field serialization in versioned json output by Akron · 6 years ago
  23. 5eb3aa0 Set field types and serialize as koral:fields by Akron · 6 years ago
  24. ed9baf0 Support non-word-tokens (fixes #5) by Akron · 6 years ago
  25. c893ac3 Added tests and minor metadata parsing adjustments for HNC by Akron · 6 years ago
  26. 28dc17f Fix certainty values in TreeTagger output by Akron · 7 years ago
  27. 0426176 Remove certainty value on lemmata in Treetagger by Akron · 7 years ago
  28. 6727b21 Fixed lwc tests by Akron · 7 years ago
  29. 4c67919 Support for LWC dependency annotations by Akron · 7 years ago
  30. 56dfb31 Added test regarding offset bug in KorAP by Akron · 7 years ago
  31. d19e275 Recheck dependency tests by Akron · 7 years ago
  32. 3c56f50 Support file extensions in base tokenization file by Akron · 7 years ago
  33. 9a062ce Fix tarring to include only filenames by Akron · 7 years ago
  34. aaea246 One more missing permission problem in the test suite fixed by Akron · 7 years ago
  35. 5fd2d8e Fixed more permission and dependency issues by Akron · 7 years ago
  36. d5bb434 Fixed permissions in test suite by Akron · 7 years ago
  37. 918ce42 Fixed primary data handling for data with white space at the beginning and at the end by Akron · 7 years ago
  38. da3097e Finished tar flag by Akron · 8 years ago
  39. 486f9ab Improved tar support by Akron · 8 years ago
  40. 9ec8887 Introduced sequential extraction flag to circumvent troubles with parallel extraction by Akron · 8 years ago
  41. bd3adda Fixing behaviour for existing output directories by Akron · 8 years ago
  42. 63f20d4 Support serial conversion and input-base by Akron · 8 years ago
  43. 636aa11 Added configuration to script by Akron · 8 years ago
  44. 821db3d Add wildcard support for inputs by Akron · 8 years ago
  45. 55778f0 Added preliminary support for diacritic insensitivity support by Akron · 8 years ago
  46. b2f1ab8 Improve test suite for MarMoT by Akron · 8 years ago
  47. 3bd942f Added marmot-support by Akron · 8 years ago
  48. f624084 Added test for quotes in archives for archiving by Akron · 8 years ago
  49. 60a8caa Treat prefixes correct for text sigles by Akron · 8 years ago
  50. 08d5445 Changed meta name for pages by Akron · 8 years ago
  51. d35d2d3 Fixed pagebreak test by Akron · 8 years ago
  52. 3c11964 Added comment regarding missing pagebreaks in the data by Akron · 8 years ago
  53. 636bd9c Fixed pagebreak treatment in script by Akron · 8 years ago
  54. 41ac10b Added pagebreak annotations (with '~'-prefix) by Akron · 8 years ago
  55. 0465de5 Improved handling of weird metadata stuff by Akron · 8 years ago
  56. 3887301 More relaxed handling of document siglen by Akron · 8 years ago
  57. a7d0e9f Improved DRuKoLa meta data handling by Akron · 8 years ago
  58. 578af4b Support translator meta data type by Akron · 8 years ago
  59. c388150 Added more drukola tests by Akron · 8 years ago
  60. 3139917 Fixed DRuKoLa annotations by Akron · 8 years ago
  61. ace612e Added DRuKoLa annotations by Akron · 8 years ago
  62. 7e2eb88 Fixed analytic+monogr behaviour for metadata by Akron · 8 years ago
  63. 3ec0a1c Updated to Mojolicious 7.20 by Akron · 8 years ago
  64. 3741f8b Added base-sentences and base-paragraphs options by Akron · 8 years ago
  65. 53167fd Added new test data without base annotations by Akron · 8 years ago
  66. 89df4fa Fixed bug in tokenizer to recognize non-word-tokenizations by Akron · 8 years ago
  67. 6f9fef5 Ignore recursion in CoreNLP by Akron · 8 years ago
  68. 13d5662 Improved 'already processed' message by Akron · 8 years ago
  69. 2812ba2 Fixed archive handling and support multiple jobs for extraction by Akron · 8 years ago
  70. 2fd402b Added support for wildcards in document siglen by Akron · 8 years ago
  71. a76d835 Improved documentation (thx @margaretha) by Akron · 8 years ago
  72. 2080758 Added extraction method for documents in archives by Akron · 8 years ago
  73. 7606afa Improved documentation to be more precise regarding non-argument calls (thx @margaretha) by Akron · 8 years ago
  74. b3e9ccd Fixed windows support by Nils Diewald · 8 years ago
  75. 3ec4897 Added archive test for directories and parallel processing by Akron · 8 years ago
  76. 7d4cdd8 Added archive test script by Akron · 8 years ago
  77. 651cb8d Fix extraction of multiple archives by Akron · 8 years ago
  78. 03b24db Added test for sigles support in extract by Akron · 8 years ago
  79. f98b669 Test meta switch in script by Akron · 8 years ago
  80. e2b902d Fixed output of version and help screens by Akron · 8 years ago
  81. 5f51d42 Fixed annotation bug in script by Akron · 8 years ago
  82. 92ad95b Added test for script execution by Akron · 8 years ago
  83. afb81ad Fixed Mojolicious 7 support by Akron · 8 years ago
  84. af0ae3f Check sentence mapping in base/sentences by Akron · 8 years ago
  85. fbf6638 Added support for direct I5 support by Akron · 8 years ago
  86. e1dbc38 Added test for script calls by Akron · 8 years ago
  87. cdf0e00 Added batch processing class for documents by Akron · 8 years ago
  88. 405f0c5 Test file processing for batch processing by Akron · 8 years ago
  89. a86d94a Fixed MDParser data and test suite by Akron · 8 years ago
  90. 05ba547 Preliminary support for MDParser annotations by Akron · 8 years ago
  91. a5920b1 Improved test suite for caching and rei by Akron · 8 years ago
  92. b0c88db Added caching test by Akron · 8 years ago
  93. 0c3e375 Test multiple archives by Akron · 8 years ago
  94. f3f0c94 Added malt dependency resource by Akron · 8 years ago
  95. 08385f6 First step to multi-archive support by Akron · 9 years ago
  96. 1924bbe Added REI to test suite by Akron · 8 years ago
  97. e8adfcc Optimize performance of text listing by Akron · 9 years ago
  98. 1cd5b87 Use slashes as separators in siglen by Akron · 9 years ago
  99. 6396c30 Cleanup metadata files by Akron · 9 years ago
  100. 35db6e3 Simplified and modularized metadata processing by Akron · 9 years ago