1. 7f3fe79 Add debug flag by Marc Kupietz · 1 year, 5 months ago
  2. 18c6159 Add DeLiKo@DNB to corpus title by Marc Kupietz · 1 year, 5 months ago
  3. 5f0db26 Index translator as a text using env variable by Marc Kupietz · 1 year, 5 months ago
  4. bec2ff0 Update Saxon from 12.4 to 12.5 by Marc Kupietz · 1 year, 5 months ago
  5. 9744860 Update Readme by Marc Kupietz · 1 year, 5 months ago
  6. 5d6c6fa Introduce env var INDEX for docker compose by Marc Kupietz · 1 year, 5 months ago
  7. 142012e Remove kustvakt-full docker compose configurations by Marc Kupietz · 1 year, 5 months ago
  8. 8e97481 Revert "docker-compose: use absolute paths" by Marc Kupietz · 1 year, 5 months ago
  9. 84da39d Now look for EPUBs in SRC_DIR/*/*.epub by Marc Kupietz · 1 year, 5 months ago
  10. 7582d57 Implement new URN solution by Marc Kupietz · 1 year, 5 months ago
  11. 877daea Readme: add hint on korap docker restart by Marc Kupietz · 1 year, 7 months ago
  12. 161d011 Make 'make test' easier and document it by Marc Kupietz · 1 year, 5 months ago
  13. c4ea409 Add rend="URN" attribute to idno elements of type URN by Marc Kupietz · 1 year, 5 months ago
  14. 8653ed5 Add test that idno elements are present by Marc Kupietz · 1 year, 5 months ago
  15. 68928f8 docker-compose:listen to port 4000 instead of 80 by Marc Kupietz · 1 year, 6 months ago
  16. 3e30bb8 docker-compose: explicitely use kustvakt:0.73.2 instead of latest by Marc Kupietz · 1 year, 6 months ago
  17. 7ba4273 docker-compose: use absolute paths by Marc Kupietz · 1 year, 6 months ago
  18. 1fbd37f Relax corpus sigle regex, to also allow for 2 digit sigles by Nicolas Arnold · 1 year, 6 months ago
  19. c1c3083 FIX: Handle EPUB with .htm files as well by Nicolas Arnold · 1 year, 6 months ago
  20. 7bb2b6f Move idno elements from idsDoc to idsText by Rebecca Wilm · 1 year, 6 months ago
  21. c169e83 Add more italic span mappings by Nicolas Arnold · 1 year, 6 months ago
  22. 3211880 Some additions to the genretable by Nicolas Arnold · 1 year, 7 months ago
  23. 5003c04 Update Readme by Marc Kupietz · 1 year, 7 months ago
  24. c214b4c Add more text type/gerne heuristics by Marc Kupietz · 1 year, 7 months ago
  25. 68b4f8a Use catalogs provided by xmlresolver-data by Marc Kupietz · 1 year, 7 months ago
  26. 5f68033 Upgrade from Saxon 9 to 12.4 by Marc Kupietz · 1 year, 7 months ago
  27. 0df2a57 Use base-uri instead of document-uri to get source file name by Marc Kupietz · 1 year, 7 months ago
  28. 9864524 Add uncommentable debugging messages by Marc Kupietz · 1 year, 7 months ago
  29. 7cc8fc7 Add xml decl to xsl stylesheet by Marc Kupietz · 1 year, 7 months ago
  30. 2635563 Improve typofix to be less ambiguous by Nicolas Arnold · 1 year, 7 months ago
  31. e5f055a Include idnos in metadata by Rebecca Wilm · 1 year, 7 months ago
  32. df3aacd CI: fix index and deploy condition by Marc Kupietz · 1 year, 7 months ago
  33. f9a703b CI: try to make manual jobs not block PRs by Marc Kupietz · 1 year, 7 months ago
  34. 105913d CI: skip index build and deploy jobs on merge requests by Marc Kupietz · 1 year, 7 months ago
  35. 54ab7e5 Merge branch 'fix-typo' into 'main' by Marc Kupietz · 1 year, 7 months ago
  36. 72aa7a1 WIP Fix: Possible typo? by Nicolas Arnold · 1 year, 7 months ago
  37. fe5cce5 Merge branch '15-make-sure-to-get-the-right-metadata-from-the-dnb-sru-api' into 'main' by Marc Kupietz · 1 year, 7 months ago
  38. f614a4b Only get metadata for the given record by Rebecca Wilm · 1 year, 7 months ago
  39. 9d87e9d Add genre classification based on metadata keywords by Marc Kupietz · 1 year, 8 months ago
  40. 0c24663 Distinguish between idno, isbn and dnbidno by Marc Kupietz · 1 year, 8 months ago
  41. ed3cc3a Simplify testing by Marc Kupietz · 1 year, 8 months ago
  42. 8467724 Get rid of mallet warnings about missing log props by Marc Kupietz · 1 year, 8 months ago
  43. 4dc1163 Update textclassifier by Marc Kupietz · 1 year, 8 months ago
  44. 5df1b16 Fix missing ERROR inc by Marc Kupietz · 1 year, 8 months ago
  45. 54ec28b Fix missing spaces at <br/> elements by Marc Kupietz · 1 year, 8 months ago
  46. 15e7d61 Handle editors by Marc Kupietz · 1 year, 8 months ago
  47. eaa9013 Handle translator by Marc Kupietz · 1 year, 8 months ago
  48. fb0f2c3 Handle span/@class='it' by Marc Kupietz · 1 year, 8 months ago
  49. ad4d446 Handle span/@class='norm' by Marc Kupietz · 1 year, 8 months ago
  50. 73a26bf Count errors in script and don´t ignore them in make test by Marc Kupietz · 1 year, 8 months ago
  51. 5b734ce CI: only dnb I5 files are important artefacts by Marc Kupietz · 1 year, 8 months ago
  52. edce85c CI: build dnb13.i5.xml by Marc Kupietz · 1 year, 8 months ago
  53. 41c4238 Add publication year to corpus title by Marc Kupietz · 1 year, 8 months ago
  54. fddbb51 Add test with [Übersetzer] in metadata by Marc Kupietz · 1 year, 8 months ago
  55. de2ca53 Pick authors only from dc:creator fields that contain [Verfasser] or nothing in brackets by Marc Kupietz · 1 year, 8 months ago
  56. 73422ce CI: use bash explicitly for assert tests by Marc Kupietz · 1 year, 8 months ago
  57. ab0a733 Add framework for semantic CI tests by Marc Kupietz · 1 year, 8 months ago
  58. 3460d26 CI: install jre-17 and fix test by Marc Kupietz · 1 year, 8 months ago
  59. 8d47303 Disallow robots by Marc Kupietz · 1 year, 8 months ago
  60. 3146b63 Increase heap again for marmot+malt annotation by Marc Kupietz · 1 year, 8 months ago
  61. 52aa505 Allow doc sigles to be only 2 chars long by Marc Kupietz · 1 year, 8 months ago
  62. 398b596 Make doc sigles even safer by Marc Kupietz · 1 year, 8 months ago
  63. ec784c8 CI: load domain classificator by Marc Kupietz · 1 year, 8 months ago
  64. 3989c74 Handle h4-h6 and strong by Marc Kupietz · 1 year, 8 months ago
  65. 77b6aa9 Make doc sigle more fail safe by Marc Kupietz · 1 year, 8 months ago
  66. ccf0904 Fix marmot and malt output by Marc Kupietz · 1 year, 8 months ago
  67. 15144ad Make: let test depend on models/dereko_domains_s.classifier by Marc Kupietz · 1 year, 8 months ago
  68. 8a1e465 Add models/dereko_domains_s.classifier to dependencies by Marc Kupietz · 1 year, 8 months ago
  69. a553865 Add topic domain classification in XSLT pass2 by Marc Kupietz · 1 year, 8 months ago
  70. 09745e1 Error out on invalid text sigles by Marc Kupietz · 1 year, 8 months ago
  71. d653bb8 Make: drop too slow spacy for now by Marc Kupietz · 1 year, 8 months ago
  72. cd32598 Make: increase heap for annotation tasks by Marc Kupietz · 1 year, 8 months ago
  73. d70fe26 Fix surname initial of second author by Marc Kupietz · 1 year, 8 months ago
  74. 66618ca Fix class attribute conditions by Marc Kupietz · 1 year, 8 months ago
  75. 33d1128 Resolve hi priorities in pass3 by Marc Kupietz · 1 year, 8 months ago
  76. ad1f3b8 Fix title -> doc sigle stop words by Marc Kupietz · 1 year, 8 months ago
  77. 2d15922 Fix getting second author initial by Marc Kupietz · 1 year, 8 months ago
  78. 568240f Use last 5 digits of ISBN as text number by Marc Kupietz · 1 year, 8 months ago
  79. 2badfb1 Error out if no author by Marc Kupietz · 1 year, 8 months ago
  80. 815cc6c Error out if no title by Marc Kupietz · 1 year, 8 months ago
  81. d320f99 CI fix global SRC_DIR, YEAR variables by Marc Kupietz · 1 year, 8 months ago
  82. cfda5f0 Update .gitignore by Marc Kupietz · 1 year, 8 months ago
  83. e97a4ef Update Readme.md by Marc Kupietz · 1 year, 8 months ago
  84. 3c72db8 Let external kalamar port default to 80 by Marc Kupietz · 1 year, 8 months ago
  85. 74fb31d Ignore faulty xhtml input files and conversion errors by Marc Kupietz · 1 year, 8 months ago
  86. 5652c81 Keep more intermediate files as long as we debug by Marc Kupietz · 1 year, 8 months ago
  87. b7a4f6c Die with ERROR if no year could be extracted from DNB DC metadata by Marc Kupietz · 1 year, 8 months ago
  88. 13e2858 Make: default SRC_DIR to production sample by Marc Kupietz · 1 year, 8 months ago
  89. 8e4e23e ISBN checksum 10 is encoded as X by Marc Kupietz · 1 year, 8 months ago
  90. ea684b8 Old ISBN numbers only have 10 digits by Marc Kupietz · 1 year, 8 months ago
  91. 38019b1 Catch two more common span classes i and b by Marc Kupietz · 1 year, 8 months ago
  92. ab1f3ac Build index for all parts by Marc Kupietz · 1 year, 8 months ago
  93. 2fe36fa Fix CI test by Marc Kupietz · 1 year, 8 months ago
  94. deb9546 Sanitize Makefile by dropping YY - use YEARS instead by Marc Kupietz · 1 year, 8 months ago
  95. d059d2d Fix heap calculation for annotations by Marc Kupietz · 1 year, 8 months ago
  96. 0961994 Give enough heap space to marmot and mal by Marc Kupietz · 1 year, 8 months ago
  97. 8759751 Improve korapxml2krill performance by Marc Kupietz · 1 year, 8 months ago
  98. 958df03 Join adjacent hi elements by Marc Kupietz · 1 year, 8 months ago
  99. 8d29363 Delete empty and only nbsp p and div elements by Marc Kupietz · 1 year, 8 months ago
  100. 6bcec63 Remove empty idsTexts and idsDocs by Marc Kupietz · 1 year, 8 months ago