1. 84da39d Now look for EPUBs in SRC_DIR/*/*.epub by Marc Kupietz · 1 year, 5 months ago
  2. 7582d57 Implement new URN solution by Marc Kupietz · 1 year, 5 months ago
  3. 877daea Readme: add hint on korap docker restart by Marc Kupietz · 1 year, 7 months ago
  4. 161d011 Make 'make test' easier and document it by Marc Kupietz · 1 year, 5 months ago
  5. c4ea409 Add rend="URN" attribute to idno elements of type URN by Marc Kupietz · 1 year, 5 months ago
  6. 8653ed5 Add test that idno elements are present by Marc Kupietz · 1 year, 5 months ago
  7. 68928f8 docker-compose:listen to port 4000 instead of 80 by Marc Kupietz · 1 year, 6 months ago
  8. 3e30bb8 docker-compose: explicitely use kustvakt:0.73.2 instead of latest by Marc Kupietz · 1 year, 6 months ago
  9. 7ba4273 docker-compose: use absolute paths by Marc Kupietz · 1 year, 6 months ago
  10. 1fbd37f Relax corpus sigle regex, to also allow for 2 digit sigles by Nicolas Arnold · 1 year, 6 months ago
  11. c1c3083 FIX: Handle EPUB with .htm files as well by Nicolas Arnold · 1 year, 6 months ago
  12. 7bb2b6f Move idno elements from idsDoc to idsText by Rebecca Wilm · 1 year, 6 months ago
  13. c169e83 Add more italic span mappings by Nicolas Arnold · 1 year, 6 months ago
  14. 3211880 Some additions to the genretable by Nicolas Arnold · 1 year, 7 months ago
  15. 5003c04 Update Readme by Marc Kupietz · 1 year, 7 months ago
  16. c214b4c Add more text type/gerne heuristics by Marc Kupietz · 1 year, 7 months ago
  17. 68b4f8a Use catalogs provided by xmlresolver-data by Marc Kupietz · 1 year, 7 months ago
  18. 5f68033 Upgrade from Saxon 9 to 12.4 by Marc Kupietz · 1 year, 7 months ago
  19. 0df2a57 Use base-uri instead of document-uri to get source file name by Marc Kupietz · 1 year, 7 months ago
  20. 9864524 Add uncommentable debugging messages by Marc Kupietz · 1 year, 7 months ago
  21. 7cc8fc7 Add xml decl to xsl stylesheet by Marc Kupietz · 1 year, 7 months ago
  22. 2635563 Improve typofix to be less ambiguous by Nicolas Arnold · 1 year, 7 months ago
  23. e5f055a Include idnos in metadata by Rebecca Wilm · 1 year, 7 months ago
  24. df3aacd CI: fix index and deploy condition by Marc Kupietz · 1 year, 7 months ago
  25. f9a703b CI: try to make manual jobs not block PRs by Marc Kupietz · 1 year, 7 months ago
  26. 105913d CI: skip index build and deploy jobs on merge requests by Marc Kupietz · 1 year, 7 months ago
  27. 54ab7e5 Merge branch 'fix-typo' into 'main' by Marc Kupietz · 1 year, 7 months ago
  28. 72aa7a1 WIP Fix: Possible typo? by Nicolas Arnold · 1 year, 7 months ago
  29. fe5cce5 Merge branch '15-make-sure-to-get-the-right-metadata-from-the-dnb-sru-api' into 'main' by Marc Kupietz · 1 year, 7 months ago
  30. f614a4b Only get metadata for the given record by Rebecca Wilm · 1 year, 7 months ago
  31. 9d87e9d Add genre classification based on metadata keywords by Marc Kupietz · 1 year, 8 months ago
  32. 0c24663 Distinguish between idno, isbn and dnbidno by Marc Kupietz · 1 year, 8 months ago
  33. ed3cc3a Simplify testing by Marc Kupietz · 1 year, 8 months ago
  34. 8467724 Get rid of mallet warnings about missing log props by Marc Kupietz · 1 year, 8 months ago
  35. 4dc1163 Update textclassifier by Marc Kupietz · 1 year, 8 months ago
  36. 5df1b16 Fix missing ERROR inc by Marc Kupietz · 1 year, 8 months ago
  37. 54ec28b Fix missing spaces at <br/> elements by Marc Kupietz · 1 year, 8 months ago
  38. 15e7d61 Handle editors by Marc Kupietz · 1 year, 8 months ago
  39. eaa9013 Handle translator by Marc Kupietz · 1 year, 8 months ago
  40. fb0f2c3 Handle span/@class='it' by Marc Kupietz · 1 year, 8 months ago
  41. ad4d446 Handle span/@class='norm' by Marc Kupietz · 1 year, 8 months ago
  42. 73a26bf Count errors in script and don´t ignore them in make test by Marc Kupietz · 1 year, 8 months ago
  43. 5b734ce CI: only dnb I5 files are important artefacts by Marc Kupietz · 1 year, 8 months ago
  44. edce85c CI: build dnb13.i5.xml by Marc Kupietz · 1 year, 8 months ago
  45. 41c4238 Add publication year to corpus title by Marc Kupietz · 1 year, 8 months ago
  46. fddbb51 Add test with [Übersetzer] in metadata by Marc Kupietz · 1 year, 8 months ago
  47. de2ca53 Pick authors only from dc:creator fields that contain [Verfasser] or nothing in brackets by Marc Kupietz · 1 year, 8 months ago
  48. 73422ce CI: use bash explicitly for assert tests by Marc Kupietz · 1 year, 8 months ago
  49. ab0a733 Add framework for semantic CI tests by Marc Kupietz · 1 year, 8 months ago
  50. 3460d26 CI: install jre-17 and fix test by Marc Kupietz · 1 year, 8 months ago
  51. 8d47303 Disallow robots by Marc Kupietz · 1 year, 8 months ago
  52. 3146b63 Increase heap again for marmot+malt annotation by Marc Kupietz · 1 year, 8 months ago
  53. 52aa505 Allow doc sigles to be only 2 chars long by Marc Kupietz · 1 year, 8 months ago
  54. 398b596 Make doc sigles even safer by Marc Kupietz · 1 year, 8 months ago
  55. ec784c8 CI: load domain classificator by Marc Kupietz · 1 year, 8 months ago
  56. 3989c74 Handle h4-h6 and strong by Marc Kupietz · 1 year, 8 months ago
  57. 77b6aa9 Make doc sigle more fail safe by Marc Kupietz · 1 year, 8 months ago
  58. ccf0904 Fix marmot and malt output by Marc Kupietz · 1 year, 8 months ago
  59. 15144ad Make: let test depend on models/dereko_domains_s.classifier by Marc Kupietz · 1 year, 8 months ago
  60. 8a1e465 Add models/dereko_domains_s.classifier to dependencies by Marc Kupietz · 1 year, 8 months ago
  61. a553865 Add topic domain classification in XSLT pass2 by Marc Kupietz · 1 year, 8 months ago
  62. 09745e1 Error out on invalid text sigles by Marc Kupietz · 1 year, 8 months ago
  63. d653bb8 Make: drop too slow spacy for now by Marc Kupietz · 1 year, 8 months ago
  64. cd32598 Make: increase heap for annotation tasks by Marc Kupietz · 1 year, 8 months ago
  65. d70fe26 Fix surname initial of second author by Marc Kupietz · 1 year, 8 months ago
  66. 66618ca Fix class attribute conditions by Marc Kupietz · 1 year, 8 months ago
  67. 33d1128 Resolve hi priorities in pass3 by Marc Kupietz · 1 year, 8 months ago
  68. ad1f3b8 Fix title -> doc sigle stop words by Marc Kupietz · 1 year, 8 months ago
  69. 2d15922 Fix getting second author initial by Marc Kupietz · 1 year, 8 months ago
  70. 568240f Use last 5 digits of ISBN as text number by Marc Kupietz · 1 year, 8 months ago
  71. 2badfb1 Error out if no author by Marc Kupietz · 1 year, 8 months ago
  72. 815cc6c Error out if no title by Marc Kupietz · 1 year, 8 months ago
  73. d320f99 CI fix global SRC_DIR, YEAR variables by Marc Kupietz · 1 year, 8 months ago
  74. cfda5f0 Update .gitignore by Marc Kupietz · 1 year, 8 months ago
  75. e97a4ef Update Readme.md by Marc Kupietz · 1 year, 8 months ago
  76. 3c72db8 Let external kalamar port default to 80 by Marc Kupietz · 1 year, 8 months ago
  77. 74fb31d Ignore faulty xhtml input files and conversion errors by Marc Kupietz · 1 year, 8 months ago
  78. 5652c81 Keep more intermediate files as long as we debug by Marc Kupietz · 1 year, 8 months ago
  79. b7a4f6c Die with ERROR if no year could be extracted from DNB DC metadata by Marc Kupietz · 1 year, 8 months ago
  80. 13e2858 Make: default SRC_DIR to production sample by Marc Kupietz · 1 year, 8 months ago
  81. 8e4e23e ISBN checksum 10 is encoded as X by Marc Kupietz · 1 year, 8 months ago
  82. ea684b8 Old ISBN numbers only have 10 digits by Marc Kupietz · 1 year, 8 months ago
  83. 38019b1 Catch two more common span classes i and b by Marc Kupietz · 1 year, 8 months ago
  84. ab1f3ac Build index for all parts by Marc Kupietz · 1 year, 8 months ago
  85. 2fe36fa Fix CI test by Marc Kupietz · 1 year, 8 months ago
  86. deb9546 Sanitize Makefile by dropping YY - use YEARS instead by Marc Kupietz · 1 year, 8 months ago
  87. d059d2d Fix heap calculation for annotations by Marc Kupietz · 1 year, 8 months ago
  88. 0961994 Give enough heap space to marmot and mal by Marc Kupietz · 1 year, 8 months ago
  89. 8759751 Improve korapxml2krill performance by Marc Kupietz · 1 year, 8 months ago
  90. 958df03 Join adjacent hi elements by Marc Kupietz · 1 year, 8 months ago
  91. 8d29363 Delete empty and only nbsp p and div elements by Marc Kupietz · 1 year, 8 months ago
  92. 6bcec63 Remove empty idsTexts and idsDocs by Marc Kupietz · 1 year, 8 months ago
  93. 5e87311 Ignore highlighting for "regular" classes by Marc Kupietz · 1 year, 8 months ago
  94. 13c986a Let final I5 depend on pass 2 and 3 stylesheets by Marc Kupietz · 1 year, 8 months ago
  95. 164a283 Fix last I5 validity errors by Marc Kupietz · 1 year, 8 months ago
  96. 28f48e1 Run xslt pass 2 and three on the whole year volumes by Marc Kupietz · 1 year, 8 months ago
  97. f62bc90 Turn off attribute expansion by Marc Kupietz · 1 year, 8 months ago
  98. 1a370e0 Start with 2nd XSLT pass by Marc Kupietz · 1 year, 8 months ago
  99. bf47ae7 Fix headings nested in ps by Marc Kupietz · 1 year, 8 months ago
  100. 10903f3 Use map in country code function by Marc Kupietz · 1 year, 8 months ago