1. a51907c minimal goal reached: data should be now ingestible; there is a niche of inefficiency though that may prevent my desktop from processing the entire dataset (other than KOT) by Piotr Banski · 2 years, 6 months ago
  2. 09096ee many fixes, structure.xml all done, retaining comments with the surface forms for now by Piotr Banski · 2 years, 6 months ago
  3. fdc858a I think it's done. Will clean it tomorrow and extend to morpho and beyond. My goodness. by Piotr Banski · 2 years, 6 months ago
  4. 69f3c5f this is just for demonstration that the offsets are now done in the KorAP way by Piotr Banski · 2 years, 6 months ago
  5. 5fe4bae far from perfect, but the road is straight now; adding the structure.xml doc, a bit schizophrenic in the indexing by Piotr Banski · 2 years, 6 months ago
  6. 6a4a252 this version attempts to re-traverse the tree over 6k times per single output document with structure in it, and I can't seem to be able to help that. It does the necessary calculation perfectly, but, naturally, in doing so it crashes my desktop by Piotr Banski · 2 years, 6 months ago
  7. f8af3a9 just saving the next step by Piotr Banski · 2 years, 6 months ago
  8. 4f4c2d2 one step further and I just want to save it by Piotr Banski · 2 years, 6 months ago
  9. 9dc1000 begin the switch from text.xml to ann_segmentation.xml; for now, data.xml is properly created (whitespace and tokenization alternatives). A lot of code cleanup has not yet happened. by bansp · 2 years, 6 months ago
  10. d1bf1db before migration from calc_content_length to calc_offsets by bansp · 2 years, 8 months ago
  11. a8e5cf1 forgot to save, already pushed the previous one, sorry by bansp · 2 years, 8 months ago
  12. b599253 this is a safety commit, before I take some stuff apart by bansp · 2 years, 8 months ago
  13. f2b24e6 add ability to skip some document IDs as a comma-separated parameter by bansp · 2 years, 8 months ago
  14. e726b4a stylesheet redone for handling larger datasets; just struct and morpho for now, though by bansp · 2 years, 8 months ago
  15. 8f6700b initial modification that I need to commit by bansp · 2 years, 8 months ago
  16. 97ba7ce update the I5 DTD identifiers, just in case by bansp · 2 years, 8 months ago
  17. ba37fb9 update gitignore by bansp · 2 years, 8 months ago
  18. 9103aab attempt to add to the headers (they are black boxes) by bansp · 2 years, 8 months ago
  19. 3e5b20c fix structure.xml, create morpho.xml by bansp · 2 years, 8 months ago
  20. 5f84173 derive structure.xml; the script isn't optimized yet but I would like to submit the output for a check by bansp · 2 years, 8 months ago
  21. 608b102 fix textSigle by bansp · 2 years, 9 months ago
  22. 5e2d1c0 first touch: make sure that I can grab at the data and send it where I want it to go by bansp · 2 years, 9 months ago
  23. 68529a8 proof of concept that I'm reaching for the right info by bansp · 2 years, 9 months ago
  24. 0748682 correct empty namespace nodes in the output by bansp · 2 years, 9 months ago
  25. f79443e version working in oXygen, data.xml should be identical modulo whitespace; missing references: metadata.xml, text.rng by bansp · 2 years, 9 months ago
  26. 9a8ee3e Added initial conversion script and example xspec by Akron · 2 years, 10 months ago