Challenges in the Management of Large Corpora — programme posted

The programme of the 3rd meeting of the workshop on Challenges in the Management of Large Corpora (CMLC-3) has been posted, with the open-content publication of the proceedings volume scheduled for the beginning of July at the latest.

The workshop is going to be held on the 20th of July as part of the Corpus Linguistics 2015 (CL2015) conference.

Kalamar – User Frontend released on GitHub!

Mojolicious-based Frontend to KorAP

We are happy to announce the open source release of Kalamar, the Mojolicious-based frontend for KorAP!

Kalamar is written in Perl and JavaScript, acts as a proof-of-concept for the KorAP API, and provides, among other features, …

  • aligned KWIC views,
  • multiple highlighting,
  • table views of morphological annotations,
  • tree views of hierarchical annotations,
  • localization,
  • a language-independent query helper for multiple tag sets,
  • and an embedded and interactive documentation!

Screenshots

Expect more features to come! You can already use Kalamar from inside the IDS and download the sources from GitHub.

EDIT: The IDS-Instance of KorAP is currently not accessible from outside the IDS.

KoralQuery at the QueryVis Workshop in Vilnius

KoralQuery, the general Corpus Query Protocol used for inter-component communication in KorAP, was presented on May 11th at the workhop on Innovative Corpus Query and Visualization Tools (QueryVis). The workshop was part of the 20th Nordic Conference of Computational Linguistics (Nodalida) in Vilnius, Lithuania. Proceedings are already available.

We would like to thank the reviewers and organizers for a great workshop!

Please cite this work as:
Joachim Bingel, Nils Diewald (2015). KoralQuery – a General Corpus Query Protocol, Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania.

Krill – Lucene-based Search Backend released on GitHub!

A Corpusdata Retrieval Index using Lucene for Look-Ups

We are happy to announce the open source release of Krill, the Lucene-based search backend for KorAP! Krill is the reference implementation for KoralQuery, covering most of the protocols features, including …

  • Fulltext search
  • Token-based annotation search
  • Span-based annotation search
  • Distance search
  • Positional search
  • Nested queries

… and many more!

You can download Krill on GitHub – feedback and contributions are very welcome!