We are happy to announce the open source release of Rabbid (“Recherche- und Analyse-Basis für Belegstellen in Diskursen”). Rabbid is a standalone rapid application development environment for KorAP and used in production for the creation and management of collections of textual examples in the area of discourse analysis and discourse lexicography.
The development of Rabbid was a joint effort by the KorAP project and Dr. Ruth Mell of the Demokratiediskurs 1918-1925 project at the Institute for the German Language in Mannheim.
Unlike KorAP, Rabbid provides only a limited set of search operators for small, non-annotated corpora.
You can download Rabbid from GitHub. Rabbid is free software published under the BSD-2 License.
The proceedings volume of the third meeting of CMLC (Challenges in the Management of Large Corpora) is available from the Library Services of IDS Mannheim.
Individual proceedings items as well as previous volumes are also available from the workshop page.
The programme of the 3rd meeting of the workshop on Challenges in the Management of Large Corpora (CMLC-3) has been posted, with the open-content publication of the proceedings volume scheduled for the beginning of July at the latest.
The workshop is going to be held on the 20th of July as part of the Corpus Linguistics 2015 (CL2015) conference.
We are happy to announce the open source release of Kalamar, the Mojolicious-based frontend for KorAP!
- aligned KWIC views,
- multiple highlighting,
- table views of morphological annotations,
- tree views of hierarchical annotations,
- a language-independent query helper for multiple tag sets,
- and an embedded and interactive documentation!
Expect more features to come! You can already use Kalamar and download the sources from GitHub.
KoralQuery, the general Corpus Query Protocol used for inter-component communication in KorAP, was presented on May 11th at the workhop on Innovative Corpus Query and Visualization Tools (QueryVis). The workshop was part of the 20th Nordic Conference of Computational Linguistics (Nodalida) in Vilnius, Lithuania. Proceedings are already available.
We would like to thank the reviewers and organizers for a great workshop!
Please cite this work as:
Joachim Bingel, Nils Diewald (2015). KoralQuery – a General Corpus Query Protocol, Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania.
We are happy to announce the open source release of Krill, the Lucene-based search backend for KorAP! Krill is the reference implementation for KoralQuery, covering most of the protocols features, including …
- Fulltext search
- Token-based annotation search
- Span-based annotation search
- Distance search
- Positional search
- Nested queries
… and many more!
You can download Krill on GitHub – feedback and contributions are very welcome!
We are happy to announce the release of Koral, the module which KorAP uses to translate queries from its supported query languages into KoralQuery, a general protocol for queries to corpus analysis systems. Taking a query string as its input, Koral generates a corresponding KoralQuery instance which represents that query independently of the source query language, such that the system may work in a query language-agnostic fashion. Besides the actual linguistic query, KoralQuery also has facilities to represent virtual collection definitions as well as error and warning messages that may arise during query processing.
You can access and download the Koral sources from the KorAP GitHub repository. Please note that the current version 0.1.0 is not a final version and subject to work in progress, which will result in further releases in the not-so-far future.
The second frontend, which in time is going to become the official one, is now back up, with improvements. It is only reachable IDS-internally, for the time being. (But it won’t be long!)
It has its own half-interactive tutorial built in, and many nifty visualisation features. Do give it a try.
In order to familiarize users with the current KorAP interface, a new selection of tutorial videos has been made available:
Furthermore, the frontend has been updated to version 1.084, which includes fixes for several stability issues and layout changes.
In the current stage of development, several corpus archives of the current release candidate of DeReKo (2014-II), prepared by Marc Kupietz, have been transformed and annotated to work with the next release of the search engines of KorAP.
- Wikipedia (2013 release)
- Goethes Werke
- Bonner Zeitungskorpus
Further archives are in planning and progress.
New versions of KorAP’s backend and frontend application have been made accessible for users connecting from within the IDS network.
Service version: 1.51
Frontend version: 1.083
Updates include several stability bug fixes as well as a further integration with a second search engine option to generate KWIC results, developed by Piotr Pęzik based on Neo4j. This feature is currently only available via API access, but will be made available via the user interface in the next release of the frontend version.