| commit | f0768c721b7e08bda41c957f2ec2dbf41e97dc86 | [log] [tgz] |
|---|---|---|
| author | Akron <nils@diewald-online.de> | Fri Nov 03 09:38:58 2017 +0100 |
| committer | Akron <nils@diewald-online.de> | Fri Nov 03 09:38:58 2017 +0100 |
| tree | 2b1f3329837b25444ea91552e503ce49ebb7f01a | |
| parent | 71fc0ec4078fe16f5bee42aa96d122dcfb7b03b8 [diff] |
Improve composition on koral result enrichments Change-Id: I60b2d26cf0be47324109e4031ea9033ce7ead64f

The Krawfish Prototype is a testbed search backend for KorAP, to implement design concepts both for Kanalito and Krill.
Krawfish Prototype focusses on
Krawfish Prototype is still work in progress. The segment handling of Krawfish is based on Krill and therefore heavily inspired by Lucene and Lucy.
! This software is in its early stages and not stable yet! Use it on your own risk!
Krawfish Prototype requires Perl of at least v5.10.1. The recommended environment is based on Perlbrew with App::cpanminus.
$ git clone https://github.com/KorAP/Krawfish-Prototype $ cd Krawfish-Prototype $ cpanm --installdeps . $ perl Makefile.PL $ make test
use Krawfish::Koral;
use Krawfish::Index;
# Add documents to index
my $index = Krawfish::Index->new;
$index->introduce_field('docID' => 'de_DE');
$index->add_doc('t/data/doc1.jsonld');
$index->add_doc('t/data/doc2.jsonld');
$index->commit;
# Start KoralQuery object
my $koral = Krawfish::Koral->new;
# Define a query
# [einen|"d.*"][][Hut]
my $query = $koral->query_builder;
$koral->query(
$query->seq(
$query->token(
$query->bool_or(
'einen',
$query->term_re('d.*')
)
),
$query->anywhere,
$query->term('Hut')
)
);
# Define a virtual corpus
my $corpus = $koral->corpus_builder;
$koral->corpus(
$corpus->bool_and(
$corpus->string('license=free'),
$corpus->string('corpus=corpus-2')
)
);
# Define a compilation target
my $compile = $koral->compilation_builder;
$koral->compilation(
$compile->aggregate(
$compile->a_fields('license'),
$compile->a_frequencies
),
$compile->enrich(
$compile->e_fields('textLength')
),
$compile->sort_by(
$compile->s_field('docID')
)
);
# Execute KoralQuery
my $request = $koral->to_query
->identify($index->dict)
->optimize($index->segment);
if ($request->next) {
print $request->current_match->to_string;
};
Copyright (C) 2017, IDS Mannheim
Author: Nils Diewald
Krawfish Prototype is free software published under the BSD-2 License.