matchStart
and matchEnd
columns to corpusQuery results, containing the start and end positions of the match in the textmergeDuplicateCollocates
function to merge collocation analysis results for different context positionscollocationAnalysis
functionstextMetadata
method to use new metadata fields API, if available, to retrieve custom metadata for a text based on its sigleUpdates recommended RKorAPClient version to 0.8.0
Added textMetadata
KorAPConnection method to retrieve all metadata for a text based on its sigle
Added webUiRequestUrl
column also to corpusStats results, so that also virtual corpus definitions can be linked to / tested directly in the KorAP UI
Uses server side tokenized matches in collocation analysis, if supported by KorAP server
Unless metadataOnly
is set, also tokenized snippets are now retrieved in corpus queries (stored in res.slots['collectedMatches']['tokens.left']
, res.slots['collectedMatches']['tokens.match']
, res.slots['collectedMatches']['tokens.right']
). Because Pandas data frames cannot store lists, tokens are stored as strings, tab separated.
Python 3.11 and 3.12 are now supported
Python 3.7 support has been dropped (by rpy2 dependency)