commit | 05b2277b70f8ac3f20ab4de8c676716add32e30d | [log] [tgz] |
---|---|---|
author | Marc Kupietz <kupietz@ids-mannheim.de> | Tue Feb 18 21:58:42 2020 +0100 |
committer | Marc Kupietz <kupietz@ids-mannheim.de> | Tue Feb 18 22:37:53 2020 +0100 |
tree | ebec81a624b892d6f39f72c34f4e0f1e008c73e1 | |
parent | e65ac41dec1cdd7777192e7a9c504bf90cc0db6d [diff] |
Don't run long running and redundant tests by default use --run-donttest to run them Change-Id: Idec047eff02b9a3e2dababc6c6fc0347ca43e5de
Simple R package to access the web service API of the KorAP Corpus Analysis Platform devloped at the IDS Mannheim
This packgage is in its early stages and not stable yet! In particular, please expect that, at this early stage, objects, functions, parameters as well as their names or identifiers will still change continuously without any notification. Use it on your own risk!
At this point there is no binary package on CRAN yet, so you have to install the development version from our Gerrit server using the devtool package:
# install.packages("devtools") library(devtools) install_git("https://korap.ids-mannheim.de/gerrit/KorAP/RKorAPClient") library(RKorAPClient) ?corpusQuery ?frequencyQuery
library(RKorAPClient) new("KorAPConnection", verbose=TRUE) %>% corpusQuery("Hello world") %>% fetchAll()
library(RKorAPClient) library(ggplot2) kco <- new("KorAPConnection", verbose=TRUE) expand_grid(condition = c("textDomain = /Wirtschaft.*/", "textDomain != /Wirtschaft.*/"), year = (2002:2018)) %>% cbind(frequencyQuery(kco, "[tt/l=Heuschrecke]", paste0(.$condition," & pubDate in ", .$year))) %>% ipm() %>% ggplot(aes(x = year, y = ipm, fill = condition, colour = condition)) + geom_freq_by_year_ci()
library(RKorAPClient) query = c("macht []{0,3} Sinn", "ergibt []{0,3} Sinn") years = c(1980:2010) as.alternatives = TRUE vc = "textType = /Zeit.*/ & pubDate in" new("KorAPConnection", verbose=T) %>% frequencyQuery(query, paste(vc, years), as.alternatives = as.alternatives) %>% hc_freq_by_year_ci(as.alternatives)
More elaborate R scripts demonstrating the use of the package can be found in the demo folder.
Authors: Marc Kupietz
Copyright (c) 2019, IDS Mannheim, Germany
This package is developed as part of the KorAP Corpus Analysis Platform at the Leibniz Institute for German Language (IDS).
It is published under the BSD-2 License.
Contributions are very welcome!
Your contributions should ideally be committed via our Gerrit server to facilitate reviewing (see Gerrit Code Review - A Quick Introduction if you are not familiar with Gerrit). However, we are also happy to accept comments and pull requests via GitHub.
Please note that unless you explicitly state otherwise any contribution intentionally submitted for inclusion into this software shall – as this software itself – be under the BSD-2 License.
Kupietz, Marc / Margaretha, Eliza / Diewald, Nils / Lüngen, Harald / Fankhauser, Peter (2019): What’s New in EuReCo? Interoperability, Comparable Corpora, Licensing. In: Bański, Piotr/Barbaresi, Adrien/Biber, Hanno/Breiteneder, Evelyn/Clematide, Simon/Kupietz, Marc/Lüngen, Harald/Iliadi, Caroline (Hrsg.): Proceedings of the Internation Corpus Linguistics Conference 2019 Workshop "Challenges in the Management of Large Corpora (CMLC-7)", 22nd of July Mannheim: Leibniz-Institut für Deutsche Sprache,33-39.