commit | 0d4c909871b8bde87d656ef39d1eb514b6dfd778 | [log] [tgz] |
---|---|---|
author | Marc Kupietz <kupietz@ids-mannheim.de> | Mon Mar 23 09:02:30 2020 +0100 |
committer | Marc Kupietz <kupietz@ids-mannheim.de> | Mon Mar 23 09:07:22 2020 +0100 |
tree | a0d4761ead574e334d87d5a44899fd5b6593ed02 | |
parent | 384dc5c2027bb1968bcf89c97c89e3a693ac7c85 [diff] |
Fix printing of virtual corpus description Change-Id: Ie3a6f3eca6f66150929b80062d0734288ad0f3bf
R client package to access the web service API of the KorAP Corpus Analysis Platform developed at the IDS Mannheim
install.packages("RKorAPClient")
devtools::install_github("KorAP/RKorAPClient") remotes::install_github("KorAP/RKorAPClient") devtools::install_git("https://korap.ids-mannheim.de/gerrit/KorAP/RKorAPClient") remotes::install_git("https://korap.ids-mannheim.de/gerrit/KorAP/RKorAPClient")
library(RKorAPClient) new("KorAPConnection", verbose=TRUE) %>% corpusQuery("Hello world") %>% fetchAll()
library(RKorAPClient) library(ggplot2) kco <- new("KorAPConnection", verbose=TRUE) expand_grid(condition = c("textDomain = /Wirtschaft.*/", "textDomain != /Wirtschaft.*/"), year = (2002:2018)) %>% cbind(frequencyQuery(kco, "[tt/l=Heuschrecke]", paste0(.$condition," & pubDate in ", .$year))) %>% ipm() %>% ggplot(aes(x = year, y = ipm, fill = condition, colour = condition)) + geom_freq_by_year_ci()
See the Highcharts license notes below.
library(RKorAPClient) query = c("macht []{0,3} Sinn", "ergibt []{0,3} Sinn") years = c(1980:2010) as.alternatives = TRUE vc = "textType = /Zeit.*/ & pubDate in" new("KorAPConnection", verbose=T) %>% frequencyQuery(query, paste(vc, years), as.alternatives = as.alternatives) %>% hc_freq_by_year_ci(as.alternatives)
More elaborate R scripts demonstrating the use of the package can be found in the demo folder.
Authors: Marc Kupietz, Nils Diewald
Copyright (c) 2020, Leibniz Institute for the German Language, Mannheim, Germany
This package is developed as part of the KorAP Corpus Analysis Platform at the Leibniz Institute for German Language (IDS).
It is published under the BSD-2 License.
RKorAPClient imports parts of the highcharter package which has a dependency on Highcharts, a commercial JavaScript charting library. Highcharts offers both a commercial license as well as a free non-commercial license. Please review the licensing options and terms before using the highcharter plot options, as the RKorAPClient
license neither provides nor implies a license for Highcharts.
Highcharts is a Highsoft product which is not free for commercial and governmental use.
Contributions are very welcome!
Your contributions should ideally be committed via our Gerrit server to facilitate reviewing (see Gerrit Code Review - A Quick Introduction if you are not familiar with Gerrit). However, we are also happy to accept comments and pull requests via GitHub.
Please note that unless you explicitly state otherwise any contribution intentionally submitted for inclusion into this software shall – as this software itself – be under the BSD-2 License.
Kupietz, Marc / Margaretha, Eliza / Diewald, Nils / Lüngen, Harald / Fankhauser, Peter (2019): What’s New in EuReCo? Interoperability, Comparable Corpora, Licensing. In: Bański, Piotr/Barbaresi, Adrien/Biber, Hanno/Breiteneder, Evelyn/Clematide, Simon/Kupietz, Marc/Lüngen, Harald/Iliadi, Caroline (eds.): Proceedings of the International Corpus Linguistics Conference 2019 Workshop "Challenges in the Management of Large Corpora (CMLC-7)", 22nd of July Mannheim: Leibniz-Institut für Deutsche Sprache, 33-39.
Kupietz, Marc / Diewald, Nils / Margaretha, Eliza (forthcoming): RKorAPClient: An R package for accessing the German Reference Corpus DeReKo via KorAP. In: Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020). Marseille/Paris: European Language Resources Association (ELRA).