Add English CA based on Wikipedia for comparison

Change-Id: Ic164e21dc25640a4de1d53561c5fc2e64d4d503c
2 files changed
tree: b3e244b474fc21246bb96c17280fc35c093a8b1a
  1. ci/
  2. css/
  3. data/
  4. R/
  5. .gitignore
  6. .gitlab-ci.yml
  7. icc-iclc10.Rproj
  8. Readme.md
Readme.md

Resources and R-Scripts

used for the poster

Marc Kupietz, Adrien Barbaresi, Anna Cermakova, Małgorzata Czachor, Nils Diewald, Jarle Ebeling, Rafał L. Górski, John Kirk, Michal Křen, Harald Lüngen, Eliza Margaretha, Signe Oksefjell Ebeling, Mícheál Ó Meachair, Ines Pisetta, Elaine Uí Dhonnchadha, Friedemann Vogel, Rebecca Wilm, Jiajin Xu and Rameela Yaddehige:

News from the International Comparable Corpus: First launch of ICC written

(To be) presented at ICLC-10

Latest artifacts built by the CI pipeline can be found here

Some artifacts

Tokens per ICC genre

tokens per ICC genre

Tokens per year

tokens per year

POS proportions

POS proportions