| commit | d1531de5fa1aa3dba2e4b383ea5fe98de34818f9 | [log] [tgz] |
|---|---|---|
| author | Marc Kupietz <kupietz@ids-mannheim.de> | Sat Mar 21 14:35:32 2026 +0100 |
| committer | Marc Kupietz <kupietz@ids-mannheim.de> | Sat Mar 21 14:54:39 2026 +0100 |
| tree | f1efd6b3009c1e5fa290a8962421b98b6ffa26e0 | |
| parent | 187bcdc918f76307d7d84bc57241274c84abded5 [diff] |
Switch to @yao-pkg/pkg to build executables again Change-Id: I33c78c05b28c0050dc54c506cdc217b4be93f2d9
Reads CoNLL-U format from stdin and annotates emojis, emoticons, hashtags, URLs, email addresses, action words, @names, and Wikipedia emoji templates with their corresponding STTS-IBK POS tag (Beißwenger/Bartsch/Evert/Würzner 2016). Writes CoNLL-U format to stdout.
# Annotate CoNLL-U input korapxml2conllu kyc.zip | docker run --rm -i korap/conllu-cmc # With sparse output (only annotated lines) korapxml2conllu kyc.zip | docker run --rm -i korap/conllu-cmc -s # Generate KorAP-XML zip with CMC annotations korapxml2conllu kyc.zip | docker run --rm -i korap/conllu-cmc -s | conllu2korapxml > kyc.cmc.zip # Show help docker run --rm korap/conllu-cmc --help
korapxml2conllu kyc.zip | npx conllu-cmc
korapxml2conllu kyc.zip | ./conllu2cmc
korapxml2conllu kyc.zip | conllu2cmc -s | conllu2korapxml > kyc.cmc.zip
docker pull korap/conllu-cmc
Download pre-built executables from the Releases page:
conllu2cmc - Linux x64conllu2cmc - macOS x64conllu2cmc.exe - Windows x64npm install 'git+https://gitlab.ids-mannheim.de/KorAP/conllu-cmc-docker.git'
npm install
# Build for all platforms npm run pkg-all # Or build for specific platforms npm run pkg-linux # Linux x64 npm run pkg-macos # macOS x64 npm run pkg-win # Windows x64
Executables are created in bin/linux/, bin/macos/, and bin/win/.
Beißwenger, Michael/Bartsch, Sabine/Evert, Stefan/Würzner, Kay-Michael (2016): EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora. In: Proceedings of the 10th Web as Corpus Workshop. Berlin: Association for Computational Linguistics, S. 44–56. https://doi.org/10.18653/v1/W16-2606.