Marc Kupietz | 4689e79 | 2021-09-07 23:29:51 +0200 | [diff] [blame] | 1 | % layout 'main', title => 'KorAP: Corpus Data'; |
Nils Diewald | a31a515 | 2015-04-17 21:05:23 +0000 | [diff] [blame] | 2 | |
Akron | 9490e3b | 2019-10-17 12:26:29 +0200 | [diff] [blame] | 3 | %= page_title |
| 4 | |
Akron | c072577 | 2019-07-18 16:23:18 +0200 | [diff] [blame] | 5 | <p>KorAP is developed as being the main access point to |
Akron | 9490e3b | 2019-10-17 12:26:29 +0200 | [diff] [blame] | 6 | <%= ext_link_to 'DeReKo', 'http://www1.ids-mannheim.de/kl/projekte/korpora' %>, |
| 7 | being the successor of <%= ext_link_to 'COSMAS II', 'https://cosmas2.ids-mannheim.de/cosmas2-web/' %> in that regard. |
| 8 | But KorAP is not focussed on any specific corpus, it is, for example, now also used for the Romanian national corpus <%= ext_link_to 'CoRoLa', 'http://corola.racai.ro/' %>.</p> |
Nils Diewald | a31a515 | 2015-04-17 21:05:23 +0000 | [diff] [blame] | 9 | |
Akron | c072577 | 2019-07-18 16:23:18 +0200 | [diff] [blame] | 10 | <p>In KorAP, corpus texts are allowed to have arbitrary metadata information, that partially can be used to create subcorpora (so-called virtual corpora).</p> |
| 11 | |
Akron | 3cfa26d | 2019-10-24 15:17:34 +0200 | [diff] [blame] | 12 | <p>KorAP also supports an arbitrary number of <%= embedded_link_to 'doc', 'Annotations', 'data', 'annotation' %> from different sources (called <em>foundries</em>) with different <em>layers</em>.</p> |
Akron | c072577 | 2019-07-18 16:23:18 +0200 | [diff] [blame] | 13 | |
| 14 | <dl> |
| 15 | <p>Annotations of the following kind are supported:</p> |
| 16 | <dt>Tokens</dt> |
| 17 | <dd>Annotations associated to single tokens (e.g. words or numbers)</dd> |
| 18 | |
| 19 | <dt>Spans</dt> |
| 20 | <dd>Annotations to a sequence of words or nodes (e.g. sentences, phrases, constituency annotations)</dd> |
| 21 | |
| 22 | <dt>Relations</dt> |
| 23 | <dd>Annotations of relations between tokens or spans (e.g. dependency annotations)</dd> |
| 24 | |
| 25 | <dt>Attributes</dt> |
| 26 | <dd>Attribute information for tokens, spans, or relations (e.g. attributes of HTML elements)</dd> |
| 27 | </dl> |