blob: b7d280ecedfc63c4bb1262af4c56b727ac926a61 [file] [log] [blame]
Nils Diewald4af3f0b2014-06-25 01:43:17 +00001% content main => begin
Nils Diewald2329e1d2014-06-12 16:07:57 +00002
Nils Diewaldbd56adc2014-06-22 18:44:53 +00003%# Store the id of an active section in the session, so the system is able to directly scroll to the relevant section
4%# This should be stored when clicking on a specific query
5%# but the remembered section contains the id - not the query
Nils Diewald2329e1d2014-06-12 16:07:57 +00006
Nils Diewald1eba6572014-06-17 19:49:53 +00007<h2>KorAP-Tutorial</h2>
Nils Diewald2329e1d2014-06-12 16:07:57 +00008
Nils Diewald0d2dcc82014-06-18 17:10:49 +00009<!--
Nils Diewaldbd56adc2014-06-22 18:44:53 +000010<p>Links to Blog, FAQ, About, Contact ...</p>
Nils Diewald2329e1d2014-06-12 16:07:57 +000011<ul>
12 <li>Introduction to KorAP</li>
13 <li>How to use Poliqarp+ QL?</li>
14 <li>How to use Cosmas-II QL?</li>
15 <li>How to use CQL?</li>
16</ul>
Nils Diewald0d2dcc82014-06-18 17:10:49 +000017-->
Nils Diewald2329e1d2014-06-12 16:07:57 +000018
Nils Diewaldbd56adc2014-06-22 18:44:53 +000019<section id="intro">
20<h3>Introduction</h3>
Nils Diewald2329e1d2014-06-12 16:07:57 +000021<p>This is a Tutorial to KorAP. It may be maintained separately (as a Wiki?) and
Nils Diewald0d2dcc82014-06-18 17:10:49 +000022has some nice features - like embedded example queries - just click on the queries below:</p>
Nils Diewald2329e1d2014-06-12 16:07:57 +000023
24%= korap_tut_query poliqarp => '[base=baum]'
25
Nils Diewald2329e1d2014-06-12 16:07:57 +000026%= korap_tut_query cosmas2 => 'der /w5 Baum'
27
Nils Diewald4af3f0b2014-06-25 01:43:17 +000028%= korap_tut_query poliqarp => 'contains(<cnx/c=np>,[opennlp/p=KOUS])'
29
30%= korap_tut_query poliqarp => 'startswith(<s>, {1:[cnx/m=PRES]}{2:[base=der]})'
31
Nils Diewald0d2dcc82014-06-18 17:10:49 +000032<p>And here is a short cheat sheet for foundries and layers</p>
Nils Diewaldbd56adc2014-06-22 18:44:53 +000033</section>
Nils Diewald2329e1d2014-06-12 16:07:57 +000034
Nils Diewaldbd56adc2014-06-22 18:44:53 +000035<section id="cheatsheet">
36 <h3>Cheatsheet</h3>
37 <ul>
38 <li><strong>base</strong>
39 <ul>
40 <li>Supports two types of spans: <strong>&lt;s&gt;</strong> for sentences and <strong>&lt;p&gt;</strong> for paragraphs - this will likely change in the next index version. These spans lack prefix information!</li>
41 </ul>
42 </li>
43 <li><strong>cnx</strong>
44 <ul>
45 <li><strong>l</strong> (Token:Lemma): All lemmas are written in lower case. Composita are split, e.g. the token &quot;Leitfähigkeit&quot; is matched by the lemmas &quot;leit&quot; and &quot;fähigkeit&quot; - not by the lemma &quot;leitfähigkeit&quot;</li>
46 <li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
47 <li><strong>syn</strong> (Token:Syntactical information): Includes token based information like @PREMOD, @NH, @MAIN ...</li>
48 <li><strong>m</strong> (Token:Morphosyntactical information): Includes information about tense (&quot;PRES&quot; ...), mode (&quot;IND&qut;), number (&quot;PL&quot; ...) etc.</li>
49 <li><strong>c</strong> (Span:Phrases): Only nominal phrases are available and all nominal phrases are written in lower case (&quot;np&quot;)</li>
50 </ul>
51 </li>
52 <li><strong>corenlp</strong>
53 <ul>
54 <li><strong>ne_hgc_175m_600</strong> (Token:Named Entity): Contains named entities like &quot;I-PER&quot;, &quot;I-ORG&quot; etc. </li>
55 <li><strong>ne_dewac_175_175m_600</strong> (Token:Named Entity): see above</li>
56 </ul>
57 </li>
58 <li><strong>tt</strong>
59 <ul>
60 <li><strong>l</strong> (Token:Lemma): All non-noun lemmas are written in lower case, nouns are written upper case. Composita stay intact (e.g. &quot;Normalbedingung&quot;)</li>
61 <li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
62 </ul>
63 </li>
64 <li><strong>mate</strong>
65 <ul>
66 <li><strong>l</strong> (Token:Lemma): All lemmas are written in lower case. Composita stay intact (e.g. &quot;buchstabenbezeichnung&quot;)</li>
67 <li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
68 <li><strong>m</strong> (Token:Morphosyntactical information): Includes information about tense (&quot;tense:pres&quot; ...), mode (&quot;mood:ind&qut;), number (&quot;number:pl&quot; ...), gender (&quot;gender:masc&quot; etc.</li>
69 </ul>
70 </li>
71 <li><strong>opennlp</strong>
72 <ul>
73 <li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
74 </ul>
75 </li>
76 <li><strong>xip</strong>
77 <ul>
78 <li><strong>l</strong> (Token:Lemma): All non-noun lemmas are written in lower case, nouns are written upper case. Composita are split, e.g. the token &quot;Leitfähigkeit&quot; is matched by the lemmas &quot;leiten&quot; and &quot;Fähigkeit&quot; - and by a merged and pretty useless &quot;leitenfähigkeit&quot; (This is going to change)</li>
79 <li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
80 <li><strong>c</strong> (Span:Phrases): Some phrases to create sentences, all upper case (&quot;NP&quot;, &quot;NPA&quot;, &quot;NOUN&quot;, &quot;VERB&quot;, &quot;PREP&quot;, &quot;AP&quot; ...)</li>
81 </ul>
82 </li>
83 </ul>
84</section>
Nils Diewald2329e1d2014-06-12 16:07:57 +000085
Nils Diewald4af3f0b2014-06-25 01:43:17 +000086% end