Blame - templates/tutorial.html.ep - KorAP/Kalamar

blob: c2f7c8a1d27db65e9b452f2c053ef5213e030f2b [file] [log] [blame]

Nils Diewald	4af3f0b	2014-06-25 01:43:17 +0000	[diff] [blame]	1	% content main => begin
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	2
Nils Diewald	bd56adc	2014-06-22 18:44:53 +0000	[diff] [blame]	3	%# Store the id of an active section in the session, so the system is able to directly scroll to the relevant section
				4	%# This should be stored when clicking on a specific query
				5	%# but the remembered section contains the id - not the query
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	6
Nils Diewald	1eba657	2014-06-17 19:49:53 +0000	[diff] [blame]	7	<h2>KorAP-Tutorial</h2>
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	8
Nils Diewald	0d2dcc8	2014-06-18 17:10:49 +0000	[diff] [blame]	9	<!--
Nils Diewald	bd56adc	2014-06-22 18:44:53 +0000	[diff] [blame]	10	<p>Links to Blog, FAQ, About, Contact ...</p>
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	11	<ul>
				12	<li>Introduction to KorAP</li>
				13	<li>How to use Poliqarp+ QL?</li>
				14	<li>How to use Cosmas-II QL?</li>
				15	<li>How to use CQL?</li>
Nils Diewald	7cad840	2014-07-08 17:06:56 +0000	[diff] [blame^]	16	<li>API</li>
				17	<li>Search</li>
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	18	</ul>
Nils Diewald	0d2dcc8	2014-06-18 17:10:49 +0000	[diff] [blame]	19	-->
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	20
Nils Diewald	7cad840	2014-07-08 17:06:56 +0000	[diff] [blame^]	21	<section name="intro">
				22	<h3>Example Queries</h3>
				23	%# <p>This is a Tutorial to KorAP. It may be maintained separately (as a Wiki?) and has some nice features - like embedded example queries - just click on the queries below:</p>
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	24
Nils Diewald	7cad840	2014-07-08 17:06:56 +0000	[diff] [blame^]	25	<p><strong>Poliqarp</strong>: Find all occurrences of the lemma "baum" as annotated by the default foundry.</p>
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	26	%= korap_tut_query poliqarp => '[base=baum]'
				27
Nils Diewald	7cad840	2014-07-08 17:06:56 +0000	[diff] [blame^]	28	<p><strong>Cosmas-II</strong>: Find all occurrences of the words "der" and "Baum", in case they are in a maximum distance of 5 tokens. The order is not relevant.</p>
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	29	%= korap_tut_query cosmas2 => 'der /w5 Baum'
				30
Nils Diewald	4af3f0b	2014-06-25 01:43:17 +0000	[diff] [blame]	31
Nils Diewald	7cad840	2014-07-08 17:06:56 +0000	[diff] [blame^]	32	<p><strong>Poliqarp+</strong>: Find all nominal phrases as annotated using Connexor, that contain an adverb as annotated by OpenNLP.</p>
				33	%= korap_tut_query poliqarp => 'contains(<cnx/c=np>,[opennlp/p=ADV])'
				34
				35	<p><strong>Poliqarp+</strong>: Find all sentences as annotated by the base foundry that start with a sequence of one token in present tense as annotated by Connexor and the lemma "der" annotated by the default foundry. Highlight both terms of the sequence.</p>
Nils Diewald	4af3f0b	2014-06-25 01:43:17 +0000	[diff] [blame]	36	%= korap_tut_query poliqarp => 'startswith(<s>, {1:[cnx/m=PRES]}{2:[base=der]})'
				37
Nils Diewald	7cad840	2014-07-08 17:06:56 +0000	[diff] [blame^]	38
				39	%# <p>And here is a short cheat sheet for foundries and layers</p>
Nils Diewald	bd56adc	2014-06-22 18:44:53 +0000	[diff] [blame]	40	</section>
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	41
Nils Diewald	7cad840	2014-07-08 17:06:56 +0000	[diff] [blame^]	42	<section name="cheatsheet">
Nils Diewald	bd56adc	2014-06-22 18:44:53 +0000	[diff] [blame]	43	<h3>Cheatsheet</h3>
				44	<ul>
				45	<li><strong>base</strong>
				46	<ul>
				47	<li>Supports two types of spans: <strong><s></strong> for sentences and <strong><p></strong> for paragraphs - this will likely change in the next index version. These spans lack prefix information!</li>
				48	</ul>
				49	</li>
				50	<li><strong>cnx</strong>
				51	<ul>
				52	<li><strong>l</strong> (Token:Lemma): All lemmas are written in lower case. Composita are split, e.g. the token "Leitfähigkeit" is matched by the lemmas "leit" and "fähigkeit" - not by the lemma "leitfähigkeit"</li>
				53	<li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
				54	<li><strong>syn</strong> (Token:Syntactical information): Includes token based information like @PREMOD, @NH, @MAIN ...</li>
				55	<li><strong>m</strong> (Token:Morphosyntactical information): Includes information about tense ("PRES" ...), mode ("IND&qut;), number ("PL" ...) etc.</li>
				56	<li><strong>c</strong> (Span:Phrases): Only nominal phrases are available and all nominal phrases are written in lower case ("np")</li>
				57	</ul>
				58	</li>
				59	<li><strong>corenlp</strong>
				60	<ul>
				61	<li><strong>ne_hgc_175m_600</strong> (Token:Named Entity): Contains named entities like "I-PER", "I-ORG" etc. </li>
				62	<li><strong>ne_dewac_175_175m_600</strong> (Token:Named Entity): see above</li>
				63	</ul>
				64	</li>
				65	<li><strong>tt</strong>
				66	<ul>
				67	<li><strong>l</strong> (Token:Lemma): All non-noun lemmas are written in lower case, nouns are written upper case. Composita stay intact (e.g. "Normalbedingung")</li>
				68	<li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
				69	</ul>
				70	</li>
				71	<li><strong>mate</strong>
				72	<ul>
				73	<li><strong>l</strong> (Token:Lemma): All lemmas are written in lower case. Composita stay intact (e.g. "buchstabenbezeichnung")</li>
				74	<li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
				75	<li><strong>m</strong> (Token:Morphosyntactical information): Includes information about tense ("tense:pres" ...), mode ("mood:ind&qut;), number ("number:pl" ...), gender ("gender:masc" etc.</li>
				76	</ul>
				77	</li>
				78	<li><strong>opennlp</strong>
				79	<ul>
				80	<li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
				81	</ul>
				82	</li>
				83	<li><strong>xip</strong>
				84	<ul>
				85	<li><strong>l</strong> (Token:Lemma): All non-noun lemmas are written in lower case, nouns are written upper case. Composita are split, e.g. the token "Leitfähigkeit" is matched by the lemmas "leiten" and "Fähigkeit" - and by a merged and pretty useless "leitenfähigkeit" (This is going to change)</li>
				86	<li><strong>p</strong> (Token:Part of Speech): All pos infos are written in capital letters and are based on STTS</li>
				87	<li><strong>c</strong> (Span:Phrases): Some phrases to create sentences, all upper case ("NP", "NPA", "NOUN", "VERB", "PREP", "AP" ...)</li>
				88	</ul>
				89	</li>
				90	</ul>
				91	</section>
Nils Diewald	2329e1d	2014-06-12 16:07:57 +0000	[diff] [blame]	92
Nils Diewald	4af3f0b	2014-06-25 01:43:17 +0000	[diff] [blame]	93	% end