-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathhelp.html
66 lines (56 loc) · 4.73 KB
/
help.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
<!DOCTYPE html>
<html>
<head>
<link rel="stylesheet" type="text/css" href="style.css">
<meta http-equiv="content-type" content="text/html; charset=utf-8">
</head>
<body>
<header>
<h1>Deutsch Diachron Digital</h1>
<p>
<a href="./index.html">Simplified search</a> -
<a href="https://korpling.german.hu-berlin.de/annis3/instance-ddd">Advanced search</a> -
Documentation
</p>
</header>
<div>
<h2>Documentation simplified search</h2>
<h3>Einleitung</h3>
<p>The simplified search page is meant to be one of two entrance points for accessing the corpora in the Deutsch Diachron Digital project.</p>
<p>The first entrance point is the <a href="https://korpling.german.hu-berlin.de/annis3/instance-ddd">ANNIS search interface</a>, which offers a <a href="http://www.sfb632.uni-potsdam.de/annis/aql.html">formulaic query language (AQL)</a> and a more visual query builder. The ANNIS search options offer a highly accurate and thorough tackle on the available corpora and are advised for scientific research.</p>
<p>The simplified search page is the second entrance point, and is intended for the general audience, or for quick and imprecise searches.</p>
<h3>Simplified search</h3>
<p>The simplified search page consists of three parts. The upper part merely states the project name, but also offers a link to the <a href="https://korpling.german.hu-berlin.de/annis3/instance-ddd">advised ANNIS search tool</a>, <a href="./index.html">the simplified search</a>, and this documentation page.</p>
<p>The middle part consists of a large query box in which a query can be entered. More information about this box is provided below.</p>
<p>At the bottom of the page, tickboxes allow the non-scientific user to constrain the query in the query box to a number of ad-hoc categories. More information on these meta-categories is provided below.</p>
<h4>Query box</h4>
<p>The query box is set up to parse the input that it receives in such a way that a more complex ANNIS query can be written. We need to distinguish between "single word" and "multiple words" queries.</p>
<p>A single word query distinguishes itself from a multiple words query simply by the fact that more than one word is entered. If a single word is entered, the query is parsed as follows: the parser goes through selected annotation levels (translation, lemma, txt (in this order)), and through all the annotation values per level; if for a specific level some values are found that match the input, that level is queried. If more than word is entered, the parsing goes as follows: for each word, the parsing is identical to the parsing of a single word; after all single words are parsed, the retrieved queries are concatenated, allowing for up to three words in between.</p>
<p>The parser of the query box also resolves diacritics from simple ascii input to the diacritics that occur in the corpora.</p>
<p>Finally, it is possible to use simple regular expressions in the query box</p>
<h4>Meta constraints</h4>
<p>For the convenience of the non-scientific user, a number of ad-hoc (and as such non-authorative) categories of meta-information are provided. The values in these categories are most certainly debatable, and in no way final. Moreover, the attribution of these values to specific texts is in certain cases questionable. These meta-constraints are therefore only usuable for the non-scientific user, and any informed scholar would prefer the ANNIS search tool, in which the queries can be precisely formulated.</p>
<ul>
<li>The time constraint is based on the century in which the text is supposedly written down.</li>
<li>The text type constraint is based on the interpretation of the scholars in the DDD project.</li>
<li>The geospatial constraint is based on an interpretation of the dialect area that some texts may be typical for.</li>
</ul>
<p>Once again, the levels of these meta-categories and the attributions of these levels to the texts are highly debatable, not to be considered as final or authorative.</p>
<h3>Example queries</h3>
<p>A number of things that can be typed in for your inspiration:</p>
<ul>
<li>fater unser</li>
<li>vater unser</li>
<li>.ater unser</li>
<li>jesus, with meta constraint on Alemannisch</li>
<li>...</li>
</ul>
<h3>Source and development</h3>
<p>The source code and development of the simplified search can be monitored on <a href="http://www.github.com/ruettet/dddsimplesearch">GitHub</a>.</p>
</div>
<footer>
<hr/>
<p>Documentation to the simplified search of the Deutsch Diachron Digital corpora.</p>
<p>Questions or comments: <a href="http://www2.hu-berlin.de/sprachgeschichte/mitarbeiter/ruette.php">Tom Ruette</a></p>
</footer>
</html>