langtech:lcp:dqd:start
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| langtech:lcp:dqd:start [2024/03/26 13:02] – johannes.graen.uzh.ch | langtech:lcp:dqd:start [Unknown date] (current) – removed - external edit (Unknown date) 127.0.0.1 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | ====== The DQD Query Language ====== | ||
| - | |||
| - | LCP's query language DQD (Descriptive Query Definition) follows the idea of Entity-Relationshop models. Entities sets are defined by logical formulae on properties and relations between them. The query engine then searches for those constellations inside the corpus or corpora selected. Along the lines of first order logic, quantors can be employed to enforce the existence or non-existence of constellations. | ||
| - | |||
| - | Every query needs to specify at least one result set, which is either a (plain) list of entities comprising the query matches, a statistical or a collocational analysis. T | ||
| - | |||
| - | |||
| - | ===== Introduction ===== | ||
| - | |||
| - | In a text corpus with standard annotations (viz the [[https:// | ||
| - | <code dqd> | ||
| - | Segment s | ||
| - | |||
| - | Token t | ||
| - | form = " | ||
| - | |||
| - | Result => plain | ||
| - | context | ||
| - | s | ||
| - | entities | ||
| - | t | ||
| - | </ | ||
| - | |||
| - | |||
| - | |||
| - | ==== Instanciation and Scopes ==== | ||
| - | |||
| - | Entities are instanciated by providing the entity name followed by a (unique) identifier, which can be used to reference the entitiy. | ||
| - | By instanciating an entity, it is by default existantially quantified and its scope ranges from its instanciation to the end of the qeuery. Exceptions are negative existential quantification and entities inside sequences with repetitions (see below). | ||
| - | <code dqd> | ||
| - | Segment s | ||
| - | </ | ||
| - | |||
| - | |||
| - | |||
| - | |||
| - | ===== Quick Reference ===== | ||
| - | |||
| - | <wrap #entity /> | ||
| - | ==== Entities ==== | ||
| - | |||
| - | Entities are defined in the [[|Corpus Template]]. Though aleatory names can be used, we use // | ||
| - | |||
| - | |||
| - | <wrap #attribute /> | ||
| - | ==== Attributes ==== | ||
| - | |||
| - | Attributes are also defined in the [[|Corpus Template]] and thus their naming is free for a corpus creator to define. As a standard set, we use //form// for word forms, //lemma// for lemmas, //upos// for [[https:// | ||
| - | |||
langtech/lcp/dqd/start.1711458138.txt.gz · Last modified: by johannes.graen.uzh.ch
