LiRI Wiki

Linguistic Research Infrastructure - University of Zurich

User Tools

Site Tools


langtech:corpora

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
langtech:corpora [2023/01/31 12:51] Gerold Schneiderlangtech:corpora [2023/02/02 13:27] (current) johannes.graen.uzh.ch
Line 1: Line 1:
-Back to [[https://www.liri.uzh.ch/en.html|Linguistic Research Infrastructure]] (LiRI) 
- 
 ====== Corpora & Assistive Technology ====== ====== Corpora & Assistive Technology ======
  
Line 11: Line 9:
   * Data crawling/scraping and processing of web sources, batch download of documents   * Data crawling/scraping and processing of web sources, batch download of documents
   * Data extraction and conversion   * Data extraction and conversion
 +
 +===== Examples of our work =====
 +
 +Swissdox@LiRI -- web based service for extraction of subcorpora from the Swiss media database Swissdox
 +
 +{{swissdox.png?600|}}
 +
 +[[https://swissdox.linguistik.uzh.ch/]]
 +----
 +
 +VIAN -- web application for multimodal corpora; comprises of corpus querying interface, multimodal corpus viewer, video and audio player and timeline with time-aligned text and annotations
 +
 +{{vian.png?600|}}
 +----
 +
 +CoLiCaSlav -- web corpus application used as an empirical basis for teaching and studying the principle categories and concepts relevant for the Slavic languages
 +
 +{{lehrkorpus.png?600|}}
 +
 +[[https://lehrkorpus-slav.linguistik.uzh.ch/]]
 +----
 +
 +Kollo -- command line tool for extracting collocations from VERT formatted corpora
 +
 +{{kollo1.png?400|}}
 +{{kollo2.png?400|}}
  
langtech/corpora.1675169495.txt.gz · Last modified: by Gerold Schneider

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki