LiRI Wiki

Linguistic Research Infrastructure - University of Zurich

User Tools

Site Tools


langtech:corpora

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
langtech:corpora [2023/01/31 13:52] – old revision restored (2022/12/30 04:41) johannes.graen.uzh.chlangtech:corpora [2023/02/02 13:27] (current) johannes.graen.uzh.ch
Line 9: Line 9:
   * Data crawling/scraping and processing of web sources, batch download of documents   * Data crawling/scraping and processing of web sources, batch download of documents
   * Data extraction and conversion   * Data extraction and conversion
 +
 +===== Examples of our work =====
 +
 +Swissdox@LiRI -- web based service for extraction of subcorpora from the Swiss media database Swissdox
 +
 +{{swissdox.png?600|}}
 +
 +[[https://swissdox.linguistik.uzh.ch/]]
 +----
 +
 +VIAN -- web application for multimodal corpora; comprises of corpus querying interface, multimodal corpus viewer, video and audio player and timeline with time-aligned text and annotations
 +
 +{{vian.png?600|}}
 +----
 +
 +CoLiCaSlav -- web corpus application used as an empirical basis for teaching and studying the principle categories and concepts relevant for the Slavic languages
 +
 +{{lehrkorpus.png?600|}}
 +
 +[[https://lehrkorpus-slav.linguistik.uzh.ch/]]
 +----
 +
 +Kollo -- command line tool for extracting collocations from VERT formatted corpora
 +
 +{{kollo1.png?400|}}
 +{{kollo2.png?400|}}
  
langtech/corpora.1675173167.txt.gz · Last modified: by johannes.graen.uzh.ch

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki