LiRI Wiki

Linguistic Research Infrastructure - University of Zurich

User Tools

Site Tools


langtech:corpora

This is an old revision of the document!


Corpora & Assistive Technology

The Language Technology group has expertise in handling various types of corpora. We are building tailor-made applications to explore large and structurally complex collections of language data. In particular, we are competent in:

  • The design of databases to hold application-relevant data
  • Generating interactive visualizations
  • Efficiently querying large data collections (in particular corpora)
  • Anonymisation of large data sets
  • Data crawling/scraping and processing of web sources, batch download of documents
  • Data extraction and conversion
langtech/corpora.1675173167.txt.gz · Last modified: by johannes.graen.uzh.ch

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki