CLARIN (link, new window) is a European Research Infrastructure Consortium (ERIC) devoted to developing language technology (LT) and language resources as e-science and promoting LT-based computer tools in aid of research in the humanities and social sciences where text and speech make up an important kind of primary research data. CLARIN is of increasing importance because of the sheer volume of available “born-digital” language data, because of large-scale digitization of our cultural heritage, and because of increasing visibility and importance of e-science in all fields. Sweden became a member of the CLARIN ERIC in October 2014, and formed the e-science consortium Swe-Clarin (link, new window)
As a node in Swe-Clarin, the Humanities Lab provides speech and language technological support to a wide range of projects (link, new window).
The Lab also provides tools and expertise related to language archiving, corpus and (meta)data management, with a continued emphasis on multimodal corpora, many of which contain Swedish resources, but also other (often endangered) languages, multilingual or learner corpora. A primary resource is The Humanities Lab corpus server, containing a varied set of multimodal language corpora with standardised metadata and linked layers of annotations and other resources.
The corpus server.