Browse and access data
The archive browser is where users search, browse, and download data hosted on the Corpus Server. Anyone can see what is available without registering, but whether data is accessible or not depends on what access level the depositor/researcher has requested. Being registered will not automatically permit blanket access to data. For example data sets may be open access for anyone, or even completely off limits (metadata will still be available). In general, access restrictions and conditions is decided by each, respective depositor/researcher. There are four access levels for archived materials:
- Open: No login or registration is required
- Available to registered users
- Access needs to be requested: you can apply for access to these materials as a registered user
- Closed: These materials are currently not accessible, mostly due to the sensitivity of the material
Archiving and managing data
We have recently migrated to new archive software and are currently working on implementing its web interface for user deposits. Until this is finished, please contact the archive manager if you wish to archive with us.
Data is by default only accessible by the user managing the uploads and/or to users who already have download access to the node/s in question. Access levels can then be set to be more open or restricted, depending on what is agreed upon.
Metadata is information describing your data (data about your data). It is what allows users to locate data and is central to the archiving principle. Information that can be included but is not limited to a general description, location where the data was collected, languages spoken, participants etc.
All metadata on the Humanities Lab Corpus Server is publicly visible by design, as opposed to the actual data files where access restrictions may apply. Each depositor will have to decide whether a piece of information should be included or not in the metadata. For example, including the real name of a participant may or may not be the cause for privacy concerns, since these will also be visible to anyone browsing archive.
The server provides metadata via the CMDI metadata standard (profiles clarin.eu:cr1:p_1407745712035 and clarin.eu:cr1:p_1407745712064). This is required as part a deposit.
The archive manager can help with workflows and options for how to create the final metadata-files. A general tip is to create a spreadsheet describing your data. Each row could contain a description corresponding to one or multiple data files. Columns represent the kinds of information you want linked to each item, such as date (YYYY-MM-DD), languages, participants, general description, location among others.
Contact and Support
All questions relating to user accounts, the access or archiving of data can be directed to the archive manager.