If you want to know more about depositing your research data on the Humanities Lab Corpus Server, please contact the corpus manager. You can also find answers to some of the more common questions below.
What kind of data can I store?
The corpus server currently hosts images, audio, video, and various kinds of text data such as word lists, transcriptions, and KML. To ensure that the data is still readable in the future, uncompressed, and preferably open formats should be used when possible. (For example, MP3 is not allowed, since it is both compressed and a proprietary format.)
Please contact the corpus manager for questions regarding formats.
Will my research data have to become publicly available?
No. As a depositor, you have no obligation to make your material accessible to the public. You will make the decision as to who can access what, in accordance with ethical and legal concerns specific to your collection. This may result in sensitive material being permanently unavailable for public access, if their distribution could potentially cause harm or distress to the participants.
Since research data may have different levels of sensitivity, even within the same collection, each individual item can be designated an individual access specification. The depositor may also choose to apply other conditions on the access or use of the data at their own discretion.
It is the depositor’s responsibility to ensure that they have sought the appropriate permissions from participants and the creators of the materials in deciding on access levels.
What kind of documentation (metadata) will my data need to have?
Data hosted on the Corpus Server need to be documented using the IMDI metadata standard before being uploaded. If unsure how to document your data, a spreadsheet is usually a good starting point. Contact the corpus manager for options on how to continue. Information such as collection date, content-type (e.g. narratives, conversations, experiment data, songs etc.), language and participants may or may not apply to you data, but those and more can all be indiviudally described with the IMDI standard.
Are the data backed up?
Yes. Data are backed up by LUNARC, The Lund Center for Scientific and Technical Computing, a part of SNIC, Swedish National Infrastructure for Computing.