CMDI 1.1 Metadata
Header
MdCreator: Kristin Hagen
MdCreationDate: 2021-03-26
MdProfile: clarin.eu:cr1:p_1407745711925
MdCollectionDisplayName: Clarino - Textlab
Resources
ResourceProxyList:
ResourceProxy [id=‘lia-norsk-lp’]:
ResourceType [mimetype=‘’]: LandingPage
ResourceRef: http://tekstlab.uio.no/LIA/norsk/index.html
ResourceProxy [id=‘lia-transcriptions’]:
ResourceType [mimetype=‘’]: Resource
ResourceRef: http://tekstlab.uio.no/LIA/norsk/index.html
ResourceProxy [id=‘lia-corpus’]:
ResourceType: Resource
ResourceRef: https://tekstlab.uio.no/glossa3/lia_norsk
JournalFileProxyList:
ResourceRelationList:
ResourceRelation:
RelationType: transcriptions
Res1 [ref=‘lia-corpus’]:
Res2 [ref=‘lia-transcriptions’]:
IsPartOfList:
Components
corpusProfile:
resourceCommonInfo [ComponentId=‘clarin.eu:cr1:c_1396012485126’] [ref=‘lia-transcriptions’]:
resourceType: corpus
identificationInfo [ComponentId=‘clarin.eu:cr1:c_1396012485125’]:
resourceName [xml:lang=‘nb’]: Transkripsjoner og utvalgte lydfiler fra LIA norsk til nedlasting
resourceName [xml:lang=‘en’]: Transcriptions and selected audio files from LIA Norwegian for download
description [xml:lang=‘en’]: All transcriptions from LIA Norwegian are downloadable in plain text format.

A folder containing 553 transcriptions from LIA Norwegian, in ELAN format, along with their corresponding audio, can moreover be downloaded. These recordings contain no sensitive information and can be used freely by linguists or for other technological purposes.

LIA Norwegian is a speech corpus with old recordings (1939 - 1996) from four Norwegian universities: NTNU, UoB, UoO and UoT. Many of the LIA recordings have content that has been deemed sensitive. Such content has not been transcribed, such that the recordings can still be used in the corpus. These recordings are not available for download.
description [xml:lang=‘nb’]: Alle transkripsjoner fra LIA norsk kan lastes ned i tekstformat.

553 transkripsjoner i ELAN-format fra LIA norsk er sammen med tilhørende lydfiler dessuten samla i ei mappe for nedlasting. Dette er opptak som ikke inneholder sensitiv informasjon og kan brukes fritt til både lingvistiske og språkteknologiske formål.

LIA norsk er et talespråkskorpus med gamle opptak (1939 - 1996) fra fire norske universitet: NTNU, UiB, UiO og UiT. Mange opptak i LIA norsk har noe innhold som kan karakteriseres som sensitiv informasjon. Denne informasjonen er ikke transkribert, derfor kan opptaka brukes i korpuset, men lydfilene kan ikke frigis siden informasjonen fremdeles ligger der.
resourceShortName [xml:lang=‘en’]: LIA Norwegian Download
resourceShortName [xml:lang=‘nb’]: LIA norsk for nedlasting
url: http://tekstlab.uio.no/LIA/norsk/index.html
url: http://tekstlab.uio.no/LIA/norsk/index_english.html
PID: http://hdl.handle.net/11538/0000-000C-368B-B
distributionInfo [ComponentId=‘clarin.eu:cr1:c_1396012485124’]:
licenceInfo [ComponentId=‘clarin.eu:cr1:c_1396012485158’]:
userCategory: Public
distributionAccessMedium: downloadable
downloadLocation: http://tekstlab.uio.no/LIA/norsk/index.html
licence [ComponentId=‘clarin.eu:cr1:c_1447674760330’] [ref=‘lia-transcriptions’]:
licenceFamily [ref=‘lia-transcriptions’]: Creative Commons (CC)
licenceName: Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
licenceURL: http://creativecommons.org/licenses/by-nc-sa/4.0/
conditionsOfUse: BY
conditionsOfUse: NC
conditionsOfUse: SA
nonStandardConditionsOfUse: The corpus has audio and video recordings classified as personal data. In agreement with NSD, the Data Protection Official in Norway, the audio and video files are accessible only through Glossa, a search and post-processing tool developed by the Text Laboratory.
Please note that every individual researcher is responsible for treating the participants in the corpus with respect and sincerity. Furthermore, the participants must be kept anonymous in every published paper or other output.
licensor:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
distributionRightsHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
contact:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
metadataInfo [ComponentId=‘clarin.eu:cr1:c_1407745711922’]:
metadataCreationDate: 2018-09-26
metadataLastDateUpdated: 2023-12-07
metadataCreator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Hagen
givenName: Kristin
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: kristin.hagen@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
versionInfo [ComponentId=‘clarin.eu:cr1:c_1430905751648’]:
version: First version (Transcriptions from 15. September 2019)
validationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711923’]:
validated: true
validationType: content
validationMode: manual
validationModeDetails: The transcriptions are proofread against the audio files.
validationExtent: partial
validator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The LIA project
organizationShortName: LIA
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
resourceDocumentationInfo [ComponentId=‘clarin.eu:cr1:c_1355150532301’]:
documentationUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532302’]:
role: documentation
documentUnstructured: Brukarrettleiing for LIA norsk - korpus av eldre dialektopptak: http://tekstlab.uio.no/brukerveiledninger/LIA%20norsk/index.html
documentationStructured [ComponentId=‘clarin.eu:cr1:c_1361876010648’]:
role: documentation
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: other
title: Heimesida til LIA-korpuset for norske dialekter
url: http://tekstlab.uio.no/LIA/norsk/index.html
resourceCreationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711921’]:
creationStartDate: 2014-04-01
creationEndDate: 2018-06-31
resourceCreator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The LIA project
(Project participants and employees in the LIA project)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://tekstlab.uio.no/LIA/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
fundingProject:
projectInfo [ComponentId=‘clarin.eu:cr1:c_1430905751647’]:
projectName [xml:lang=‘nb’]: LIA (Language Infrastructure made Accessible)
projectShortName: LIA
projectID: 22 59 41
url: http://tekstlab.uio.no/LIA/
url: https://www.hf.uio.no/iln/english/research/projects/language-infrastructure-made-accessible/index.html
fundingType: nationalFunds
funder: The Research Council of Norway
fundingCountry: Norway
projectStartDate: 2014-01-04
projectEndDate: 2019-12-31
corpusInfo [ComponentId=‘clarin.eu:cr1:c_1407745711878’]:
corpusType: Multimodal Corpus
corpusPartInfo [ComponentId=‘clarin.eu:cr1:c_1407745711885’] [ref=‘lia-transcriptions’]:
mediaType: text
corpusTextInfo [ComponentId=‘clarin.eu:cr1:c_1396012485188’]:
textFormatInfo [ComponentId=‘clarin.eu:cr1:c_1427452477072’] [ref=‘lia-transcriptions’]:
mimeType: Downloadable transcriptionns in txt and ELAN format
sizePerTextFormat [ComponentId=‘clarin.eu:cr1:c_1447674760342’]:
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 3 481 547
sizeUnit: tokens
characterEncodingInfo [ComponentId=‘clarin.eu:cr1:c_1447674760355’]:
characterEncoding: utf-8
corpusPartGeneralInfo [ComponentId=‘clarin.eu:cr1:c_1407745711882’]:
personSourceSetInfo [ComponentId=‘clarin.eu:cr1:c_1360931019775’]:
numberOfPersons: 1374
ageOfPersons: teenager
ageOfPersons: adult
ageOfPersons: elderly
ageRangeStart: 10
ageRangeEnd: 99
sexOfPersons: mixed
originOfPersons: native
dialectAccentOfPersons: Dialects from 222 places in Norway
geographicDistributionOfPersons: All over Norway
lingualityInfo [ComponentId=‘clarin.eu:cr1:c_1355150532313’]:
lingualityType: monolingual
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: No
languageName: Norwegian
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: Nn
languageName: Norwegian Nynorsk
modalityInfo [ComponentId=‘clarin.eu:cr1:c_1447674760356’]:
modalityType: spokenLanguage
modalityTypeDetails: Norwegian dialects. Two annotation modes: One phonetic (with Norwegian alphabet) and one orthographic.
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 3 481 547
sizeUnit: tokens
annotationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711924’]:
annotationType: speechAnnotation-phoneticTranscription
annotationType: speechAnnotation-orthographicTranscription
annotationManualUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532325’]:
role: annotationManual
documentUnstructured: Orthographic transcription,cf Nynorskordboka: https://ordbok.uib.no/
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: manual
title [xml:lang=‘nn’]: Transkripsjonsrettleiing for LIA
author: Kristin Hagen and Live Håberg and Eirik Olsen and Åshild Søfteland
year: 2018
url: http://tekstlab.uio.no/LIA/pdf/transkripsjonsrettleiing_lia.pdf
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: manual
title [xml:lang=‘nn’]: LIA:Translitterering frå dialekt til nynorsk
author: Anneke Askeland, Kristin Hagen, Live Håberg,Janne Bondi Johannessen, Linn Iren Sjånes Rødvand og Eirik Tengesdal
year: 2019
url: http://www.tekstlab.uio.no/LIA/pdf/rettleiing-translitterator.pdf
annotationTool [ComponentId=‘clarin.eu:cr1:c_1355150532326’]:
targetResourceNameURI: https://www.hf.uio.no/iln/english/about/organization/text-laboratory/services/oslo-transliterator/index.html
classificationInfo [ComponentId=‘clarin.eu:cr1:c_1403588862809’]:
genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:
genreType: speechGenre
genre: informal
unstandardisedGenre: conversations and informal interviews
classificationInfo [ComponentId=‘clarin.eu:cr1:c_1403588862809’]:
genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:
genreType: speechGenre
genre: semi formal
unstandardisedGenre: interviews
timeCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760358’]:
timeCoverage: 1939 - 1995
geographicCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760357’]:
geographicCoverage: All over Norway