CMDI 1.1. Metadata
Header
MdCreator: Kristin Hagen
MdCreationDate: 2017-04-06
MdSelfLink:
MdProfile: clarin.eu:cr1:p_1407745711925
MdCollectionDisplayName: Clarino - Textlab
Resources
ResourceProxyList:
JournalFileProxyList:
ResourceRelationList:
IsPartOfList:
Components
corpusProfile:
resourceCommonInfo [ComponentId=‘clarin.eu:cr1:c_1396012485126’]:
resourceType: corpus
identificationInfo [ComponentId=‘clarin.eu:cr1:c_1396012485125’]:
resourceName [xml:lang=‘nb’]: Lege-pasient-korpuset fra Ahus
resourceName [xml:lang=‘en’]: Corpus of Doctor-Patient Consultations from Ahus
description [xml:lang=‘en’]: The Corpus of Doctor-Patient Consultations from Ahus is a unique corpus of transcribed dialogue between doctors and patients in different types of consultations at Akershus University Hospital (Ahus). The audio files are not available in the corpus due to their sensitive nature.
Regional Ethics Committee for Medical and Health Research has approved permanent storage of the transcriptions for research purposes.
Version 2 of the corpus (June 2015) has 220 consultations, well over 950 000 words.
description [xml:lang=‘nb’]: Lege-pasient-korpuset er et unikt korpus med transkripsjoner av samtaler mellom leger og pasienter i forskjellige typer konsultasjoner på Akershus universitetssykehus (Ahus). Fordi materialet er sensitivt, er ikke lydfilene tilgjengelige i korpuset.
Transkripsjonene i korpuset bygger på videoopptak av samtaler mellom lege og pasient/pårørende ved Ahus i 2007 og 2008. Materialet ble samlet inn i forbindelse med en studie der formålet var å studere effekten av et kurs i kommunikasjon for sykehusleger. Legene representerer alle ikke-psykiatriske kliniske fagområder (indremedisin, kirurgi, ortopedi, gynekologi, pediatri, nevrologi, øre-nese-hals, anestesiologi) ved sykehuset. Det ble gjort inntil 8 opptak av hver lege med ulike pasienter, i alt 497 opptak. Versjon 2 av korpuset (juni 2015) inneholder 220 samtaler og drøye 950 000 ord.
Regional etisk komité for medisinsk forskning har godkjent varig lagring av transkripsjonene for forskning.
resourceShortName: Lege-pasient
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lege-pasient/
url: http://www.hf.uio.no/iln/english/about/organization/text-laboratory/projects/doctor-patient/index.html
PID: http://hdl.handle.net/11538/0000-000B-C020-7
distributionInfo [ComponentId=‘clarin.eu:cr1:c_1396012485124’]:
licenceInfo [ComponentId=‘clarin.eu:cr1:c_1396012485158’]:
userCategory: Academic
distributionAccessMedium: accessibleThroughInterface
executionLocation: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lege-pasient/
executionLocation: http://www.hf.uio.no/iln/english/about/organization/text-laboratory/projects/doctor-patient/index.html
licence [ComponentId=‘clarin.eu:cr1:c_1447674760330’]:
licenceFamily: CLARIN
licenceName: CLARIN_ACA-NC-LOC-PRIV-ND-*
licenceURL: https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&PRIV=1&NORED=1&ND=1
conditionsOfUse: *
conditionsOfUse: BY
conditionsOfUse: ID
conditionsOfUse: LOC
conditionsOfUse: NC
conditionsOfUse: ND
conditionsOfUse: NORED
conditionsOfUse: PRIV
nonStandardConditionsOfUse: The corpus contains transcripts of sensitive hospital consultations. In agreement with Regional Ethics Committee for Medical and Health Research and the Ipr holders of the material, the corpus is accessible only through Glossa, a search and post-processing tool developed by the Text Laboratory.
The Ipr holders want the users of the corpus to send a message to them that the corpus is used, and in what kind of research. Where it is natural to draw medical or psychological expertise into the research, the Ipr holders should be asked whether they wish to participate, before eventually seeking expertise elsewhere. Contact: pal.gulbrandsen by medisin.uio.no
licensor:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Institute of Clinical Medicine
departmentName [xml:lang=‘no’]: Institutt for klinisk medisin
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: pal.gulbrandsen@medisin.uio.no
url: http://www.med.uio.no/klinmed/personer/vit/paalgul/index.html
address: Akershus universitetssykehus
zipCode: 1478
city: LØRENSKOG
country: Norway
distributionRightsHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
iprHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Gulbrandsen
givenName: Pål
sex: male
affiliation:
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘en’]: UoO
organizationShortName [xml:lang=‘no’]: UiO
departmentName [xml:lang=‘en’]: Institute of Clinical Medicine
departmentName [xml:lang=‘no’]: Institutt for klinisk medisin
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: pal.gulbrandsen@medisin.uio.no
url: http://www.med.uio.no/klinmed/personer/vit/paalgul/index.html
address: Akershus universitetssykehus
zipCode: 1478
city: LØRENSKOG
country: Norway
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Finset
givenName: Arnstein
sex: male
affiliation:
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘en’]: UoO
organizationShortName [xml:lang=‘no’]: UiO
departmentName [xml:lang=‘en’]: Institute of Basic Medical Sciences
departmentName [xml:lang=‘no’]: Institutt for medisinske basalfag
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: arnstein.finset @medisin.uio.no
url: http://www.med.uio.no/imb/english/people/aca/arnsten/index.html
address: Postboks 1111 Blindern
zipCode: 0317
city: Oslo
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Jensen
givenName: Bård Fossli
sex: male
contact:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Gulbrandsen
givenName: Pål
sex: male
affiliation:
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘en’]: UoO
organizationShortName [xml:lang=‘no’]: UiO
departmentName [xml:lang=‘en’]: Institute of Clinical Medicine
departmentName [xml:lang=‘no’]: Institutt for klinisk medisin
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: pal.gulbrandsen@medisin.uio.no
url: http://www.med.uio.no/klinmed/personer/vit/paalgul/index.html
address: Akershus universitetssykehus
zipCode: 1478
city: LØRENSKOG
country: Norway
metadataInfo [ComponentId=‘clarin.eu:cr1:c_1407745711922’]:
metadataCreationDate: 2017-04-06
metadataLastDateUpdated: 2017-08-14
metadataCreator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Hagen
givenName: Kristin
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: kristin.hagen@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
versionInfo [ComponentId=‘clarin.eu:cr1:c_1430905751648’]:
version: Version 2
lastDateUpdated: 2015-06-01
validationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711923’]:
validated: true
validationType: content
validationMode: manual
validationModeDetails: The transcriptions are proof read against the audio files.
validationExtent: full
validator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
resourceDocumentationInfo [ComponentId=‘clarin.eu:cr1:c_1355150532301’]:
documentationUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532302’]:
role: documentation
documentUnstructured: In Norwegian only:
http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lege-pasient/lege-pasient-transkripsjonsveil7.pdf
resourceCreationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711921’]:
creationStartDate: 2007-01-01
creationEndDate: 2015-06-01
corpusInfo [ComponentId=‘clarin.eu:cr1:c_1407745711878’]:
corpusType: Multimodal Corpus
corpusPartInfo [ComponentId=‘clarin.eu:cr1:c_1407745711885’]:
mediaType: text
corpusTextInfo [ComponentId=‘clarin.eu:cr1:c_1396012485188’]:
textFormatInfo [ComponentId=‘clarin.eu:cr1:c_1427452477072’]:
mimeType: txt
sizePerTextFormat [ComponentId=‘clarin.eu:cr1:c_1447674760342’]:
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 958 830
sizeUnit: words
characterEncodingInfo [ComponentId=‘clarin.eu:cr1:c_1447674760355’]:
characterEncoding: Latin1
corpusPartInfo [ComponentId=‘clarin.eu:cr1:c_1407745711885’]:
mediaType: video
corpusVideoInfo [ComponentId=‘clarin.eu:cr1:c_1407745711880’]:
videoContentInfo [ComponentId=‘clarin.eu:cr1:c_1360931019779’]:
typeOfVideoContent: Conversations between doctors and patients in different types of consultations
videoFormatInfo [ComponentId=‘clarin.eu:cr1:c_1427452477073’]:
mimeType: The videos are not available in the corpus due to the sensitiveness of the conversations
corpusPartGeneralInfo [ComponentId=‘clarin.eu:cr1:c_1407745711882’]:
personSourceSetInfo [ComponentId=‘clarin.eu:cr1:c_1360931019775’]:
numberOfPersons: 220
ageOfPersons: teenager
ageOfPersons: adult
ageOfPersons: elderly
ageRangeStart: 1
ageRangeEnd: 100
sexOfPersons: mixed
originOfPersons: mixed
dialectAccentOfPersons: Some of the patients and doctors speak Norwegian as a second language.
lingualityInfo [ComponentId=‘clarin.eu:cr1:c_1355150532313’]:
lingualityType: monolingual
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: No
languageName: Norwegian
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: Nb
languageName: Norwegian Bokmål
modalityInfo [ComponentId=‘clarin.eu:cr1:c_1447674760356’]:
modalityType: spokenLanguage
modalityTypeDetails: Orthographic transcription of 220 patients (some together with their next of kin) their doctors and other health personnel. There are many descriptions/stage directions of the consultant situation to make up for the missing videos.
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 958 830
sizeUnit: words
annotationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711924’]:
annotationType: morphosyntacticAnnotation-posTagging
annotatedElements: other
segmentationLevel: word
tagset: POS tagset created for the statistical NoTa-tagger - based on the tagset of the Oslo Bergen Tagger.
tagsetLanguageId: Nb
tagsetLanguageName: Norwegian Bokmål
theoreticModel: TreeTagger
annotationMode: automatic
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: article
title [xml:lang=‘en’]: Tagging a Norwegian Speech Corpus
author: Anders Nøklestad and Åshild Søfteland
editor: Joakim Nivre,Heiki-Jaan Kaalep,Kadri Muischnek, Mare Koit
year: 2007
bookTitle: Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007
pages: 245–248
conference: Nodalida 2007
documentLanguageName: English
documentLanguageId: en
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: article
title [xml:lang=‘nb’]: Manuell morfologisk
tagging av NoTa-materialet med støtte fra en statistisk tagger.
author: Åshild Søfteland og Anders Nøklestad
editor: Janne Bondi Johannessen og Kristin Hagen
year: 2008
publisher: Novus forlag
bookTitle: Språk i Oslo. Ny forskning omkring talespråk
pages: 226–234.
ISBN: 978-82-7099-471-7
documentLanguageName: Norwegian
documentLanguageId: nb
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: manual
title [xml:lang=‘nb’]: NoTa-taggeren: TAGGEVEILEDNING
author: Åshild Søfteland
year: 2007
url: http://www.tekstlab.uio.no/nota/oslo/Taggeveiledning2.pdf
documentLanguageName: Norwegian bokmål
documentLanguageId: nb
annotationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711924’]:
annotationType: speechAnnotation-orthographicTranscription
annotationManualUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532325’]:
role: annotationManual
documentUnstructured: Orthographic transcription,cf Bokmålsordboka (Wangensteen 2004)
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: manual
title [xml:lang=‘nb’]: Transkripsjonsveiledning for NoTa-Oslo
author: Kristin Hagen
year: 2008
url: http://www.tekstlab.uio.no/nota/oslo/transkripsjon/NoTa-transkripsjonsveil22.pdf
annotationTool [ComponentId=‘clarin.eu:cr1:c_1355150532326’]:
targetResourceNameURI: Transcriber (http://trans.sourceforge.net/en/presentation.php
)
classificationInfo [ComponentId=‘clarin.eu:cr1:c_1403588862809’]:
genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:
genreType: speechGenre
genre: informal
unstandardisedGenre: patient conversations
timeCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760358’]:
timeCoverage: 2007 - 2008