CMDI 1.1 Metadata

Header

MdCreator: Kristin Hagen

MdCreationDate: 2017-06-20

MdProfile: clarin.eu:cr1:p_1407745711925

MdCollectionDisplayName: Clarino - Textlab

Resources

ResourceProxyList:

ResourceProxy [id=‘LBK2013-lp’]:

ResourceType [mimetype=‘’]: LandingPage

ResourceRef: https://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lbk/

JournalFileProxyList:

ResourceRelationList:

IsPartOfList:

Components

corpusProfile:

resourceCommonInfo [ComponentId=‘clarin.eu:cr1:c_1396012485126’]:

resourceType: corpus

identificationInfo [ComponentId=‘clarin.eu:cr1:c_1396012485125’]:

resourceName [xml:lang=‘nb’]: Leksikografisk bokmålskorpus

resourceName [xml:lang=‘en’]: The Lexicographic Corpus for Norwegian Bokmål

description [xml:lang=‘en’]: The corpus consists of texts collected from available literature/prose from 1985 to 2013. The corpus is composed of texts from five genres: non-fiction prose (45 %) fiction (35 %) newpapers/magazines (10 %), TV subtitles (5 %), and non-standardized, unpublished texts (5 %), all in all 100 mill words.
The corpus is grammatically tagged with the original version of The Oslo-Bergen tagger.

description [xml:lang=‘nb’]: Korpuset består av tekster hentet fra tilgjengelig litteratur/prosa fra 1985 til 2013. Korpuset har tekster fra fem sjangere: sakprosa (45%) skjønnlitteratur (35%) aviser og periodika (10%), TV-teksting( 5%), og upublisert materiale, småtrykk (5%), alt i alt 100 mill ord.
Korpuset er grammatisk merket med den opprinnelige versjonen av Oslo-Bergen taggeren.

resourceShortName [xml:lang=‘nb’]: LBK2013

resourceShortName [xml:lang=‘en’]: LBK2013

url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lbk/

PID: http://hdl.handle.net/11538/0000-000B-C022-5

distributionInfo [ComponentId=‘clarin.eu:cr1:c_1396012485124’]:

licenceInfo [ComponentId=‘clarin.eu:cr1:c_1396012485158’]:

userCategory: Academic

distributionAccessMedium: accessibleThroughInterface

executionLocation: https://tekstlab.uio.no/glossa3/bokmal

licence [ComponentId=‘clarin.eu:cr1:c_1447674760330’]:

licenceFamily: CLARIN

licenceName: CLARIN_ACA-NC-LOC-ND

licenceURL: https://kitwiki.csc.fi/twiki/bin/view/FinCLARIN/ClarinEulaAca?ID=1&AFFIL=EDU&BY=1&NC=1&LOC=1&NORED=1&ND=1

conditionsOfUse: BY

conditionsOfUse: ID

conditionsOfUse: LOC

conditionsOfUse: NC

conditionsOfUse: ND

conditionsOfUse: NORED

nonStandardConditionsOfUse: Due to agreements with the third party copyright holders, the corpus is only available through Glossa, a search and post-processing tool developed by the Text Laboratory.

licensor:

actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:

actorType: organization

organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:

organizationName [xml:lang=‘en’]: University of Oslo

organizationName [xml:lang=‘no’]: Universitetet i Oslo

organizationShortName [xml:lang=‘no’]: UiO

organizationShortName [xml:lang=‘en’]: UoO

departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies

departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)

communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:

email: r.e.v.fjeld@iln.uio.no

url: http://www.hf.uio.no/iln/english/

address: Box 1102 Blindern

zipCode: 0317

city: OSLO

country: Norway

distributionRightsHolder:

actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:

actorType: organization

organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:

organizationName [xml:lang=‘en’]: University of Oslo

organizationName [xml:lang=‘no’]: Universitetet i Oslo

organizationShortName [xml:lang=‘no’]: UiO

organizationShortName [xml:lang=‘en’]: UoO

departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies

departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)

communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:

email: tekstlab-post@iln.uio.no

url: http://www.hf.uio.no/iln/english/

address: Box 1102 Blindern

zipCode: 0317

city: OSLO

country: Norway

contact:

actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:

actorType: person

organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:

organizationName: The Text Laboratory

organizationShortName: Textlab

departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo

communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:

email: tekstlab-post@iln.uio.no

url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/

address: Box 1102 Blindern

zipCode: 0317

city: OSLO

country: Norway

actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:

actorType: person

personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:

surname: Ruth E. Vatvedt

givenName: Fjeld

affiliation:

organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:

organizationName [xml:lang=‘en’]: University of Oslo

organizationName [xml:lang=‘no’]: Universitetet i Oslo

organizationShortName [xml:lang=‘no’]: UiO

organizationShortName [xml:lang=‘en’]: UoO

departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies

departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)

metadataInfo [ComponentId=‘clarin.eu:cr1:c_1407745711922’]:

metadataCreationDate: 2015-08-07

metadataLastDateUpdated: 2023-11-22

metadataCreator:

actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:

actorType: person

personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:

surname: Hagen

givenName: Kristin

organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:

organizationName: The Text Laboratory

organizationShortName: Textlab

departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo

communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:

email: kristin.hagen@iln.uio.no

url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/

address: Box 1102 Blindern

zipCode: 0317

city: OSLO

country: Norway

versionInfo [ComponentId=‘clarin.eu:cr1:c_1430905751648’]:

version: 2013

resourceDocumentationInfo [ComponentId=‘clarin.eu:cr1:c_1355150532301’]:

documentationUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532302’]:

role: documentation

documentUnstructured: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/prosjekter/lbk/

resourceCreationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711921’]:

creationEndDate: 2013-12-31

resourceCreator:

actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:

actorType: organization

organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:

organizationName [xml:lang=‘en’]: University of Oslo

organizationName [xml:lang=‘no’]: Universitetet i Oslo

organizationShortName [xml:lang=‘no’]: UiO

organizationShortName [xml:lang=‘en’]: UoO

departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies

departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)

communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:

email: r.e.v.fjeld@iln.uio.no

url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/

address: Box 1102 Blindern

zipCode: 0317

city: OSLO

country: Norway

actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:

actorType: organization

organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:

organizationName: The Text Laboratory

organizationShortName: Textlab

departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo

communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:

email: tekstlab-post@iln.uio.no

url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/

address: Box 1102 Blindern

zipCode: 0317

city: OSLO

country: Norway

corpusInfo [ComponentId=‘clarin.eu:cr1:c_1407745711878’]:

corpusType: Written Corpus

corpusPartInfo [ComponentId=‘clarin.eu:cr1:c_1407745711885’]:

mediaType: text

corpusTextInfo [ComponentId=‘clarin.eu:cr1:c_1396012485188’]:

textFormatInfo [ComponentId=‘clarin.eu:cr1:c_1427452477072’]:

mimeType: txt

characterEncodingInfo [ComponentId=‘clarin.eu:cr1:c_1447674760355’]:

characterEncoding: latin1

corpusPartGeneralInfo [ComponentId=‘clarin.eu:cr1:c_1407745711882’]:

sourceWorkInfo [ComponentId=‘clarin.eu:cr1:c_1407745712071’]:

workDescription: The corpus consists of texts collected from available literature/prose from 1985 to 2013. The corpus is composed of texts from five genres: non-fiction prose (45 %) fiction (35 %) newpapers/magazines (10 %), TV subtitles (5 %), and non-standardized, unpublished texts (5 %), all in all 100 mill words.

lingualityInfo [ComponentId=‘clarin.eu:cr1:c_1355150532313’]:

lingualityType: monolingual

languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:

languageId: Nb

languageName: Norwegian Bokmål

modalityInfo [ComponentId=‘clarin.eu:cr1:c_1447674760356’]:

modalityType: writtenLanguage

sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:

size: 100 mill

sizeUnit: tokens

annotationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711924’]:

annotationType: morphosyntacticAnnotation-posTagging

annotationType: lemmatization

segmentationLevel: word

tagset: The Oslo Bergen-tagger tagset: http://tekstlab.uio.no/obt-ny/english/index.html

tagsetLanguageId: Nb

tagsetLanguageName: Norwegian bokmål

theoreticModel: Constraint grammar

annotationMode: automatic

annotationManualUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532325’]:

role: annotationManual

documentUnstructured: http://www.tekstlab.uio.no/obt-ny/english/index.html

annotationTool [ComponentId=‘clarin.eu:cr1:c_1355150532326’]:

targetResourceNameURI: The Oslo-Bergen Tagger: http://tekstlab.uio.no/obt-ny/english/index.html

classificationInfo [ComponentId=‘clarin.eu:cr1:c_1403588862809’]:

genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:

genreType: textGenre

genre: factual prose

genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:

genreType: textGenre

genre: fiction and drama

genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:

genreType: textGenre

genre: newspaper and magazines

genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:

genreType: textGenre

genre: unstandardised

timeCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760358’]:

timeCoverage: 1985 - 2013