CMDI 1.2 Metadata
Header
cmd:MdCreator: Kristin Hagen
cmd:MdCreationDate: 2024-01-08
cmd:MdSelfLink:
cmd:MdProfile: clarin.eu:cr1:p_1422885449331
cmd:MdCollectionDisplayName: Clarino - Textlab
Resources
cmd:ResourceProxyList:
cmd:ResourceProxy [id=‘ndc-parser-lp’]:
cmd:ResourceType [mimetype=‘’]: LandingPage
cmd:ResourceRef: https://tekstlab.uio.no/nota/scandiasyn/treebank.html
cmd:ResourceProxy [id=‘ndc-parser’]:
cmd:ResourceType [mimetype=‘’]: Resource
cmd:ResourceRef: https://github.com/textlab/spoken_norwegian_resources/tree/master/parsers/clarino/ndc
cmd:ResourceProxy [id=‘ndc-treebank’]:
cmd:ResourceType [mimetype=‘’]: Resource
cmd:ResourceRef: https://github.com/textlab/spoken_norwegian_resources/tree/master/treebanks/Norwegian-BokmaalNDC
cmd:JournalFileProxyList:
cmd:ResourceRelationList:
cmd:ResourceRelation:
cmd:RelationType: partOF
cmd:Resource:
cmd:Role:
cmd:Resource:
cmd:Role:
cmd:ResourceRelation:
cmd:RelationType: trainedOn
cmd:Resource:
cmd:Role:
cmd:Resource:
cmd:Role:
Components
cmdp:toolProfile:
cmdp:resourceCommonInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485126’] [cmd:ref=‘ndc-parser-lp’]:
cmdp:resourceType: toolService
cmdp:identificationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485125’] [cmd:ref=‘ndc-parser-lp’]:
cmdp:resourceName [cmd:cmd=‘lia-parser-lp’] [xml:lang=‘en’]: The NDC parser
cmdp:resourceName [xml:lang=‘no’]: NDC-parseren
cmdp:description [xml:lang=‘en’]: The NDC parser is a dependency parser for spoken Norwegian dialects trancribed to Bokmål. The parser is trained on the NDC Treebank.
The NDC parser is a so-called transition-based dependency parser, UUParser, developed at Uppsala University.
cmdp:description [xml:lang=‘no’]: NDC-parseren er en dependensparser for transkripsjoner av norske dialekter på bokmål. Parseren er trent på NDC-trebanken. NDC-parseren er en såkalt transition-based dependensparser, UUparser, utviklet ved Uppsala Universitet.
cmdp:resourceShortName: NDC parser
cmdp:url: https://tekstlab.uio.no/nota/scandiasyn/treebank.html
cmdp:PID: https://hdl.handle.net/11538/32D34B83-2
cmdp:distributionInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485124’] [cmd:ref=‘ndc-parser-lp’]:
cmdp:licenceInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485158’] [cmd:ref=‘ndc-parser-lp’]:
cmdp:userCategory: Public
cmdp:distributionAccessMedium: downloadable
cmdp:downloadLocation: https://github.com/textlab/spoken_norwegian_resources/tree/master/parsers/clarino/ndc
cmdp:licence [cmd:ComponentRef=‘clarin.eu:cr1:c_1447674760330’]:
cmdp:licenceFamily: Creative Commons (CC)
cmdp:licenceName: Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
cmdp:licenceURL: https://creativecommons.org/licenses/by-nc-sa/4.0/
cmdp:conditionsOfUse: BY
cmdp:conditionsOfUse: NC
cmdp:conditionsOfUse: SA
cmdp:licensor:
cmdp:actorInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485194’]:
cmdp:actorType: organization
cmdp:organizationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1407745711883’]:
cmdp:organizationName [xml:lang=‘en’]: University of Oslo
cmdp:organizationName [xml:lang=‘no’]: Universitetet i Oslo
cmdp:organizationShortName [xml:lang=‘no’]: UiO
cmdp:organizationShortName [xml:lang=‘en’]: UoO
cmdp:departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
cmdp:departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
cmdp:communicationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1352813745460’]:
cmdp:email: tekstlab-post@iln.uio.no
cmdp:url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
cmdp:address: Box 1102 Blindern
cmdp:zipCode: 0317
cmdp:city: OSLO
cmdp:country: Norway
cmdp:distributionRightsHolder:
cmdp:actorInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485194’]:
cmdp:actorType: organization
cmdp:organizationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1407745711883’]:
cmdp:organizationName [xml:lang=‘en’]: University of Oslo
cmdp:organizationName [xml:lang=‘no’]: Universitetet i Oslo
cmdp:organizationShortName [xml:lang=‘no’]: UiO
cmdp:organizationShortName [xml:lang=‘en’]: UoO
cmdp:departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
cmdp:departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
cmdp:communicationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1352813745460’]:
cmdp:email: tekstlab-post@iln.uio.no
cmdp:url: http://www.hf.uio.no/iln/english/
cmdp:address: Box 1102 Blindern
cmdp:zipCode: 0317
cmdp:city: OSLO
cmdp:country: Norway
cmdp:iprHolder:
cmdp:actorInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485194’] [cmd:ref=‘ndc-parser-lp’]:
cmdp:actorType: organization
cmdp:organizationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1407745711883’]:
cmdp:organizationName: The Text Laboratory
cmdp:organizationShortName: Textlab
cmdp:departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
cmdp:contact [cmd:ref=‘ndc-parser-lp’]:
cmdp:actorInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485194’] [cmd:ref=‘ndc-parser-lp’]:
cmdp:actorType: organization
cmdp:organizationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1407745711883’]:
cmdp:organizationName: The Text Laboratory
cmdp:organizationShortName: Textlab
cmdp:departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
cmdp:communicationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1352813745460’]:
cmdp:email: tekstlab-post@iln.uio.no
cmdp:url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
cmdp:address: Box 1102 Blindern
cmdp:zipCode: 0317
cmdp:city: OSLO
cmdp:country: Norway
cmdp:metadataInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1407745711922’] [cmd:ref=‘ndc-parser-lp’]:
cmdp:metadataCreationDate: 2024-04-08
cmdp:metadataLastDateUpdated: 2024-01-11
cmdp:metadataCreator [cmd:ref=‘ndc-parser-lp’]:
cmdp:actorInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485194’]:
cmdp:actorType: person
cmdp:personInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485192’]:
cmdp:surname: Hagen
cmdp:givenName: Kristin
cmdp:organizationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1407745711883’]:
cmdp:organizationName: The Text Laboratory
cmdp:organizationShortName: Textlab
cmdp:departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
cmdp:communicationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1352813745460’]:
cmdp:email: kristin.hagen@iln.uio.no
cmdp:url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
cmdp:address: Box 1102 Blindern
cmdp:zipCode: 0317
cmdp:city: OSLO
cmdp:country: Norway
cmdp:validationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1407745711923’] [cmd:ref=‘ndc-parser-lp’]:
cmdp:validated: true
cmdp:validationModeDetails: In order to quantify the parsability i.e. the quality that can be induced by a parser based on the annotations of the treebank; we partitioned the treebank in n folds and performed a n-fold cross validation with n=5 (given the size of the treebank):

UAS (unlabelled attachment score): 84.11
LAS (labelled attachment score): 78.43
cmdp:resourceDocumentationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1355150532301’]:
cmdp:resourceCreationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1407745711921’] [cmd:ref=‘ndc-parser-lp’]:
cmdp:creationStartDate: 2019
cmdp:creationEndDate: 2024
cmdp:resourceCreator [cmd:ref=‘ndc-parser’]:
cmdp:actorInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485194’]:
cmdp:actorType: organization
cmdp:personInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1396012485192’]:
cmdp:surname: Kåsen
cmdp:givenName: Andre
cmdp:affiliation:
cmdp:organizationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1407745711883’]:
cmdp:organizationName: Nasjonalbiblioteket
cmdp:organizationShortName: NB
cmdp:communicationInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1352813745460’]:
cmdp:email: andre.kaasen@gmail.com
cmdp:url: https://www.nb.no/sprakbanken/
cmdp:fundingProject:
cmdp:projectInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1430905751647’]:
cmdp:projectName: Common Language Resources and Technology Infrastructure Norway +
cmdp:projectShortName: CLARINO +
cmdp:projectID: 295700
cmdp:url: http://clarin.b.uib.no/
cmdp:fundingType: nationalFunds
cmdp:funder: the Research Council of Norway
cmdp:fundingCountry: Norway
cmdp:projectStartDate: 2020-03-01
cmdp:projectEndDate: 2023-12-31
cmdp:toolInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1422885449327’]:
cmdp:description: The NDC parser is a dependency parser trained on the NDC Treebank. The parser is a so-called transition-based dependency parser, UUParser (https://github.com/UppsalaNLP/uuparser), developed at Uppsala University.
cmdp:inputInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1360931019804’]:
cmdp:mediaType: text
cmdp:resourceType: corpus
cmdp:modalityType: spokenLanguage
cmdp:languageName: Norwegian
cmdp:languageName: Norwegian Bokmål
cmdp:languageId: No
cmdp:languageId: Nb
cmdp:mimeType: txt, xml
cmdp:characterEncoding: utf-8
cmdp:annotationType: syntacticAnnotation-treebanks
cmdp:tagset: http://www.tekstlab.uio.no/obt-ny/english/tagset.html
cmdp:segmentationLevel: word
cmdp:segmentationLevel: utterance
cmdp:outputInfo [cmd:ComponentRef=‘clarin.eu:cr1:c_1360931019824’]:
cmdp:mediaType: text
cmdp:resourceType: corpus
cmdp:modalityType: spokenLanguage
cmdp:languageName: Norwegian
cmdp:languageName: Norwegian Bokmål
cmdp:languageId: No
cmdp:languageId: Nb
cmdp:mimeType: txt, xml
cmdp:characterEncoding: utf-8
cmdp:tagset: http://www.tekstlab.uio.no/obt-ny/english/tagset.html
cmdp:segmentationLevel: utterance
cmdp:segmentationLevel: word