CMDI 1.1 Metadata
Header
MdCreator: Kristin Hagen
MdCreationDate: 2023-06-20
MdProfile: clarin.eu:cr1:p_1422885449331
MdCollectionDisplayName: Clarino - Textlab
Resources
ResourceProxyList:
ResourceProxy [id=‘lia-parser-lp’]:
ResourceType [mimetype=‘’]: LandingPage
ResourceRef: https://tekstlab.uio.no/LIA/norsk/index_english.html#parser
ResourceProxy [id=‘lia-parser’]:
ResourceType [mimetype=‘’]: Resource
ResourceRef: https://github.com/textlab/spoken_norwegian_resources/tree/master/parsers/clarino/lia
ResourceProxy [id=‘lia-treebank’]:
ResourceType [mimetype=‘’]: Resource
ResourceRef: https://github.com/textlab/spoken_norwegian_resources/tree/master/treebanks/Norwegian-NynorskLIA
JournalFileProxyList:
ResourceRelationList:
ResourceRelation:
RelationType: partOF
Res1 [ref=‘lia-parser-lp’]:
Res2 [ref=‘lia-parser’]:
ResourceRelation:
RelationType: trainedOn
Res1 [ref=‘lia-parser’]:
Res2 [ref=‘lia-treebank’]:
IsPartOfList:
Components
toolProfile:
resourceCommonInfo [ComponentId=‘clarin.eu:cr1:c_1396012485126’]:
resourceType [ref=‘lia-parser-lp’]: toolService
identificationInfo [ComponentId=‘clarin.eu:cr1:c_1396012485125’] [ref=‘lia-parser-lp’]:
resourceName [cmd=‘lia-parser-lp’] [xml:lang=‘en’]: The LIA parser
resourceName [xml:lang=‘no’]: LIA-parseren
description [ref=‘lia-parser-lp’] [xml:lang=‘en’]: The LIA parser is a dependency parser for spoken Norwegian dialects trancribed to Nynorsk. The parser is trained on the LIA Treebank.
The LIA parser is a so-called transition-based dependency parser, UUParser, developed at Uppsala University.
description [ref=‘lia-parser-lp’] [xml:lang=‘no’]: LIA-parseren er ein dependensparser for transkripsjoner av norske dialekter på nynorsk. Parseren er trent på LIA-trebanken. LIA-parseren er ein såkalla transition-based dependensparser, UUparser, utvikla ved Uppsala Universitet.
resourceShortName [ref=‘lia-parser-lp’]: LIA parser
url [ref=‘lia-parser-lp’]: https://tekstlab.uio.no/LIA/norsk/index_english.html#parser
url: https://tekstlab.uio.no/LIA/parser.html
distributionInfo [ComponentId=‘clarin.eu:cr1:c_1396012485124’] [ref=‘lia-parser-lp’]:
licenceInfo [ComponentId=‘clarin.eu:cr1:c_1396012485158’] [ref=‘lia-parser-lp’]:
userCategory: Public
distributionAccessMedium: downloadable
downloadLocation [ref=‘lia-parser’]: https://github.com/textlab/spoken_norwegian_resources/tree/master/parsers/clarino/lia
licence [ComponentId=‘clarin.eu:cr1:c_1447674760330’]:
licenceFamily: Creative Commons (CC)
licenceName: Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
licenceURL: https://creativecommons.org/licenses/by-nc-sa/4.0/
conditionsOfUse: BY
conditionsOfUse: NC
conditionsOfUse: SA
licensor:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
distributionRightsHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/english/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
iprHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’] [ref=‘lia-parser-lp’]:
actorType [ref=‘lia-parser-lp’]: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
contact [ref=‘lia-parser-lp’]:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’] [ref=‘lia-parser-lp’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
metadataInfo [ComponentId=‘clarin.eu:cr1:c_1407745711922’] [ref=‘lia-parser-lp’]:
metadataCreationDate: 2024-04-08
metadataLastDateUpdated: 2024-01-10
metadataCreator [ref=‘lia-parser-lp’]:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Hagen
givenName: Kristin
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: kristin.hagen@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
validationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711923’] [ref=‘lia-parser’]:
validated: true
validationModeDetails [ref=‘lia-parser’]: In order to quantify the parsability i.e. the quality that can be induced by a parser based on the annotations of the treebank; we partitioned the treebank in n folds and performed a n-fold cross validation with n=5 (given the size of the treebank):

UAS (unlabelled attachment score): 85.23
LAS (labelled attachment score): 80.01
resourceDocumentationInfo [ComponentId=‘clarin.eu:cr1:c_1355150532301’]:
resourceCreationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711921’] [ref=‘lia-parser-lp’]:
creationStartDate: 2014
creationEndDate: 2024
resourceCreator [ref=‘lia-parser’]:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Kåsen
givenName: Andre
affiliation:
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: Nasjonalbiblioteket
organizationShortName: NB
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: andre.kaasen@gmail.com
url: https://www.nb.no/sprakbanken/
fundingProject:
projectInfo [ComponentId=‘clarin.eu:cr1:c_1430905751647’]:
projectName [xml:lang=‘en’]: Language Infrastructure made Accessible
projectShortName [xml:lang=‘en’]: LIA
projectID: 22 59 41
url: http://www.hf.uio.no/iln/english/research/projects/language-infrastructure-made-accessible/index.html
fundingType: nationalFunds
funder: The Research Council of Norway
fundingCountry: Norway
projectStartDate: 2014-04-01
projectEndDate: 2019-04-01
toolInfo [ComponentId=‘clarin.eu:cr1:c_1422885449327’]:
description: The LIA parser is a dependency parser trained on the LIA Treebank. The parser is a so-called transition-based dependency parser, UUParser (https://github.com/UppsalaNLP/uuparser), developed at Uppsala University.
inputInfo [ComponentId=‘clarin.eu:cr1:c_1360931019804’]:
mediaType: text
resourceType: corpus
modalityType: spokenLanguage
languageName: Norwegian
languageName: Norwegian Nynorsk
languageId: No
languageId: Nn
mimeType: txt, xml
characterEncoding: utf-8
annotationType: syntacticAnnotation-treebanks
tagset: http://www.tekstlab.uio.no/obt-ny/english/tagset.html
segmentationLevel: word
segmentationLevel: utterance
outputInfo [ComponentId=‘clarin.eu:cr1:c_1360931019824’]:
mediaType: text
resourceType: corpus
modalityType: spokenLanguage
languageName: Norwegian
languageName: Norwegian Nynorsk
languageId: No
languageId: Nn
mimeType: txt, xml
characterEncoding: utf-8
tagset: http://www.tekstlab.uio.no/obt-ny/english/tagset.html
segmentationLevel: utterance
segmentationLevel: word