Show simple item record

dc.contributorOdijk, Jan
dc.contributorMonachesi, Paola
dc.contributor.authorWesterhout, E.N.
dc.date.accessioned2019-10-24T04:16:59Z
dc.date.available2019-10-24T04:16:59Z
dc.date.created2017-01-05 00:57
dc.date.issued2010-07-02
dc.identifieroai:dspace.library.uu.nl:1874/44744
dc.identifierhttp://dspace.library.uu.nl:8080/handle/1874/44744
dc.identifier.urihttp://hdl.handle.net/20.500.12424/837509
dc.description.abstractThe central topic of this thesis is the automatic extraction of definitions from text. Definition extraction can play a role in various applications including the semi-automatic development of glossaries in an eLearning context, which constitutes the main focus of this dissertation. A glossary provides definitions for the most important terms that are discussed in a text. The semi-automatic extraction approach presented in this study consists of two phases. As a first step, a method entirely based on lexico-syntactic patterns has been used to distinguish between definitions and non-definitions. A corpus consisting of 600 definitions has been employed to identify recurrent definition patterns. Since many of these patterns are not unique to definitions, a second step was employed to reduce the number of non-definitions identified. It has been investigated whether other textual characteristics can contribute to the correct classification of definitions, in addition to the lexico-syntactic patterns. The properties that have been examined vary from the importance of the defined word (phrase) within a text to the layout of the definition. Machine learning techniques have been employed to identify which are the most relevant (combinations of) definition properties. The results of this dissertation are relevant for researchers in linguistics and lexicography as well as for the development of language technology applications
dc.format.mediumtext/plain
dc.languageother
dc.publisherLOT
dc.rightsinfo:eu-repo/semantics/OpenAccess
dc.titleDefinition extraction for glossary creation : a study on extracting definitions for semi-automatic glossary creation in Dutch
dc.typeDissertation
ge.collectioncodeOAIDATA
ge.dataimportlabelOAI metadata object
ge.identifier.legacyglobethics:10434336
ge.identifier.permalinkhttps://www.globethics.net/gel/10434336
ge.lastmodificationdate2017-01-05 00:57
ge.lastmodificationuseradmin@pointsoftware.ch (import)
ge.submissions0
ge.oai.exportid148934
ge.oai.repositoryid1826
ge.oai.setnameUtrecht University Repository
ge.oai.setnameUtrecht University Repository
ge.oai.setspeccom_1874_296827
ge.oai.setspeccol_1874_296828
ge.oai.streamid2
ge.setnameGlobeEthicsLib
ge.setspecglobeethicslib
ge.linkhttp://dspace.library.uu.nl:8080/handle/1874/44744


This item appears in the following Collection(s)

Show simple item record