Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web

The purpose of this thesis is to show how word sense disambiguation (WSD) can be improved with background real-world knowledge encoded in ontologies and, especially, in ontologies based on psychological considerations. Ontologies are used, because conceptualized background knowledge is not available...

Full description

Bibliographic Details
Main Author: Legrand, Steve
Format: Doctoral dissertation
Language:eng
Published: 2008
Subjects:
Online Access: https://jyx.jyu.fi/handle/123456789/103642
_version_ 1835762656695287808
author Legrand, Steve
author_facet Legrand, Steve Legrand, Steve
author_sort Legrand, Steve
datasource_str_mv jyx
description The purpose of this thesis is to show how word sense disambiguation (WSD) can be improved with background real-world knowledge encoded in ontologies and, especially, in ontologies based on psychological considerations. Ontologies are used, because conceptualized background knowledge is not available directly, from texts, to WSD systems. Although it is possible to disambiguate text to some extent without using ontologies, employing this kind of knowledge for WSD is of great help, especially in an environment like the Semantic Web, which has been the principal motivating factor behind this thesis. Some of the real-world knowledge, which is indispensable for human understanding, cannot be readily encoded in conventional ontologies either. One of the fundamental types of this kind of embodied knowledge is basic-level categories. After showing that conventional ontologies can be used to automatically group and label concepts in a text for disambiguation purposes with the help of self-organizing maps, the idea is extended to ontological structures based on basic-level categories. The thesis shows that the use of basic-level categories in WSD significantly improves accuracy. It also shows that linguistic phenomena, such as metaphoric expressions, can be manipulated structurally to reduce them to basic-level components with the potential to use them in WSD. The approach used here proves fruitful and can be used as a starting point for designing an application that not only disambiguates using hybrid systems (including ontological real-world component) but also selects the best applicable disambiguation system for a particular word.
first_indexed 2025-06-17T20:00:40Z
format Väitöskirja
fullrecord [{"key": "dc.contributor.author", "value": "Legrand, Steve", "language": null, "element": "contributor", "qualifier": "author", "schema": "dc"}, {"key": "dc.date.accessioned", "value": "2025-06-17T06:31:44Z", "language": null, "element": "date", "qualifier": "accessioned", "schema": "dc"}, {"key": "dc.date.available", "value": "2025-06-17T06:31:44Z", "language": null, "element": "date", "qualifier": "available", "schema": "dc"}, {"key": "dc.date.issued", "value": "2008", "language": null, "element": "date", "qualifier": "issued", "schema": "dc"}, {"key": "dc.identifier.isbn", "value": "978-952-86-0804-2", "language": null, "element": "identifier", "qualifier": "isbn", "schema": "dc"}, {"key": "dc.identifier.uri", "value": "https://jyx.jyu.fi/handle/123456789/103642", "language": null, "element": "identifier", "qualifier": "uri", "schema": "dc"}, {"key": "dc.description.abstract", "value": "The purpose of this thesis is to show how word sense disambiguation (WSD) can be improved with background real-world knowledge encoded in ontologies and, especially, in ontologies based on psychological considerations. Ontologies are used, because conceptualized background knowledge is not available directly, from texts, to WSD systems. Although it is possible to disambiguate text to some extent without using ontologies, employing this kind of knowledge for WSD is of great help, especially in an environment like the Semantic Web, which has been the principal motivating factor behind this thesis. Some of the real-world knowledge, which is indispensable for human understanding, cannot be readily encoded in conventional ontologies either. One of the fundamental types of this kind of embodied knowledge is basic-level categories. After showing that conventional ontologies can be used to automatically group and label concepts in a text for disambiguation purposes with the help of self-organizing maps, the idea is extended to ontological structures based on basic-level categories. The thesis shows that the use of basic-level categories in WSD significantly improves accuracy. It also shows that linguistic phenomena, such as metaphoric expressions, can be manipulated structurally to reduce them to basic-level components with the potential to use them in WSD. The approach used here proves fruitful and can be used as a starting point for designing an application that not only disambiguates using hybrid systems (including ontological real-world component) but also selects the best applicable disambiguation system for a particular word.", "language": "en", "element": "description", "qualifier": "abstract", "schema": "dc"}, {"key": "dc.description.provenance", "value": "Submitted by Harri Hirvi (hirvi@jyu.fi) on 2025-06-17T06:31:44Z\nNo. of bitstreams: 0", "language": "en", "element": "description", "qualifier": "provenance", "schema": "dc"}, {"key": "dc.description.provenance", "value": "Made available in DSpace on 2025-06-17T06:31:44Z (GMT). No. of bitstreams: 0\n Previous issue date: 2008", "language": "en", "element": "description", "qualifier": "provenance", "schema": "dc"}, {"key": "dc.format.mimetype", "value": "application/pdf", "language": null, "element": "format", "qualifier": "mimetype", "schema": "dc"}, {"key": "dc.language.iso", "value": "eng", "language": null, "element": "language", "qualifier": "iso", "schema": "dc"}, {"key": "dc.relation.ispartofseries", "value": "Jyv\u00e4skyl\u00e4 studies in computing", "language": null, "element": "relation", "qualifier": "ispartofseries", "schema": "dc"}, {"key": "dc.rights", "value": "In Copyright", "language": null, "element": "rights", "qualifier": null, "schema": "dc"}, {"key": "dc.subject.other", "value": "kieliteknologia", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "semantiikka", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "yksiselitteisyys", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "merkitykset (semantiikka)", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "sanat", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "semanttinen web", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "kieli ja kielet", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "tietojenk\u00e4sittely", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "tiedonsiirto", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "tietokonelingvistiikka", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "tietokoneet", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "ontologia (filosofia)", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "ontologiat (tiedonhallinta)", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "ontologia", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "word sense disambiguation", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "semantic web", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "datorlingvistik", "language": null, "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.title", "value": "Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web", "language": null, "element": "title", "qualifier": null, "schema": "dc"}, {"key": "dc.type", "value": "doctoral thesis", "language": null, "element": "type", "qualifier": null, "schema": "dc"}, {"key": "dc.identifier.urn", "value": "URN:ISBN:978-952-86-0804-2", "language": null, "element": "identifier", "qualifier": "urn", "schema": "dc"}, {"key": "dc.type.coar", "value": "http://purl.org/coar/resource_type/c_db06", "language": null, "element": "type", "qualifier": "coar", "schema": "dc"}, {"key": "dc.relation.numberinseries", "value": "87", "language": null, "element": "relation", "qualifier": "numberinseries", "schema": "dc"}, {"key": "dc.rights.copyright", "value": "\u00a9 The Author & University of Jyv\u00e4skyl\u00e4", "language": null, "element": "rights", "qualifier": "copyright", "schema": "dc"}, {"key": "dc.rights.accesslevel", "value": "restrictedAccess", "language": null, "element": "rights", "qualifier": "accesslevel", "schema": "dc"}, {"key": "dc.type.publication", "value": "doctoralThesis", "language": null, "element": "type", "qualifier": "publication", "schema": "dc"}, {"key": "dc.format.content", "value": "fulltext", "language": null, "element": "format", "qualifier": "content", "schema": "dc"}, {"key": "dc.rights.url", "value": "https://rightsstatements.org/page/InC/1.0/", "language": null, "element": "rights", "qualifier": "url", "schema": "dc"}, {"key": "dc.rights.accessrights", "value": "Aineistoon p\u00e4\u00e4sy\u00e4 on rajoitettu tekij\u00e4noikeussyist\u00e4. Aineisto on luettavissa Jyv\u00e4skyl\u00e4n yliopiston kirjaston <a href=\"https://www.jyu.fi/fi/osc/kirjasto/tyoskentelytilat/laitteet-ja-tilat#toc-jyx-ty-asema\">arkistoty\u00f6asemalta</a>.", "language": "fi", "element": "rights", "qualifier": "accessrights", "schema": "dc"}, {"key": "dc.rights.accessrights", "value": "<br><br>This material has a restricted access due to copyright reasons. It can be read at the <a href=\"https://www.jyu.fi/fi/osc/kirjasto/tyoskentelytilat/laitteet-ja-tilat#toc-jyx-ty-asema\">workstation</a> at Jyv\u00e4skyl\u00e4 University Library reserved for the use of archival materials.", "language": "en", "element": "rights", "qualifier": "accessrights", "schema": "dc"}, {"key": "dc.date.digitised", "value": "2025", "language": null, "element": "date", "qualifier": "digitised", "schema": "dc"}, {"key": "dc.type.okm", "value": "G4", "language": null, "element": "type", "qualifier": "okm", "schema": "dc"}]
id jyx.123456789_103642
language eng
last_indexed 2025-06-17T20:00:40Z
main_date 2008-01-01T00:00:00Z
main_date_str 2008
publishDate 2008
record_format qdc
source_str_mv jyx
spellingShingle Legrand, Steve Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web kieliteknologia semantiikka yksiselitteisyys merkitykset (semantiikka) sanat semanttinen web kieli ja kielet tietojenkäsittely tiedonsiirto tietokonelingvistiikka tietokoneet ontologia (filosofia) ontologiat (tiedonhallinta) ontologia word sense disambiguation semantic web datorlingvistik
title Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web
title_full Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web
title_fullStr Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web
title_full_unstemmed Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web
title_short Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web
title_sort use of background real world knowledge in ontologies for world sense disambiguation in the semantic web
title_txtP Use of background real-world knowledge in ontologies for world sense disambiguation in the semantic web
topic kieliteknologia semantiikka yksiselitteisyys merkitykset (semantiikka) sanat semanttinen web kieli ja kielet tietojenkäsittely tiedonsiirto tietokonelingvistiikka tietokoneet ontologia (filosofia) ontologiat (tiedonhallinta) ontologia word sense disambiguation semantic web datorlingvistik
topic_facet datorlingvistik kieli ja kielet kieliteknologia merkitykset (semantiikka) ontologia ontologia (filosofia) ontologiat (tiedonhallinta) sanat semantic web semantiikka semanttinen web tiedonsiirto tietojenkäsittely tietokoneet tietokonelingvistiikka word sense disambiguation yksiselitteisyys
url https://jyx.jyu.fi/handle/123456789/103642 http://www.urn.fi/URN:ISBN:978-952-86-0804-2
work_keys_str_mv AT legrandsteve useofbackgroundrealworldknowledgeinontologiesforworldsensedisambiguationinthesemant