Un-polarizing news in social media platform

A person with incorrect information on a given subject/topic mays act against his/her own best interest due to the faulty believes. This is the misinformation problem and the rise of internet and social media has only worsened the problem as false stories are spread six times quicker than the correc...

Full description

Bibliographic Details
Main Author: Le Pham, Minh Duc
Other Authors: Informaatioteknologian tiedekunta, Faculty of Information Technology, Informaatioteknologia, Information Technology, Jyväskylän yliopisto, University of Jyväskylä
Format: Master's thesis
Language:eng
Published: 2019
Subjects:
Online Access: https://jyx.jyu.fi/handle/123456789/64500
_version_ 1826225755487469568
author Le Pham, Minh Duc
author2 Informaatioteknologian tiedekunta Faculty of Information Technology Informaatioteknologia Information Technology Jyväskylän yliopisto University of Jyväskylä
author_facet Le Pham, Minh Duc Informaatioteknologian tiedekunta Faculty of Information Technology Informaatioteknologia Information Technology Jyväskylän yliopisto University of Jyväskylä Le Pham, Minh Duc Informaatioteknologian tiedekunta Faculty of Information Technology Informaatioteknologia Information Technology Jyväskylän yliopisto University of Jyväskylä
author_sort Le Pham, Minh Duc
datasource_str_mv jyx
description A person with incorrect information on a given subject/topic mays act against his/her own best interest due to the faulty believes. This is the misinformation problem and the rise of internet and social media has only worsened the problem as false stories are spread six times quicker than the correct one. Moreover, due to the nature of social platform, users unknowingly lock themselves in their own echo-chamber, amplifying news that strengthen their viewpoints while disregarding the opposition information. With the inspiration and knowledge gained from the public project: "Value from Public Health Data with Cognitive Computing project" at the University of Jyvaskylä (2017), I started this thesis with one main goal: to fight these problems concerning our modern society: misinformation, the spread of misinformation and the echo-chamber in social media platforms. By utilizing different sub-fields of Natural Language Processing (NLP) technology such as: Sentiment Analysis, Named Entity Recognition (NER) and Open Information Extraction (OIE), I created two hypotheses with two different approaches to suggest articles with different points of view to any given article. The main emphasis is that, by showing various news documents from diverse perspectives, a person gets a possibility to identify and discard the misinformation as well as crushing his/her own echo-chamber due to the exposure to the "other sides". With a handcrafted evaluation database and benchmarks, I develop two prototypes to test the correctness and rigidity of our hypotheses. The first approach: the "Sentiment-based" solution achieves a satisfactory benchmark level by finding articles with similar topic/subject to the comparing article as well as suggesting ones with different sentiments/attitudes (negative, positive, neutral) using Sentiment Analysis and NER. The second approach: the "Statement/Triples-based" solution, by suggesting articles with relating or contradicting facts in the form of semantic-triples using OIE and NER, while fails our evaluation tests due to technical issues, has some convincing evident of a promising solution that can reliably detect contradictions spanning throughout multiple news sources. Thus, with a successful solution and many captivating findings, I hope that with the works described below, I could contribute to help battling the echo-chamber and misinformation as well as inspire other scholars and companies to do the same: help creating a better world.
first_indexed 2019-08-19T08:21:24Z
format Pro gradu
free_online_boolean 1
fullrecord [{"key": "dc.contributor.advisor", "value": "Oleksiy, Oleksiy", "language": "", "element": "contributor", "qualifier": "advisor", "schema": "dc"}, {"key": "dc.contributor.author", "value": "Le Pham, Minh Duc", "language": "", "element": "contributor", "qualifier": "author", "schema": "dc"}, {"key": "dc.date.accessioned", "value": "2019-06-10T12:08:20Z", "language": null, "element": "date", "qualifier": "accessioned", "schema": "dc"}, {"key": "dc.date.available", "value": "2019-06-10T12:08:20Z", "language": null, "element": "date", "qualifier": "available", "schema": "dc"}, {"key": "dc.date.issued", "value": "2019", "language": "", "element": "date", "qualifier": "issued", "schema": "dc"}, {"key": "dc.identifier.uri", "value": "https://jyx.jyu.fi/handle/123456789/64500", "language": null, "element": "identifier", "qualifier": "uri", "schema": "dc"}, {"key": "dc.description.abstract", "value": "A person with incorrect information on a given subject/topic mays act against his/her own best interest due to the faulty believes. This is the misinformation problem and the rise of internet and social media has only worsened the problem as false stories are spread six times quicker than the correct one. Moreover, due to the nature of social platform, users unknowingly lock themselves in their own echo-chamber, amplifying news that strengthen their viewpoints while disregarding the opposition information. With the inspiration and knowledge gained from the public project: \"Value from Public Health Data with Cognitive Computing project\" at the University of Jyvaskyla\u0308 (2017), I started this thesis with one main goal: to fight these problems concerning our modern society: misinformation, the spread of misinformation and the echo-chamber in social media platforms. By utilizing different sub-fields of Natural Language Processing (NLP) technology such as: Sentiment Analysis, Named Entity Recognition (NER) and Open Information Extraction (OIE), I created two hypotheses with two different approaches to suggest articles with different points of view to any given article. The main emphasis is that, by showing various news documents from diverse perspectives, a person gets a possibility to identify and discard the misinformation as well as crushing his/her own echo-chamber due to the exposure to the \"other sides\".\n\nWith a handcrafted evaluation database and benchmarks, I develop two prototypes to test the correctness and rigidity of our hypotheses. The first approach: the \"Sentiment-based\" solution achieves a satisfactory benchmark level by finding articles with similar topic/subject to the comparing article as well as suggesting ones with different sentiments/attitudes (negative, positive, neutral) using Sentiment Analysis and NER. The second approach: the \"Statement/Triples-based\" solution, by suggesting articles with relating or contradicting facts in the form of semantic-triples using OIE and NER, while fails our evaluation tests due to technical issues, has some convincing evident of a promising solution that can reliably detect contradictions spanning throughout multiple news sources. Thus, with a successful solution and many captivating findings, I hope that with the works described below, I could contribute to help battling the echo-chamber and misinformation as well as inspire other scholars and companies to do the same: help creating a better world.", "language": "en", "element": "description", "qualifier": "abstract", "schema": "dc"}, {"key": "dc.description.provenance", "value": "Submitted by Miia Hakanen (mihakane@jyu.fi) on 2019-06-10T12:08:20Z\nNo. of bitstreams: 0", "language": "en", "element": "description", "qualifier": "provenance", "schema": "dc"}, {"key": "dc.description.provenance", "value": "Made available in DSpace on 2019-06-10T12:08:20Z (GMT). No. of bitstreams: 0\n Previous issue date: 2019", "language": "en", "element": "description", "qualifier": "provenance", "schema": "dc"}, {"key": "dc.format.extent", "value": "88", "language": "", "element": "format", "qualifier": "extent", "schema": "dc"}, {"key": "dc.format.mimetype", "value": "application/pdf", "language": null, "element": "format", "qualifier": "mimetype", "schema": "dc"}, {"key": "dc.language.iso", "value": "eng", "language": null, "element": "language", "qualifier": "iso", "schema": "dc"}, {"key": "dc.rights", "value": "In Copyright", "language": "en", "element": "rights", "qualifier": null, "schema": "dc"}, {"key": "dc.subject.other", "value": "Natural language processing", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "Sentiment Analysis", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "Named Entity Recognition", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "Open Information Extraction", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "Social media", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "Misinformation", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "Echo-chamber", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.title", "value": "Un-polarizing news in social media platform", "language": "", "element": "title", "qualifier": null, "schema": "dc"}, {"key": "dc.type", "value": "master thesis", "language": null, "element": "type", "qualifier": null, "schema": "dc"}, {"key": "dc.identifier.urn", "value": "URN:NBN:fi:jyu-201906103112", "language": "", "element": "identifier", "qualifier": "urn", "schema": "dc"}, {"key": "dc.type.ontasot", "value": "Pro gradu -tutkielma", "language": "fi", "element": "type", "qualifier": "ontasot", "schema": "dc"}, {"key": "dc.type.ontasot", "value": "Master\u2019s thesis", "language": "en", "element": "type", "qualifier": "ontasot", "schema": "dc"}, {"key": "dc.contributor.faculty", "value": "Informaatioteknologian tiedekunta", "language": "fi", "element": "contributor", "qualifier": "faculty", "schema": "dc"}, {"key": "dc.contributor.faculty", "value": "Faculty of Information Technology", "language": "en", "element": "contributor", "qualifier": "faculty", "schema": "dc"}, {"key": "dc.contributor.department", "value": "Informaatioteknologia", "language": "fi", "element": "contributor", "qualifier": "department", "schema": "dc"}, {"key": "dc.contributor.department", "value": "Information Technology", "language": "en", "element": "contributor", "qualifier": "department", "schema": "dc"}, {"key": "dc.contributor.organization", "value": "Jyv\u00e4skyl\u00e4n yliopisto", "language": "fi", "element": "contributor", "qualifier": "organization", "schema": "dc"}, {"key": "dc.contributor.organization", "value": "University of Jyv\u00e4skyl\u00e4", "language": "en", "element": "contributor", "qualifier": "organization", "schema": "dc"}, {"key": "dc.subject.discipline", "value": "Kognitiotiede", "language": "fi", "element": "subject", "qualifier": "discipline", "schema": "dc"}, {"key": "dc.subject.discipline", "value": "Cognitive Science", "language": "en", "element": "subject", "qualifier": "discipline", "schema": "dc"}, {"key": "yvv.contractresearch.funding", "value": "0", "language": "", "element": "contractresearch", "qualifier": "funding", "schema": "yvv"}, {"key": "dc.type.coar", "value": "http://purl.org/coar/resource_type/c_bdcc", "language": null, "element": "type", "qualifier": "coar", "schema": "dc"}, {"key": "dc.rights.accesslevel", "value": "openAccess", "language": null, "element": "rights", "qualifier": "accesslevel", "schema": "dc"}, {"key": "dc.type.publication", "value": "masterThesis", "language": null, "element": "type", "qualifier": "publication", "schema": "dc"}, {"key": "dc.subject.oppiainekoodi", "value": "601", "language": "", "element": "subject", "qualifier": "oppiainekoodi", "schema": "dc"}, {"key": "dc.format.content", "value": "fulltext", "language": null, "element": "format", "qualifier": "content", "schema": "dc"}, {"key": "dc.rights.url", "value": "https://rightsstatements.org/page/InC/1.0/", "language": null, "element": "rights", "qualifier": "url", "schema": "dc"}, {"key": "dc.type.okm", "value": "G2", "language": null, "element": "type", "qualifier": "okm", "schema": "dc"}]
id jyx.123456789_64500
language eng
last_indexed 2025-02-18T10:55:04Z
main_date 2019-01-01T00:00:00Z
main_date_str 2019
online_boolean 1
online_urls_str_mv {"url":"https:\/\/jyx.jyu.fi\/bitstreams\/ef75422b-2d4e-4fde-992d-d6693cdbc20f\/download","text":"URN:NBN:fi:jyu-201906103112.pdf","source":"jyx","mediaType":"application\/pdf"}
publishDate 2019
record_format qdc
source_str_mv jyx
spellingShingle Le Pham, Minh Duc Un-polarizing news in social media platform Natural language processing Sentiment Analysis Named Entity Recognition Open Information Extraction Social media Misinformation Echo-chamber Kognitiotiede Cognitive Science 601
title Un-polarizing news in social media platform
title_full Un-polarizing news in social media platform
title_fullStr Un-polarizing news in social media platform Un-polarizing news in social media platform
title_full_unstemmed Un-polarizing news in social media platform Un-polarizing news in social media platform
title_short Un-polarizing news in social media platform
title_sort un polarizing news in social media platform
title_txtP Un-polarizing news in social media platform
topic Natural language processing Sentiment Analysis Named Entity Recognition Open Information Extraction Social media Misinformation Echo-chamber Kognitiotiede Cognitive Science 601
topic_facet 601 Cognitive Science Echo-chamber Kognitiotiede Misinformation Named Entity Recognition Natural language processing Open Information Extraction Sentiment Analysis Social media
url https://jyx.jyu.fi/handle/123456789/64500 http://www.urn.fi/URN:NBN:fi:jyu-201906103112
work_keys_str_mv AT lephamminhduc unpolarizingnewsinsocialmediaplatform