Deduplikoinnin suorituskyvystä

Deduplikointi säästää tallennustilaa. Siinä etsitään datasta identtisiä alueita, joista yksi säilytetään ja loput korvataan viitteellä tähän säilytettävään alueeseen. Tässä tutkielmassa käsiteltiin kirjallisuuteen perustuen deduplikoinnin eri osa-alueita. Erityistä huomiota kiinnitettiin deduplikoin...

Full description

Bibliographic Details
Main Author: Kaiponen, Samuel
Other Authors: Informaatioteknologian tiedekunta, Faculty of Information Technology, Informaatioteknologia, Information Technology, Jyväskylän yliopisto, University of Jyväskylä
Format: Master's thesis
Language:fin
Published: 2022
Subjects:
Online Access: https://jyx.jyu.fi/handle/123456789/81880
_version_ 1826225722345127936
author Kaiponen, Samuel
author2 Informaatioteknologian tiedekunta Faculty of Information Technology Informaatioteknologia Information Technology Jyväskylän yliopisto University of Jyväskylä
author_facet Kaiponen, Samuel Informaatioteknologian tiedekunta Faculty of Information Technology Informaatioteknologia Information Technology Jyväskylän yliopisto University of Jyväskylä Kaiponen, Samuel Informaatioteknologian tiedekunta Faculty of Information Technology Informaatioteknologia Information Technology Jyväskylän yliopisto University of Jyväskylä
author_sort Kaiponen, Samuel
datasource_str_mv jyx
description Deduplikointi säästää tallennustilaa. Siinä etsitään datasta identtisiä alueita, joista yksi säilytetään ja loput korvataan viitteellä tähän säilytettävään alueeseen. Tässä tutkielmassa käsiteltiin kirjallisuuteen perustuen deduplikoinnin eri osa-alueita. Erityistä huomiota kiinnitettiin deduplikoinnin suorituskykyyn ja sen parantamiseen. Katsauksessa selvisi, että deduplikoinnin moninaisiin sovelluskohteisiin tarvitaan hyvin erilaisia deduplikointijärjestelmiä. Niissä tasapainoillaan suorituskyvyn eri alueiden välillä: yhden alueen parantaminen heikentää usein toista. Työssä toteutettiin myös tietokoneohjelma, joka deduplikoi tiedostoja. Sen suoritusaikoja mitattiin kahden muuttujan eri arvoilla. Mittauksissa löydettiin muuttujille arvot, joilla suoritusaika oli yleisesti pienin. Deduplication saves storage space. In deduplication, data is searched for identical sections. One of these sections is stored and the rest are replaced with a reference pointing to the stored section. In this study, various aspects of deduplication were examined based on the literature. Special attention was given to the performance of deduplication and its improvement. In the review it was found that the diverse applications of deduplication require very different deduplication systems. The systems have to balance between the many aspects of performance: improving one aspect often weakens another. A computer program that deduplicates files was also implemented in this work. Its execution times were measured with different values of two variables. Values were found with which the program's execution times were generally the lowest.
first_indexed 2022-06-20T20:05:49Z
format Pro gradu
free_online_boolean 1
fullrecord [{"key": "dc.contributor.advisor", "value": "Valmari, Antti", "language": "", "element": "contributor", "qualifier": "advisor", "schema": "dc"}, {"key": "dc.contributor.author", "value": "Kaiponen, Samuel", "language": "", "element": "contributor", "qualifier": "author", "schema": "dc"}, {"key": "dc.date.accessioned", "value": "2022-06-20T07:34:28Z", "language": null, "element": "date", "qualifier": "accessioned", "schema": "dc"}, {"key": "dc.date.available", "value": "2022-06-20T07:34:28Z", "language": null, "element": "date", "qualifier": "available", "schema": "dc"}, {"key": "dc.date.issued", "value": "2022", "language": "", "element": "date", "qualifier": "issued", "schema": "dc"}, {"key": "dc.identifier.uri", "value": "https://jyx.jyu.fi/handle/123456789/81880", "language": null, "element": "identifier", "qualifier": "uri", "schema": "dc"}, {"key": "dc.description.abstract", "value": "Deduplikointi s\u00e4\u00e4st\u00e4\u00e4 tallennustilaa. Siin\u00e4 etsit\u00e4\u00e4n datasta identtisi\u00e4 alueita, joista yksi s\u00e4ilytet\u00e4\u00e4n ja loput korvataan viitteell\u00e4 t\u00e4h\u00e4n s\u00e4ilytett\u00e4v\u00e4\u00e4n alueeseen. T\u00e4ss\u00e4 tutkielmassa k\u00e4siteltiin kirjallisuuteen perustuen deduplikoinnin eri osa-alueita. Erityist\u00e4 huomiota kiinnitettiin deduplikoinnin suorituskykyyn ja sen parantamiseen. Katsauksessa selvisi, ett\u00e4 deduplikoinnin moninaisiin sovelluskohteisiin tarvitaan hyvin erilaisia deduplikointij\u00e4rjestelmi\u00e4. Niiss\u00e4 tasapainoillaan suorituskyvyn eri alueiden v\u00e4lill\u00e4: yhden alueen parantaminen heikent\u00e4\u00e4 usein toista. Ty\u00f6ss\u00e4 toteutettiin my\u00f6s tietokoneohjelma, joka deduplikoi tiedostoja. Sen suoritusaikoja mitattiin kahden muuttujan eri arvoilla. Mittauksissa l\u00f6ydettiin muuttujille arvot, joilla suoritusaika oli yleisesti pienin.", "language": "fi", "element": "description", "qualifier": "abstract", "schema": "dc"}, {"key": "dc.description.abstract", "value": "Deduplication saves storage space. In deduplication, data is searched for identical sections. One of these sections is stored and the rest are replaced with a reference pointing to the stored section. In this study, various aspects of deduplication were examined based on the literature. Special attention was given to the performance of deduplication and its improvement. In the review it was found that the diverse applications of deduplication require very different deduplication systems. The systems have to balance between the many aspects of performance: improving one aspect often weakens another. A computer program that deduplicates files was also implemented in this work. Its execution times were measured with different values of two variables. Values were found with which the program's execution times were generally the lowest.", "language": "en", "element": "description", "qualifier": "abstract", "schema": "dc"}, {"key": "dc.description.provenance", "value": "Submitted by Miia Hakanen (mihakane@jyu.fi) on 2022-06-20T07:34:28Z\nNo. of bitstreams: 0", "language": "en", "element": "description", "qualifier": "provenance", "schema": "dc"}, {"key": "dc.description.provenance", "value": "Made available in DSpace on 2022-06-20T07:34:28Z (GMT). No. of bitstreams: 0\n Previous issue date: 2022", "language": "en", "element": "description", "qualifier": "provenance", "schema": "dc"}, {"key": "dc.format.extent", "value": "75", "language": "", "element": "format", "qualifier": "extent", "schema": "dc"}, {"key": "dc.format.mimetype", "value": "application/pdf", "language": null, "element": "format", "qualifier": "mimetype", "schema": "dc"}, {"key": "dc.language.iso", "value": "fin", "language": null, "element": "language", "qualifier": "iso", "schema": "dc"}, {"key": "dc.rights", "value": "In Copyright", "language": "en", "element": "rights", "qualifier": null, "schema": "dc"}, {"key": "dc.subject.other", "value": "deduplikointi", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "tallennustila", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "tiiviste", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.subject.other", "value": "suorituskyky", "language": "", "element": "subject", "qualifier": "other", "schema": "dc"}, {"key": "dc.title", "value": "Deduplikoinnin suorituskyvyst\u00e4", "language": "", "element": "title", "qualifier": null, "schema": "dc"}, {"key": "dc.type", "value": "master thesis", "language": null, "element": "type", "qualifier": null, "schema": "dc"}, {"key": "dc.identifier.urn", "value": "URN:NBN:fi:jyu-202206203489", "language": "", "element": "identifier", "qualifier": "urn", "schema": "dc"}, {"key": "dc.type.ontasot", "value": "Pro gradu -tutkielma", "language": "fi", "element": "type", "qualifier": "ontasot", "schema": "dc"}, {"key": "dc.type.ontasot", "value": "Master\u2019s thesis", "language": "en", "element": "type", "qualifier": "ontasot", "schema": "dc"}, {"key": "dc.contributor.faculty", "value": "Informaatioteknologian tiedekunta", "language": "fi", "element": "contributor", "qualifier": "faculty", "schema": "dc"}, {"key": "dc.contributor.faculty", "value": "Faculty of Information Technology", "language": "en", "element": "contributor", "qualifier": "faculty", "schema": "dc"}, {"key": "dc.contributor.department", "value": "Informaatioteknologia", "language": "fi", "element": "contributor", "qualifier": "department", "schema": "dc"}, {"key": "dc.contributor.department", "value": "Information Technology", "language": "en", "element": "contributor", "qualifier": "department", "schema": "dc"}, {"key": "dc.contributor.organization", "value": "Jyv\u00e4skyl\u00e4n yliopisto", "language": "fi", "element": "contributor", "qualifier": "organization", "schema": "dc"}, {"key": "dc.contributor.organization", "value": "University of Jyv\u00e4skyl\u00e4", "language": "en", "element": "contributor", "qualifier": "organization", "schema": "dc"}, {"key": "dc.subject.discipline", "value": "Tietotekniikka", "language": "fi", "element": "subject", "qualifier": "discipline", "schema": "dc"}, {"key": "dc.subject.discipline", "value": "Mathematical Information Technology", "language": "en", "element": "subject", "qualifier": "discipline", "schema": "dc"}, {"key": "yvv.contractresearch.funding", "value": "0", "language": "", "element": "contractresearch", "qualifier": "funding", "schema": "yvv"}, {"key": "dc.type.coar", "value": "http://purl.org/coar/resource_type/c_bdcc", "language": null, "element": "type", "qualifier": "coar", "schema": "dc"}, {"key": "dc.rights.accesslevel", "value": "openAccess", "language": null, "element": "rights", "qualifier": "accesslevel", "schema": "dc"}, {"key": "dc.type.publication", "value": "masterThesis", "language": null, "element": "type", "qualifier": "publication", "schema": "dc"}, {"key": "dc.subject.oppiainekoodi", "value": "602", "language": "", "element": "subject", "qualifier": "oppiainekoodi", "schema": "dc"}, {"key": "dc.subject.yso", "value": "tiedostot", "language": null, "element": "subject", "qualifier": "yso", "schema": "dc"}, {"key": "dc.subject.yso", "value": "kiintolevyt", "language": null, "element": "subject", "qualifier": "yso", "schema": "dc"}, {"key": "dc.subject.yso", "value": "tietokoneohjelmat", "language": null, "element": "subject", "qualifier": "yso", "schema": "dc"}, {"key": "dc.subject.yso", "value": "tietotekniikka", "language": null, "element": "subject", "qualifier": "yso", "schema": "dc"}, {"key": "dc.subject.yso", "value": "muistit (tietotekniikka)", "language": null, "element": "subject", "qualifier": "yso", "schema": "dc"}, {"key": "dc.subject.yso", "value": "tietojenk\u00e4sittely", "language": null, "element": "subject", "qualifier": "yso", "schema": "dc"}, {"key": "dc.subject.yso", "value": "mittaus", "language": null, "element": "subject", "qualifier": "yso", "schema": "dc"}, {"key": "dc.format.content", "value": "fulltext", "language": null, "element": "format", "qualifier": "content", "schema": "dc"}, {"key": "dc.rights.url", "value": "https://rightsstatements.org/page/InC/1.0/", "language": null, "element": "rights", "qualifier": "url", "schema": "dc"}, {"key": "dc.type.okm", "value": "G2", "language": null, "element": "type", "qualifier": "okm", "schema": "dc"}]
id jyx.123456789_81880
language fin
last_indexed 2025-02-18T10:56:05Z
main_date 2022-01-01T00:00:00Z
main_date_str 2022
online_boolean 1
online_urls_str_mv {"url":"https:\/\/jyx.jyu.fi\/bitstreams\/311f24c2-12a0-435a-a9b9-e0b08ebb5e47\/download","text":"URN:NBN:fi:jyu-202206203489.pdf","source":"jyx","mediaType":"application\/pdf"}
publishDate 2022
record_format qdc
source_str_mv jyx
spellingShingle Kaiponen, Samuel Deduplikoinnin suorituskyvystä deduplikointi tallennustila tiiviste suorituskyky Tietotekniikka Mathematical Information Technology 602 tiedostot kiintolevyt tietokoneohjelmat tietotekniikka muistit (tietotekniikka) tietojenkäsittely mittaus
title Deduplikoinnin suorituskyvystä
title_full Deduplikoinnin suorituskyvystä
title_fullStr Deduplikoinnin suorituskyvystä Deduplikoinnin suorituskyvystä
title_full_unstemmed Deduplikoinnin suorituskyvystä Deduplikoinnin suorituskyvystä
title_short Deduplikoinnin suorituskyvystä
title_sort deduplikoinnin suorituskyvystä
title_txtP Deduplikoinnin suorituskyvystä
topic deduplikointi tallennustila tiiviste suorituskyky Tietotekniikka Mathematical Information Technology 602 tiedostot kiintolevyt tietokoneohjelmat tietotekniikka muistit (tietotekniikka) tietojenkäsittely mittaus
topic_facet 602 Mathematical Information Technology Tietotekniikka deduplikointi kiintolevyt mittaus muistit (tietotekniikka) suorituskyky tallennustila tiedostot tietojenkäsittely tietokoneohjelmat tietotekniikka tiiviste
url https://jyx.jyu.fi/handle/123456789/81880 http://www.urn.fi/URN:NBN:fi:jyu-202206203489
work_keys_str_mv AT kaiponensamuel deduplikoinninsuorituskyvystä