Data Anonymization as a Vector Quantization Problem: Control Over Privacy for Health Data

Yoan Miche, Ian Oliver, Silke Holtmanns, Aapo Kalliola, Anton Akusok, Amaury Lendasse, Kaj-Mikael Björk

Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKonferensbidragVetenskapligPeer review

4 Citeringar (Scopus)

Sammanfattning

This paper tackles the topic of data anonymization from a vector quantization point of view. The admitted goal in this work is to provide means of performing data anonymization to avoid single individual or group re-identification from a data set, while maintaining as much as possible (and in a very specific sense) data integrity and structure. The structure of the data is first captured by clustering (with a vector quantization approach), and we propose to use the properties of this vector quantization to anonymize the data. Under some assumptions over possible computations to be performed on the data, we give a framework for identifying and “pushing back outliers in the crowd”, in this clustering sense, as well as anonymizing cluster members while preserving cluster-level statistics and structure as defined by the assumptions (density, pairwise distances, cluster shape and members...).
OriginalspråkEngelska
Titel på värdpublikationCD-ARES 2016: Availability, Reliability, and Security in Information Systems
Antal sidor11
UtgivningsortCham
FörlagSpringer
Utgivningsdatum23.08.2016
Sidor193-203
ISBN (tryckt)978-3-319-45506-8
ISBN (elektroniskt)978-3-319-45507-5
DOI
StatusPublicerad - 23.08.2016
MoE-publikationstypA4 Artikel i en konferenspublikation

Publikationsserier

Namn Lecture Notes in Computer Science book series (LNCS)
Volym9817

Nyckelord

  • 512 Företagsekonomi

Fingeravtryck

Fördjupa i forskningsämnen för ”Data Anonymization as a Vector Quantization Problem: Control Over Privacy for Health Data”. Tillsammans bildar de ett unikt fingeravtryck.

Citera det här