SHARE - Sharing data with cryptograpHic and differentiAlly pRivate guarantEes

The modern economy is largely data-driven and relies on the processing and sharing of data across organizations as a key contributor to its success. At the same time, the value, amount, and sensitivity of processed data is steadily increasing, making it a major target of cyber-attacks. A large fraction of the many reported data breaches happened in the healthcare sector, mostly affecting privacy-sensitive data such as medical records and other patient data. This puts data security technologies as a priority item on the agenda of many healthcare organizations, such as of the Dutch health insurance company Centraal Ziekenfonds (CZ). In particular when it comes to sharing data securely, practical data protection technologies are lacking as they mostly focus on securing the link between two organizations while being completely oblivious of what is happening with the data after sharing. For CZ, searchable encryption (SE) technologies that allow to share data in encrypted form, while enabling the private search on this encrypted data without the need to decrypt, are of particular interest.

Unfortunately, existing efficient SE schemes completely leak the access pattern (= pattern of encrypted search results, e.g. identifiers of retrieved items) and the search pattern (= pattern of search queries, e.g. frequency of same queries), making them susceptible to leakage-abuse attacks that exploit this leakage to recover what has been queried for and/or (parts of) the shared data itself. The SHARE project will investigate ways to reduce the leakage in searchable encryption in order to mitigate the impact of leakage-abuse attacks while keeping the performance-level high enough for practical use. Concretely, we propose the construction of SE schemes that allow the leakage to be modeled as a statistic released on the queries and shared dataset in terms of ε-differential privacy, a well-established notion that informally says that, after observing the statistic, you learn approximately (determined by the ε-parameter) the same amount of information about an individual data item or query as if the item was not present in the dataset or the query has not been performed. Naturally, such an approach will produce false positives and negatives in the querying process, affecting the scheme’s performance. By calibrating the ε-parameter, we can achieve various leakage-performance trade-offs tailored to the needs of specific applications. SHARE will explore the idea of differentially-private leakage on different parts of SE with different search capabilities, starting with exact-keyword-match SE schemes with differentially-private leakage on the access pattern only, up to schemes with differentially-private leakage on the access and search pattern as well as on the shared dataset itself, allowing for more expressive query types like fuzzy match, range, or substring queries.

SHARE comes with an attack lab in which we investigate existing and new types of leakage-abuse attacks to assess the mitigation-potential of our proposed combination of differential privacy with cryptographic guarantees in searchable encryption. To stimulate commercial exploitation of SHARE-results, our consortium partners CZ and TNO will take the lead on applying and evaluating our envisioned technologies in various healthcare use-cases.

Meer informatie

Meer informatie volgt.


Schrijf u hier in voor onze nieuwsbrief en blijf op de hoogte van het laatste nieuws omtrent Commit2Data.

Project nieuws

Real-Time Data for Products to Move Data-Driven Real-Time Decision Making in Supply Chains and Logistics
Dit project - verder genoemd Data2Move - is gestart in 2017. Het doel van Data2Move is het ontwikkelen en demonstreren van real-time data-driven logistieke methoden en technieken, met in eerste instantie een focus op de voorraad en transport. Er zijn...
27 november 2023
Virtual Interiors as Interfaces for Big Historical Data Research
Het Semantisch Web heeft interfaces nodig voor kritische, betrouwbare analyses van Big Data voor geesteswetenschappelijk onderzoek, cultureel erfgoed en creatieve industrieën. Met behulp van data over de productie en consumptie van cultuurgoederen, g...
21 november 2023
Hoe kan wiskunde uw uitdaging in data science oplossen? Schrijf u in voor de Studiegroep
De wiskunde heeft een verrassend aantal nuttige technieken om uitdagingen in data science mee op te lossen. Heeft uw bedrijf zo’n uitdaging maar geen tijd om er goed in te duiken? Of wilt u uw contacten in de universitaire wereld verbeteren? Wilt u m...
17 november 2023

Wetenschappelijke publicaties

Er zijn geen wetenschappelijke publicaties beschikbaar.

Actuele themas