TU Darmstadt / ULB / TUbiblio

Mainzelliste SecureEpiLinker (MainSEL): privacy-preserving record linkage using secure multi-party computation

Stammler, Sebastian ; Kussel, Tobias ; Schoppmann, Phillipp ; Stampe, Florian ; Tremper, Galina ; Katzenbeisser, Stefan ; Hamacher, Kay ; Lablans, Martin (2022)
Mainzelliste SecureEpiLinker (MainSEL): privacy-preserving record linkage using secure multi-party computation.
In: Bioinformatics (Oxford, England), 38 (6)
doi: 10.1093/bioinformatics/btaa764
Artikel, Bibliographie

Kurzbeschreibung (Abstract)

MOTIVATION

Record Linkage has versatile applications in real-world data analysis contexts, where several datasets need to be linked on the record level in the absence of any exact identifier connecting related records. An example are medical databases of patients, spread across institutions, that have to be linked on personally identifiable entries like name, date of birth or ZIP code. At the same time, privacy laws may prohibit the exchange of this personally identifiable information (PII) across institutional boundaries, ruling out the outsourcing of the record linkage task to a trusted third party. We propose to employ privacy-preserving record linkage (PPRL) techniques that prevent, to various degrees, the leakage of PII while still allowing for the linkage of related records.

RESULTS

We develop a framework for fault-tolerant PPRL using secure multi-party computation with the medical record keeping software Mainzelliste as the data source. Our solution does not rely on any trusted third party and all PII is guaranteed to not leak under common cryptographic security assumptions. Benchmarks show the feasibility of our approach in realistic networking settings: linkage of a patient record against a database of 10 000 records can be done in 48 s over a heavily delayed (100 ms) network connection, or 3.9 s with a low-latency connection.

AVAILABILITY AND IMPLEMENTATION

The source code of the sMPC node is freely available on Github at https://github.com/medicalinformatics/SecureEpilinker subject to the AGPLv3 license. The source code of the modified Mainzelliste is available at https://github.com/medicalinformatics/MainzellisteSEL.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Typ des Eintrags: Artikel
Erschienen: 2022
Autor(en): Stammler, Sebastian ; Kussel, Tobias ; Schoppmann, Phillipp ; Stampe, Florian ; Tremper, Galina ; Katzenbeisser, Stefan ; Hamacher, Kay ; Lablans, Martin
Art des Eintrags: Bibliographie
Titel: Mainzelliste SecureEpiLinker (MainSEL): privacy-preserving record linkage using secure multi-party computation
Sprache: Englisch
Publikationsjahr: 4 März 2022
Titel der Zeitschrift, Zeitung oder Schriftenreihe: Bioinformatics (Oxford, England)
Jahrgang/Volume einer Zeitschrift: 38
(Heft-)Nummer: 6
DOI: 10.1093/bioinformatics/btaa764
Kurzbeschreibung (Abstract):

MOTIVATION

Record Linkage has versatile applications in real-world data analysis contexts, where several datasets need to be linked on the record level in the absence of any exact identifier connecting related records. An example are medical databases of patients, spread across institutions, that have to be linked on personally identifiable entries like name, date of birth or ZIP code. At the same time, privacy laws may prohibit the exchange of this personally identifiable information (PII) across institutional boundaries, ruling out the outsourcing of the record linkage task to a trusted third party. We propose to employ privacy-preserving record linkage (PPRL) techniques that prevent, to various degrees, the leakage of PII while still allowing for the linkage of related records.

RESULTS

We develop a framework for fault-tolerant PPRL using secure multi-party computation with the medical record keeping software Mainzelliste as the data source. Our solution does not rely on any trusted third party and all PII is guaranteed to not leak under common cryptographic security assumptions. Benchmarks show the feasibility of our approach in realistic networking settings: linkage of a patient record against a database of 10 000 records can be done in 48 s over a heavily delayed (100 ms) network connection, or 3.9 s with a low-latency connection.

AVAILABILITY AND IMPLEMENTATION

The source code of the sMPC node is freely available on Github at https://github.com/medicalinformatics/SecureEpilinker subject to the AGPLv3 license. The source code of the modified Mainzelliste is available at https://github.com/medicalinformatics/MainzellisteSEL.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

ID-Nummer: pmid:32871006
Fachbereich(e)/-gebiet(e): 10 Fachbereich Biologie
10 Fachbereich Biologie > Computational Biology and Simulation
Hinterlegungsdatum: 19 Apr 2022 06:13
Letzte Änderung: 19 Apr 2022 06:13
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen