TU Darmstadt / ULB / TUbiblio

False Discovery Rate Control for Fast Screening of Large-Scale Genomics Biobanks

Machkour, Jasin ; Muma, Michael ; Palomar, Daniel P. (2023)
False Discovery Rate Control for Fast Screening of Large-Scale Genomics Biobanks.
22nd IEEE Statistical Signal Processing Workshop. Hanoi, Vietnam (02.07. - 05.07.2023)
doi: 10.1109/SSP53291.2023.10207957
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

Genomics biobanks are information treasure troves with thousands of phenotypes (e.g., diseases, traits) and millions of single nucleotide polymorphisms (SNPs). The development of methodologies that provide reproducible discoveries is essential for the understanding of complex diseases and precision drug development. Without statistical reproducibility guarantees, valuable efforts are spent on researching false positives. Therefore, scalable multivariate and high-dimensional false discovery rate (FDR)-controlling variable selection methods are urgently needed, especially, for complex polygenic diseases and traits. In this work, we propose the Screen-T-Rex selector, a fast FDR-controlling method based on the recently developed T-Rex selector. The method is tailored to screening large-scale biobanks and it does not require choosing additional parameters (sparsity parameter, target FDR level, etc). Numerical simulations and a real-world HIV-1 drug resistance example demonstrate that the performance of the Screen-T-Rex selector is superior, and its computation time is multiple orders of magnitude lower compared to current benchmark knockoff methods.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2023
Autor(en): Machkour, Jasin ; Muma, Michael ; Palomar, Daniel P.
Art des Eintrags: Bibliographie
Titel: False Discovery Rate Control for Fast Screening of Large-Scale Genomics Biobanks
Sprache: Englisch
Publikationsjahr: 9 August 2023
Ort: Piscataway, NY
Verlag: IEEE
Buchtitel: Proceedings of the 22nd IEEE Statistical Signal Processing Workshop (SSP 2023)
Veranstaltungstitel: 22nd IEEE Statistical Signal Processing Workshop
Veranstaltungsort: Hanoi, Vietnam
Veranstaltungsdatum: 02.07. - 05.07.2023
DOI: 10.1109/SSP53291.2023.10207957
Kurzbeschreibung (Abstract):

Genomics biobanks are information treasure troves with thousands of phenotypes (e.g., diseases, traits) and millions of single nucleotide polymorphisms (SNPs). The development of methodologies that provide reproducible discoveries is essential for the understanding of complex diseases and precision drug development. Without statistical reproducibility guarantees, valuable efforts are spent on researching false positives. Therefore, scalable multivariate and high-dimensional false discovery rate (FDR)-controlling variable selection methods are urgently needed, especially, for complex polygenic diseases and traits. In this work, we propose the Screen-T-Rex selector, a fast FDR-controlling method based on the recently developed T-Rex selector. The method is tailored to screening large-scale biobanks and it does not require choosing additional parameters (sparsity parameter, target FDR level, etc). Numerical simulations and a real-world HIV-1 drug resistance example demonstrate that the performance of the Screen-T-Rex selector is superior, and its computation time is multiple orders of magnitude lower compared to current benchmark knockoff methods.

Fachbereich(e)/-gebiet(e): 18 Fachbereich Elektrotechnik und Informationstechnik
18 Fachbereich Elektrotechnik und Informationstechnik > Institut für Nachrichtentechnik
18 Fachbereich Elektrotechnik und Informationstechnik > Institut für Nachrichtentechnik > Robust Data Science
LOEWE
LOEWE > LOEWE-Zentren
LOEWE > LOEWE-Zentren > emergenCITY
Zentrale Einrichtungen
Zentrale Einrichtungen > Hochschulrechenzentrum (HRZ)
Zentrale Einrichtungen > Hochschulrechenzentrum (HRZ) > Hochleistungsrechner
Hinterlegungsdatum: 03 Apr 2024 11:32
Letzte Änderung: 26 Jun 2024 09:48
PPN: 519389212
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen