An adversarial scheme for integrating multi-modal data on protein function

Guardado en:
Detalles Bibliográficos
Publicado en:bioRxiv (Jan 21, 2025)
Autor principal: Nasser, Rami
Otros Autores: Schaffer, Leah, Ideker, Trey, Sharan, Roded
Publicado:
Cold Spring Harbor Laboratory Press
Materias:
Acceso en línea:Citation/Abstract
Full text outside of ProQuest
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Resumen:In order to begin to decipher the structure of the cell, we need to integrate multiple types of data of different scales on subcellular organization. Such integration requires dealing with multiple data modalities and with missing data. To this end, we developed MIRAGE, a multi-modal generative model for integrating protein sequence, protein-protein interaction and protein localization data. Our approach successfully learns a joint embedding space that captures the complex relationships between these diverse modalities. We evaluate our model's performance against existing methods, obtaining superior performance in several key tasks, including protein function prediction and module detection. MIRAGE source code is available at https://github.com/raminass/MIRAGE.Competing Interest StatementThe authors have declared no competing interest.Footnotes* School name, Computer Science -> Computer Science and AI
ISSN:2692-8205
DOI:10.1101/2025.01.16.633332
Fuente:Biological Science Database