Тёмный
No video :(

Protein generation with evolutionary diffusion: sequence is all you need 

ML for protein engineering seminar series
Подписаться 3,9 тыс.
Просмотров 4,3 тыс.
50% 1

Protein generation with evolutionary diffusion: sequence is all you need
Tuesday September 19th, 4-5 pm EST | Kevin Yang, PhD - Senior Researcher, Microsoft
Abstract: Deep generative models are increasingly powerful tools for the in silico design of novel proteins. Recently, a family of generative models called diffusion models has demonstrated the ability to generate biologically plausible proteins that are dissimilar to any actual proteins seen in nature, enabling unprecedented capability and control in de novo protein design. However, current state-of-the-art models generate protein structures, which severely limits the scope of their training data and restricts generations to a small and biased subset of protein design space. Here, we introduce a general-purpose diffusion framework, EvoDiff, that combines evolutionary-scale data with the distinct conditioning capabilities of diffusion models for controllable protein generation in sequence space. EvoDiff generates high-fidelity, diverse, and structurally-plausible proteins that cover natural sequence and functional space. Critically, EvoDiff can generate proteins inaccessible to structure-based models, such as those with disordered regions, while maintaining the ability to design scaffolds for functional structural motifs, demonstrating the universality of our sequence-based formulation. We envision that EvoDiff will expand capabilities in protein engineering beyond the structure-function paradigm toward programmable, sequence-first design.
Preprint: www.biorxiv.or...

Опубликовано:

 

28 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 4   
@trevy5273
@trevy5273 10 месяцев назад
Many thanks for this summery, very helpful!
@lo8885
@lo8885 10 месяцев назад
In your opinion the viewers : In what sense would it be interesting to train EvoDiff-seq (without modifying its architecture) on the PDB dataset of 200K sequences, these sequences that have been used to train RFDiffusion and the state of the art structure based generative models.
@jakeparker1287
@jakeparker1287 10 месяцев назад
Not very. The whole point is to leverage the huge sequence-only datasets available, of which the pdb is a very small subset.
@castilloh.gianmarco1048
@castilloh.gianmarco1048 8 месяцев назад
niceee!
Далее
New Dyna Skin is OP🥵🔥 | Brawl Stars
00:16
Просмотров 375 тыс.
Why Does Diffusion Work Better than Auto-Regression?
20:18
John Jumper: "Structure Prediction with AlphaFold"
18:58