8K7O | pdb_00008k7o

De novo design protein -T03


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.70 Å
  • R-Value Free: 
    0.246 (Depositor), 0.250 (DCC) 
  • R-Value Work: 
    0.206 (Depositor), 0.208 (DCC) 
  • R-Value Observed: 
    0.210 (Depositor) 

Starting Model: other
View more details

wwPDB Validation   3D Report Full Report


This is version 1.1 of the entry. See complete history


Literature

De novo protein design with a denoising diffusion network independent of pretrained structure prediction models.

Liu, Y.Wang, S.Dong, J.Chen, L.Wang, X.Wang, L.Li, F.Wang, C.Zhang, J.Wang, Y.Wei, S.Chen, Q.Liu, H.

(2024) Nat Methods 21: 2107-2116

  • DOI: https://doi.org/10.1038/s41592-024-02437-w
  • Primary Citation of Related Structures:  
    8K7M, 8K7O, 8K7Z, 8K83, 8K84, 8K8F, 8K8G, 8K8I, 8KA6, 8KA7, 8KAC, 8KC0, 8KC1, 8KC4, 8KC5, 8KC8, 8KCJ, 8KCK, 8KDQ, 8W97, 8WWC, 8WX8

  • PubMed Abstract: 

    The recent success of RFdiffusion, a method for protein structure design with a denoising diffusion probabilistic model, has relied on fine-tuning the RoseTTAFold structure prediction network for protein backbone denoising. Here, we introduce SCUBA-diffusion (SCUBA-D), a protein backbone denoising diffusion probabilistic model freshly trained by considering co-diffusion of sequence representation to enhance model regularization and adversarial losses to minimize data-out-of-distribution errors. While matching the performance of the pretrained RoseTTAFold-based RFdiffusion in generating experimentally realizable protein structures, SCUBA-D readily generates protein structures with not-yet-observed overall folds that are different from those predictable with RoseTTAFold. The accuracy of SCUBA-D was confirmed by the X-ray structures of 16 designed proteins and a protein complex, and by experiments validating designed heme-binding proteins and Ras-binding proteins. Our work shows that deep generative models of images or texts can be fruitfully extended to complex physical objects like protein structures by addressing outstanding issues such as the data-out-of-distribution errors.


  • Organizational Affiliation

    Department of Rheumatology and Immunology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, Hefei National Research Center for Physical Sciences at the Microscale, Center for Advanced Interdisciplinary Science and Biomedicine of IHM, University of Science and Technology of China, Hefei, China.


Macromolecules
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
De novo design protein233synthetic constructMutation(s): 0 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
Sequence Annotations
Expand
  • Reference Sequence
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.70 Å
  • R-Value Free:  0.246 (Depositor), 0.250 (DCC) 
  • R-Value Work:  0.206 (Depositor), 0.208 (DCC) 
  • R-Value Observed: 0.210 (Depositor) 
Space Group: C 1 2 1
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 86.5α = 90
b = 33.469β = 91.89
c = 71.772γ = 90
Software Package:
Software NamePurpose
PHENIXrefinement
XDSdata reduction
Aimlessdata scaling
PHENIXphasing

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
Not funded--

Revision History  (Full details and data files)

  • Version 1.0: 2024-07-31
    Type: Initial release
  • Version 1.1: 2025-05-07
    Changes: Database references, Structure summary