Alexander Wlodawer et al.: Duplicate entries in the Protein Data Bank: how to detect and handle them
#DataDuplication
#protein
https://journals.iucr.org/d/issues/2025/04/00/gm5112/index.html
Duplicate entries in the Protein Data Bank: how to detect and handle them

A global analysis of protein crystal structures in the Protein Data Bank (PDB) reveals many pairs with (nearly) identical main-chain coordinates. Such cases are identified and analyzed, leading to a proposal about how the PDB could ameliorate this problem.

Acta Crystallographica Section D