1/19
If the protein or complex was purified from its native host organism:
1. I sequence the map with `model_angelo build_no_seq`.
2. I identify all proteins with `model_angelo hmm_search` against the reference proteome of the host organism (downloaded from #Uniprot).
If the protein or complex was prepared recombinantly, I already know the sequences of all proteins.
So at this stage, either way, I know which proteins are in there.
3/19
Once I'm happy, or I abandoned modeling the last remaining bits of unexplained density or fixing the last subtle model errors in places mostly irrelevant to the biological question the model is meant to answer, then I run `phenix.validation_cryoem` on the half-maps and atomic model. This gives me most of the numbers to put in the refinement table. And I deposit the model into the #PDB and maps into the #EMDB. Fin.
Happy to hear any comments or descriptions of how other people do this!
18/19
19/19
Links to the resources and programs mentioned in this thread:
https://github.com/3dem/model-angelo
https://www.cgl.ucsf.edu/chimerax/
https://github.com/tristanic/isolde
https://github.com/rsanchezgarc/deepEMhancer
https://phenix-online.org/documentation/reference/real_space_refine.html
https://github.com/keitaroyam/servalcat/
https://phenix-online.org/documentation/reference/validation_cryo_em.html