#proteomics_101 Estimate the # of proteoforms (order of magnitude) for human GYPA:p, HBA1:p & PRKDC:p due to splice variants, SAVs, raggedness & S/T/N+glycosyl, S/T+phosphoryl & K+ubiquitinyl/SUMOyl/acetyl. Include p-forms caused by branched glycosyl, ubiquitinyl & SUMOyl moeities. Show your work!
I was tempted to use the notation "O(proteoform)", but since it wasn't in the notes I thought it would be a bit too obscure for most of the class.
Hint: remember that you can use gnomAD to estimate the number of SAVs associated with a gene, for example:
https://gnomad.broadinstitute.org/gene/ENSG00000206172?dataset=gnomad_r4
gnomAD

The Genome Aggregation Database (gnomAD) is a resource developed by an international coalition of investigators, with the goal of aggregating and harmonizing both exome and genome sequencing data from a wide variety of large-scale sequencing projects, and making summary data available for the wider scientific community.