Valverde et al. analyzed 15 years of Gene Ontology evolution, finding that GO changes not just incrementally, but also via major, curator-driven restructurings, underscoring the need for version-aware, multi-layer models to ensure reproducibility and interpretability.
🔗 doi.org/10.1093/molbev/msaf148
Current me: Ugh, human GO terms are annotated with Uniprot, current data has Entrez, when I generate the GO annotation object I'll have to convert as part of the process.
Past me: Human GO terms are annotated with Uniprot, when we ingest this data, we should be able to provide a mapping table so that the final GO annotation object uses whatever gene IDs we want.
Current me: Thanks past me! Glad I looked at the help and the code for that function first!
🧬 Can we map the evolutionary history of plant genes through network analysis?
🔗 Hayai-Annotation: A functional gene prediction tool that integrates orthologs and gene ontology for network analysis in plant species. Computational and Structural Biotechnology Journal, DOI: https://doi.org/10.1016/j.csbj.2024.12.011
📚 CSBJ: https://www.csbj.org/
#PlantGenomics #GeneAnnotation #Bioinformatics #Orthologs #RShiny #GeneOntology #ComparativeGenomics #NetworkBiology #Genetics
Abstract Genic codominant multiallelic markers are essential to identify the genetic variation, population diversity and evolutionary history of a species. Soybean (Glycine max) is a major legume crop having importance in both a protein-rich pulse as well as a high recovery oilseed crop. To date, no genome-wide genic SSR markers have been elucidated in this crop of high importance. This article aims to identify and validate regulatory gene-derived SSR markers in soybean. The coding sequences of Glycine max were downloaded from PlantTFDB and used for the identification, followed by the localization of SSRs by using a Perl 5 script (MISA, MIcroSAtellite identification tool). The flanking primers to SSRs were designed and chromosomal distribution and Gene ontology searches were performed using BLAST2GO. Twenty random SSR markers were validated to check cross-species transferability and genetic diversity study was performed. A set of 1138 simple sequence repeat markers from transcription factor coding genes were designed and designated as TF-derived SSR markers. They were anchored on 20 G. max chromosomes, and the SSR motif frequency was one per 4.64 kb. Trinucleotide repeats were found abundant and tetra, as well as pentanucleotide frequency, was least in soybean. Gene Ontology search revealed the diverse role of SSR-containing TFs in soybean. Eight soybean accessions were analyzed for identified twenty candidates for genic SSR diversification, and a principal co-ordinate analysis, a genic dissimilarity-based unweighted neighbour-joining tree, was constructed. Our findings will serve as a potential functional marker resource for marker-assisted selection and genomic characterization of soybean.
Working with #GeneOntology annotations today for a #bioinformatics analysis of cytokines (we'll see what actually turns up when you just measure 225 cytokines).
So, remap genes to terms using our {gocats} python library, and then read in the new annotations for use in {categoryCompare2}.
Somehow, I always forget that I wrote a function specifically for importing the {gocats} generated stuff into the annotation format. 🤦♂️
https://github.com/MoseleyBioinformaticsLab/categoryCompare2/blob/main/R/gocats.R#L13
Happy #DNADay2024!
Today's @AllianceGenome forum post highlights some of the close ties between the Human Genome Project and the #GeneOntology: both Dr. David Botstein and Dr. Paul Thomas.
https://community.alliancegenome.org/t/happy-dna-day-close-ties-between-the-hgp-go/7642
DNA Day is a celebration of the work of Franklin, Wilkins, Crick & Watson, published on this day in 1953. April is also the month the Human Genome Project (HGP) was declared complete, in 2003. But did you know that several of the people responsible for the HGP are also deeply involved in GO? David Botstein et al. laid down the framework for constructing genetic linkage maps using restriction fragment length polymorphisms in 1980. That work helped enable sequencing for the HGP. Some years late...
As part of a FlyBase and GO consortium project focused on ncRNAs, we have created GO terms to annotate primary GO:0140990 and secondary GO:0140965 piRNA processing, piRNA-mediated retrotransposon silencing by heterochromatin formation GO:0141006 and piRNA-mediated retrotransposon silencing by mRNA destabilization GO:0141009. We welcome any feedback to improve the coverage and accuracy of these annotations.