Christophe Dessimoz

115 Followers
52 Following
229 Posts
Director of the SIB Swiss Institute of Bioinformatics (@SIB) and Professor at the University of Lausanne.
SIB websitehttps://sib.swiss
Lab websitehttps://lab.dessimoz.org
QfO 2025 review: where AI, protein structure, isoforms & domains meet orthology. We outline new methods, benchmarks and integration frameworks—with applications from environmental genomics to agriculture. Open access: http://doi.org/10.1007/s00239-025-10272-6
EdgeHOG reconstructs ancestral gene order at Tree-of-Life scale in linear time. 2,845 genomes → 1,133 ancestors; contigs for LECA (~1.8 Ga). Also dates gene adjacencies & rearrangements. Sexual chromosomes stand out! Led by Charles Bernard (https://research.pasteur.fr/fr/member/charles-bernard) http://doi.org/10.1038/s41559-025-02818-0
Testing the “least-diverged ortholog” conjecture at scale: using >1M structures + expression across 16 animals & 20 plants, the LDO tends to retain ancestral function while the other copy specialises. Led by Irene Julca (https://www.linkedin.com/in/irene-julca-ph-genomics) Paper: http://doi.org/10.1101/gr.280166.124 Preprint: https://www.biorxiv.org/content/10.1101/2024.10.29.620890v1.full.pdf+html
Structure-informed phylogenetic inference with FoldTree outperforms sequence-only trees on divergent proteins, and untangles RRNPPA quorum-sensing receptors across Gram-positive bacteria & phages. Led by David Moi (https://www.linkedin.com/in/david-moi) Paper: http://doi.org/10.1038/s41594-025-01649-8

Really important points by Alex Bateman (@alexbateman1.bsky.social) on the importance of curation, addressing some of the myths out there.

Deeply resonates with the commentary Paul Thomas and I wrote last year https://www.nature.com/articles/s41597-024-03099-1

#ISMBECCB2025

Fantastic keynote by @marcrr at the joint EvolCompGen-Function session of #ISMBECCB2025 on the power of (curated & FAIR) gene expression data from @bgeedb to study functional retention and innovation across animals, summarising 15+ yrs of work

Papers listed here: https://www.unil.ch/dee/en/home/menuguid/people/group-leaders/prof-marc-robinson-rechavi.html

When the @SIB Roadshow Initiative works out, everyone wins: optimized travel, more scientific exchange, and a chance to catch top speakers across Switzerland—this time with Prof. Piyanun Harnpicharnchai on fungal communities & water quality.
/cc @saramitri

Delighted to represent @SIB at the AI-Bioscience Collaborative Summit in Washington DC, organized by the US Department of State & others.

Great to see a growing recognition of curated biodata’s critical role in AI breakthroughs, exemplified by resources like PDB and UniProt enabling AlphaFold.

Now, let’s focus on securing sustainable funding for these invaluable resources to continue advancing AI and bioscience innovation.

At #eccb2024, Alan Bridge, head of Swiss-Prot group at @SIB, presented a highlight talk on a benchmark set for curating enzyme reactions using LLMs. Also made the point that many AI initiatives are overlooking needed investments in benchmark sets. https://www.nature.com/articles/s41597-024-03835-7
EnzChemRED, a rich enzyme chemistry relation extraction dataset - Scientific Data

Expert curation is essential to capture knowledge of enzyme functions from the scientific literature in FAIR open knowledgebases but cannot keep pace with the rate of new discoveries and new publications. In this work we present EnzChemRED, for Enzyme Chemistry Relation Extraction Dataset, a new training and benchmarking dataset to support the development of Natural Language Processing (NLP) methods such as (large) language models that can assist enzyme curation. EnzChemRED consists of 1,210 expert curated PubMed abstracts where enzymes and the chemical reactions they catalyze are annotated using identifiers from the protein knowledgebase UniProtKB and the chemical ontology ChEBI. We show that fine-tuning language models with EnzChemRED significantly boosts their ability to identify proteins and chemicals in text (86.30% F1 score) and to extract the chemical conversions (86.66% F1 score) and the enzymes that catalyze those conversions (83.79% F1 score). We apply our methods to abstracts at PubMed scale to create a draft map of enzyme functions in literature to guide curation efforts in UniProtKB and the reaction knowledgebase Rhea.

Nature

For evolutionary biologists interested in tracing genomes back to their ancient ancestors, we’ve developed EdgeHOG—a tool that reconstructs ancestral gene orders at an unprecedented scale and speed. #Genomics #EvolutionaryBiology #EdgeHOG

For instance, we can identify very highly conserved adjacencies corresponding to histone clusters (arrow) or fast evolving sex chromosomes.

Check out first author Charles Bernard‘s blog post here: https://lab.dessimoz.org/blog/2024/08/30/edgehog

Preprint: https://www.biorxiv.org/content/10.1101/2024.08.28.610045v1