We needed a dataset, diverse and at scale. We chose the Escherichia genus. Thus, we established 2 collections of 403 natural, diverse, Escherichia strains and 96 bacteriophages.
We looked in their genomes 💻🧬 for traits related to infection (receptors, capsule, defense systems…).
3/n