Interested in the genetic secrets of your favorite plant?

A genome sequence might reveal them. We wrote down a workflow from DNA extraction & sequencing to genome sequence assembly & annotation. It includes a hands-on guide with example commands.

https://doi.org/10.1186/s12864-026-12623-z

#Genomics #Bioinformatics
@samnm
@boas_pucker

@PuckerLab Love this kind of end-to-end guide. One thing that would be great to see included: how assembly quality differs between long-read (ONT/PacBio) vs short-read approaches for polyploid genomes. The gap between a draft assembly and something annotation-ready is where most people get stuck.
@deepdna_ai Thanks for your comments. Why would you even try to sequence a polyploid genome with short-reads? It should be obvious that long-reads are superior. There is already a comparison by Marks et al., 2021 looking at different assembly metrics.
@PuckerLab Fair point for de novo assembly — long-reads win there, no question. I was thinking more broadly: a lot of polyploid genomics still relies on short-reads for population resequencing (hundreds of wheat/rapeseed accessions), RNA-seq for homeolog expression, variant calling against a reference. Cost is real when you need 30x HiFi on a 16 Gb hexaploid across dozens of samples. Marks et al. is great for assembly metrics, but assembly isn't the whole picture in polyploid research.
@PuckerLab Great workflow overview! The gap between raw reads and a publishable assembly has shrunk dramatically. ONT and PacBio HiFi now give near chromosome-level assemblies with fairly straightforward pipelines. The real bottleneck has shifted from assembly to annotation and functional interpretation.