Mastodawn

ugly, but it works: I can run a maximum of 50 jobs with a maximum of 500 cpus on our server.

So I run 10 # gatk HaplotypeCaller in parallel in one nextflow process using `make -j 10` ===> 500 gatk HC at the same time.

https://gitlab.univ-nantes.fr/pierre.lindenbaum/gazoduc-nf/-/blob/master/modules/gatk/hapcaller2/main.nf?ref_type=heads#L81

#nextflow #gatk #make #makefile #bioinformatics #workflow

modules/gatk/hapcaller2/main.nf · master · Pierre LINDENBAUM / gazoduc-nf · GitLab

bioinformatics workflows using nextflow

GitLab

Pierre Lindenbaum Apr 18

https://github.com/WeTheAgents/markdup8x-wea
"
markdup-wea: Rust rewrite of Picard MarkDuplicates — 3.26x faster, 54x less RAM, byte-identical output on 8 ENCODE RNA-seq samples vs Picard 3.4.0. Plus nf-core/rnaseq Picard memory-tuning proposal"

#bioinformatics #gatk #rust

GitHub - WeTheAgents/markdup8x-wea: markdup-wea: Rust rewrite of Picard MarkDuplicates — 3.26x faster, 54x less RAM, byte-identical output on 8 ENCODE RNA-seq samples vs Picard 3.4.0. Plus nf-core/rnaseq Picard memory-tuning proposal.

markdup-wea: Rust rewrite of Picard MarkDuplicates — 3.26x faster, 54x less RAM, byte-identical output on 8 ENCODE RNA-seq samples vs Picard 3.4.0. Plus nf-core/rnaseq Picard memory-tuning proposal...

GitHub

Pierre Lindenbaum Sep 5, 2025

I asked #biostars : "Parabricks : Number of GPUs requested (2) is more than number of GPUs (0) in the system., exiting."
https://www.biostars.org/p/9614740/

#bioinformatics #parabricks #NVIDIA #haplotypecaller #gatk #gpus

Parabricks : Number of GPUs requested (2) is more than number of GPUs (0) in the system., exiting.

Ben Fulton Apr 7, 2025

Know what would have been cool? If Broad hadn't broken two thirds of the links in the GATK documentation when they redid the website.

#BroadInstitute #GATK

Pierre Lindenbaum Apr 6, 2025

https://xcancel.com/gatk_dev/status/1908902547521257779 "GATK Dev Team :" GATK forums will be community-driven and self-moderated. They will not be moderated or monitored by a GATK team member. We encourage members of the community to continue to engage with each other on these forums." https://gatk.broadinstitute.org/hc/en-us/community/topics

#gatk

Ben Fulton Mar 13, 2025

TIL: You can get a list of Spark-enabled GATK tools with the command

gatk --list | grep Spark

(The website doesn't seem to have a list anywhere)

#bioinformatics #GATK #ApacheSpark

Pierre Lindenbaum Jan 3, 2025

#gatk dependencies graph : https://deps.dev/maven/org.broadinstitute%3Agatk/4.6.1.0/dependencies/graph

Open Source Insights

Pierre Lindenbaum Sep 12, 2024

https://github.com/nf-core/modules/pull/6619 PR accepted for nf-core/modules/gatk/VariantsToTable #gatk 🥳

new module gatk/variants2table by lindenb · Pull Request #6619 · nf-core/modules

This modules implements gatk/variants2table ( was https://nfcore.slack.com/archives/CJRH30T6V/p1725906645564699 ) This comment contains a description of changes (with reason). If you've fixe...

GitHub

Luke Pembleton Jul 1, 2024

So after much community backlash #GATK has reverted back to the ./. representation for no-calls.

The question is, will something like this occur again? and how many people have already switched away to other tools like #bcftools.

I know I have moved all my pipelines back to bcftools and honestly its brilliant.

#genomics #bioinformatics

https://github.com/broadinstitute/gatk/releases/tag/4.6.0.0

Release 4.6.0.0 · broadinstitute/gatk

Download release: gatk-4.6.0.0.zip Docker image: https://hub.docker.com/r/broadinstitute/gatk/ Highlights of the 4.6.0.0 release: We've fixed a serious CRAM writing bug that affects GATK versions...

GitHub

Pierre Lindenbaum May 21, 2024

https://gatk.broadinstitute.org/hc/en-us/articles/6012243429531-GenotypeGVCFs-and-the-death-of-the-dot "GenotypeGVCFs and the death of the dot" #gatk