Happy to report that our latest paper is released: https://doi.org/10.1021/acs.analchem.5c02832. About a program I wrote in #rust to annotate spectra with the ability to handle very complex peptide definitions (eg cross-linked, glycans, ambiguous modifications and much more) thanks to ProForma. We think these complex definitions make it functional for many people within the field of mass spec, we use it already for bottom-up, top-down, glycopeptides, XL, and de novo peptides. Additionally it supports reading in many identified peptidoform files (eg MSFragger) to easily visualise data from other sources. The program itself and the underlying libraries are all open source and centered around #ProteomicsStandardsInitiative formats (for now ProForma, mzTab, mzML, mzSpecLib, & mzPAF) for maximum interoperability.
#OpenScience #Proteomics #MassSpectrometry

Happy to announce the publication of our roadmap for #MzPeak, a truely open, performant format for storage of mass spectrometry data:
Please find the paper at https://pubs.acs.org/doi/full/10.1021/acs.jproteome.5c00435

#OpenScience #TeamMassSpec #Proteomics #ProteomicsStandardsInitiative

mzPeak: Designing a Scalable, Interoperable, and Future-Ready Mass Spectrometry Data Format

Advances in mass spectrometry (MS) instrumentation, including higher resolution, faster scan speeds, and improved sensitivity, have dramatically increased the data volume and complexity. The adoption of imaging and ion mobility further amplifies these challenges in proteomics, metabolomics, and lipidomics. Current open formats such as mzML and imzML struggle to keep pace due to large file sizes, slow data access, and limited metadata support. Vendor-specific formats offer faster access but lack interoperability and long-term archival guarantees. We here lay the groundwork for mzPeak, a next-generation community data format designed to address these challenges and support high-throughput, multidimensional MS workflows. By adopting a hybrid model that combines efficient binary storage for numerical data and both human- and machine-readable metadata storage, mzPeak will reduce file sizes, accelerate data access, and offer a scalable, adaptable solution for evolving MS technologies. For researchers, mzPeak will support complex workflows and regulatory compliance through faster access, improved metadata, and interoperability. For vendors, it offers a streamlined, open alternative to proprietary formats. mzPeak aims to become a cornerstone of MS data management, enabling sustainable, high-performance solutions for future data types and fostering collaboration across the mass spectrometry community.

ACS Publications

We just had the kickoff meeting for the #ProteomicsStandardsInitiative #AI working group yesterday. I think it went well, but we'd also like to get more input on what our scope and mandate should be (and as always more volunteers). You can watch the recording here: https://cloud.samwein.com/s/oyjG4ijeEDbiSRF

#OpenScience #TeamMassSpec #Proteomics

Aug14_Kickoff.mp4

Managed Nextcloud - powered by hosting.de

Managed Nextcloud

It was fun to meet my colleagues from the #ProteomicsStandardsInitiative again in Leiden 🇳🇱 for the PSI Spring Meeting 2023 (https://www.psidev.info/).

Among other productive sessions, we made good progress on the mzQC format for #QualityControl in biological #MassSpectrometry.

HUPO-PSI Working groups and Outputs | HUPO Proteomics Standards Initiative