🎙️ Join Franciszek Job at EuroSciPy as he presents a scalable framework to unify chemical datasets from sources like PubChem, UniChem & COCONUT.
💻 Canonicalize with RDKit
⚡ Scale via Dask
🔁 Deduplicate with InChI keys
Ideal for ML pretraining, benchmarking, and chemical data analysis.
📅 Schedule: https://lnkd.in/eaAxwUN2
🎟️ Tickets: https://lnkd.in/end9aYzE
#EuroSciPy #cheminformatics #rdkit #dask #openscience #molecularML