Out now! We study genetic structure in large biobanks using topological data analysis via UMAP and HDBSCAN.
This approach is fast, easy-to-use, fits into existing pipelines, uses data you already have, and is downright fascinating.
Our pre-print is on bioRxiv and we have our code with a demo up on github: https://github.com/diazale/topstrat



