Looking for people who are analyzing large amounts (TBs ?) of genomic data using k-mer methods. I'm developing some new tools and would really benefit from feedback and testing.
@jtnystrom Do you have a public repo for it?
@baris it's going to be in the next version of https://github.com/jtnystrom/discount - happy to share a preview with you if you want to take it for a spin.
GitHub - jtnystrom/Discount: Distributed k-mer counting and analysis on Apache Spark.

Distributed k-mer counting and analysis on Apache Spark. - GitHub - jtnystrom/Discount: Distributed k-mer counting and analysis on Apache Spark.

GitHub