Querying 3 billion vectors

Requirements are hard

Pick an embedding model that supports binary quantization and then use a SIMD-optimized Hamming Distance function. I'm doing this for Scour and doing about 1.6 billion comparisons per second.

https://scour.ing

https://emschwartz.me/binary-vector-embeddings-are-so-cool/

Scour

Scour interesting reads from noisy feeds you can't keep up with and smaller sites you didn't know to check.