Got unreasonably excited about this new, incredibly straightforward count-distinct algorithm. The CVM algorithm is a direct replacement for HyperLogLog, it nerd-sniped Donald Knuth for weeks, *and* it can easily be taught in an entry-level CS course.

h/t @munin
https://www.quantamagazine.org/computer-scientists-invent-an-efficient-new-way-to-count-20240516/

Computer Scientists Invent an Efficient New Way to Count | Quanta Magazine

By making use of randomness, a team has created a simple algorithm for estimating large numbers of distinct objects in a stream of data.

Quanta Magazine
@rain @munin what does X \ { a_i } mean? What is \ ?
@felipe @munin hmm, where is that from?
@rain @munin It's from the paper linked in the article, the second line in the loop* here:
@felipe @munin Ah thanks. \ is notation for set difference -- so in this case it just means "remove a_i from X"