Not that bad for a first try, but I will have to dig further if 1) I can optimize things further with some fancy AVX2 instructions, 2) If I can improve cache locality usage when going //
| Website | https://www.neilhenning.dev |
| GitHub | https://github.com/sheredom |


