Mastodawn

Alexandre Mutel Mar 24, 2023

I have spent my last evenings optimizing my C# .NET 7 vectorized exp2 and log2, by improving their precision to 1 ULP (in addition to 3) and by allowing to parameterize over it, so that codegen gets nicely monomorphized

I compared it with SLEEF, that I used partly to optimize further. It's crazy how many code out there with exp2 and log2 are sometimes wrong or not as optimized!

I can now continue building higher level blocks for my tensor lib with activation functions for neural networks! 🏎️

Show thread

rastilin

@xoofx

Those are some impressive numbers. If it's broadly applicable you should do a full blog post on it.

Show thread

Alexandre Mutel Mar 24, 2023

@rastilin That would be definitely interesting! The code will be OSS with a BSD permissive license as well, so folks will be able to experiment/port it.