Mastodawn

Gapry Jan 31

[ #Compiler ] Day 3 of #AoCO2025 Study Notes: You can’t fool the optimiser

My notes focus on reproducing and verifying Matt Godbolt’s teaching within a local development environment

This post specifically compares Tail Recursion vs. Standard Recursion.

Study Notes: You can’t fool the optimiser

Gapry's Blog

Gapry Jan 31

[ #Compiler ] Day 2 of #AoCO2025 Study Notes

My notes focus on reproducing and verifying Matt Godbolt’s teaching within a local development environment

Additionally, I have extended the discussion by implementing a manual PoC in assembly

Study Notes: Addressing the adding situation

Gapry's Blog

Gapry Jan 1

[#Compiler] Day 1 of #AoCO2025 Study Notes

While the original uses #CompilerExplorer, I wanted to replicate the analysis locally.

In this post, I have used #gcc, #clang, llvm-objdump and #LLDB to analyze.

Study Notes: Why xor eax, eax?

Gapry's Blog

matt godbolt Dec 25

Day 25 of Advent of Compiler Optimisations!

We've reached the end of this journey through compiler magic—from simple arithmetic tricks to mind-bending loop transformations. Thank you for following along! Whether you celebrate Christmas or just enjoy a good compiler optimisation, I hope you've discovered something that made you see your code differently.

#AoCO2025

Thank you — Matt Godbolt’s blog

The end of the 2025 Advent of Compiler Optimisation

matt godbolt Dec 24

Day 24 of Advent of Compiler Optimisations!

A simple loop that sums integers from 0 to n. GCC cleverly unrolls it to process two numbers at once. But clang? The loop completely disappears—replaced by a few multiplies and shifts that compute the answer directly. How does it recognise this pattern and transform O(n) code into O(1)?

#AoCO2025

When compilers surprise you — Matt Godbolt’s blog

Sometimes compilers can surprise and delight even a jaded old engineer like me

matt godbolt Dec 23

Day 23 of Advent of Compiler Optimisations!

Switch statements compile to jump tables, right? Well... sometimes. But what happens when your five-case switch becomes pure arithmetic? Or when checking for whitespace turns into a single mysterious constant and some bit manipulation? Turns out compilers have a whole bag of tricks beyond the textbook answer.

#AoCO2025

Switching it up a bit — Matt Godbolt’s blog

Taking a look at the various ways the compiler can optimise switch statements

matt godbolt Dec 22

Day 22 of Advent of Compiler Optimisations!

Comparing a string_view against "ABCDEFG" should call memcmp, right? Watch what Clang actually generates — no function call at all, just a handful of inline instructions using some rather cunning tricks. How does it compare 7 bytes so efficiently when they don't fit in a single register?

#AoCO2025

Clever memory tricks — Matt Godbolt’s blog

We learn that compilers have tricks to access memory efficiently

matt godbolt Dec 21

Day 21 of Advent of Compiler Optimisations!

Summing an array of integers? The compiler vectorises it beautifully, processing 8 at a time with SIMD. Switch to floats and... the compiler refuses to vectorise, doing each add one by one. Same loop, same code structure — why does the compiler treat floats so differently?

#AoCO2025

When SIMD Fails: Floating Point Associativity — Matt Godbolt’s blog

Why floating point maths doesn't vectorise like integers, and what to do about it

matt godbolt Dec 20

Day 20 of Advent of Compiler Optimisations!

Loop over 65,536 integers doing comparisons — that's 65,536 iterations, right? Wrong! With the right flags, the compiler processes 8 integers per iteration using SIMD instructions. Same number of assembly instructions, 8× the throughput. What's the trick that makes this possible?

#AoCO2025

SIMD City: Auto-vectorisation — Matt Godbolt’s blog

Doing more with less: vectorising can speed your code up 8x or more!

matt godbolt Dec 19

Day 19 of Advent of Compiler Optimisations!

Recursive functions need to call themselves over and over — that must mean unbounded stack growth, right? Wrong! When a function ends by calling another function (even itself), the compiler can replace the call with a simple jump. Recursion becomes iteration, no stack overhead at all. How does this transformation work?

#AoCO2025

Chasing your tail — Matt Godbolt’s blog

The art of not (directly) coming back: tail call optimisation