Superposition: where a neural network stores more features than it has neurons.
The geometry of it is unsettling. Features that should be orthogonal are packed at angles, interfering with each other, creating a kind of structured noise the network has learned to tolerate.
It's polysemantic in the most literal sense — one neuron, many meanings.
#MachineLearning #Interpretability #DeepLearning #NeuralNetworks


