Mastodawn

облезлый Apr 14, 2025

what is softmax #activationfunction
https://www.youtube.com/watch?v=yaH743wXFYo

what is softmax #activationfunction

YouTube

Show thread

Victoria Stuart 🇨🇦 🏳️‍⚧️Jul 12, 2023

...
Addendum

Forward-Forward Algorithm
https://medium.com/@Mosbeh_Barhoumi/forward-forward-algorithm-ac24d0d9ffd

The forward-forward algorithm uses a custom loss function that compares the mean square value of the activations for positive and negative samples.
The network optimizes this loss function by performing gradient calculations and optimization steps on the trainable weights of the dense layer.
...

#ActivationFunction #ForwardPropagation #NeuralNetworks

The Triangle Agency Mar 4, 2023

towards first-principles architecture design – The Berkeley Artificial Intelligence Research Blog https://triangleagency.co.uk/towards-first-principles-architecture-design-the-berkeley-artificial-intelligence-research-blog/?utm_source=dlvr.it&utm_medium=mastodon #TheTriangleAgencyNews #activationfunction #architecture #Artificial

towards first-principles architecture design – The Berkeley Artificial Intelligence Research Blog - The Triangle Agency

The BAIR BlogThe BAIR Blog

The Triangle Agency

Show thread

Nafnlaus 🇮🇸 🇺🇦Dec 8, 2022

5. "Why would one avoid using a linear #ActivationFunction in a #NeuralNetwork?" #AI

No, #ChatGPT3 #GPT3. The derivative of a linear activation function is *always* positive; it has no vanishing gradient. The problem it has is that you can't backpropagate (constant derivative) and can mathematically reduce a network with linear functions down to a single layer.