@AdrienB Pretty much, although those values can be multiplied by base constants, and you need a slight tweak if your input and output layers are tied.
Here's my implementation for a regular transformer.