Steerling-8B, the first interpretable model that can trace any token it generates to its input context, concepts a human can understand, and its training data.
https://www.guidelabs.ai/post/steerling-8b-base-model-release/
Steerling-8B, the first interpretable model that can trace any token it generates to its input context, concepts a human can understand, and its training data.
https://www.guidelabs.ai/post/steerling-8b-base-model-release/