INTERCAL: We have keywords like PLEASE and IGNORE because we think it's funny
Anthropic: We have PLEASE and IGNORE *baked into our source-code* because we have no fucking idea how this thing works
INTERCAL: We have keywords like PLEASE and IGNORE because we think it's funny
Anthropic: We have PLEASE and IGNORE *baked into our source-code* because we have no fucking idea how this thing works
@pikesley Since we have no idea what the training data actually is, there's a chance it *does* give better results, for some definition of "better" anyway.
Someone should do a study on the quality of results between asking tersely worded questions, asking politely worded questions, and burning the entire industry to the ground.
For science.