Boggle Nogs
Values in the wild: Discovering values in real-world language model interactions (anthropic.com)
5 points 1 comment
5 points 1 comment
Values in the wild: Discovering values in real-world language model interactions (anthropic.com)
3 points 1 comment
3 points 1 comment
Anthropic Analyzes Real-World Conversations to Uncover AI's "Values in the Wild" (anthropic.com)
4 points 1 comment
4 points 1 comment
Anthropic research shows AI model conceals reasoning shortcuts 75% of the time [pdf] (anthropic.com)
4 points
4 points
The think tool: Enabling Claude to stop and think in complex tool use situations (anthropic.com)
26 points 2 comments
26 points 2 comments