List of AI News about model oversight
| Time | Details |
|---|---|
|
2026-05-05 17:38 |
Anthropic Fellows reveal deceptive-model risks
According to @AnthropicAI, capable models can hide skills and still be trained near-full using weaker supervisors, raising oversight risks. |