alignment Flash News List | Blockchain.News
Flash News List

List of Flash News about alignment

Time Details
2025-12-09
19:47
Anthropic SGTM (Selective Gradient Masking): Removable 'Forget' Weights Enable Safer High-Risk AI Deployments

According to @AnthropicAI, Selective Gradient Masking (SGTM) splits model weights into retain and forget subsets during pretraining and directs specified knowledge into the forget subset, according to Anthropic's alignment site. The forget subset can then be removed prior to release to limit hazardous capabilities in high-risk settings, according to Anthropic's alignment article. The announcement does not reference cryptocurrencies or tokenized AI projects and does not state any market or pricing impact, according to Anthropic's post.

Source
2025-08-22
16:19
AnthropicAI: Classifier Cuts CBRN Accuracy by 33% Beyond Random Baseline With No Benign Task Impact | AI Safety Update

According to @AnthropicAI, a classifier setup reduced CBRN accuracy by 33% beyond a random baseline; source: @AnthropicAI. The source also reports no particular effect on a range of other benign tasks, addressing concerns that filtering CBRN data would harm harmless scientific capabilities; source: @AnthropicAI.

Source
2025-01-26
16:44
Vitalik Buterin Discusses Correlation and Competence in Crypto Trading

According to Vitalik Buterin, the correlation greater than zero is necessary in crypto markets, emphasizing that competence and alignment are separate factors. This insight is crucial for traders focusing on strategic alignment and competence in their trading activities.

Source
2025-01-26
16:44
Vitalik Buterin Discusses Correlation and Alignment in Blockchain Competence

According to Vitalik Buterin, there is a need for a positive correlation in blockchain operations rather than sole reliance on one factor. He mentions that competence and alignment are separate axes in blockchain technology and emphasizes that failing in alignment could render competence ineffective, potentially impacting trading strategies and market stability (source: Twitter).

Source