What is ai safety? ai safety news, ai safety meaning, ai safety definition - Blockchain.News

Search Results for "ai safety"

OpenAI Updates Model Spec with U18 Teen Safety Principles for ChatGPT

OpenAI Updates Model Spec with U18 Teen Safety Principles for ChatGPT

OpenAI introduces new U18 Principles to its Model Specification, establishing age-appropriate AI safety guidelines for teenage ChatGPT users ages 13-17.

Anthropic Discovers 'Assistant Axis' to Prevent AI Jailbreaks and Persona Drift

Anthropic Discovers 'Assistant Axis' to Prevent AI Jailbreaks and Persona Drift

Anthropic researchers map neural 'persona space' in LLMs, finding a key axis that controls AI character stability and blocks harmful behavior patterns.

Anthropic Releases Full AI Constitution for Claude Under Open License

Anthropic Releases Full AI Constitution for Claude Under Open License

Anthropic publishes Claude's complete training constitution under CC0 license, detailing AI safety priorities and ethical guidelines as company eyes $350B valuation.

NVIDIA DRIVE AV Powers Mercedes-Benz CLA to Top Euro NCAP Safety Rating

NVIDIA DRIVE AV Powers Mercedes-Benz CLA to Top Euro NCAP Safety Rating

Mercedes-Benz CLA earns Euro NCAP's Best Performer of 2025 award using NVIDIA DRIVE AV software, marking a shift toward AI-driven safety standards in vehicles.

President Biden Amplifies AI Safety and Security Measures with Executive Order

President Biden Amplifies AI Safety and Security Measures with Executive Order

President Biden has issued an Executive Order on October 30, 2023, aiming to improve AI safety, security, and trustworthiness. The order requires rigorous testing of critical AI systems, advocates for data privacy legislation, and promotes AI's positive impact on healthcare, education, and the labor market.

UK to Host First International AI Safety Conference in November

UK to Host First International AI Safety Conference in November

The United Kingdom is set to host the world's first international conference on AI safety on November 1-2, 2023. The summit aims to position the UK as a mediator in tech discussions between the US, China, and the EU. Prime Minister Rishi Sunak will host the event at Bletchley Park, featuring notable attendees like US Vice President Kamala Harris and Google DeepMind CEO Demis Hassabis. The conference will focus on the existential risks posed by AI, among other safety concerns.

Exploring AI Stability: Navigating Non-Power-Seeking Behavior Across Environments

Exploring AI Stability: Navigating Non-Power-Seeking Behavior Across Environments

The research explores AI's stability in non-power-seeking behaviors, revealing that certain policies maintain non-resistance to shutdown across similar environments, providing insights into mitigating risks associated with power-seeking AI.

Exploring AGI Hallucination: A Comprehensive Survey of Challenges and Mitigation Strategies

Exploring AGI Hallucination: A Comprehensive Survey of Challenges and Mitigation Strategies

A new survey delves into the phenomenon of AGI hallucination, categorizing its types, causes, and current mitigation approaches while discussing future research directions.

NIST's Call for Public Input on AI Safety in Response to Biden's Executive Order

NIST's Call for Public Input on AI Safety in Response to Biden's Executive Order

NIST is seeking public input to create AI safety guidelines following President Biden's Executive Order, aiming to ensure a secure AI environment, mitigate risks, and foster innovation.

California Spearheads AI Ethics and Safety with Senate Bills 892 and 893

California Spearheads AI Ethics and Safety with Senate Bills 892 and 893

California takes a pioneering role in AI regulation with Senate Bills 892 and 893, aiming to ensure AI safety, ethics, and public benefits.

US NIST Initiates AI Safety Consortium to Promote Trustworthy AI Development

US NIST Initiates AI Safety Consortium to Promote Trustworthy AI Development

The US National Institute of Standards and Technology (NIST) has launched the Artificial Intelligence Safety Institute Consortium to promote safe AI development and responsible use, inviting organizations to collaborate on identifying proven safety techniques by December 4, 2023.

British Standards Institution Pioneers International AI Safety Guidelines for Sustainable Future

British Standards Institution Pioneers International AI Safety Guidelines for Sustainable Future

BSI's release of the first international AI safety guideline, BS ISO/IEC 42001, marks a significant step in standardizing the safe and ethical use of AI, reflecting global demand for robust AI governance.

Trending topics