Search Results for "ai safety"
Anthropic Expands AI Ethics Talks Amid $380B Valuation
Anthropic opens dialogues with global thought leaders on AI safety as its valuation soars to $380B. Learn how this shapes the future of AI governance.
OpenAI Updates ChatGPT for Context-Aware Safety in Sensitive Talks
OpenAI enhances ChatGPT's ability to detect evolving risks in sensitive conversations, improving safety in scenarios like self-harm and violence.
Open-Source AI Guardrails Removed in Minutes, Raising Regulation Concerns
Tests show open-source AI guardrails can be removed in under 10 minutes, exposing gaps in regulatory frameworks as policymakers scramble to adapt.
OpenAI Outlines Playbook for Third-Party AI Model Evaluations
OpenAI shares detailed guidance for evaluating frontier AI models, emphasizing safeguards, validity, and structured harnesses for capability testing.
OpenAI Pushes for Global Youth AI Safety Standards at G7 Summit
OpenAI urges G7 leaders to establish a global institute for youth AI safety, aiming to standardize protections and promote opportunities.
President Biden Amplifies AI Safety and Security Measures with Executive Order
President Biden has issued an Executive Order on October 30, 2023, aiming to improve AI safety, security, and trustworthiness. The order requires rigorous testing of critical AI systems, advocates for data privacy legislation, and promotes AI's positive impact on healthcare, education, and the labor market.
UK to Host First International AI Safety Conference in November
The United Kingdom is set to host the world's first international conference on AI safety on November 1-2, 2023. The summit aims to position the UK as a mediator in tech discussions between the US, China, and the EU. Prime Minister Rishi Sunak will host the event at Bletchley Park, featuring notable attendees like US Vice President Kamala Harris and Google DeepMind CEO Demis Hassabis. The conference will focus on the existential risks posed by AI, among other safety concerns.
Exploring AI Stability: Navigating Non-Power-Seeking Behavior Across Environments
The research explores AI's stability in non-power-seeking behaviors, revealing that certain policies maintain non-resistance to shutdown across similar environments, providing insights into mitigating risks associated with power-seeking AI.
Exploring AGI Hallucination: A Comprehensive Survey of Challenges and Mitigation Strategies
A new survey delves into the phenomenon of AGI hallucination, categorizing its types, causes, and current mitigation approaches while discussing future research directions.
NIST's Call for Public Input on AI Safety in Response to Biden's Executive Order
NIST is seeking public input to create AI safety guidelines following President Biden's Executive Order, aiming to ensure a secure AI environment, mitigate risks, and foster innovation.
California Spearheads AI Ethics and Safety with Senate Bills 892 and 893
California takes a pioneering role in AI regulation with Senate Bills 892 and 893, aiming to ensure AI safety, ethics, and public benefits.
US NIST Initiates AI Safety Consortium to Promote Trustworthy AI Development
The US National Institute of Standards and Technology (NIST) has launched the Artificial Intelligence Safety Institute Consortium to promote safe AI development and responsible use, inviting organizations to collaborate on identifying proven safety techniques by December 4, 2023.