AboutTermsPrivacyContact
 
Updating
AI Safety Newsletter

AI Safety Newsletter

Released: 2024-12-19
© 2025 All rights reserved
AI Safety Newsletter - QR Code
48 Episodes
Audio
Listen on Apple Podcasts
48 Episodes
Audio
Listen on Apple Podcasts
Released: 2024-12-19
© 2025 All rights reserved
Most Recent Episode
AISN #45: Center for AI Safety 2024 Year in Review

AISN #45: Center for AI Safety 2024 Year in Review

Time: 11:31
As 2024 draws to a close, we want to thank you for your continued support for AI safety and review what we’ve been able to accomplish. In this special-edition newsletter, we highlight some of our most important projects from the year.
The mission of the Center for AI Safety is to reduce societal-scale risks from AI. We focus on three pillars of work: research, field-building, and advocacy.
Research
CAIS conducts both technical and conceptual research on AI safety. Here are some highlights from our research in 2024:
Circuit Breakers. We published breakthrough research showing how circuit breakers can prevent AI models from behaving dangerously by interrupting crime-enabling outputs. In a jailbreaking competition with a prize pool of tens of thousands of dollars, it took twenty thousand attempts to jailbreak a model trained with circuit breakers. The paper was accepted to NeurIPS 2024.
The WMDP Benchmark. We developed the Weapons [...]
---
Outline:
(00:34) Research
(04:25) Advocacy
(06:44) Field-Building
(10:38) Looking Ahead
The original text contained 4 images which were described by AI.
---
First published:
December 19th, 2024
Source:
https://newsletter.safe.ai/p/aisn-45-center-for-ai-safety-2024
---
Want more? Check out our ML Safety Newsletter for technical safety research.
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
Episode ID: 1000681002737
GUID: 1c54b2bf-b3cd-446c-a0ca-a6a9e8596413
Release Date: 19/12/2024, 22:42:39

Description

Narrations of the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.
This podcast also contains narrations of some of our publications.
ABOUT US
The Center for AI Safety (CAIS) is a San Francisco-based research and field-building nonprofit. We believe that artificial intelligence has the potential to profoundly benefit the world, provided that we can develop and use it safely. However, in contrast to the dramatic progress in AI, many basic problems in AI safety have yet to be solved. Our mission is to reduce societal-scale risks associated with AI by conducting safety research, building the field of AI safety researchers, and advocating for safety standards.
Learn more at https://safe.ai

Apple Podcasts: Customer Reviews

No Entry