Afleveringen
-
On seeing and not seeing souls. Text version here: https://joecarlsmith.com/2025/05/21/the-stakes-of-ai-moral-status/
-
It's really important; we've got a real shot; there are a ton of ways to fail.
Text version here: https://joecarlsmith.com/2025/04/30/can-we-safely-automate-alignment-research/.
There's also a video and transcript of a talk I gave on this topic here: https://joecarlsmith.com/2025/04/30/video-and-transcript-of-talk-on-automating-alignment-research/
-
Zijn er afleveringen die ontbreken?
-
We should try extremely hard to use AI labor to help address the alignment problem. Text version here: https://joecarlsmith.com/2025/03/14/ai-for-ai-safety
-
On the structure of the path to safe superintelligence, and some possible milestones along the way. Text version here: https://joecarlsmith.substack.com/p/paths-and-waystations-in-ai-safety
-
Examining the conditions required for rogue AI behavior. Text version here: https://joecarlsmith.substack.com/p/when-should-we-worry-about-ai-power
-
Also: to avoid it? Handle it? Solve it forever? Solve it completely?
Text version here: https://joecarlsmith.substack.com/p/what-is-it-to-solve-the-alignment
-
Introduction to a series of essays about paths to safe and useful superintelligence.
Text version here: https://joecarlsmith.substack.com/p/how-do-we-solve-the-alignment-problem
-
When the line pulls at your hand.
Text version here: https://joecarlsmith.com/2025/01/28/fake-thinking-and-real-thinking/. -
What can we learn from recent empirical demonstrations of scheming in frontier models? Text version here: https://joecarlsmith.com/2024/12/18/takes-on-alignment-faking-in-large-language-models/
-
Extended audio from my conversation with Dwarkesh Patel. This part focuses on the basic story about AI takeover. Transcript available on my website here: https://joecarlsmith.com/2024/09/30/part-2-ai-takeover-extended-audio-transcript-from-my-conversation-with-dwarkesh-patel
-
Extended audio from my conversation with Dwarkesh Patel. This part focuses on my series "Otherness and control in the age of AGI." Transcript available on my website here: https://joecarlsmith.com/2024/09/30/part-1-otherness-extended-audio-transcript-from-my-conversation-with-dwarkesh-patel/
-
This is the introduction and summary for my series "Otherness and control in the age of AGI."
Text version here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi -
Second half of the full audio for my series on how agents with different values should relate to one another, and on the ethics of seeking and sharing power.
First half here: https://joecarlsmithaudio.buzzsprout.com/2034731/15266490-first-half-of-full-audio-for-otherness-and-control-in-the-age-of-agi
PDF of the full series here: https://jc.gatspress.com/pdf/otherness_full.pdf
Summary of the series here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi -
First half of the full audio for my series on how agents with different values should relate to one another, and on the ethics of seeking and sharing power.
Second half here: https://joecarlsmithaudio.buzzsprout.com/2034731/15272132-second-half-of-full-audio-for-otherness-and-control-in-the-age-of-agi
PDF of the full series here: https://jc.gatspress.com/pdf/otherness_full.pdf
Summary of the series here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi -
Garden, campfire, healing water.
Text version here: https://joecarlsmith.com/2024/06/18/loving-a-world-you-dont-trust
This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi -
Examining a certain kind of meaning-laden receptivity to the world.
Text version here: https://joecarlsmith.com/2024/03/25/on-attunement
This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi
(Though: note that I haven't put the summary post on the podcast yet.) -
Examining a philosophical vibe that I think contrasts in interesting ways with "deep atheism."
Text version here: https://joecarlsmith.com/2024/03/21/on-green
This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi
(Though: note that I haven't put the summary post on the podcast yet.) -
What does it take to avoid tyranny towards to the future?
Text version here: https://joecarlsmith.com/2024/01/18/on-the-abolition-of-man
This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi
(Though: note that I haven't put the summary post on the podcast yet.) -
Let's be the sort of species that aliens wouldn't fear the way we fear paperclippers.
Text version here: https://joecarlsmith.com/2024/01/16/being-nicer-than-clippy/
This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief text summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi
(Though: note that I haven't put the summary post on the podcast yet.) -
Who isn't a paperclipper?
Text version here: https://joecarlsmith.com/2024/01/11/an-even-deeper-atheism
This essay is part of a series I'm calling "Otherness and control in the age of AGI." I'm hoping that individual essays can be read fairly well on their own, but see here for brief summaries of the essays that have been released thus far: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi - Laat meer zien