Τα καλύτερα LessWrong Curated podcast (2025)

1
“The Most Forbidden Technique” by Zvi 32:12

1d ago32:12

32:12

The Most Forbidden Technique is training an AI using interpretability techniques. An AI produces a final output [X] via some method [M]. You can analyze [M] using technique [T], to learn what the AI is up to. You could train on that. Never do that. You train on [X]. Only [X]. Never [M], never [T]. Why? Because [T] is how you figure out when the mod…

1
“Trojan Sky” by Richard_Ngo 22:28

2d ago22:28

22:28

You learn the rules as soon as you’re old enough to speak. Don’t talk to jabberjays. You recite them as soon as you wake up every morning. Keep your eyes off screensnakes. Your mother chooses a dozen to quiz you on each day before you’re allowed lunch. Glitchers aren’t human any more; if you see one, run. Before you sleep, you run through the whole…

1
“OpenAI:” by Daniel Kokotajlo 7:21

4d ago7:21

7:21

Exciting Update: OpenAI has released this blog post and paper which makes me very happy. It's basically the first steps along the research agenda I sketched out here. tl;dr: 1.) They notice that their flagship reasoning models do sometimes intentionally reward hack, e.g. literally say "Let's hack" in the CoT and then proceed to hack the evaluation …

1
“How Much Are LLMs Actually Boosting Real-World Programmer Productivity?” by Thane Ruthenis 7:16

6d ago7:16

7:16

LLM-based coding-assistance tools have been out for ~2 years now. Many developers have been reporting that this is dramatically increasing their productivity, up to 5x'ing/10x'ing it. It seems clear that this multiplier isn't field-wide, at least. There's no corresponding increase in output, after all. This would make sense. If you're doing anythin…

1
“So how well is Claude playing Pokémon?” by Julian Bradshaw 9:05

7d ago9:05

9:05

Background: After the release of Claude 3.7 Sonnet,[1] an Anthropic employee started livestreaming Claude trying to play through Pokémon Red. The livestream is still going right now. TL:DR: So, how's it doing? Well, pretty badly. Worse than a 6-year-old would, definitely not PhD-level. Digging in But wait! you say. Didn't Anthropic publish a benchm…

1
“Methods for strong human germline engineering” by TsviBT 0:18

9d ago0:18

0:18

Note: an audio narration is not available for this article. Please see the original text. The original text contained 169 footnotes which were omitted from this narration. The original text contained 79 images which were described by AI. --- First published: March 3rd, 2025 Source: https://www.lesswrong.com/posts/2w6hjptanQ3cDyDw7/methods-for-stron…

1
“Have LLMs Generated Novel Insights?” by abramdemski, Cole Wyeth 3:49

9d ago3:49

3:49

In a recent post, Cole Wyeth makes a bold claim: . . . there is one crucial test (yes this is a crux) that LLMs have not passed. They have never done anything important. They haven't proven any theorems that anyone cares about. They haven't written anything that anyone will want to read in ten years (or even one year). Despite apparently memorizing…

1
“A Bear Case: My Predictions Regarding AI Progress” by Thane Ruthenis 18:47

10d ago18:47

18:47

This isn't really a "timeline", as such – I don't know the timings – but this is my current, fairly optimistic take on where we're heading. I'm not fully committed to this model yet: I'm still on the lookout for more agents and inference-time scaling later this year. But Deep Research, Claude 3.7, Claude Code, Grok 3, and GPT-4.5 have turned out la…

1
“Statistical Challenges with Making Super IQ babies” by Jan Christian Refsgaard 17:33

10d ago17:33

17:33

This is a critique of How to Make Superbabies on LessWrong. Disclaimer: I am not a geneticist[1], and I've tried to use as little jargon as possible. so I used the word mutation as a stand in for SNP (single nucleotide polymorphism, a common type of genetic variation). Background The Superbabies article has 3 sections, where they show: Why: We shou…

1
“Self-fulfilling misalignment data might be poisoning our AI models” by TurnTrout 1:51

10d ago1:51

1:51

This is a link post.Your AI's training data might make it more “evil” and more able to circumvent your security, monitoring, and control measures. Evidence suggests that when you pretrain a powerful model to predict a blog post about how powerful models will probably have bad goals, then the model is more likely to adopt bad goals. I discuss ways t…

1
“Judgements: Merging Prediction & Evidence” by abramdemski 11:13

14d ago11:13

11:13

I recently wrote about complete feedback, an idea which I think is quite important for AI safety. However, my note was quite brief, explaining the idea only to my closest research-friends. This post aims to bridge one of the inferential gaps to that idea. I also expect that the perspective-shift described here has some value on its own. In classica…

1
“The Sorry State of AI X-Risk Advocacy, and Thoughts on Doing Better” by Thane Ruthenis 12:31

17d ago12:31

12:31

First, let me quote my previous ancient post on the topic: Effective Strategies for Changing Public Opinion The titular paper is very relevant here. I'll summarize a few points. The main two forms of intervention are persuasion and framing. Persuasion is, to wit, an attempt to change someone's set of beliefs, either by introducing new ones or by ch…

1
“Power Lies Trembling: a three-book review” by Richard_Ngo 27:11

18d ago27:11

27:11

In a previous book review I described exclusive nightclubs as the particle colliders of sociology—places where you can reliably observe extreme forces collide. If so, military coups are the supernovae of sociology. They’re huge, rare, sudden events that, if studied carefully, provide deep insight about what lies underneath the veneer of normality a…

1
“Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs” by Jan Betley, Owain_Evans 7:58

17d ago7:58

7:58

This is the abstract and introduction of our new paper. We show that finetuning state-of-the-art LLMs on a narrow task, such as writing vulnerable code, can lead to misaligned behavior in various different contexts. We don't fully understand that phenomenon. Authors: Jan Betley*, Daniel Tan*, Niels Warncke*, Anna Sztyber-Betley, Martín Soto, Xuchan…

1
“The Paris AI Anti-Safety Summit” by Zvi 42:06

21d ago42:06

42:06

It doesn’t look good. What used to be the AI Safety Summits were perhaps the most promising thing happening towards international coordination for AI Safety. This one was centrally coordination against AI Safety. In November 2023, the UK Bletchley Summit on AI Safety set out to let nations coordinate in the hopes that AI might not kill everyone. Ch…

1
“Eliezer’s Lost Alignment Articles / The Arbital Sequence” by Ruby 2:37

22d ago2:37

2:37

Note: this is a static copy of this wiki page. We are also publishing it as a post to ensure visibility. Circa 2015-2017, a lot of high quality content was written on Arbital by Eliezer Yudkowsky, Nate Soares, Paul Christiano, and others. Perhaps because the platform didn't take off, most of this content has not been as widely read as warranted by …

1
“Arbital has been imported to LessWrong” by RobertM, jimrandomh, Ben Pace, Ruby 8:52

24d ago8:52

8:52

Arbital was envisioned as a successor to Wikipedia. The project was discontinued in 2017, but not before many new features had been built and a substantial amount of writing about AI alignment and mathematics had been published on the website. If you've tried using Arbital.com the last few years, you might have noticed that it was on its last legs …

1
“How to Make Superbabies” by GeneSmith, kman 1:08:04

23d ago1:08:04

1:08:04

We’ve spent the better part of the last two decades unravelling exactly how the human genome works and which specific letter changes in our DNA affect things like diabetes risk or college graduation rates. Our knowledge has advanced to the point where, if we had a safe and reliable means of modifying genes in embryos, we could literally create supe…

1
“A computational no-coincidence principle” by Eric Neyman 13:28

24d ago13:28

13:28

Audio note: this article contains 134 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description. In a recent paper in Annals of Mathematics and Philosophy, Fields medalist Timothy Gowers asks why mathematicians sometimes believe that unproved statements are likely to be true.…

1
“A History of the Future, 2025-2040” by L Rudolf L 2:22:38

25d ago2:22:38

2:22:38

This is an all-in-one crosspost of a scenario I originally published in three parts on my blog (No Set Gauge). Links to the originals: A History of the Future, 2025-2027 A History of the Future, 2027-2030 A History of the Future, 2030-2040 Thanks to Luke Drago, Duncan McClements, and Theo Horsley for comments on all three parts. 2025-2027 Below is …

1
“It’s been ten years. I propose HPMOR Anniversary Parties.” by Screwtape 1:54

25d ago1:54

1:54

On March 14th, 2015, Harry Potter and the Methods of Rationality made its final post. Wrap parties were held all across the world to read the ending and talk about the story, in some cases sparking groups that would continue to meet for years. It's been ten years, and think that's a good reason for a round of parties. If you were there a decade ago…

1
“Some articles in ‘International Security’ that I enjoyed” by Buck 7:56

27d ago7:56

7:56

A friend of mine recently recommended that I read through articles from the journal International Security, in order to learn more about international relations, national security, and political science. I've really enjoyed it so far, and I think it's helped me have a clearer picture of how IR academics think about stuff, especially the core power …

1
“The Failed Strategy of Artificial Intelligence Doomers” by Ben Pace 8:39

27d ago8:39

8:39

This is the best sociological account of the AI x-risk reduction efforts of the last ~decade that I've seen. I encourage folks to engage with its critique and propose better strategies going forward. Here's the opening ~20% of the post. I encourage reading it all. In recent decades, a growing coalition has emerged to oppose the development of artif…

1
“Murder plots are infohazards” by Chris Monteiro 3:58

29d ago3:58

3:58

Hi all I've been hanging around the rationalist-sphere for many years now, mostly writing about transhumanism, until things started to change in 2016 after my Wikipedia writing habit shifted from writing up cybercrime topics, through to actively debunking the numerous dark web urban legends. After breaking into what I believe to be the most success…

1
“Why Did Elon Musk Just Offer to Buy Control of OpenAI for $100 Billion?” by garrison 11:41

1M ago11:41

11:41

This is the full text of a post from "The Obsolete Newsletter," a Substack that I write about the intersection of capitalism, geopolitics, and artificial intelligence. I’m a freelance journalist and the author of a forthcoming book called Obsolete: Power, Profit, and the Race to build Machine Superintelligence. Consider subscribing to stay up to da…

Podcasts που αξίζει να ακούσετε

LessWrong Curated Podcasts

Podcasts που αξίζει να ακούσετε

1
LessWrong (Curated & Popular)

LessWrong

1
“The Most Forbidden Technique” by Zvi 32:12

1
“Trojan Sky” by Richard_Ngo 22:28

1
“OpenAI:” by Daniel Kokotajlo 7:21

1
“How Much Are LLMs Actually Boosting Real-World Programmer Productivity?” by Thane Ruthenis 7:16

1
“So how well is Claude playing Pokémon?” by Julian Bradshaw 9:05

1
“Methods for strong human germline engineering” by TsviBT 0:18

1
“Have LLMs Generated Novel Insights?” by abramdemski, Cole Wyeth 3:49

1
“A Bear Case: My Predictions Regarding AI Progress” by Thane Ruthenis 18:47

1
“Statistical Challenges with Making Super IQ babies” by Jan Christian Refsgaard 17:33

1
“Self-fulfilling misalignment data might be poisoning our AI models” by TurnTrout 1:51

1
“Judgements: Merging Prediction & Evidence” by abramdemski 11:13

1
“The Sorry State of AI X-Risk Advocacy, and Thoughts on Doing Better” by Thane Ruthenis 12:31

1
“Power Lies Trembling: a three-book review” by Richard_Ngo 27:11

1
“Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs” by Jan Betley, Owain_Evans 7:58

1
“The Paris AI Anti-Safety Summit” by Zvi 42:06

1
“Eliezer’s Lost Alignment Articles / The Arbital Sequence” by Ruby 2:37

1
“Arbital has been imported to LessWrong” by RobertM, jimrandomh, Ben Pace, Ruby 8:52

1
“How to Make Superbabies” by GeneSmith, kman 1:08:04

1
“A computational no-coincidence principle” by Eric Neyman 13:28

1
“A History of the Future, 2025-2040” by L Rudolf L 2:22:38

1
“It’s been ten years. I propose HPMOR Anniversary Parties.” by Screwtape 1:54

1
“Some articles in ‘International Security’ that I enjoyed” by Buck 7:56

1
“The Failed Strategy of Artificial Intelligence Doomers” by Ben Pace 8:39

1
“Murder plots are infohazards” by Chris Monteiro 3:58

1
“Why Did Elon Musk Just Offer to Buy Control of OpenAI for $100 Billion?” by garrison 11:41

Οδηγός γρήγορης αναφοράς