Meta will lay off roughly 600 employees to reduce layers and operate more nimbly. The cuts will impact workers across Meta’s AI infrastructure units

TLDR AI 2025-10-23

Headlines & Launches

Meta lays off 600 from ‘bloated’ AI unit as Wang cements leadership (3 minute read)

Meta will lay off roughly 600 employees to reduce layers and operate more nimbly. The cuts will impact workers across Meta’s AI infrastructure units, the Fundamental Artificial Intelligence Research unit, and other product-related positions. They will not impact employees within TBD Labs. Meta will pay 16 weeks of severance plus two weeks for every completed year of service, minus the notice period.

Why Cohere’s ex-AI research lead is betting against the scaling race (5 minute read)

AI labs are racing to build giant data centers due to a deep belief that adding more compute power to existing AI training methods will eventually yield superintelligent systems. A growing group of AI researchers are saying that the scaling of large language models may be reaching its limits and that other breakthroughs may be needed to improve AI performance. Cohere’s former VP of AI Research has launched a new startup, Adaption Labs, to build thinking machines that adapt and continuously learn. If the startup is right about the limitations of scaling, the implications could be huge - billions of dollars have already been invested in scaling.

Snapchat makes its first open prompt AI Lens available for free in the US (2 minute read)

Snapchat’s new “Imagine Lens” AI, initially exclusive to paid users, is now free in the US, allowing users to edit or generate Snaps by inputting custom prompts. This expansion comes amid competition from Meta and OpenAI’s advanced AI video features. Snap aims to attract users by offering limited free AI-generated images with plans to expand access to other countries.

Deep Dives & Analysis

Thoughts on the AI buildout (23 minute read)

OpenAI’s Sam Altman wants to create a factory that can produce a gigawatt of new AI infrastructure every week. To make this vision happen would require a lot of work. This article looks at whether the vision is physically feasible and what it could mean for different energy sources, upstream CapEx, and the US vs China competition.

How Well Does RL Scale? (14 minute read)

RL-training for LLMs scales poorly. Most gains are from allowing LLMs to productively use longer chains of thought. This may be evidence that compute scaling will be less effective for AI progress than previously thought. The finding could lengthen timelines and affect strategies for AI governance and safety.

Smuggled Intelligence (6 minute read)

GPT-5 Pro has solved complex problems in abstract algebra and aided in quantum computing research, showcasing AI’s growing capability to perform expert-level tasks. Creating these benchmarks involves extensive human input, highlighting the ongoing need for human oversight in AI applications.

Engineering & Research

Helion (19 minute read)

Helion is a Python-embedded domain-specific language (DSL) for authoring machine learning kernels that compiles down to Triton. It makes it easier to write correct and efficient kernels while enabling more automation in the autotuning process. Helion combines a familiar and high-level PyTorch-like syntax with a powerful ahead-of-time autotuning engine to provide a unique balance of developer productivity, fine-grained control, and performance portability. It is currently in beta.

World Models for Embodied Agents (3 minute read)

World-In-World introduces the first open benchmark platform for evaluating world models in closed-loop environments where agents actively interact with their surroundings. It shifts focus from visual fidelity to task performance.

Lightweight Memory for LLM Agents (GitHub Repo)

LightMem is a streamlined memory management system for large language models that offers tools for storing, retrieving, and updating long-term memory in AI agents with minimal overhead.

Miscellaneous

Reddit sues Perplexity for allegedly ripping its content to feed AI (3 minute read)

Reddit is suing Perplexity, SerpApi, Oxylabs, and AWMProxy to stop them from scraping its data. It claims Perplexity will apparently do anything to get Reddit’s data except enter into an agreement with Reddit directly. Reddit sent a cease-and-desist letter to Perplexity in May last year, demanding it stop scraping Reddit data, but Perplexity claimed it didn’t use Reddit content. This was shown to be untrue after content posted on Reddit was produced by Perplexity within hours of the post being made.

This chart is so insane it looks fake (1 minute read)

ChatGPT’s one-month retention has skyrocketed from under 60% two years ago to an unprecedented ~90%. YouTube was the previous best-in-class with ~85%. ChatGPT’s six-month retention is trending to ~80%.

Quick Links

Statement on Superintelligence (5 minute read)

Over 20,0000 signatories, including AI pioneers Geoffrey Hinton and Yoshua Bengio, Apple co-founder Steve Wozniak, Richard Branson, Prince Harry, Steve Bannon, Glenn Beck, five Nobel laureates, and Pope Francis’s AI advisor, called for a prohibition on developing superintelligence until proven safe and controllable with strong public buy-in.

ChatGPT on WhatsApp Ending (1 minute read)

OpenAI has announced the end of WhatsApp access for ChatGPT users and encouraged linking to a ChatGPT account to continue conversations elsewhere.

Anthropic Is Catching Up With OpenAI (3 minute read)

Anthropic quietly went from $1B to nearly $7B in run-rate revenue this year, a six-fold jump that puts it within striking distance of OpenAI.

Unseeable prompt injections in screenshots: more vulnerabilities in Comet and other AI browsers (6 minute read)

Indirect prompt injection is a systematic challenge that the entire category of AI-powered browsers faces.

sumitup.dev

Explorer

tldr-ai-2025-10-23

Table of Contents