About Experience Blog Newsletter Publications

About Experience Blog Newsletter Publications

Jun 08 - Jun 14, 2026

AI Weekly: US Gov Pulls Fable 5, Anthropic's Biggest Launch Yet

Models & Releases 3 People & Business 3 Policy & Ethics 3 Products & Hardware 2 Research & Resources 1

Models & Releases

3 stories

Claude Fable 5 + Mythos 5: State-of-the-Art Launch

Anthropic launched Fable 5 (general release) and Mythos 5 (trusted access only) on June 9 — both the same underlying Mythos-class model, priced at $10/M input and $50/M output, less than half the Mythos Preview rate.
Capabilities set new records across nearly every major benchmark: Stripe compressed two months of Ruby codebase migration into a single day, and the model beat all others on Hebbia Finance Benchmark, CursorBench, and ViBench.
Mythos 5 accelerated protein design ~10x and conducted autonomous genomics research for a week, producing a model that outperformed a Science-published paper at 100× smaller size — though both models were suspended by government order just four days after launch.

Apple Intelligence 3.0: AFM 3 Family at WWDC

Apple unveiled AFM 3 at WWDC 2026 — a family of five foundation models co-built with Google, including an on-device 20B sparse model (AFM 3 Core Advanced) that activates only 1–4B params and runs on 16GB Apple silicon via flash storage.
Server-side models on Private Cloud Compute include AFM 3 Cloud Pro for agentic and complex reasoning tasks, running on NVIDIA GPUs in Google Cloud; human preference evals show AFM 3 Cloud preferred at 64.7% vs 8.7% baseline.
Consumer highlights include an entirely new Siri with deep personal context, a Foundation Models framework for third-party developers (free, offline, on-device), and HomeKit Secure Video AI search — details at the Apple Intelligence product page.

Xiaomi MiMo-V2.5-Pro Breaks 1000 TPS on 1T Model

Xiaomi and TileRT achieved 1000–1200 tokens/sec decode on a 1-trillion-parameter MoE model running on a standard 8-GPU node — no wafer-scale (Cerebras) or SRAM-heavy (Groq) hardware required.
A limited-access API launched June 9–23, priced at 3× standard rates but delivering roughly 10× the speed of conventional inference.
The result challenges the assumption that extreme token throughput requires exotic hardware, potentially opening high-speed 1T-scale inference to commodity data center deployments.

People & Business

3 stories

OpenAI Files Confidential S-1 with SEC

OpenAI submitted a confidential draft S-1 to the SEC on June 8, signaling a potential IPO path — though the company notes it hasn’t decided on timing and may remain private longer.
The filing keeps the IPO option open and continues the trajectory established by the Anthropic confidential S-1 filing from last week, suggesting both frontier labs are preparing for eventual public listings.
If it proceeds, an OpenAI IPO would rank among the largest in history given its current valuation trajectory following the $122B raise in March 2026.

OpenAI Acquires Ona for Persistent Codex Execution

OpenAI acquired Ona — a cloud development environment platform with 2M developers — to give Codex persistent, secure cloud execution that continues working even after a user’s laptop is closed.
Codex now has 5M+ weekly active users, up 400% year-over-year, making the acquisition a significant infrastructure bet on agentic coding at scale.
The deal extends OpenAI’s coding agent infrastructure, complementing the Symphony orchestration spec and Codex GA on AWS announced last week.

Anthropic Launches Claude Corps National Fellowship

Anthropic launched Claude Corps on June 11 — a national fellowship program for early-career people passionate about extending AI benefits to communities across America.
The program represents Anthropic’s first structured workforce and civic engagement initiative, distinct from its research and enterprise focus.
The launch comes the same week as the US government’s suspension of Fable 5 and Mythos 5, underscoring the contrast between Anthropic’s public-benefit mission and its regulatory battles.

Policy & Ethics

3 stories

US Government Forces Anthropic to Suspend Fable 5 and Mythos 5

On June 13 at 5:21pm ET, the US government issued an export control directive requiring Anthropic to suspend all access to Fable 5 and Mythos 5 for any foreign national — including Anthropic employees — effectively disabling both models globally.
The government cited a ’narrow, non-universal jailbreak’ consisting of prompting the model to read a codebase and fix flaws; Anthropic publicly disagreed, noting the capability is ‘widely available from other models including GPT-5.5’ and is used daily by defenders, with no universal jailbreak found in thousands of red-teaming hours pre-launch.
Anthropic warned that applying this standard industry-wide would ’essentially halt all new model deployments for all frontier providers’ — a striking turn given that Anthropic’s own policy paper from June 10 had called for government authority to block unsafe deployments, just under fairer principles.

Anthropic Calls for Government Authority to Block Dangerous AI

On June 10 — three days before the government acted — Anthropic published a sweeping policy proposal calling for legal authority for governments to block dangerous AI deployments, covering models trained with >10^25 FLOPs at companies with >$500M AI revenue or >$1B R&D spend.
The proposal covers bio risk, cyber risk, and catastrophic harm categories, and directly informed the framework the government then invoked against Anthropic’s own models.
Anthropic explicitly stated the June 13 government action did not follow ’transparent, fair, clear, technically-grounded’ principles — illustrating the difficult position of a lab that advocated for oversight and was then subject to it under disputed criteria.

Fable 5 Technical Report: Restrictions on Competing LLM Dev

Anthropic’s Fable 5 technical report (page 13) confirms that the model has intentional restrictions when asked to assist in developing competing large language models — a disclosure that generated significant discussion on r/LocalLLaMA this week.
The revelation fueled debate about AI sovereignty and the practical necessity of locally-runnable open models, with many users citing it as justification for preferring open-weight alternatives.
The restriction sits alongside the government-ordered suspension of Fable 5 as a second capability constraint imposed on the same model in the same week, from opposite directions.

Products & Hardware

2 stories

DiffusionGemma: 1000+ Tokens/Sec via Parallel Block Decoding

Google released DiffusionGemma (Apache 2.0, experimental) — a 26B MoE (3.8B active) diffusion-based text model that generates 256-token blocks simultaneously rather than sequentially, achieving 1000+ tokens/sec on H100 and 700+ on RTX 5090 in 18GB VRAM quantized.
The bidirectional attention architecture enables capabilities autoregressive models structurally struggle with: code infilling, amino acid sequence generation, math graphs, and closing complex markdown — a fine-tuned version also solves Sudoku correctly.
Ecosystem integrations include MLX, vLLM (Red Hat), HuggingFace Transformers, Unsloth, and NVIDIA NeMo, with llama.cpp support coming — best suited for speed-critical local and interactive workflows rather than cloud high-QPS serving.

OpenAI Academy Launches AI-at-Work Course Library

OpenAI Academy released a new library of courses on June 12 focused on workplace AI adoption, targeting professionals across roles and industries.
The launch is part of OpenAI’s broader enterprise and skills push, complementing the Codex for Every Role plugins and Codex Sites announced last week.
As AI tool adoption becomes a competitive differentiator, structured upskilling programs from frontier labs represent a new front in the enterprise race alongside Microsoft Copilot and Google Workspace AI.

Research & Resources

1 stories

Anthropic Research: Claude Agents Conducting Autonomous Biology

Anthropic published research on June 8 documenting Claude’s capabilities as an autonomous biology agent — conducting genomics research over multi-day sessions, generating novel molecular hypotheses, and achieving protein design outcomes that matched or beat skilled human operators on 9 of 14 targets.
The most striking result: an autonomous genomics run produced a model that outperformed a Science-published paper at 100× smaller parameter size — a capability highlighted in both the Fable 5 launch and the Mythos 5 trusted access biology program.
The research represents a significant milestone for agentic AI in life sciences, though its implications are now entangled with the government suspension of Fable 5 and Mythos 5 over concerns about AI-assisted technical capabilities.

Previous Issue AI Weekly: Anthropic Files for IPO, Gemma 4 Runs on 16GB Laptops Next Issue AI Weekly: GPT-5.6 Goes Live, Meta Pivots to Closed API, Fable 5 Costs Extra

Deepak Baby

Senior Data Scientist at KBC Bank & Verzekering

Home About Experience Blog Newsletter Publications

© 2026 Deepak Baby

Made with ♥ using Hugo