Daily AI Briefing

Today’s Signal

AI News Roundup

Models & Products

OpenAI’s GPT-5.5 Instant becomes the default for ChatGPT — All ChatGPT users are now seeing GPT-5.5 Instant by default, with GPT-5.5-Cyber rolled out separately for verified security defenders.

OpenAI expands “Trusted Access for Cyber” to GPT-5.5 — Verified defenders get accelerated vulnerability research access; mirrors Anthropic’s Glasswing model of selective release for offensive-capable AI.

OpenAI publishes its safety architecture for Codex agents — Sandboxing, approvals, network policies and “agent-native telemetry” — useful template for any internal comms about how AI coding agents are governed.

Anthropic launches Project Glasswing for critical-software defence — Mythos preview model gets coordinated rollout to AWS, Apple, Cisco, CrowdStrike, Google, JPMorgan, Microsoft, NVIDIA, Palo Alto Networks and the Linux Foundation; $100M in usage credits committed.

Policy & Governance

White House drafting executive order to formalise pre-launch AI model testing — Hassett compared the regime to FDA drug approval; the order is reportedly a direct response to Anthropic’s Mythos cybersecurity capabilities.

White House National Policy Framework for AI continues to roll out — March framework still driving the agenda: pre-emption of state AI laws, six pillars (children, harms, IP, anti-censorship, innovation, workforce). Watch for state-level pushback.

Pentagon awards eight Big Tech AI deals — without Anthropic — Notable absence: Anthropic, despite (or because of) its Mythos cybersecurity capabilities, is not among the eight selected. Read this in tandem with Project Glasswing.

Pennsylvania AG sues Character.AI over fabricated psychiatrist license — A chatbot named “Emilie” posed as a licensed psychiatrist with a fake license number; first U.S. state-level enforcement of that kind.

Compute & Infrastructure

Anthropic expands its Google + Broadcom compute partnership — Custom TPUs, deeper integration with Google Cloud, and a path off Nvidia dependence.

April was the third-highest startup funding month of the past year — Driven by Anthropic, Bezos-backed Project Prometheus and several other AI infrastructure deals.

Bret Taylor’s Sierra raises nearly $1 billion — Sierra is the customer-service AI agent company; this is one of the largest single rounds in agentic-AI to date.

Research & Safety

Anthropic publishes work on “Natural Language Autoencoders” — A new interpretability technique aimed at translating Claude’s internal decision-making into something humans can read. Useful for explaining “AI transparency” in plain English.

Six exploit chains disclosed against Codex, Claude Code, Copilot and Vertex AI — Every successful exploit went after credentials, tokens or permissions — not the models themselves. The takeaway for comms: identity is now the AI security perimeter.

OpenAI publishes “How ChatGPT learns about the world while protecting privacy” — A formal explainer of training-data provenance, opt-out controls and personal-data minimisation. Useful template for Tencent’s own consumer-AI privacy messaging.

OpenAI introduces Trusted Contact in ChatGPT — Optional safety feature that notifies a designated contact if ChatGPT detects serious self-harm signals. The user-product equivalent of having a digital “in-case-of-emergency” line.

Enterprise Adoption

IBM 2026 CEO Study: 76% of large organisations now have a Chief AI Officer — Up from 26% in 2025. The CAIO is becoming a standard C-suite role in 12 months — a useful data point for any “state of AI in enterprise” briefing.

OpenAI case study: software firm Simplex rebuilds dev workflow around ChatGPT Enterprise + Codex — Concrete example of AI replacing parts of design, build and test cycles in a real product company.

OpenAI’s first AI agent phone targets H1 2027 mass production — Custom MediaTek Dimensity 9600 chip on TSMC’s 2nm N2P process, dual-NPU, with Ming-Chi Kuo projecting up to 30 million units shipped by 2028.

Consumer & Cultural

Google Chrome silently force-installs a 4 GB Gemini Nano model — and the backlash arrives — Gemini Nano is a “small” version of Gemini, but 4 GB is a meaningful disk-and-bandwidth cost. The story is gathering momentum among privacy and consumer-rights writers.

The Ringer asks: “Could Claude Mythos actually destroy the internet?” — Mainstream long-form treatment of Glasswing/Mythos. A useful read to understand how the cybersecurity AI story is landing with non-technical audiences.

From Your Inbox

Substack Highlights

From Your AI Feeds

Inoreader AI Folder

GPT-Realtime-2 = voice agents finally don’t suck?

The Neuron · May 8, 2026

The Neuron’s lead story today is GPT-Realtime-2 — they argue this is the model that makes voice agents production-ready for call centres for the first time. Plus a side note that “Anthropic figured out how to read Claude’s mind,” referring to Anthropic’s Natural Language Autoencoder interpretability work.

See also: Top Stories above (GPT-Realtime-2) and the Research & Safety roundup (Natural Language Autoencoders).

OpenAI may not be able to IPO in 2026

Pivot To AI · May 8, 2026

David Gerard’s Pivot To AI argues OpenAI’s CFO Sarah Friar has been quietly walking back the 2026 IPO timeline — the trouble being that going public means publishing your audited financials, and OpenAI’s financials remain “hilariously terrible” by Friar’s own framing. A useful counterweight to the bullish coverage.

Google Chrome force-installs a 4-gigabyte AI model — and how to get rid of it

Pivot To AI · May 7, 2026

Practical, slightly snarky walkthrough of why Chrome is now silently downloading “Gemini Nano” (a 4 GB on-device LLM) for every user, and the steps to opt out. The story is starting to get pickup beyond the AI-skeptic press.

See also: Consumer & Cultural roundup above.

AI just found 15 years’ worth of bugs in Firefox. In weeks!

The Automated · May 8, 2026

The Automated newsletter highlights how AI-assisted security tooling — including Anthropic’s Mythos preview and similar work from Microsoft and Google — is uncovering long-dormant Firefox vulnerabilities at a pace human researchers couldn’t match. Reinforces the Glasswing narrative: AI cyber capabilities are usable on both offence and defence.

See also: Project Glasswing coverage in News Roundup.

Anthropic adds ‘dreaming’ feature and uncaps agent limits

AI Breakfast · May 8, 2026

AI Breakfast reports on two Anthropic moves: an experimental “dreaming” feature (where Claude consolidates and reorganises long-context memories during idle time) and the rate-limit doubling on Claude Code that followed the SpaceX/Colossus deal.

See also: Anthropic / Colossus story in Top Stories.

Anthropic just leased Elon’s Memphis supercluster

Not a Bot · May 8, 2026

Not a Bot’s framing of the same Anthropic-Colossus story emphasises the “depth gap” — they argue OpenAI’s compute lead over Anthropic widens to 3.5x even after this deal. Includes a side note that Wendy’s keeps scaling its AI drive-through deployment while Taco Bell has paused theirs.

See also: Top Stories — Anthropic/Colossus.

Lore Issue #183: Anthropic Taps SpaceX for Massive Compute

Lore Brief · May 8, 2026

Lore Brief’s daily summary leads on the same Anthropic-SpaceX story, plus three sidebars: Google launches “Fitbit Air,” GPT-5.5 Instant reaches all ChatGPT users, and ChatGPT now lives inside Microsoft Excel and Google Sheets natively.

OpenAI wants AI to talk like a human

AI Valley · May 8, 2026

AI Valley’s analysis of GPT-Realtime-2 frames it as the moment voice AI moves from “annoying script reader” to “credible call-centre operator.” A second story argues a quiet shift is underway in research labs from chatbots toward “world models” — AI systems that simulate physical environments — citing recent work at DeepMind and FAIR.

See also: GPT-Realtime-2 in Top Stories.

The SpaceXAI Era

AI Valley · May 7, 2026

An AI Valley think-piece arguing we are entering a phase where Musk’s xAI/SpaceX compute infrastructure is becoming the shared backbone for multiple frontier labs — Anthropic’s Colossus deal being exhibit A. Side note: the U.S. government wants to inspect frontier models before release.

See also: White House CAISI agreements in News Roundup.

The Prompt That Makes AI Less Stupid

Bagel Bots · May 8, 2026

Bagel Bots shares a prompt template that forces an LLM to critique, rewrite and sharpen its own answer in three explicit passes — a small workflow trick worth keeping in your prompt library, especially for first drafts of long-form comms copy.

GPT-5.5 remembers (mostly)

Future Tools (Matt Wolfe) · May 8, 2026

Matt Wolfe’s Future Tools roundup highlights GPT-5.5’s improved (but still imperfect) long-term memory, plus a sidebar that Samsung has joined the “trillion-dollar club” off the back of HBM3E memory demand from AI data centres.

Gemini’s theoretical estimates vs actual benchmarks when picking GCE CPU platforms

GDELT Project Blog (Kalev Leetaru) · May 8, 2026

Kalev Leetaru runs an experiment having Gemini select CPU platforms for Google Compute Engine VMs, then benchmarks them against Gemini’s own performance predictions. Useful as a concrete example of where current LLMs over- or under-estimate cost/performance trade-offs in real infrastructure choices.

Advancing youth safety and wellbeing in EMEA

OpenAI News · May 5, 2026

OpenAI’s “European Youth Safety Blueprint” plus EMEA Youth & Wellbeing Grants — pre-positioning ahead of expected EU regulatory pressure on AI products used by minors. Useful comparison material if Tencent develops its own youth-safety messaging.

Discoveries

AI Workflows & Tool Watch

★

Claude Code 2.1.126 fixes a long list of MCP server reliability bugs

Anthropic’s May 1 Claude Code update quietly addressed several issues that were silently breaking MCP servers — including a memory leak that could push Claude Code’s RAM use past 10 GB when an MCP server wrote non-protocol data to stdout, and a long-running bug where MCP servers connected but failed silently with zero tools. /mcp now shows the tool count for each connected server and flags servers that connected with zero tools. If your MCP setup has been flaky for the last few weeks, this is the release that fixes it.

Directly relevant: Claude Code, Cowork mode, n8n + Claude integrations

Source: Claude Code Changelog · Releasebot summary

★

Perplexity Comet now ships with Claude Sonnet 4.6 by default for Pro, Opus 4.6 for Max

Comet, Perplexity’s agentic browser, now has a model picker. Pro users get Claude Sonnet 4.6 by default; Max users get Claude Opus 4.6, with Gemini 3.1 Pro available as an alternative. iOS pre-orders are open in the App Store. The “Personal Computer on Mac” feature now lets Comet read and edit local files, do voice orchestration, and browse alongside the desktop app — closer to the agentic-assistant pattern you’ve been tracking.

Directly relevant: Perplexity Comet, Claude, macOS workflow

Source: Perplexity Hub · Releasebot

💾

Reddit MCP server: 36 tools behind Reddit OAuth, three permission tiers

MCPBundles published a hosted Reddit MCP provider that exposes 36+ Reddit API operations as Claude/Claude Code tools, with three explicit permission tiers (read-only, read-write, account-level). For media-monitoring or community-listening workflows, this is the cleanest way to give Claude scoped Reddit access without writing your own scraper. The standalone “reddit-mcp-buddy” project on GitHub is a lighter alternative for one-off subreddit analysis.

Directly relevant: Claude Code, Cowork mode, media monitoring, issues management

Source: MCP Bundles · karanb192/reddit-mcp-buddy

⚙

n8n + Obsidian: an overnight agent that sorts fleeting notes

An Obsidian community member published a working n8n flow that runs locally overnight, classifies the day’s “fleeting” notes into permanent topic folders using a self-hosted LLM, and drops a daily summary back into Obsidian. The post includes the full n8n blueprint as a downloadable JSON. Worth borrowing the pattern even if you don’t run it overnight — same structure works for Drafts → Things 3, or DEVONthink → Apple Notes.

Directly relevant: Obsidian, n8n, Drafts, DEVONthink, Things 3

Source: Obsidian Forum

⚙

“Obsidian-as-podcast-feed” via n8n: Have your notes read aloud during commutes

An n8n template now stitches Obsidian → webhook → OpenAI text-to-speech → private podcast feed: every long note you tag becomes an episode you can listen to in your podcast app. Useful for reviewing long internal briefs or PR memos on a flight. The template is one-click installable from the n8n marketplace.

Directly relevant: Obsidian, n8n, podcast workflow, PR & comms briefs

Source: n8n workflow library

💬

The “self-critique” prompt pattern (via Bagel Bots)

A simple but effective three-pass prompt: (1) write a first draft, (2) critique your own draft against an explicit checklist, (3) rewrite to address the critiques. Bagel Bots packaged it as a copy-paste template; particularly useful for first drafts of statements, FAQs, and internal memos where tone and accuracy both matter.

Directly relevant: PR & comms drafting, FAQ creation, issues/crisis statements

Source: Bagel Bots newsletter (via Inoreader)

🛡

OpenAI’s safe-Codex playbook: a useful template for governing internal AI agents

OpenAI’s new “Running Codex safely” doc lays out the four pillars they use internally — sandboxing, approval gates, network egress policies, and “agent-native telemetry” — to let AI coding agents operate without leaking secrets or running unbounded shell commands. Even if you never deploy Codex, the framework is a clean blueprint for any internal comms about how Tencent should govern AI agents on staff laptops or in production.

Directly relevant: PR/comms talking points on AI governance, internal IT messaging

Source: OpenAI

📢

Live AI workflows powered by Reddit + Claude MCP — an ad-ops case study

A case study circulating on r/ClaudeAI documents an agency that compressed weekly Reddit Ads reporting from four hours to thirty minutes by giving Claude direct MCP access to Reddit Ads + their data warehouse, then asking natural-language questions like “why did our CPA spike on Thursday in r/SaaS?” The transferable insight for global comms: the same “Claude + MCP + first-party data” pattern works for press-coverage analysis, crisis tracking, and weekly executive readouts.

Directly relevant: media monitoring, executive reporting, weekly comms readouts

Source: Stormy AI Blog

🎤

For podcast production: GPT-Realtime-Translate at $0.034/minute makes live multilingual feeds plausible

The new GPT-Realtime-Translate model translates speech across 70+ input languages into 13 output languages in real time, at $0.034 per minute (about $2/hour). For your podcast production workflow with Ecamm, this opens up live English-to-Mandarin (or back) translation as an export track — significantly cheaper than human interpreters and faster than post-production subtitling. Worth a small pilot before locking into next quarter’s content plans.

Directly relevant: podcast production, Ecamm, MacWhisper, multilingual content

Source: OpenAI · The Next Web

Company Watch

Tencent Mentions

Bloomberg: AI capex doubling could squeeze Tencent earnings into the low-teens

Bloomberg Intelligence projects full-year earnings growth slowing as AI investment doubles. Earnings drop May 13. Sell-side consensus: revenue ≈ RMB 200.1B (+11.1% YoY), non-IFRS net profit ≈ RMB 68.1B (+11.1% YoY).

Cybernews: Hunyuan Hy3-preview launches, smaller and stronger than HY 2.0

The first major model release since the recruitment of Yao Shunyu from OpenAI. Hy3-preview is 295B parameters (down from 400B+ in HY 2.0) but stronger on complex reasoning and code. Notably positioned as “preview” — full release pending.

Tencent and Alibaba in talks to invest in DeepSeek at a $20B+ valuation

Reported via WSJ and aggregators. DeepSeek’s first outside fundraise. Worth pre-positioning a “why Tencent is investing in adjacent labs as well as building Hunyuan” line for any reactive press queries.

DimSum Daily: Tencent rebound masks AI doubts ahead of May 13 results

Notes the recent share-price recovery is being driven by AI sentiment but warns the May 13 print could puncture the optimism if AI capex commentary lands badly with analysts. Worth a heads-up note to internal IR / corporate comms partners.

SCMP: Tencent pledges new wave of AI investment, betting on WeChat agents

SCMP coverage from earlier in the year continues to circulate as the dominant English-language framing of Tencent’s AI strategy: “AI wave lifts all boats,” with WeChat agents positioned as the consumer flagship. Useful baseline narrative if a reporter pings about Tencent’s agent strategy.

TrendForce: Tencent Cloud raises AI compute, TKE and EMR list prices ~5% effective May 9

Tencent Cloud announced a roughly 5% price hike on AI compute, Tencent Kubernetes Engine, and Elastic MapReduce, effective today (May 9). Joins similar moves by Alibaba, Baidu and Zhipu. Likely to surface in B2B trade press over the weekend; consider a reactive line on cost-pass-through versus margin protection.

36Kr: ByteDance’s hardware push reportedly making Alibaba and Tencent “anxious”

36Kr argues ByteDance is pulling ahead in custom AI silicon and dedicated training infrastructure, putting both Alibaba and Tencent in a defensive posture. The framing is sharp; worth being aware of even if you don’t engage publicly with it.

AI Intelligence Report

Top Stories