AI Intelligence Report

Today’s Signal

Top Stories

Google Blog · I/O 2026

Google goes “all-in” on AI agents at I/O 2026 — Gemini Omni, Spark, and a new Flash model

Google’s I/O developer conference on May 19-20 was almost entirely an AI show. The headline launch was Gemini Spark, a personal AI agent that lives in the Gemini app, runs 24/7 on Google’s own servers (so you don’t have to keep your laptop open), and can take action across connected apps on your behalf. Google also debuted Gemini Omni, a new model family that can generate output in any modality — starting with video and expanding to image and text. The lighter-weight Gemini 3.5 Flash became generally available, offering roughly the intelligence of frontier models at one-third to one-half the price. Together, the moves push agents into Search, Android, YouTube (“Ask YouTube”), Workspace, and shopping — Google’s clearest answer yet to OpenAI and Anthropic.

TechCrunch · Anthropic

Andrej Karpathy joins Anthropic to lead recursive AI research

One of the most respected technical minds in the field — OpenAI co-founder, former Tesla Autopilot lead, and Eureka Labs founder Andrej Karpathy — joined Anthropic this week. He’ll work under pre-training lead Nick Joseph on a team focused on using Claude itself to accelerate Anthropic’s pre-training research. In plain terms, the goal is “recursive self-improvement”: AI that helps train its own successors with progressively less human input. The hire is being read as a major talent coup for Anthropic ahead of its expected October IPO.

CNBC · OpenAI

OpenAI is preparing to confidentially file its IPO this week, targeting a September listing

OpenAI is finalizing a draft S-1 prospectus with Goldman Sachs and Morgan Stanley and could submit the confidential filing to the SEC as early as today (Friday, May 22), with a September listing in view. Private-market investors currently value the company between roughly $850 billion and $1 trillion, which would make it one of the largest IPOs in history. The filing follows a federal jury’s rejection of Elon Musk’s lawsuit against OpenAI and Sam Altman on May 19 — a verdict that cleared a major legal overhang. Anthropic is reportedly targeting an October listing at a $900 billion valuation, setting up back-to-back AI mega-IPOs this fall.

NPR · Meta

Meta cuts 8,000 jobs and links some firings to its own employee-surveillance AI program

Meta began notifying roughly 8,000 employees they are being laid off on May 20, framing the cuts as funding for its AI push. The story has a sharper edge: in April, Reuters reported on Meta’s internal “Model Capability Initiative” (MCI), which records mouse movements, keystrokes, dropdown navigation, and periodic screenshots of employee workstations to train AI agents that can perform white-collar tasks autonomously. Critics note that some employees who contributed behavioural data to MCI are now among those being cut — a pattern likely to draw scrutiny across the industry. Markets reacted negatively, with several of the major “AI layoff” announcers seeing their stocks drop the week the cuts were disclosed.

Axios · OpenAI / xAI

Musk’s $150B lawsuit against OpenAI fails in Oakland federal court

An Oakland jury on May 19 rejected Elon Musk’s claims that OpenAI and CEO Sam Altman had defrauded him and breached founding agreements when the company transitioned to a for-profit structure. The damages sought reportedly exceeded $150 billion. The verdict closes one of the largest pieces of litigation hanging over the AI industry and removes the last major obstacle to OpenAI’s near-term IPO.

InfoQ · Anthropic

Anthropic ships self-hosted sandboxes and MCP tunnels for enterprise agents

At its “Code with Claude” event in London on May 19, Anthropic released two updates that matter for any large organization deploying AI agents. Self-hosted sandboxes (public beta) let tool execution run on the customer’s own infrastructure while the agent’s reasoning loop stays on Anthropic’s servers — useful for regulated industries. MCP tunnels (research preview) let Claude agents reach private internal databases without exposing them to the public internet. Together they answer the most common enterprise objections to letting AI agents touch production data.

CXO Digital Pulse · Tencent

Tencent unveils sweeping Hunyuan upgrade and agent-first strategy at Cloud summit (May 21)

At yesterday’s Tencent Cloud industry summit, the company introduced a broad upgrade to its Hunyuan ecosystem: the fast-thinking Hunyuan Turbo S, the deep-thinking Hunyuan T1, the visual reasoning model T1-Vision, a low-latency voice model Hunyuan Voice, plus Hunyuan Image 2.0, Hunyuan 3D 2.5, and Hunyuan-Game. Tencent also rebranded its knowledge engine as the Tencent Cloud Agent Development Platform (TCADP), formalizing a shift to an agent-first strategy. The summit followed Tencent’s Q1 results, where President Martin Lau confirmed AI spending will more than double in 2026 to over RMB 36 billion.

Axios · Policy

White House postpones AI executive order on voluntary model pre-release sharing

The White House on May 21 postponed an expected executive order that would have established a voluntary framework for AI companies to share frontier models with the U.S. government up to 90 days before public release. The pause is being read as the administration giving more ground to industry lobbying — and as a quiet acknowledgement that the original Biden-era voluntary commitments are losing force. No new timeline was given.

OpenAI

An OpenAI model disproved an 80-year-old conjecture in discrete geometry

OpenAI announced this week that one of its models solved the “unit distance problem,” disproving a major long-standing conjecture in discrete geometry that has stood since the 1940s. While narrow in scope, the result is being cited as one of the first cases where an AI system contributed a novel mathematical proof published in a refereed-quality form — important context for the broader debate about how much AI can actually “reason.”

VentureBeat · Perplexity

Perplexity pushes “Computer” deeper into enterprise — native Mac app, GPT-5.5 orchestration, structured outputs

Perplexity continued the rapid evolution of its Computer product this week. Personal Computer on Mac (launched May 7) is now positioned as a “personal orchestrator” that can read and edit local files, drive native Mac apps, browse the web, and accept voice commands. GPT-5.5 is rolling out as the default orchestration model for Pro and Max users, with OpenAI’s GPT Image 2 powering image generation. Deep Research and Pro Search now produce presentations, spreadsheets, dashboards, and websites directly. For communications teams, this turns Perplexity into a credible alternative to ChatGPT and Claude for one-off research-to-deliverable workflows.

Around the Industry

AI News Roundup

Model Releases & Performance

Gemini 3.5 Flash now generally available — 4x faster than comparable frontier models, $1.50/$9 per 1M tokens, 1M-token context window; scored 76.2% on Terminal-Bench 2.1.

Google introduced Gemma 4 — latest series of open-weights models built specifically for advanced reasoning and agentic workflows.

Google research unveiled TurboQuant at ICLR 2026 — a new algorithm that significantly reduces the memory cost of running long-context LLMs (it shrinks the “KV cache,” a known bottleneck).

Industry Moves & Money

OpenAI confidential S-1 filing expected as early as today — September listing targeted at ~$850B–$1T; Anthropic eyes October at ~$900B.

Andrej Karpathy joins Anthropic pre-training team — to lead recursive self-improvement research using Claude.

Tencent more than doubles AI spend to RMB 36B+ in 2026 — Martin Lau details investment plans on Q1 earnings call.

OpenAI–Dell partnership brings Codex to on-premise enterprise — for customers who can’t (or won’t) put sensitive code in the cloud.

Databricks ships GPT-5.5 inside enterprise agent workflows — after the model set a new state-of-the-art on OfficeQA Pro.

Policy, Legal & Regulation

Musk’s $150B OpenAI lawsuit rejected by Oakland jury — clears the path for OpenAI’s IPO.

White House postpones voluntary AI pre-release sharing executive order — no new timeline given.

OpenAI advances Content Credentials and SynthID watermarking — releases a new verification tool to help people identify AI-generated media.

Labour, Layoffs & Surveillance

Meta cuts 8,000 jobs as it pivots to AI — effective May 20, on top of earlier “MCI” employee-tracking program controversy.

Amazon engineers using “MeshClaw” bot to hit managers’ AI token quotas — internal pressure to consume tokens reportedly driving make-work AI usage.

GitHub Copilot AI token charges set to rise 10×–100× — Microsoft confronts the cost reality of running coding agents.

Partnerships & Public-Sector AI

OpenAI launches “OpenAI for Singapore” — multi-year national AI partnership for deployment, talent and public services.

OpenAI and Malta partner to give every citizen ChatGPT Plus — including AI literacy training.

OpenAI advances “Education for Countries” program — new partnerships and teacher training tools for AI in schools.

AdventHealth deploys ChatGPT for Healthcare across its network — clinical workflow automation case study.

Science & Research

OpenAI model disproves 80-year-old unit-distance conjecture — milestone for AI-assisted mathematics.

University of Michigan AI reads brain MRIs in seconds — trained on hundreds of thousands of scans to identify neurological conditions faster than human radiologists.

arXiv bans academic authors for submitting AI-generated “slop” papers — preprint server cracks down on synthetic research.

Consumer & Product

ChatGPT launches personal-finance feature — connect U.S. bank accounts for AI-grounded financial guidance (Pro only for now).

Google teases “intelligent eyewear” with Samsung, Qualcomm, Gentle Monster and Warby Parker — first audio glasses ship this fall.

From Your Inbox

Substack Highlights

From Your AI Feeds

Inoreader AI Folder

Amazon engineers use “MeshClaw” bot to hit managers’ AI token targets

Pivot To AI · May 21

David Gerard reports that Amazon is pushing AI hard across the company with an internal bot called MeshClaw, after managers were impressed by “OpenClaw.” Engineers are being measured on AI token consumption — leading to make-work usage. A cautionary tale on what happens when AI adoption becomes a top-down KPI.

AdventHealth advances whole-person care with OpenAI

OpenAI News · May 21

AdventHealth is rolling out ChatGPT for Healthcare to streamline workflows, reduce administrative load on clinicians, and return more time to patient care. Case study of a major U.S. hospital network going all-in on a single AI vendor.

Scaling GDELT For A New Era: Go, Daemon Proxies, Gemini Advice & Agentic Self-Healing

GDELT Official Blog · May 21

Kalev Leetaru walks through how the GDELT global news-monitoring project re-architected its Google Cloud infrastructure using AI advice from Gemini and agentic self-healing scripts. Useful read for anyone running media-monitoring at scale.

Karpathy joins Anthropic

Not a Bot · May 21

Newsletter framing of the Karpathy hire: Anthropic has now made “recursive AI research” an explicit org structure, with Karpathy running it. Reads it as the biggest signal yet that Anthropic’s IPO story is “we’ll improve faster than anyone else.”

LIVE NOW: Building AI Voice Agents w/ LiveKit’s Ben Cherry

The Neuron · May 21

Live podcast on how voice AI actually works under the hood, hosted by The Neuron with LiveKit’s Ben Cherry. Relevant for any team thinking about adding voice-driven channels (support callbacks, exec briefings, podcast Q&A) into their stack.

An OpenAI model has disproved a central conjecture in discrete geometry

OpenAI News · May 20

OpenAI announces that one of its models solved the “unit distance problem,” disproving an 80-year-old conjecture. A milestone in AI-driven mathematics and a useful talking point for the “are LLMs really reasoning?” debate.

How Ramp engineers accelerate code review with Codex

OpenAI News · May 20

Case study on how Ramp’s engineering team uses Codex with GPT-5.5 to review code, cutting reviewer feedback cycles from hours to minutes. Relevant template if any internal Tencent teams want to standardize AI-assisted code review.

PODCAST: Can AI Solve Math’s Biggest Mystery?

The Neuron · May 20

Long-form interview with Tudor Achim of Harmonic and the Aristotle model team on whether AI can crack the Riemann Hypothesis. Companion piece to the OpenAI geometry result.

The next phase of OpenAI’s Education for Countries

OpenAI News · May 20

OpenAI expands its Education for Countries program with new sovereign partnerships, teacher training and tools targeting school systems. Read against the OpenAI for Singapore and OpenAI for Malta announcements: a deliberate diplomacy-by-product push.

Introducing OpenAI for Singapore

OpenAI News · May 20

Multi-year partnership to expand AI deployment, train local talent, and support businesses and public services in Singapore. Notable that OpenAI is now branding country-level partnerships in the same way governments brand defence treaties.

Advancing content provenance for a safer, more transparent AI ecosystem

OpenAI News · May 19

OpenAI expands its Content Credentials and SynthID work and releases a new verification tool that can help readers identify AI-generated media. Directly relevant for media-relations and crisis-comms teams thinking about provenance defaults.

Musk’s OpenAI lawsuit fails

AI Valley · May 19

Newsletter recap of the Oakland verdict, plus Odyssey’s release of two new world models. Frames the verdict as removing the last obstacle to OpenAI’s IPO.

The strongest agent finished under 4% of real work last week

Not a Bot · May 19

Important reality check: even with six vendors claiming “agents are the primary user” of their tools, new METR benchmark data shows the best public agent only completed under 4% of real-world workflows end-to-end. Both can be true simultaneously.

Elon Musk’s $150B lawsuit just went up in smoke

The Automated · May 19

Plain-English summary of the Oakland verdict, plus a sidebar arguing LinkedIn is now algorithmically suppressing AI-generated posts. Relevant for comms teams using AI to draft thought-leadership content.

Measurement Myth Busted: 5 Takeaways from Exec Connect London

Signal AI · May 20

Signal AI’s senior-comms gathering on May 13 explored whether comms teams can put a financial value on their work. Five takeaways from the panel — directly relevant for anyone building out comms-ROI dashboards.

Discoveries

AI Workflows & Tool Watch

★

Self-hosted Claude sandboxes & MCP tunnels (Anthropic)

For any enterprise that wants to use Claude agents on sensitive internal systems without exposing them to the public internet, this week’s “Code with Claude” releases are the most important practical update of the month. Self-hosted sandboxes (public beta) keep tool execution on your own servers while the agent’s reasoning stays on Anthropic. MCP tunnels (research preview) let Claude reach private databases through a secure tunnel rather than an open endpoint.

Directly relevant: Claude, Claude Code, Cowork mode, MCP server stack

Source: Testing Catalog · InfoQ

★

Perplexity Personal Computer on Mac

Perplexity’s upgraded Mac app turns it into a “personal orchestrator” — it can read and edit local files, drive native Mac apps, browse the web, and take voice commands. Combined with new GPT-5.5 orchestration and the ability to output presentations, spreadsheets, dashboards and websites directly, it’s now a serious contender for one-off research-to-deliverable workflows where you’d previously have copy-pasted between Perplexity, Word and Keynote.

Directly relevant: Claude (as comparison), Perplexity, Mac, Keynote/PowerPoint replacement workflows

Source: 9to5Mac · Perplexity changelog

★

Hybrid Perplexity Computer + Claude Code workflow (The AI Maker)

Cam’s paid Substack The AI Maker published a step-by-step setup this week for routing browser-heavy research through Perplexity Computer and handing the structured results to Claude Code for writing, file editing and longer reasoning. It’s a clean answer to the “which tool replaces which” question — keep both, split by job.

Directly relevant: Claude Code, Perplexity, comms-team research workflows

Source: The AI Maker

📝

One-prompt “remove the ChatGPT voice” filter (Bagel Bots)

A short system prompt that strips robotic phrasing, fake enthusiasm and AI filler from generated copy. Useful as a pre-publish pass on ghostwritten thought-leadership and exec briefings, or as a baseline edit before handing copy to a human reviewer.

Directly relevant: ChatGPT, Claude, executive communications, ghostwriting workflows

Source: Bagel Bots

📱

Codex now manageable from your phone (OpenAI)

OpenAI shipped a mobile-management surface for Codex, so you can spin up, monitor and approve long-running coding agent runs from an iOS or Android device. Less interesting for engineers — more interesting as a template for what a “manage your AI agents from your phone” interface looks like across other categories.

Directly relevant: OpenAI Codex, mobile agent management patterns

Source: AI Breakfast

💻

n8n May 2026 release: AI Builder draft workflows in lists, better reliability

The May release of n8n surfaces in-progress AI Builder draft workflows in the main workflow list and improves reliability for the Sheets and workflow-lookup integrations. Small but useful if you’re running an n8n instance for comms automations (briefings, monitoring digests, Slack/WeChat bridges).

Directly relevant: n8n, Slack, Notion, automation pipelines

Source: n8n release notes

🎔

Claude Code reliability + background-session improvements (May 19)

Claude Code’s May release adds /resume support for background sessions started via claude --bg (they now show up in the agent view), a “last updated” timestamp in the /plugin browse pane, and roughly 2-second faster MCP/SDK startup for slow servers. Tiny on paper, real impact for long-running comms automations.

Directly relevant: Claude Code, MCP servers, background agent runs

Source: Claude Code changelog

⚠

The strongest agent finished under 4% of real work (METR)

New METR benchmark data, summarized in the Not a Bot newsletter, found that even the strongest publicly available AI agent today completes fewer than 4% of representative real-world workflows end-to-end. A useful reality check to keep in the back pocket when vendors pitch full autonomy.

Directly relevant: vendor evaluation, agent procurement decisions

Source: Not a Bot

🎮

Reddit MCP integration patterns (Composio)

Composio published a working pattern for connecting Reddit to Claude Code (and the Claude Agent SDK) via an MCP server. Practical use cases: monitoring r/ClaudeAI / r/macapps / r/n8n for tool changes that affect your stack, or auto-summarizing community sentiment around a brand or topic.

Directly relevant: Claude Code, MCP servers, media monitoring

Source: Composio

📣

Voice-agent platforms maturing fast — LiveKit, PollyReach

Voice-capable agents that can actually place and receive real phone calls are stabilising. LiveKit (covered in this week’s Neuron live podcast) and PollyReach (a Product Hunt launch) are both worth watching for inbound/outbound voice use cases. Early credible applications: appointment bookings, support callbacks, and outbound qualification.

Directly relevant: media outreach, exec briefings, podcast Q&A, comms automation

Source: Ben’s Bites

📊

Token-spend visibility (Exponential View)

If your team is starting to use Claude or ChatGPT at scale, tokens are now a real budget line — Uber’s CTO disclosed his 5,000 engineers’ usage and the numbers are growing exponentially. The Exponential View piece is a good primer for getting AI cost reporting into your monthly leadership review before it becomes a problem.

Directly relevant: Claude, ChatGPT, finance/ops reporting, team automation budgets

Source: Exponential View

Company Watch

Tencent Mentions

Tencent unveils sweeping Hunyuan ecosystem upgrade at Cloud summit (May 21)

Two new tiers — fast-thinking Hunyuan Turbo S and deep-thinking Hunyuan T1 — were introduced alongside the visual-reasoning T1-Vision, low-latency Hunyuan Voice, Hunyuan Image 2.0, Hunyuan 3D 2.5, and Hunyuan-Game. The bigger story is the agent-first pivot: Tencent has rebranded its knowledge engine as the Tencent Cloud Agent Development Platform (TCADP).

Tencent doubles down on agentic AI with latest Hunyuan updates

KrAsia’s framing of this week’s summit: a formal strategic shift from “model-first” to “agent-first,” with TCADP positioned as the developer surface and Hunyuan models as the underlying engines.

Q1 2026 results context: AI spend more than doubling to RMB 36B+

From the May 13 earnings call: President Martin Lau confirmed AI spending will more than double in 2026 to in excess of RMB 36 billion, up from RMB 18 billion in 2025 (HunYuan foundation model plus Yuanbao AI assistant). Tencent reported its slowest revenue growth in six quarters, putting the AI pivot under sharper scrutiny.

China tech stocks and the AI revolution: Tencent as a strategic 2026 bet

Industry analysis framing Tencent and Alibaba as the two best-positioned Chinese names to capture the next phase of the AI buildout, especially as Chinese models’ share of usage on developer platform OpenRouter rose from 1% in 2024 to over 60% by May 2026.

Azeem Azhar on the “Chinese AI efficiency moat”

Exponential View’s lead essay this week argues Chinese AI labs, “suffocated by export controls,” have developed an efficiency advantage that will reshape global model pricing over the next 24 months — a tailwind for Tencent’s Hunyuan strategy if it holds.

Top Stories

AI News Roundup

Substack Highlights

The Signal — “6 AI tools for the people still doing everything in ChatGPT“

The Signal — “Anthropic Pulls Away, OpenAI Strikes Back, and Google’s Gemini Rising“

The AI Maker — “How I Am Testing Perplexity Computer Without Replacing Claude Code“

The AI-Augmented Engineer — “How to use AI to plan a new app“

Ben’s Bites — “Google’s take on OpenClaw“

Ben’s Bites — “Can I get my agents on the phone?“

Exponential View (Azeem Azhar) — “Data to start your week: The cost of tokenmaxxing“

Exponential View — “EV #574: Inside Anthropic’s rocket ship; AI pluralism; love commoditized“

The Neuron — “Meta used staff as AI training data. Then cut them.”

AI Valley — “Google unveils Omni, Spark, and 3.5 Flash“

Not a Bot — “Gemini moved into every surface Google owns“

AI Breakfast — “Every Announcement from Google’s I/O“

The Automated — “How one fake blog post broke Google’s trillion-dollar AI“

Pivot to AI (David Gerard) — “Google is replacing search results with only the AI“

Bagel Bots — “The Prompt That Removes the “ChatGPT Voice”“

The Neuron — “Google Search is dead. Long live agents?“

Future Tools — “Forget you“

Last Week in AI Podcast — “TML-Interaction, Claude For Legal, Sam Altman on Stand“

One More Thing in AI — “Microsoft’s Bold Move: Secret Hunt for New AI Partners“

WhatPlugin.ai — “I checked how long 135 AI deployments took“

Pivot to AI — “Tucson Project Blue: data centres lie about water again“

Superpower Daily — “OpenAI launches ChatGPT for personal finance, will let you connect bank accounts“

Inoreader AI Folder

AI Workflows & Tool Watch

Self-hosted Claude sandboxes & MCP tunnels (Anthropic)

Perplexity Personal Computer on Mac

Hybrid Perplexity Computer + Claude Code workflow (The AI Maker)

One-prompt “remove the ChatGPT voice” filter (Bagel Bots)

Codex now manageable from your phone (OpenAI)

n8n May 2026 release: AI Builder draft workflows in lists, better reliability

Claude Code reliability + background-session improvements (May 19)

The strongest agent finished under 4% of real work (METR)

Reddit MCP integration patterns (Composio)

Voice-agent platforms maturing fast — LiveKit, PollyReach

Token-spend visibility (Exponential View)

Tencent Mentions