Tech Digest – May 16, 2026

The Agent Platform Takes Shape

OpenAI Fuses ChatGPT and Codex Into a Single Agent Layer — Brockman Takes the Helm

Greg Brockman has taken control of OpenAI’s product organisation with one mandate: merge ChatGPT and Codex into a unified experience. Codex lead Tibo Sottiaux nicknamed the result “CochatGPTex” — shorthand for an apparent bid to become Anthropic faster than Anthropic can become OpenAI. The ambition is already visible in production: Peter Steinberger, now at OpenAI working on OpenClaw, describes running roughly 100 cloud Codex instances that review every pull request and commit simultaneously.

The economic case is sharpening in parallel. Artificial Analysis’s GDPval-AA benchmark now gives GPT-5.5 a 98% win rate over last year’s leading model on realistic economic tasks — a year’s worth of competitive advantage compressed into a single product generation.

Note: When the interface becomes the agent and the agent becomes the interface, software procurement stops being about features and starts being about which system you trust to act on your behalf. The “CochatGPTex” joke is funny until you realise it describes your future IT vendor relationship.

Sources: Wired, Tibo Sottiaux (X), Peter Steinberger (X), Artificial Analysis

Google Publishes Official Playbook for Appearing in AI Search Results

Google has issued formal developer guidance on how websites should optimise for AI Overviews and AI Mode — the generative features that increasingly replace traditional search results. The guide confirms that crawlability, content quality, and trust signals remain dominant ranking factors for AI citation, while explicitly debunking tactics like micro-fragmenting content, relying on llms.txt files, or loading pages with schema markup to “unlock” AI responses.

Note: Every public institution with a website now has a second optimisation target. The question is no longer just “do we rank in Google?” but “does the AI cite us when a citizen asks a question?” Communications teams that haven’t read this guide are already falling behind.

Sources: Google Developers, Search Engine Land

Singapore’s Foreign Minister Runs Parliamentary Affairs Through a Personal AI Agent

Singapore’s foreign minister disclosed that he runs his parliamentary affairs — scheduling, briefing preparation, legislative tracking — through a personal agent built on Nanoclaw and a Raspberry Pi 5. The system is not a pilot or a demonstration; it is his operational workflow for managing ministerial duties.

Note: A sitting cabinet minister using an AI agent for actual governance work, not a press event. The hardware cost is under €100. The barrier to institutional AI adoption is not budget — it’s permission.

Sources: Minister’s Facebook (video)

Infrastructure at Breaking Point

Samsung’s 61,000 Workers Vote to Strike as AI Memory Shortage Reaches Crisis

Samsung Electronics faces its largest-ever strike after 61,000 unionised workers approved an 18-day walkout with 93.1% support, following collapsed pay negotiations. The dispute is driven by a widening gap: SK Hynix engineers are expected to receive bonuses of 700 million won in 2026 while Samsung’s remain capped at roughly half that. Over 200 core memory engineers have already left in four months. JPMorgan estimates an 18-day stoppage would cost Samsung over 4 trillion won in direct revenue.

The demand side shows why Samsung is desperate to retain talent. Kioxia and Dell have packed 9.8 petabytes of flash storage into a single 2U server — 40 NVMe SSDs at 245 TB each — built explicitly for generative AI workloads. Meanwhile, Bank of America’s Michael Hartnett notes that AI chip stocks now trade 62% above their 200-day moving average, more stretched than the 2000 dot-com peak. The power bill is arriving too: data centre demand drove a 76% jump in first-quarter electricity prices on PJM, the largest US grid.

Note: Four pressure points in one supply chain: labour walking out, storage density racing to keep up, equity valuations exceeding historical bubbles, and electricity costs repricing. Any institution planning a major digital project on a 2027-2028 timeline should stress-test its assumptions about hardware availability and energy costs.

Sources: Reuters, Bloomberg, BusinessWire (Kioxia), Benzinga (BofA), Bloomberg (PJM)

Workforce & Wealth Divide

US Employment in 18 AI-Exposed Occupations Falls for a Second Straight Year

Bureau of Labor Statistics data shows employment across 18 occupations flagged as AI-exposed — accounting for roughly 10 million jobs — fell for the second consecutive year. Customer service representatives lost 130,180 positions (−4.8%) in the year through May 2025. Since May 2022, the hardest-hit roles include credit authorisers (−26.2%), broadcast announcers (−20.8%), and sales engineers (−13.2%). The gains, meanwhile, concentrate sharply: an estimated 10,000 AI lab employees and founders in San Francisco have crossed $20 million in personal wealth over the past five years while layoffs continue around them.

Note: The data no longer show “exposure” — they show contraction. Two years of consecutive decline across 18 job categories is not a forecast; it’s a trend with compounding momentum. Workforce planning models built on pre-2023 replacement rates are already wrong.

Sources: Bloomberg, Deedy Das (X)

Cybersecurity & Institutional Risk

AI Models Now Achieve Arbitrary Code Execution on Production Chrome’s V8 Engine

ExploitBench, a new capability-graded benchmark, measures how far AI agents can climb from reaching vulnerable code to achieving arbitrary code execution. Tested against production V8 (the engine inside Chrome, Edge, Node.js, and Cloudflare Workers) with the security sandbox enabled, Claude Mythos Preview achieved 69% success across 41 real bugs. GPT-5.5 also reaches full arbitrary code execution. The benchmark decomposes exploitation into 16 measurable flags — crash, primitives, control-flow hijack, sandbox escape — and finds that while triggering crashes is routine for most models, the full exploit chain remains concentrated in the frontier two.

Note: The engine these models exploit runs in every Chromium browser on every government desktop. The question for CISOs is no longer whether AI-assisted attacks are theoretically possible but how to defend infrastructure when the attacker has a 69% success rate against hardened production code.

Sources: ExploitBench, arXiv paper

EY Withdraws Study After GPTZero Finds 60% of References Are AI Hallucinations

Ernst & Young has pulled a 44-page cybersecurity report on loyalty-programme fraud after GPTZero’s investigation found that 60% of its references were hallucinated — broken URLs, non-existent pages, and a citation to a McKinsey report that does not exist. EY Canada had been using the report to promote consulting services. The firm removed the document and stated it was unconnected to client work.

Note: A Big Four firm publishing AI-generated content with fake citations and getting caught by a detection tool. The reputational cost is obvious, but the systemic risk is worse: how many institutional reports — internal strategy documents, procurement justifications, policy briefs — have already been written this way and not yet checked?

Sources: Financial Times, GPTZero investigation

Biotech Acceleration

Clinical Trials Double in a Decade — AI-Designed Drug Shows First Evidence of Clearing Atherosclerosis Root Cause

Nearly 5,000 trials for innovative drugs began last year, more than double the figure a decade ago, with roughly half now starting in China. That accelerating pipeline is already reaching the clinic: Cyclarity Therapeutics presented the first clinical evidence that UDP-003, a compound identified by its AI platform, can safely excrete 7-ketocholesterol — the oxidised cholesterol considered the root cause of atherosclerotic plaque. If confirmed in larger trials, this would represent true plaque reversal rather than the symptom management that defines current cardiovascular treatment.

Note: Cardiovascular disease remains the leading cause of death in the EU. A shift from management to reversal would reshape health system economics over a decade — fewer interventions, different drug budgets, changed long-term care projections. Worth tracking as a procurement signal, not just a science headline.

Sources: Samuel Hume (X), GlobeNewsWire (Cyclarity)


Today’s threads converge on a single uncomfortable question: who is actually in control? The agent layer is consolidating into the platforms institutions already depend on. The infrastructure beneath it is straining at every physical limit — labour, silicon, power, and financial gravity. The workforce data says the transition is no longer theoretical. And the EY episode shows that even the firms hired to provide institutional judgment are outsourcing it to machines that fabricate sources. The pace isn’t the story anymore. The story is that the systems meant to provide oversight are being reshaped by the same forces they’re supposed to govern.

Similar Posts