Review · May 17, 2026

Agentic Modeling, World Models & Safety Adaptation

3 papers · 3 labs · auto-generated

TL;DR

Focus

No flagship model launch surfaced in the 36-hour window between the May 16 review and this one — Google’s I/O keynote (with the rumoured Gemini 3.2 Flash) is still pencilled for May 19–20, and Anthropic’s May 14 partnership announcements (PwC, Gates Foundation $200M) are non-technical. The page collects three Tier 2 frontier-lab research papers that landed on Hugging Face Daily Papers on May 14–15. Google’s LiSA (Lifelong Safety Adaptation) treats deployment-time guardrail failures as a structured-memory problem and converts sparse user reports into reusable policy abstractions gated by a Beta-posterior lower bound on accumulated evidence. Microsoft Research’s Orchard open-sources the missing piece of agentic training infrastructure — a Kubernetes-native environment service plus three training recipes that push Qwen3-30B-A3B-Thinking to a new open-source SOTA of 67.5% on SWE-bench Verified. NVIDIA’s SANA-WM introduces a 2.6B-parameter open-source world model that generates 60-second 720p videos with precise 6-DoF camera control on a single GPU, beating prior open baselines on action-following at 36× throughput.

Competitiveness

Orchard-SWE is the eye-catcher: 67.5% on SWE-bench Verified from a 30B-active-parameter MoE backbone with a fully open-source recipe. The current SWE-bench Verified leaders are Claude Opus 4.7 at 78.4% and GPT-5.5 at the same tier, both proprietary — Orchard closes the open/closed gap to roughly 11 points at a comparable activation budget to a single H100 inference node. Orchard-GUI’s 4B computer-use agent hits 74.1% on WebVoyager / 67.0% on Online-Mind2Web, beating every prior open-source computer-use baseline and within striking distance of the OpenAI Operator family. SANA-WM’s 36× throughput advantage over LingBot-World on a 60-second 720p generation budget — trained on only 213K public video clips in 15 days on 64 H100s — is the first open-source minute-scale world model that is realistic to retrain at a research budget; Genie 3 and Veo 3 remain proprietary. LiSA pushes the safety-helpfulness frontier past pure backbone scaling on PrivacyLens+ / ConFaide+ / AgentHarm and survives a 20% label-flip rate on user feedback — the closest external comparison is Anthropic’s contextual-integrity guardrails work from late 2025, which assumed offline-trained policies rather than online adaptation.

New frontier releases

No new flagship models in the window. Most recent launches remain Anthropic’s Claude Opus 4.7 (Apr 16) and OpenAI’s GPT-5.5 (Apr 23). DeepSeek-V4 Pro (Apr 24) is the most recent open-source flagship. Google’s I/O event on May 19–20 is the next expected launch window.

Google

LiSA: Lifelong Safety Adaptation via Conservative Policy Induction

Tier 2 · Research Paper arXiv:2605.14454 2026-05-14 Safety · Guardrails · Agentic deployment · Online adaptation · Memory

Overview

Methodology

Evaluation & results

Ablations

Other

Microsoft AI

Orchard: An Open-Source Agentic Modeling Framework

Tier 2 · Research Paper arXiv:2605.15040 2026-05-14 Agentic training · RL · SWE-bench · Computer use · Open-source infra

Overview

Methodology

Evaluation & results

Ablations

Other

NVIDIA Research

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Tier 2 · Technical Report arXiv:2605.15178 2026-05-14 World model · Video diffusion · Linear attention · Camera control · Open source

Overview

Architecture

Training

Evaluation & results

Ablations

Other