Auto Lit Review

Frontier lab papers,
read for you, every day.

An automated daily digest of research papers and technical reports from frontier AI labs around the world. Synthesized into reviews you can actually read.

Reviews

May 17, 2026

Agentic Modeling, World Models & Safety Adaptation

Three Tier 2 frontier-lab papers: Google’s LiSA conservative policy induction for adaptive guardrails, Microsoft Research’s Orchard open-source agentic modeling framework (67.5% on SWE-bench Verified at 30B-A3B size), and NVIDIA’s SANA-WM 2.6B minute-scale world model with hybrid linear attention.
May 16, 2026

Multimodal Foundations & Context Optimization

Three Tier 2 frontier-lab papers plus one Tier 3 OpenAI engineering note: Alibaba’s Qwen-Image-VAE-2.0 high-compression VAE suite (f16c128 beats FLUX.1-dev on text legibility despite 2× compression), ByteDance Seed’s MMProLong 5B-token long-context VLM recipe (32K→128K, generalizes to 512K), Google DeepMind’s BeamSearch-IS context optimizer that makes external tools actually useful, and OpenAI’s Codex mobile relay.
May 15, 2026

AI Geopolitics & Codex Sandboxing

Two Tier 3 frontier-lab posts: Anthropic’s “2028: Two scenarios for global AI leadership” policy essay on US-China compute and distillation policy, and OpenAI’s engineering note on the Windows-native Codex sandbox (CodexSandboxOffline/Online split, DPAPI, firewall, four-layer execution).
May 14, 2026

Flow Map Distillation

NVIDIA + NUS Show Lab introduce AnyFlow (arXiv:2605.13724): the first any-step video diffusion distillation framework based on flow maps, with Flow Map Backward Simulation for on-policy distillation that preserves test-time scaling from 1.3B to 14B.
May 13, 2026

Qwen-Image-2.0 Launch

Alibaba’s Qwen team ships a unified 7B image-generation-and-editing foundation model (arXiv:2605.10730): Qwen3-VL encoder + MMDiT, native 2K, 1K-token prompts, #1 on AI Arena in both T2I and editing.
May 12, 2026

RL, Byte-Level & Long Video

Three Tier 2 frontier-lab papers: Tencent Hunyuan’s Listwise Policy Optimization, Meta FAIR + Stanford + UW’s Fast Byte Latent Transformer, and Google’s A²RD long-video diffusion architecture.
May 10, 2026

The April 2026 frontier-model wave

Cold-start backfill: eight Tier 1 flagship launches in 19 days — DeepSeek-V4, Qwen3.5-Omni, GPT-5.5, Claude Opus 4.7, Kimi K2.6, MiniMax M2.7, Grok 4.3, Mistral Medium 3.5.

About

This repository hosts daily reviews of newly released papers and technical reports from frontier AI labs. Each review is generated by a scheduled agent that scans arXiv, Semantic Scholar, Hugging Face Daily Papers, X, the Emergent Mind newsletter, and other sources, then writes a structured summary covering methodology, evaluation, and results.

Only papers from a curated list of frontier labs are included. Reviews are deduplicated against prior entries — each paper appears at most once.

Reviews

Agentic Modeling, World Models & Safety Adaptation

Multimodal Foundations & Context Optimization

AI Geopolitics & Codex Sandboxing

Flow Map Distillation

Qwen-Image-2.0 Launch

RL, Byte-Level & Long Video

The April 2026 frontier-model wave

About