Review · May 14, 2026

Flow Map Distillation

1 paper · 1 lab · auto-generated

TL;DR

Focus

One Tier 2 frontier-lab paper qualified after dedup in the 36-hour window: AnyFlow from NVIDIA Research and NUS Show Lab (arXiv:2605.13724, submitted Wed, 13 May 2026 16:06 UTC). The paper introduces the first any-step video diffusion distillation framework based on flow maps, replacing the consistency-distillation objective with flow-map transition learning over arbitrary time intervals and adding an on-policy distillation procedure called Flow Map Backward Simulation. No Tier 1 flagship launches and no other qualifying Tier 2/3 frontier-lab papers surfaced from the May 12–14 window after dedup against prior reviews; the May 11 Qwen-Image-2.0 launch and the May 12 A²RD / Fast-BLT / LPO papers are covered upstream.

Competitiveness

The relevant axis is open-weights video diffusion distillation under any-step inference. AnyFlow targets the regime where consistency-distilled models (CM, sCM, rCM, Self-Forcing) sit on top of strong flow-matching teachers such as Wan2.1-T2V-1.3B and Wan2.1-T2V-14B, and a parallel line of community-distilled checkpoints (Krea-Realtime-Wan2.1-14B, LightX2V-Wan2.1-14B-CausVid, FastVideo-CausalWan2.2-A14B-Preview). On the bidirectional Wan2.1-T2V-14B backbone, AnyFlow matches or surpasses rCM in the few-step regime and continues to improve quality as more sampling steps are allocated — the test-time scaling axis where consistency-distilled models degrade. On the causal FAR-Wan2.1-14B backbone, AnyFlow-FAR at 4 NFEs reaches I2V quality comparable to the undistilled Wan2.1-I2V-14B teacher at 50×2 NFEs, and beats the three community 14B consistency-distilled baselines on T2V. The closed-source frontier on video generation is Veo 3 (Google DeepMind), Sora 2 (OpenAI), and Kling 2.5 (Kuaishou); AnyFlow does not benchmark against these — the comparison is within the open Wan-family ecosystem and against prior distillation paradigms.

New frontier releases

No new flagship model launches in the past 36 hours. The latest LLM-side flagships remain GPT-5.5 (April 23), Claude Opus 4.7 (April 16), DeepSeek-V4 (April 24), and Grok 4.3 (May 6); the latest image-generation flagship is Qwen-Image-2.0 (May 11) — all covered upstream.

NVIDIA

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Tier 2 · Research Paper arXiv:2605.13724 2026-05-13 Video diffusion · Distillation · Flow matching · On-policy

Overview

Consistency distillation vs. flow map distillation paradigm comparison
Figure 1 (from the project page, nvlabs.github.io/AnyFlow). Consistency distillation (top) replaces the original Euler sampling trajectory with a consistency-sampling trajectory and applies a truncated gradient. Flow map distillation (bottom) preserves the Euler trajectory and decomposes it into shortcut segments for on-policy distillation.

Methodology

Evaluation & results

AnyFlow test-time scaling curves vs. consistency baselines
Figure 2 (from the project page, nvlabs.github.io/AnyFlow). Quantitative test-time scaling. AnyFlow lifts the flow-matching teacher curve across all step budgets and preserves the upward scaling shape that consistency-distilled baselines lose.

Ablations

Pipeline for continued training on a downstream dataset after AnyFlow distillation
Figure 3 (from the project page, nvlabs.github.io/AnyFlow). Continued-training pipeline. The AnyFlow-distilled checkpoint is fine-tuned on a specialized I2V dataset; few-step sampling is retained while identity preservation and trajectory accuracy improve.

Other