ExplorerComputer VisionComputer Vision
Research PaperResearchia:202602.04020

PerpetualWonder: Long-Horizon Action-Conditioned 4D Scene Generation

Jiahao Zhan

Abstract

We introduce PerpetualWonder, a hybrid generative simulator that enables long-horizon, action-conditioned 4D scene generation from a single image. Current works fail at this task because their physical state is decoupled from their visual representation, which prevents generative refinements to update the underlying physics for subsequent interactions. PerpetualWonder solves this by introducing the first true closed-loop system. It features a novel unified representation that creates a bidirecti...

Submitted: February 4, 2026Subjects: Computer Vision; Computer Vision

Description / Details

We introduce PerpetualWonder, a hybrid generative simulator that enables long-horizon, action-conditioned 4D scene generation from a single image. Current works fail at this task because their physical state is decoupled from their visual representation, which prevents generative refinements to update the underlying physics for subsequent interactions. PerpetualWonder solves this by introducing the first true closed-loop system. It features a novel unified representation that creates a bidirectional link between the physical state and visual primitives, allowing generative refinements to correct both the dynamics and appearance. It also introduces a robust update mechanism that gathers supervision from multiple viewpoints to resolve optimization ambiguity. Experiments demonstrate that from a single image, PerpetualWonder can successfully simulate complex, multi-step interactions from long-horizon actions, maintaining physical plausibility and visual consistency.


Source: arXiv:2602.04876v1 - http://arxiv.org/abs/2602.04876v1 PDF: https://arxiv.org/pdf/2602.04876v1 Original Article: View on arXiv

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Feb 4, 2026
Topic:
Computer Vision
Area:
Computer Vision
Comments:
0
Bookmark