World-Shaper

World-Shaper: A Unified Framework for 360° Panoramic Editing

Create panoramic content interactively with geometry-aware editing. Maintain spherical consistency, enable distortion-adaptive reasoning, and create coherent 360° visual experiences through our unified ERP-domain framework.

📄 Paper GitHub 🚀 Project 📊 Dataset 🤗 Hugging Face

Editing panoramic images is crucial for creating realistic 360° visual experiences, yet existing perspective-based image editing methods fail to model the spatial structure of panoramas. Conventional cube-map decompositions attempt to bypass this issue but inevitably break global consistency due to their mismatch with spherical geometry.

Motivated by this insight, we reformulate panoramic editing directly in the ERP domain and present World-Shaper (World-Shaper), a unified geometry‑aware framework for panoramic image editing.

To overcome the scarcity of paired data, we adopt a novel generate‑then‑edit paradigm, in which a controllable generation model synthesized panoramic pairs for supervised editing learning.

To address geometric distortion, we design a geometry‑aware learning strategy that enables distortion‑adaptive reasoning and consistent manipulation across latitudes through spatially adaptive supervision and progressive curriculum training.

Extensive experiments on our new benchmark PEBench demonstrate that World‑Shaper achieves superior geometric consistency, editing fidelity, and text controllability compared to state‑of‑the‑art methods, enabling coherent and flexible 360° visual world creation.

Our method supports diverse applications.

3D World Generation

Users can begin by generating a panorama from either a text prompt or a local-view input image. A pre-trained depth estimation method is then applied to obtain the corresponding depth map of the panorama. Using this depth information, the 2D pixels are lifted into 3D points, and a sequence of camera poses is defined. The panorama is then rendered along these camera trajectories, and our method is employed to inpaint any missing regions in the rendered views. Finally, a panoramic Gaussian Splatting (GS) representation is optimized using the inpainted panorama frames.

Indoor Design

Users fetch a desired piece of furniture from a catalog and specify a location in the room; our method then seamlessly integrates the object into the panoramic scene, adapting to the spherical geometry and lighting conditions for a photorealistic visualization.

Video Demo

3D World Generation visualization: the left column shows the panorama image, and the right column shows the reconstructed 3D world video.

Panorama Image

3D World

Panorama Image

3D World

World-Shaper: A Unified Framework for 360° Panoramic Editing

Gallery

Interactive Editing Results

Abstract

Motivation

The Challenge: Perspective Editing on Panoramas

Our Insight: ERP-aware Editing

Approach

Results

Applications

3D World Generation

Indoor Design

Video Demo