BEGIN:VCALENDAR VERSION:2.0 PRODID:Linklings LLC BEGIN:VTIMEZONE TZID:Asia/Tokyo X-LIC-LOCATION:Asia/Tokyo BEGIN:STANDARD TZOFFSETFROM:+0900 TZOFFSETTO:+0900 TZNAME:JST DTSTART:18871231T000000 END:STANDARD END:VTIMEZONE BEGIN:VEVENT DTSTAMP:20250110T023313Z LOCATION:Hall B7 (1)\, B Block\, Level 7 DTSTART;TZID=Asia/Tokyo:20241206T144500 DTEND;TZID=Asia/Tokyo:20241206T145600 UID:siggraphasia_SIGGRAPH Asia 2024_sess150_papers_938@linklings.com SUMMARY:360-degree Human Video Generation with 4D Diffusion Transformer DESCRIPTION:Technical Papers\n\nRuizhi Shao, Youxin Pang, Zerong Zheng, Ji ngxiang Sun, and Yebin Liu (Tsinghua University)\n\nWe present a novel app roach for generating 360-degree high-quality, spatio-temporally coherent h uman videos from a single image. Our framework combines the strengths of d iffusion transformers for capturing global correlations across viewpoints and time, and CNNs for accurate condition injection. The core is a hierarc hical 4D transformer architecture that factorizes self-attention across vi ews, time steps, and spatial dimensions, enabling efficient modeling of th e 4D space. Precise conditioning is achieved by injecting human identity, camera parameters, and temporal signals into the respective transformers. To train this model, we collect a multi-dimensional dataset spanning image s, videos, multi-view data, and limited 4D footage, along with a tailored multi-dimensional training strategy. Our approach overcomes the limitation s of previous methods based on generative adversarial networks or vanilla diffusion models, which struggle with complex motions, viewpoint changes, and generalization. Through extensive experiments, we demonstrate our meth od's ability to synthesize 360-degree realistic, coherent human motion vid eos, paving the way for advanced multimedia applications in areas such as virtual reality and animation.\n\nRegistration Category: Full Access, Full Access Supporter\n\nLanguage Format: English Language\n\nSession Chair: L i-Yi Wei (Adobe Research) URL:https://asia.siggraph.org/2024/program/?id=papers_938&sess=sess150 END:VEVENT END:VCALENDAR