BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Asia/Tokyo
X-LIC-LOCATION:Asia/Tokyo
BEGIN:STANDARD
TZOFFSETFROM:+0900
TZOFFSETTO:+0900
TZNAME:JST
DTSTART:18871231T000000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20250110T023313Z
LOCATION:Hall B5 (2)\, B Block\, Level 5
DTSTART;TZID=Asia/Tokyo:20241206T130000
DTEND;TZID=Asia/Tokyo:20241206T131100
UID:siggraphasia_SIGGRAPH Asia 2024_sess146_papers_359@linklings.com
SUMMARY:Correlation-aware Encoder-Decoder with Adapters for SVBRDF Acquisi
 tion
DESCRIPTION:Technical Papers\n\nDi Luo and Hanxiao Sun (Nankai University)
 , Lei Ma (Peking University), Jian Yang (Nankai University), and Beibei Wa
 ng (Nanjing University)\n\nCapturing materials from the real world avoids 
 laborious manual material authoring. However, recovering high-fidelity Spa
 tially Varying Bidirectional Reflectance Distribution Function (SVBRDF) ma
 ps from a few captured images is challenging due to its ill-posed nature. 
 Existing approaches have made extensive efforts to alleviate this ambiguit
 y issue by leveraging generative models with latent space optimization or 
 extracting features with variant encoder-decoders. Albeit the rendered ima
 ges at input views can match input images, the problematic decomposition a
 mong maps leads to significant differences when rendered under novel views
 /lighting. We observe that for human eyes, besides individual images, the 
 correlation (or the highlights variation) among input images also serves a
 s an important hint to recognize the materials of objects. Hence, our key 
 insight is to explicitly model this correlation in the SVBRDF acquisition 
 network. To this end, we propose a correlation-aware encoder-decoder netwo
 rk to model the correlation features among the input images via a graph co
 nvolutional network by treating channel features from each image as a grap
 h node. This way, the ambiguity among the maps has been reduced significan
 tly. However, several SVBRDF maps still tend to be over-smooth, leading to
  a mismatch in the novel-view rendering. The main reason is the uneven upd
 ate of different maps caused by a single decoder for map interpretation. T
 o address this issue, we further design an adapter-equipped decoder consis
 ting of a main decoder and four tiny per-map adapters, where adapters are 
 employed for individual maps interpretation, together with fine-tuning, to
  enhance flexibility. As a result, our framework allows the optimization o
 f the latent space with the input image feature embeddings as the initial 
 latent vector and the fine-tuning of per-map adapters. Consequently, our m
 ethod can outperform existing approaches both visually and quantitatively 
 on synthetic and real data.\n\nRegistration Category: Full Access, Full Ac
 cess Supporter\n\nLanguage Format: English Language\n\nSession Chair: Vale
 ntin Deschaintre (Adobe Research)
URL:https://asia.siggraph.org/2024/program/?id=papers_359&sess=sess146
END:VEVENT
END:VCALENDAR
