BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Asia/Tokyo
X-LIC-LOCATION:Asia/Tokyo
BEGIN:STANDARD
TZOFFSETFROM:+0900
TZOFFSETTO:+0900
TZNAME:JST
DTSTART:18871231T000000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20250110T023309Z
LOCATION:Hall B5 (2)\, B Block\, Level 5
DTSTART;TZID=Asia/Tokyo:20241203T135600
DTEND;TZID=Asia/Tokyo:20241203T141000
UID:siggraphasia_SIGGRAPH Asia 2024_sess104_papers_905@linklings.com
SUMMARY:MVImgNet2.0: A Larger-scale Dataset of Multi-view Images
DESCRIPTION:Technical Papers\n\nXiaoguang Han (SSE, The Chinese University
  of Hong Kong, Shenzhen; FNii, The Chinese University of Hong Kong, Shenzh
 en); Yushuang Wu, Luyue Shi, Haolin Liu, Hongjie Liao, and Lingteng Qiu (F
 Nii, The Chinese University of Hong Kong, Shenzhen; SSE, The Chinese Unive
 rsity of Hong Kong, Shenzhen); Weihao Yuan, Xiaodong Gu, and Zilong Dong (
 Alibaba); and Shuguang Cui (SSE, The Chinese University of Hong Kong, Shen
 zhen; FNii, The Chinese University of Hong Kong, Shenzhen)\n\nMVImgNet is 
 a large-scale dataset that contains multi-view images of ~220k real-world 
 objects in 238 classes. As a counterpart of ImageNet, it introduces 3D vis
 ual signals via multi-view shooting, making a soft bridge between 2D and 3
 D vision. This paper constructs the MVImgNet2.0 dataset that expands MVImg
 Net into a total of ~520k objects and 515 categories, which derives a 3D d
 ataset with a larger scale that is more comparable to ones in the 2D domai
 n. In addition to the expanded dataset scale and category range, MVImgNet2
 .0 is of a higher quality than MVImgNet owing to four new features: (i) mo
 st shoots capture 360-degree views of the objects, which can support the l
 earning of object reconstruction with completeness; (ii) the segmentation 
 manner is advanced to produce foreground object masks of higher accuracy; 
 (iii) a more powerful structure-from-motion method is adopted to derive th
 e camera pose for each frame of a lower estimation error; (iv) higher-qual
 ity dense point clouds are reconstructed via advanced methods for objects 
 captured in 360-degree views, which can serve for downstream applications.
  Extensive experiments confirm the value of the proposed MVImgNet2.0 in bo
 osting the performance of large 3D reconstruction models. MVImgNet2.0 will
  be public at luyues.github.io/mvimgnet2, including multi-view images of a
 ll 520k objects, the reconstructed high-quality point clouds, and data ann
 otation codes, hoping to inspire the broader vision community.\n\nRegistra
 tion Category: Full Access, Full Access Supporter\n\nLanguage Format: Engl
 ish Language\n\nSession Chair: Bernhard Kerbl (Technical University of Vie
 nna)
URL:https://asia.siggraph.org/2024/program/?id=papers_905&sess=sess104
END:VEVENT
END:VCALENDAR