BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Australia/Melbourne
X-LIC-LOCATION:Australia/Melbourne
BEGIN:DAYLIGHT
TZOFFSETFROM:+1000
TZOFFSETTO:+1100
TZNAME:AEDT
DTSTART:19721003T020000
RRULE:FREQ=YEARLY;BYMONTH=4;BYDAY=1SU
END:DAYLIGHT
BEGIN:STANDARD
DTSTART:19721003T020000
TZOFFSETFROM:+1100
TZOFFSETTO:+1000
TZNAME:AEST
RRULE:FREQ=YEARLY;BYMONTH=10;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20260114T163641Z
LOCATION:Meeting Room C4.11\, Level 4 (Convention Centre)
DTSTART;TZID=Australia/Melbourne:20231215T151500
DTEND;TZID=Australia/Melbourne:20231215T152500
UID:siggraphasia_SIGGRAPH Asia 2023_sess138_papers_433@linklings.com
SUMMARY:UVDoc: Neural Grid-based Document Unwarping
DESCRIPTION:Floor Verhoeven, Tanguy Magne, and Olga Sorkine-Hornung (ETH Z
 urich)\n\nRestoring the original, flat appearance of a printed document fr
 om casual photographs of bent and wrinkled pages is a common everyday prob
 lem. In this paper we propose a novel method for grid-based single-image d
 ocument unwarping. Our method performs geometric distortion correction via
  a fully convolutional deep neural network that learns to predict the 3D g
 rid mesh of the document and the corresponding 2D unwarping grid in a mult
 i-task fashion, implicitly encoding the coupling between the shape of a 3D
  piece of paper and its 2D image. In order to allow unwarping models to tr
 ain on data that is more realistic in appearance than the commonly used sy
 nthetic Doc3D dataset we create and publish our own dataset, called UVDoc,
  which combines pseudo-photorealistic document images with physically accu
 rate 3D shape and unwarping function annotations. Our dataset is labeled w
 ith all the information necessary to train our unwarping network, without 
 having to engineer separate loss functions that can deal with the lack of 
 ground-truth typically found in document in the wild datasets. We perform 
 an in-depth evaluation that demonstrates that with the inclusion of our no
 vel pseudo-photorealistic dataset, our relatively small network architectu
 re achieves state-of-the-art results on the DocUNet benchmark. We show tha
 t the pseudo-photorealistic nature of our UVDoc dataset allows for new and
  better evaluation methods, such as lighting-corrected MS-SSIM. We provide
  a novel benchmark dataset that facilitates such evaluations, and propose 
 a metric that quantifies line straightness after unwarping. Our code, resu
 lts and UVDoc dataset will be made publicly available upon publication.\n\
 nRegistration Category: Full Access\n\nSession Chair: Seung-Hwan Baek (POS
 TECH)\n\n
URL:https://asia.siggraph.org/2023/full-program?id=papers_433&sess=sess138
END:VEVENT
END:VCALENDAR