ICCV Workshop MVR3D 2017

The recent improvements in the 3D sensing technologies have caused a remarkable amplification in the utilization of 3D data. 3D information has found tremendous use in Autonomous Driving, 3D Mapping, Quality Control, Drones and UAVs or Robot Guidance to name but a few applicative domains. These applications typically fuse different modalities such as range images, stereo triangulations, structure-from-motion reconstructions or laser scans. A common flexible representation governing all these are point clouds. Thus, in many of the applications that rely on multiple 3D acquisitions, good registration of point clouds is a prerequisite. Yet, when unstructured dense scans of large scenes are of concern, establishing the alignment in a fully automatic manner is far from being trivial -- a difficulty that is exacerbated when the scans in question are allowed to undergo locally non-rigid deformations due to miscalibration of the capturing device or object movement.

In such a complex scenario, researchers are now taking on the challenge of accurately auto-stitching tens of millions of unstructured/structured points that include symmetries, self-similarities and that do not admit scan-order constraints. This workshop will be dedicated to exploring the theoretical and practical aspects of obtaining multi-view global alignment and registration of scans captured by any 3D data modality. Our main objective is to gather together industry experts, academic researchers, and practitioners of 3D data acquisition and scene reconstruction into a lively environment for discussing methodologies and challenges raised by the emergence of large-scale 3D reconstruction applications; as a targeted topic venue, this workshop will offer participants a unique opportunity to network with a diverse but focused research community.

8 Speakers
1 Day

Call for Papers

The goal of this workshop is to push the frontier in the area of global multi-scan alignment. Focal points for discussions and solicited submissions include but are not limited to:
  • Global point cloud alignment
  • Multiview registration using scene priors
  • Learning methods for multiview correspondence estimation
  • 3D Object reconstruction from multiple views
  • Joint registration and segmentation of multiple scans
  • Joint matching of multiple non-rigid surfaces
  • Multiview object detection
  • Multi-object Instance reconstruction
  • Feature descriptors for multiview 3D matching
  • Multiview pose estimation
  • Joint processing of multiple point clouds
  • Pose averaging and error diffusion on graphs
  • Multiview stitching of 3D scans on mobile and embedded devices
  • Practical applications of multiple scan registration on large scale settings
  • Datasets and dataset methods for ground truth acquisition

An official call-for-papers is found here:

Download Call For Papers (CfP) PDF

Our speakers

Our invited speakers come from top research institutions and companies around the globe, and are leading figures in the topics covered by the workshop. This diverse selection will prove valuable for academic as well as industry researchers and practitioners. Both practical and theoretical aspects of multiview 3D computer vision will be covered by the invited lecturers.

Alex Bronstein

Alex
Bronstein

Associate Professor
Technion

Andrew Fitzgibbon

Andrew
Fitzgibbon

Partner Scientist
Microsoft Research

Radu Horaud

Radu
Horaud

Senior Research Scientist
INRIA

Vladlen Koltun

Vladlen
Koltun

Principal Scientist
Intel

Andreas Nüchter

Andreas
Nüchter

Professor
JMU Würtzburg

Luc Robert

Luc
Robert

Software Architect, Fellow
Bentley Systems

Konrad Schindler

Konrad
Schindler

Professor
ETH Zurich

Christopher Zach

Christopher
Zach

Research Scientist
Toshiba Europe

Dates

    Paper submission:
    July 27, 2017
    Notification of acceptance:
    August 18, 2017
    Camera ready:
    August 23, 2017
    Workshop date:
    October 29, 2017

* With these days, we hope to allow a sufficient time slot for re-submission of the unlucky ICCV papers.

Awards

Thanks to our generous supporters, we are able to grant several awards:

  • First author of the Best Paper will receive a software license for 3DF Zephyr Pro.
  • First author of the Best Student Paper will receive an Intel RealSense camera.
  • Authors of All Accepted Papers will receive licenses for 3DF Zephyr Lite software.

Papers

Paper submission is through Conference Management Toolkit. We are looking forward to submissions, following the enlisted guidelines:

  • Submitted papers should not have been published, accepted or under review elsewhere. Non-peer reviewed media, such as Arxiv do not violate the terms.
  • The submissions can be up to 8 pages (excluding references).
  • All the papers must be submitted using the provided templates.
  • All the submissions will be subject to double-blind review process. Therefore author names, affiliations, email addresses, personal acknowledgements, etc. should be removed from the paper.
  • Every accepted paper requires that at least one author has a workshop-inclusive ICCV registration
  • Here are further detailed guidelines.

All published papers will be included in ICCV 2017 proceedings. We accept manuscripts in Latex or Word, in the same format as ICCV 2017:

Download Latex and Word Templates

The schedule

We have a packed and exciting day ahead of us!

  • 0900

    Opening Remarks

    Good morning everybody! We gladly welcome you to our first workshop, MVR3D 2017.

  • 0915
    Christopher Zach

    Invited talk by Christopher Zach: The importance of good matches for 3D reconstruction

    Christopher Zach (PhD 2007 TU Graz) is currently a principal research scientist in the computer vision group at Toshiba Research Europe. Previous to that he had post-doctoral and senior researcher positions at UNC-Chapel Hill (2008–2009), ETH Zürich (2009–2011) and Microsoft Research Cambridge (2012–2014). His main research interests are structure from motion, dense 3D reconstruction from images, convex methods in computer vision and real-time computer vision.

    Abstract

    In the first part of my presentation I review several methods to detect false positive matches between images before they lead to distorted 3D models, which are difficult to rectify afterwards. These false positive matches are due to "perceptual aliasing" and occur frequently in man-made environments. Several complementary cues allow to identify these false positives. I also describe situations where perceptual aliasing actually can help to improve the reconstructed model. In the second part I will describe recent efforts to bypass several stages of a typical 3D reconstruction pipeline, and how to push the envelope of initialization-free bundle adjustment. Empirical evidence (and early-stage theoretical understanding) strongly suggests that the secret of obtaining faithful 3D models lies mainly in using mostly outlier-free correspondences. Thus, the often theoretically non-satisfactory steps to obtain a sufficiently good initial 3D model (and camera poses) from pairwise image matches might not be necessary in many cases, and one can directly apply a suitable version of bundle adjustment from random initial values.

    Speakers:
    Christopher Zach

    Christopher Zach

  • 0955
    Vladlen Koltun

    Invited talk by Vladlen Koltun: TBD

    Vladlen Koltun is the Director of the Intel Visual Computing Lab. He received a PhD in 2002 for new results in theoretical computational geometry, spent three years at UC Berkeley as a postdoc in the theory group, and joined the Stanford Computer Science faculty in 2005 as a theoretician. He switched to research in visual computing in 2007 and joined Intel as a Principal Researcher in 2015 to establish the Visual Computing Lab.

    Abstract

    TBD.

    Speakers:
    Vladlen Koltun

    Vladlen Koltun

  • 1035

    Morning Coffee Break

    We are proudly serving Italian coffee!

  • 1050

    Oral Session

    We look forward to novel and exciting oral presentations of a subset of the accepted papers. A slot of 15 minutes is allocated per presentation and we advise to spare 3 minutes for questions.

    Edge SLAM: Edge Points Based Monocular Visual SLAM
    Arindam Saha, Soumyadip Maity, Brojeshwar Bhowmick
    Probabilistic Surfel Fusion for Dense LiDAR Mapping
    Chanoh Park, Soohwan Kim, Peyman Moghadam, Clinton Fookes, Sridha Sridharan
    Computer Vision Meets Geometric Modeling: Multi-view Reconstruction of Surface Points and Normals using Affine Correspondences
    Iván Eichhardt, Levente Hajder
    Camera Pose Filtering with Local Regression Geodesics on the Riemannian Manifold of Dual Quaternions
    Benjamin Busam, Tolga Birdal, Nassir Navab
  • 1150
    Andrew Fitzgibbon

    Invited talk by Andrew Fitzgibbon: Optimization for reconstruction problems

    Andrew Fitzgibbon is a scientist with HoloLens at Microsoft, Cambridge, UK. He is best known for his work on 3D vision, having been a core contributor to the Emmy-award-winning 3D camera tracker “boujou” (www.boujou.com) and Kinect for Xbox 360, but his interests are broad, spanning computer vision, graphics, machine learning, and even a little neuroscience. He has published numerous highly-cited papers, and received many awards for his work, including ten “best paper” prizes at various venues, the Silver medal of the Royal Academy of Engineering, and the BCS Roger Needham award. He is a fellow of the Royal Academy of Engineering, the British Computer Society, and the International Association for Pattern Recognition. Before joining Microsoft in 2005, he was a Royal Society University Research Fellow at Oxford University, having previously studied at Edinburgh University, Heriot-Watt University, and University College, Cork.

    Abstract

    Sparse Gauss-Newton optimization, or “Bundle Adjustment”, is a crucial tool of 3D reconstruction. It is considered common knowledge that bundle adjustment has a small basin of convergence and needs a good initialization. For example, there are well known benchmark sequences where initialization from a random starting point using any of the mainstream bundle adjustment packages leads to essentially 0% chance of convergence to the known best optima. However, this situation is changing. In 2011, Okatani and others re-introduced the VarPro method to matrix factorization (after some “dark ages”, for which I take some blame for promulgating in 2005). This conferred remarkable improvements on the basin of convergence of that problem, which is, of course, the same problem as affine bundle adjustment. More recently, my student John Hong, working with Zach, Cipolla, and me, has shown how to bring the advantages of VarPro to projective bundle adjustment. I shall describe the main components of this work, and then speculate on future directions.

    Speakers:
    Andrew Fitzgibbon

    Andrew Fitzgibbon

  • 1230

    Lunch Break

    The smell of that delicious Italian cusine is irresistable.

  • 1320
    Konrad Schindler

    Invited talk by Konrad Schindler: Image-based reconstruction of semantically interpreted 3D city models

    Konrad Schindler received the Diplomingenieur (M.tech) degree in photogrammetry from Vienna University of Technology, Austria in 1999, and a PhD in computer science from Graz University of Technology, Austria, in 2003. He has worked as a photogrammetric engineer in the private industry, and has held researcher positions at Graz University of Technology, Monash University, and ETH Zurich. He became assistant professor of Image Understanding at TU Darmstadt in 2009, and since 2010 has been a tenured professor of Photogrammetry and Remote Sensing at ETH Zurich. His research interests lie in the field of computer vision, photogrammetry, and remote sensing, with a focus on image understanding and 3d reconstruction. Konrad has been president of ISPRS Technical Commission III “Photogrammetric Computer Vision and Image Analysis” for the period 2012-2016. He has received several awards, including the U. V. Helava Award 2012 for the best paper published in the ISPRS Journal 2008-2011 (with A. Ess, B. Leibe and L. Van Gool), and a honorable mentions for the Marr Prize at ICCV 2013 (with C. Vogel and S. Roth).

    Abstract

    TBD.

    Speakers:
    Konrad Schindler

    Konrad Schindler

  • 1400

    Poster Session

    Those heated discussions and further networking...

    A Use-Case Study on Multi-View Hypothesis Fusion for 3D Object Classification
    Panagiotis Papadakis
    Accurate Depth Map Estimation from Small Motions
    Hossein Javidnia, Peter Corcoran
    On Tablet 3D Structured Light Reconstruction and Registration
    Matea Donlic, Tomislav Petkovic, Tomislav Pribanic
    Multiview Absolute Pose Using 3D -- 2D Perspective Line Correspondences and Vertical Direction
    Nora Horanyi, Zoltan Kato
    Reference-based Light Field Super-resolution Using a Hybrid Imaging System
    Haitian Zheng, Minghao Guo, Haoqian Wang, Yebin Liu, Lu Fang
    A Content-aware Metric for Stitched Panoramic Image Quality Assessment
    Luyu Yang, Zhigang Tan, Zhe Huang, Gene Cheung
    KPPF: Keypoint-based Point-Pair-Feature for scalable automatic global registration of large RGB-D scans
    Lucas Malleus, Thomas Fisichella, Diane Lingrand, Frédéric Precioso, Nicolas Gros, Yann Noutary, Luc Robert, Lirone Samoun
    Edge SLAM: Edge Points Based Monocular Visual SLAM
    Arindam Saha, Soumyadip Maity, Brojeshwar Bhowmick
    Probabilistic Surfel Fusion for Dense LiDAR Mapping
    Chanoh Park, Soohwan Kim, Peyman Moghadam, Clinton Fookes, Sridha Sridharan
    Computer Vision Meets Geometric Modeling: Multi-view Reconstruction of Surface Points and Normals using Affine Correspondences
    Iván Eichhardt, Levente Hajder
    Camera Pose Filtering with Local Regression Geodesics on the Riemannian Manifold of Dual Quaternions
    Benjamin Busam, Tolga Birdal, Nassir Navab
  • 1500
    Alex Bronstein

    Invited talk by Alex Bronstein: Geometry and learning in 3D correspondence problems

    Dr. Alex Bronstein was born in 1980. He received the B.Sc. and M.Sc. (both summa cum laude) from the Department of Electrical Engineering in 2002 and 2005, and Ph.D. from the Department of Computer Science, Technion in 2007. Until 2016, Dr. Alex Bronstein was an Associate Professor in the School of Electrical Engineering at Tel Aviv University. In 2016, he has joined the Department of Computer Science at the Technion also as an Associate Professor. His main research interests are theoretical and computational methods in metric geometry and their application to problems in computer vision, pattern recognition, shape analysis, computer graphics, imaging and image processing, and machine learning. He has authored over 120 publications in leading journals and conferences, over two dozens of patents and patent applications, and the book Numerical geometry of non-rigid shapes (published by Springer). His h-index is 38. Alex Bronstein is the alumnus of the Technion Excellence Program and the Academy of Achievement, and a member of the IEEE. His research was recognized by numerous awards, including the Kasher prize (2002), Thomas Schwartz award (2002), Hershel Rich Technion Innovation award (2003), Gensler counter-terrorism prize (2003), the Copper Mountain Conference on Multigrid Methods Best Paper award (2005) and the Adams Fellowship (2006), the Krill Prize by Wolf Foundation (2012), and the European Research Council (ERC) Startup Grant (2013). Highlights of his research were featured in CNN, SIAM News, and in Prof. Guillermo Sapiro's Science Lecture "One small step for Gromov, one giant leap for shape analysis" that he gave in Oslo on the occasion of awarding Prof. Mikhail Gromov the 2009 Abel Prize, considered the "Nobel of Math". Besides scientific awards, Alex received the Technion Humanities and Arts Department prize (2001) for the translation of Shakespearean sonnets into Italian. He co-chaired the IEEE International Workshop on Non-rigid shapes and deformable image alignment (NORDIA) in 2008-2011, the International Conference on n Scale Space and Variational Methods in Computer Vision (SSVM) in 2011, served as the program chair of the Eurographics Workshop on 3D Object Retrieval (3DOR) in 2012, area chair of the IEEE Asian Conference on Computer Vision (ACCV) in 2010, and participated in program committees of major conferences in his field. Dr. Bronstein held visiting appointments in Politecnico di Milano (2008), Stanford university (2009), Verona University (2010,2014), and Duke University (from 2014). In addition to his academic activities, he was a co-founder of a Silicon Valley startup Novafora, Inc., where he served from 2004 till 2009 as a scientist and a Vice President of video technology, leading a group of researchers and engineers in developing novel Internet-scale video analysis technologies. Dr. Bronstein was one of the inventors and developers of the 3D sensing technology in the foundation of the Israeli startup Invision, subsequently acquired by Intel Corporation in 2012 and distributed under the RealSense brand.

    Abstract

    The need to compute correspondence between three-dimensional objects is a fundamental ingredient in numerous computer vision and graphics tasks. In this talk, I will show how several geometric notions related to the Laplacian spectrum provide a set of tools for efficiently calculating correspondence between deformable shapes. I will also show how this framework combined with recent ideas in deep learning promises to bring correspondence problems to new levels of accuracy.

    Speakers:
    Alex Bronstein

    Alex Bronstein

  • 1540

    Afternoon Coffee Break

    We are proudly serving Italian coffee!

  • 1555
    Radu Horaud

    Invited talk by Radu Horaud: Towards high-resolution reconstruction of people using a very large number of RGB and depth cameras

    Radu Patrice Horaud holds a position of director of research at INRIA Grenoble Rhône-Alpes, France. He is the founder and director of the PERCEPTION team. Radu’s research interests cover computational vision, audio signal processing, audio-visual scene analysis, machine learning, and robotics. He is the author of over 200 scientific publications. Radu pioneered work in computer vision using range data (or depth images) and developed a number of principles and methods at the cross-roads of computer vision and robotics. In 2006, he started to develop audio-visual fusion and recognition techniques in conjunction with human-robot interaction. He is an area editor for the CVIU (Elsevier), a member of the advisory board for the International JRR (Sage), and an associated editor for the IJCV (Kluwer-Springer). In 2001 he was program co-chair of the IEEE Eighth ICCV and in 2015 he was program co-chair of the 17th ACM International Conference on Multimodal Interaction (ICMI’15). Radu Horaud was the scientific coordinator of the European Marie Curie network VISIONTRAIN (2005-2009), STREP projects POP (2006-2008) and HUMAVIPS (2010-2013), and the principal investigator of a collaborative project between INRIA and Samsung’s Advanced Institute of Technology (SAIT) on computer vision algorithms for 3D television (2010-2013). In 2013 he was awarded an ERC Advanced Grant for his five year project VHIA (2014-2019). In 2015 he received a three year grant (jointly with Florence Forbes) from Xerox University Affairs Committee.

    Abstract

    TBD.

    Speakers:
    Radu Horaud

    Radu Horaud

  • 1635
    Andreas Nüchter

    Invited talk by Andreas Nüchter: Continuous-time SLAM for high-precise kinematic mapping of 3D point clouds

    Andreas Nüchter is professor of computer science (telematics) at University of Würzburg. Before summer 2013 he headed as assistant professor the Automation group at Jacobs University Bremen. Prior he was a research associate at University of Osnabrück. Further past affiliations were with the Fraunhofer Institute for Autonomous Intelligent Systems (AIS, Sankt Augustin), the University of Bonn, from which he received the diploma degree in computer science in 2002 (best paper award by the German society of informatics (GI) for his thesis) and the Washington State University. He holds a doctorate degree (Dr. rer. nat) from University of Bonn. His thesis was shortlisted for the EURON PhD award. Andreas works on robotics and automation, cognitive systems and artificial intelligence. His main research interests include reliable robot control, 3D environment mapping, 3D vision, and laser scanning technologies, resulting in fast 3D scan matching algorithms that enable robots to perceive and map their environment in 3D representing the pose with 6 degrees of freedom. The capabilities of these robotic SLAM approaches were demonstrated at RoboCup Rescue competitions, ELROB and several other events. He is a member of the GI and the IEEE.

    Abstract

    Mobile laser scanning puts high requirements on the accuracy of the positioning systems and the calibration of the measurement system. The talk describes a general framework for calibrating mobile sensor platforms that estimates all configuration parameters for any arrangement of positioning sensors. In addition, we present a novel Continuous-time Simultaneous Localization and Mapping (SLAM) algorithm that corrects the system position at every point in time along its trajectory, while simultaneously improving the quality and precision of the entire acquired point cloud. Using this algorithm. The talk demonstrate the capabilities of the algorithms on a wide variety of datasets, ranging from underground mining to improving Google's Cartographer result.

    Speakers:
    Andreas Nüchter

    Andreas Nüchter

  • 1705
    Luc Robert

    Invited talk by Luc Robert: Context Capture : Solving industry problems using multi-view relationships

    Luc Robert graduated from Ecole Polytechnique in 1988, and obtained his PhD in 3D computer vision, in 1993, from the National Research Institute for Computer Sciences and Automatics (Inria, France). After a 1-year post-doc at Carnegie Mellon University (Pittsburgh, USA) working on 3D vision for autonomous vehicles, he joined Inria as a research scientist in 1995. In 1998 he co-founded REALVIZ, a startup bringing technology from the lab to the industry of digital content creation. During the following ten years he led the development of the REALVIZ technology and products, that became industry leaders on the markets of 3D digital effects and panoramic photography. After the acquisition of REALVIZ by Autodesk in 2008, Luc drove reality capture technology development for the 123D Catch and ReCap products. In 2016, he joined the Bentley Systems team in charge of products and technology related to reality capture.

    Abstract

    TBD.

    Speakers:
    Luc Robert

    Luc Robert

  • 1745

    Closing Remarks

    Untill next time!

Our partners

MVR3D 2017 is enabled by our generous sponsors.

Siemens
Intel
3D Flow
DAIS - Dipartimento di Scienze Ambientali Informatica Statistica


Computer Vision Online

Program Committee

Your papers are in great hands! MVR3D 2017 is proudly backed by the following program committee composed of very influential computer vision researchers:


Ali Osman Ulusoy
Andrea Fusiello
Andrea Gasparetto
Andreas Geiger
Benjamin Busam
Bertram Drost
Edmond Boyer
Federico Tombari
Juergen Sturm
Luca Cosmo
Matthew Trager
Miroslava Slavcheva
Zorah Laehner
Srikumar Ramalingam
Stefan Hinterstoisser
Venu Madhav Govindu
Vincent Lepetit
Wadim Kehl


Of course this is not the entire list. Stay tuned for more.

Organizers

Here are the diligent people behind MVR3D 2017.

Tolga Birdal

Tolga
Birdal

PhD Candidate
TU Munich

Emanuele Rodolà

Emanuele
Rodolà

PostDoc Researcher
USI Lugano

Gul Varol

Gül
Varol

PhD Candidate
INRIA

Slobodan Ilic

Slobodan
Ilic

Principal Scientist
Siemens AG

Umberto Castellani

Umberto
Castellani

Assistant Professor
University of Verona

Andrea Torsello

Andrea
Torsello

Associate Professor
CF University Venice

FAQ

Latest news

Follow the recent hapennings here.

MVR3D 2017 Venue

  • Lungomare Guglielmo Marconi, 30, 30126 Lido di Venezia - Venice, Italy
  • E-mail : multiview.workshop@gmail.com
-->