Keynote Speakers

 

Monday, November 3

 

Title: Deep Convolution Networks as Geometric Image Representations


Until recently, computer vision algorithms mostly resulted from geometric considerations over shapes, textures, motion and 3D perspective projections. Deep learning algorithms seem to ignore these geometric considerations, while providing state of the art classification results on complex image data bases. It is thus time to wonder what type of image information is extracted by these deep neural networks and why do they work so well.


We show that deep convolution networks provide remarkable architectures to represent geometric image information, including shapes, textures and complex structured objects. They compute descriptors having appropriate invariance and stability to geometric transformations. The properties of these deep representations have strong similarities with Gestalt perception. However, the network filters do not need to be learned and can adapted to prior information on geometry, with multiscale wavelets. This brings us back from learning everything to more traditional computer vision approaches. Classification results will be shown on multiple image data bases, with linear SVM applied to network outputs.


Stéphane Mallat
École Normale Supérieure


Biography


Stéphane Mallat received the Ph.D. degree in electrical engineering from the University of Pennsylvania, in 1988. He was then Professor at the Courant Institute of Mathematical Sciences. In 1995, he became Professor in Applied Mathematics at Ecole Polytechnique, Paris. From 2001 to 2007 he was co-founder and CEO of a semiconductor start-up company. In 2012 he joined the Computer Science Department of Ecole Normale Supérieure, in Paris.


Stéphane Mallat’s research interests include signal processing, computer vision, harmonic analysis and learning. He wrote a book entitled “Wavelet tour of signal processing: the sparse way”. In 1997, he received the Outstanding Achievement Award from the SPIE Society and was a plenary lecturer at the International Congress of Mathematicians in 1998. He also received the 2004 European IST Grand prize, the 2004 INIST-CNRS prize for most cited French researcher in engineering and computer science, and the 2007 EADS grand prize of the French Academy of Sciences.

 

Tuesday, November 4

 

Title: How Changing Mobile and Media Technologies is Changing The Way We Create Innovations


According to Schumpeter's definition of "Innovation," all the innovation instances are combinations of technologies that already exist. In that context, this talk covers the combination of progress of mobile network technologies and media understanding technologies. When the first mobile phones of 2nd Generation came out, people thought it was only for speech communication and texting. Then the phones got cameras, GPS. accelerometers, near filed communication devices together with fat communication pipes, people in all walks of life started using mobile phones in various opportunities. It is noteworthy that the progress of media understanding applications remarkably creates big data in a way of virtuous cycle by service and technology developments. This talk also highlights big-data driven service developments and API strategies for mobile innovations as well as technologies.


Minoru Etoh
NTT Docomo


Biography


Dr. Minoru “Mick” Etoh has several professional and academic roles. He is a Senior Vice President of NTT DOCOMO in charge of innovation management. He is also President and CEO of NTT DOCOMO Ventures (the VC firm of NTT Group), and of DOCOMO Capital, Inc. (Palo Alto, California).. He has written several books and more than a hundred journal papers on network architecture, terminal software, coding technologies, media transport, information retrieval, and data mining. Through those activities, he is recognized as one of major contributors to H.264 standard for Engineering Emmy Award 2008.

 

Wednesday, November 5

 

Title: RGB-D Perception in Robotics


RGB-D cameras provide per pixel color and depth information at high frame rate and resolution. Gaming and entertainment applications such as the Microsoft Kinect system resulted in the mass production of RGB-D cameras at extremely low cost, also making them available for a wide range of robotics applications. In this talk, I will provide an overview of depth camera research done in the Robotics and State Estimation Lab over the last five years. This work includes 3D mapping, autonomous object modeling, unsupervised feature learning for object recognition, and articulated object tracking.


Dieter Fox
University of Washington


Biography


Dieter Fox is a Professor in the Department of Computer Science & Engineering at the University of Washington, where he heads the UW Robotics and State Estimation Lab. From 2009 to 2011, he was also Director of the Intel Research Labs Seattle. He currently serves as the academic PI of the Intel Science and Technology Center for Pervasive Computing hosted at UW. Dieter obtained his Ph.D. from the University of Bonn, Germany. Before going to UW, he spent two years as a postdoctoral researcher at the CMU Robot Learning Lab. Fox's research is in artificial intelligence, with a focus on state estimation applied to robotics and activity recognition. He has published over 150 technical papers and is co-author of the text book "Probabilistic Robotics". He is a fellow of the AAAI and received several best paper awards at major robotics and AI conferences. He is an editor of the IEEE Transactions on Robotics, was program co-chair of the 2008 AAAI Conference on Artificial Intelligence, and served as the program chair of the 2013 Robotics: Science and Systems conference.