LNCS Homepage
ContentsAuthor IndexSearch

Simultaneous Detection and Segmentation

Bharath Hariharan1, Pablo Arbeláez1, 2, Ross Girshick1, and Jitendra Malik1

1University of California, Berkeley, USA
bharath2@eecs.berkeley.edu
arbelaez@eecs.berkeley.edu
rbg@eecs.berkeley.edu
malik@eecs.berkeley.edu

2Universidad de los Andes, Colombia

Abstract. We aim to detect all instances of a category in an image and, for each instance, mark the pixels that belong to it. We call this task Simultaneous Detection and Segmentation (SDS). Unlike classical bounding box detection, SDS requires a segmentation and not just a box. Unlike classical semantic segmentation, we require individual object instances. We build on recent work that uses convolutional neural networks to classify category-independent region proposals (R-CNN [16]), introducing a novel architecture tailored for SDS. We then use category-specific, top-down figure-ground predictions to refine our bottom-up proposals. We show a 7 point boost (16% relative) over our baselines on SDS, a 5 point boost (10% relative) over state-of-the-art on semantic segmentation, and state-of-the-art performance in object detection. Finally, we provide diagnostic tools that unpack performance and provide directions for future work.

Keywords: detection, segmentation, convolutional networks

LNCS 8695, p. 297 ff.

Full article in PDF | BibTeX


lncs@springer.com
© Springer International Publishing Switzerland 2014