Blog

HOME
Blog
ECCV
Paper in ECCV Workshop 2012: “Weakly Supervised Learning of Object Segmentations from Web-Scale Videos”

October 7, 2012 / Last updated : July 24, 2024 irfan ECCV

Paper in ECCV Workshop 2012: “Weakly Supervised Learning of Object Segmentations from Web-Scale Videos”

Paper / Citation

Glenn Hartmann, Matthias Grundmann, Judy Hoffman, David Tsai, Vivek Kwatra, Omid Madani, Sudheendra Vijayanarasimhan, Irfan Essa, James Rehg, Rahul Sukthankar

Weakly Supervised Learning of Object Segmentations from Web-Scale Videos Best Paper Proceedings Article

In: Proceedings of ECCV 2012 Workshop on Web-scale Vision and Social Media, 2012.

Abstract | Links | BibTeX | Tags: awards, best paper award, computer vision, ECCV, machine learning

Abstract

We propose to learn pixel-level segmentations of objects from weakly labeled (tagged) internet videos. Especially, given a large collection of raw YouTube content, along with potentially noisy tags, our goal is to automatically generate spatiotemporal masks for each object, such as a “dog”, without employing any pre-trained object detectors. We formulate this problem as learning weakly supervised classiers for a set of independent spatiotemporal segments. The object seeds obtained using segment-level classiers are further rened using graphcuts to generate high-precision object masks. Our results, obtained by training on a dataset of 20,000 YouTube videos weakly tagged into 15 classes, demonstrate the automatic extraction of pixel-level object masks. Evaluated against a ground-truthed subset of 50,000 frames with pixel-level annotations, we conOur proposed methods can learn good object masks just by watching YouTube.

More Information

Presented at: ECCV 2012 Workshop on Web-scale Vision and Social Media, 2012, October 7-12, 2012, in Florence, ITALY.
Awarded the BEST PAPER AWARD!

Categories: ECCV

Tags: ACM Multimedia Activity Recognition Awards Best Paper Award Computational Video Computer Vision ECCV Google WWW

Paper in ECCV Workshop 2012: “Weakly Supervised Learning of Object Segmentations from Web-Scale Videos”

Paper / Citation

Abstract

More Information

Leave a Reply Cancel reply

Paper in IROS 2012: "Linguistic Transfer of Human Assembly Tasks to Robots"

Presentation (2012): CMU Robotics Institute Seminar