Today, two of my Ph. D. Students defended their Dissertations. Back to back. Congrats to both as they are both done. Thesis title: Surgical Skill Assessment Using Motion Texture analysis Student: Yachna Sharma, Ph. D. Candidate in ECE http://users.ece.gatech.edu/~ysharma3/ Date/Time : 2nd April, 1:00 pm Title : Temporally Consistent Semantic Segmentation in Videos S. Hussain […]
Paper in CVIU 2013 "A Visualization Framework for Team Sports Captured using Multiple Static Cameras"
Abstract We present a novel approach for robust localization of multiple people observed using a set of static cameras. We use this location information to generate a visualization of the virtual offside line in soccer games. To compute the position of the offside line, we need to localize players′ positions, and identify their team roles. […]
Paper in ACM Ubicomp 2013 "Technological approaches for addressing privacy concerns when recognizing eating behaviors with wearable cameras"
Abstract First-person point-of-view (FPPOV) images taken by wearable cameras can be used to better understand people’s eating habits. Human computation is a way to provide effective analysis of FPPOV images in cases where algorithmic approaches currently fail. However, privacy is a serious concern. We provide a framework, the privacy-saliency matrix, for understanding the balance between […]
Paper in ACM KDD 2013 "Detecting insider threats in a real corporate database of computer usage activity"
Abstract This paper reports on methods and results of an applied research project by a team consisting of SAIC and four universities to develop, integrate, and evaluate new approaches to detect the weak signals characteristic of insider threats on organizations’ information systems. Our system combines structural and semantic information from a real corporate database of […]
Paper in IEEE CVPR 2013 "Augmenting Bag-of-Words: Data-Driven Discovery of Temporal and Structural Information for Activity Recognition"
Abstract We present data-driven techniques to augment Bag of Words (BoW) models, which allow for more robust modeling and recognition of complex long-term activities, especially when the structure and topology of the activities are not known a priori. Our approach specifically addresses the limitations of standard BoW approaches, which fail to represent the underlying temporal […]
Paper in ECCV Workshop 2012: “Weakly Supervised Learning of Object Segmentations from Web-Scale Videos”
Citation Abstract We propose to learn pixel-level segmentations of objects from weakly labeled (tagged) internet videos. Especially, given a large collection of raw YouTube content, along with potentially noisy tags, our goal is to automatically generate spatiotemporal masks for each object, such as dog”, without employing any pre-trained object detectors. We formulate this problem as […]
Citation Abstract We demonstrate the automatic transfer of an assembly task from human to robot. This work extends efforts showing the utility of linguistic models in veriﬁable robot control policies by now performing real visual analysis of human demonstrations to automatically extract a policy for the task. This method tokenizes each human demonstration into a […]
At IEEE CVPR 2012 is in Providence RI, from Jun 16 – 21, 2012. Busy week ahead meeting good friends and colleagues. Here are some highlights of what my group is involved with. Paper in Main Conference K. Kim, D. Lee, and I. Essa (2012), “Detecting Regions of Interest in Dynamic Scenes with Camera Motions,” in […]
Detecting Regions of Interest in Dynamic Scenes with Camera Motions Abstract We present a method to detect the regions of interests in moving camera views of dynamic scenes with multiple mov- ing objects. We start by extracting a global motion tendency that reflects the scene context by tracking movements of objects in the scene. We […]
Presentation at CVPR 2012 workshop on Large Scale Video Search and Mining "Extracting Content and Context from Video."
Extracting Content and Context from Video. (Presentation at CVPR 2012 workshop on Large Scale Video Search and Mining 2012, June 21, 2012) Irfan Essa GEORGIA Tech prof.irfanessa.com Abstract In this talk, I will describe various efforts aimed at extracting context and content from video. I will highlight some of our recent work in extracting spatio-temporal […]