August 9, 2023 / Last updated : December 13, 2024 irfan SIGGRAPH

ACM SIGGRAPH Seminal Graphics Papers, Volume 2. Published as part of SIGGRAPH 50th Anniversary Meeting in 2023

ACM SIGGRAPH has published Seminal Graphics Papers: Pushing the Boundaries, Volume 2, as part of its 50th-year celebration. They are making all these amazing papers available online for free access. These are all amazing papers I have read for my research and included in my teachings. Proud that 2 of my papers with amazing collaborators […]

October 12, 2021 / Last updated : March 15, 2023 irfan UIST

Paper in UIST 2021 on “Automatic Instructional Video Creation from a Markdown-formatted Tutorial”

Abstract We introduce HowToCut, an automatic approach that converts a Markdown-formatted tutorial into an interactive video presenting visual instructions with a synthesized voiceover for narration. HowToCut extracts instructional content from a multimedia document that describes a step-by-step procedure. Our method selects and converts text instructions to a voiceover. It makes automatic editing decisions to align […]

February 25, 2021 / Last updated : March 15, 2023 irfan CHI

Paper in ACM CHI 2021 on “Automatic Generation of Two-Level Hierarchical Tutorials from Instructional Makeup Videos”

We present a multi-modal approach for automatically generating hierarchical tutorials from instructional makeup videos. Our approach is inspired by prior research in cognitive psychology, which suggests that people mentally segment procedural tasks into event hierarchies, where coarse-grained events focus on objects while fine-grained events focus on actions. In the instructional makeup domain, we find that objects correspond to facial parts while fine-grained steps correspond to actions on those facial parts. Given an input instructional makeup video, we apply a set of heuristics that combine computer vision techniques with transcript text analysis to automatically identify the fine-level action steps and group these steps by facial part to form the coarse-level events. We provide a voice-enabled, mixed-media UI to visualize the resulting hierarchy and allow users to efficiently navigate the tutorial (e.g., skip ahead, return to previous steps) at their own pace. Users can navigate the hierarchy at both the facial-part and action-step levels using click-based interactions and voice commands. We demonstrate the effectiveness of segmentation algorithms and the resulting mixed-media UI on a variety of input makeup videos. A user study shows that users prefer following instructional makeup videos in our mixed-media format to the standard video UI and that they find our format much easier to navigate.

October 18, 2016 / Last updated : February 22, 2021 irfan Multimedia

Paper (ACM MM 2016) “Leveraging Contextual Cues for Generating Basketball Highlights”

Paper Abstract The massive growth of sports videos has resulted in a need for automatic generation of sports highlights that are comparable in quality to the hand-edited highlights produced by broadcasters such as ESPN. Unlike previous works that mostly use audio-visual cues derived from the video, we propose an approach that additionally leverages contextual cues […]

September 8, 2015 / Last updated : March 25, 2023 irfan Ubicomp

Paper in Ubicomp 2015: "A Practical Approach for Recognizing Eating Moments with Wrist-Mounted Inertial Sensing"

Paper Abstract Recognizing when eating activities take place is one of the key challenges in automated food intake monitoring. Despite progress over the years, most proposed approaches have been largely impractical for everyday usage, requiring multiple onbody sensors or specialized devices such as neck collars for swallow detection. In this paper, we describe the implementation […]

April 1, 2015 / Last updated : March 25, 2023 irfan IUI

Paper in ACM IUI15: “Inferring Meal Eating Activities in Real-World Settings from Ambient Sounds: A Feasibility Study”

Citation Abstract Dietary self-monitoring has been shown to be an effective method for weight loss, but it remains an onerous task despite recent advances in food journaling systems. Semi-automated food journaling can reduce the effort of logging but often requires that eating activities be detected automatically. In this work, we describe results from a feasibility […]

September 14, 2013 / Last updated : February 21, 2021 irfan Ubicomp

Paper in ACM Ubicomp 2013 "Technological approaches for addressing privacy concerns when recognizing eating behaviors with wearable cameras"

[bibtex file=IrfanEssaWS.bib key=2013-Thomaz-TAAPCWREBWWC] Abstract First-person point-of-view (FPPOV) images taken by wearable cameras can be used to better understand people’s eating habits. Human computation is a way to provide effective analysis of FPPOV images in cases where algorithmic approaches currently fail. However, privacy is a serious concern. We provide a framework, the privacy-saliency matrix, for understanding […]

August 11, 2013 / Last updated : February 22, 2021 irfan KDD

Paper in ACM KDD 2013 “Detecting insider threats in a real corporate database of computer usage activity”

Abstract This paper reports on methods and results of an applied research project by a team consisting of SAIC and four universities to develop, integrate, and evaluate new approaches to detect the weak signals characteristic of insider threats on organizations’ information systems. Our system combines structural and semantic information from a real corporate database of […]

September 4, 2012 / Last updated : September 1, 2020 irfan Ubicomp

AT UBICOMP 2012 Conference, in Pittsburgh, PA, September 5 – 7, 2012

At ACM sponsored, 14th International Conference on Ubiquitous Computing (Ubicomp 2012), Pittsburgh, PA, September 5 – 7, 2012. Here are the highlights of my group’s participation in Ubicomp 2012. [bibtex key= 2012-Thomaz-RWAHTIS] (Oral Presentation at 2pm on Wednesday September 5, 2012) [bibtex key= 2012-Wang-OASUMC] (Oral Presentation at 2pm on Thursday September 6, 2012). In addition, my colleague, […]

March 1, 2009 / Last updated : January 5, 2020 irfan Computational Photography and Video

Paper (2009) In ACM Symposium on Interactive 3D Graphics "Human Video Textures"

Matthew Flagg, Atsushi Nakazawa, Qiushuang Zhang, Sing Bing Kang, Young Kee Ryu, Irfan Essa, James M. Rehg (2009), Human Video Textures In Proceedings of the ACM Symposium on Interactive 3D Graphics and Games 2009 (I3D ’09), Boston, MA, February 27-March 1 (Fri-Sun), 2009 [PDF (see Copyright) | Video in DiVx | Website ] Abstract This paper describes a data-driven approach […]