Skip to the content Skip to the Navigation

Irfan Essa

  • Home
  • Blog
  • Publications
  • Team
  • Videos
  • Teaching
  • FAQ
  • Contact

Blog

  1. HOME
  2. Blog
September 10, 2015 / Last updated : March 25, 2023 irfan Presentations

Presentation at Max-Planck-Institute for Intelligent Systems in Tübingen (2015): "Data-Driven Methods for Video Analysis and Enhancement"

Irfan EssaGeorgia Institute of Technology Thursday, September 10, 2 pm,Max Planck House Lecture Hall (Spemannstr. 36)Hosted by Max-Planck-Institute for Intelligent Systems (Michael Black, Director of Perceiving Systems) Abstract In this talk, I will start with describing the pervasiveness of image and video content, and how such content is growing with the ubiquity of cameras.  I will use […]

September 8, 2015 / Last updated : March 25, 2023 irfan Ubicomp

Paper in Ubicomp 2015: "A Practical Approach for Recognizing Eating Moments with Wrist-Mounted Inertial Sensing"

Paper Abstract Recognizing when eating activities take place is one of the key challenges in automated food intake monitoring. Despite progress over the years, most proposed approaches have been largely impractical for everyday usage, requiring multiple onbody sensors or specialized devices such as neck collars for swallow detection. In this paper, we describe the implementation […]

September 7, 2015 / Last updated : March 25, 2023 irfan ISWC

Paper in ISWC 2015: "Predicting Daily Activities from Egocentric Images Using Deep Learning"

Paper Abstract We present a method to analyze images taken from a passive egocentric wearable camera along with contextual information, such as time and day of the week, to learn and predict the everyday activities of an individual. We collected a dataset of 40,103 egocentric images over 6 months with 19 activity classes and demonstrate […]

August 15, 2015 / Last updated : March 25, 2023 irfan Teaching

Fall 2015 Teaching: Computer Vision and Computational Photography for Online MSCS.

In the fall 2015 term, I am teaching two classes. Both for Georgia Tech’s Online MSCS program.

April 1, 2015 / Last updated : March 25, 2023 irfan IUI

Paper in ACM IUI15: “Inferring Meal Eating Activities in Real-World Settings from Ambient Sounds: A Feasibility Study”

Citation Abstract Dietary self-monitoring has been shown to be an effective method for weight loss, but it remains an onerous task despite recent advances in food journaling systems. Semi-automated food journaling can reduce the effort of logging but often requires that eating activities be detected automatically. In this work, we describe results from a feasibility […]

March 1, 2015 / Last updated : March 25, 2023 irfan Presentations

Participated in the KAUST Conference on Computational Imaging and Vision 2015

I was invited to participate and present at the King Abdullah University of Science & Technology Conference on Computational Imaging and Vision (CIV) March 1-4, 2015, Visual Computing Center (VCC) Invited Speakers included This event was hosted by the Visual Computing Center (Wolfgang Heidrich, Bernard Ghanem, Ganesh Sundaramoorthi). Daniel Castro also attended and presented a poster at the meeting.

January 6, 2015 / Last updated : February 22, 2021 irfan WACV

Paper in WACV (2015): “Semantic Instance Labeling Leveraging Hierarchical Segmentation"

Paper Abstract Most of the approaches for indoor RGBD semantic labeling focus on using pixels or superpixels to train a classifier. In this paper, we implement a higher level segmentation using a hierarchy of superpixels to obtain a better segmentation for training our classifier. By focusing on meaningful segments that conform more directly to objects, […]

January 6, 2015 / Last updated : February 22, 2021 irfan WACV

Paper in IEEE WACV (2015): “Finding Temporally Consistent Occlusion Boundaries using Scene Layout”

Citation Abstract We present an algorithm for finding temporally consistent occlusion boundaries in videos to support the segmentation of dynamic scenes. We learn occlusion boundaries in a pairwise Markov random field (MRF) framework. We first estimate the probability of a spatiotemporal edge being an occlusion boundary by using appearance, flow, and geometric features. Next, we […]

January 6, 2015 / Last updated : February 22, 2021 irfan WACV

Paper in IEEE WACV (2015): “Leveraging Context to Support Automated Food Recognition in Restaurants”

Citation Abstract The pervasiveness of mobile cameras has resulted in a dramatic increase in food photos, which are pictures reflecting what people eat. In this paper, we study how taking pictures of what we eat in restaurants can be used for the purpose of automating food journaling. We propose to leverage the context of where […]

January 6, 2015 / Last updated : February 22, 2021 irfan WACV

Paper in WACV (2015): “Egocentric Field-of-View Localization Using First-Person Point-of-View Devices”

Paper Abstract We present a technique that uses images, videos and sensor data taken from first-person point-of-view devices to perform egocentric field-of-view (FOV) localization. We define egocentric FOV localization as capturing the visual information from a person’s field-of-view in a given environment and transferring this information onto a reference corpus of images and videos of […]

Posts pagination

  • «
  • Page 1
  • …
  • Page 6
  • Page 7
  • Page 8
  • …
  • Page 26
  • »

Recent Posts

ACL 2025 paper (Awarded the “Best Social Impact Award”) on “AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset”
August 1, 2025
Sphere-WoZ
Wizard of Oz at the Las Vegas Sphere, using Google AI
June 13, 2025
CVPR 2025 paper on “Cropper: Vision-Language Model for Image Cropping through In-Context Learning”
June 13, 2025
CVPR 2025 paper on “Calibrated Multi-Preference Optimization for Aligning Diffusion Models”
June 13, 2025
Award-winning paper in ICML 2024 on “VideoPoet: A large language model for zero-shot video generation.”
July 22, 2024
ACM SIGGRAPH Seminal Graphics Papers, Volume 2. Published as part of SIGGRAPH 50th Anniversary Meeting in 2023
August 9, 2023
Paper in UIST 2023 on “Slide Gestalt: Automatic Structure Extraction in Slide Decks for Non-Visual Access”
April 23, 2023
Award-winning paper in ICLR 2023 on “Emergence of Maps in the Memories of Blind Navigation Agents”
March 22, 2023
Paper in ICLR 2023 on “Discrete Predictor-Corrector Diffusion Models for Image Synthesis”
March 10, 2023
Some recent publications for 2023
March 10, 2023

Tags

ACM (20) Activity Recognition (52) Affective Computing (9) Aging-in-place (5) AI (20) Audio Analysis (9) Awards (15) Aware Home (15) Behavioral Imaging (11) Best Paper Award (12) Computational Journalism (36) Computational Photography (62) Computational Video (71) Computer Animation (10) Computer Graphics (9) Computer Vision (117) CVPR (30) DVFX (9) ECCV (5) Events (7) Faces (12) Funding (7) Gesture (6) Google (24) HCI (8) Health (8) ICCV (8) ICLR (5) IEEE (30) Machine Learning (39) Medical (10) ML@GT (5) News (17) NSF (16) PhD Thesis (12) Presentations (28) Robotics (10) SIGGRAPH (7) Sports Visualization (6) Teaching (21) UIST (5) Video Segmentation (7) Video Stabilization (14) WACV (8) Wearable Computing (9)

More about this Website

  • About
    • Tags & Categories
    • Archives
    • Copyright
    • Privacy Policy

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org

Copyright © Irfan Essa All Rights Reserved.

Powered by WordPress with Lightning Theme & VK All in One Expansion Unit

MENU
  • Home
  • Blog
  • Publications
  • Team
  • Videos
  • Teaching
  • FAQ
  • Contact
PAGE TOP