Skip to the content Skip to the Navigation

Irfan Essa

  • Home
  • Blog
  • Publications
  • Team
  • Videos
  • Teaching
  • FAQ
  • Contact
Blog
  1. HOME
  2. Blog
  3. First-Person Vision

First-Person Vision

January 6, 2015 / Last updated : February 22, 2021 irfan WACV

Paper in IEEE WACV (2015): “Leveraging Context to Support Automated Food Recognition in Restaurants”

Citation Abstract The pervasiveness of mobile cameras has resulted in a dramatic increase in food photos, which are pictures reflecting what people eat. In this paper, we study how taking pictures of what we eat in restaurants can be used for the purpose of automating food journaling. We propose to leverage the context of where […]

January 6, 2015 / Last updated : February 22, 2021 irfan WACV

Paper in WACV (2015): “Egocentric Field-of-View Localization Using First-Person Point-of-View Devices”

Paper Abstract We present a technique that uses images, videos and sensor data taken from first-person point-of-view devices to perform egocentric field-of-view (FOV) localization. We define egocentric FOV localization as capturing the visual information from a person’s field-of-view in a given environment and transferring this information onto a reference corpus of images and videos of […]

Recent Posts

Award-winning paper in ICML 2024 on “VideoPoet: A large language model for zero-shot video generation.”
July 22, 2024
ACM SIGGRAPH Seminal Graphics Papers, Volume 2. Published as part of SIGGRAPH 50th Anniversary Meeting in 2023
August 9, 2023
Paper in UIST 2023 on “Slide Gestalt: Automatic Structure Extraction in Slide Decks for Non-Visual Access”
April 23, 2023
Award-winning paper in ICLR 2023 on “Emergence of Maps in the Memories of Blind Navigation Agents”
March 22, 2023
Paper in ICLR 2023 on “Discrete Predictor-Corrector Diffusion Models for Image Synthesis”
March 10, 2023
Some recent publications for 2023
March 10, 2023
Publications in 2022
December 31, 2022
Paper in NeurIPS 2022 on “VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement”
December 7, 2022
Paper in ACM UIST 2022 on “Synthesis-Assisted Video Prototyping From a Document”
October 15, 2022
Paper in ECCV 2022 on “BLT: Bidirectional Layout Transformer for Controllable Layout Generation”
October 7, 2022

Tags

ACM (20) Activity Assessment (5) Activity Recognition (52) Affective Computing (9) Aging-in-place (5) AI (20) Audio Analysis (9) Awards (15) Aware Home (15) Behavioral Imaging (11) Best Paper Award (11) Computational Journalism (36) Computational Photography (62) Computational Video (70) Computer Animation (10) Computer Graphics (9) Computer Vision (115) CVPR (28) DVFX (9) Events (7) Faces (12) Funding (7) Gesture (6) Google (21) HCI (8) Health (7) ICCV (8) IEEE (30) Machine Learning (39) Medical (10) ML@GT (5) News (17) NSF (16) PhD Thesis (12) Presentations (28) Robotics (10) SIGGRAPH (7) Sports Visualization (6) Teaching (21) Ubiquitous Computing (5) UIST (5) Video Segmentation (7) Video Stabilization (14) WACV (8) Wearable Computing (9)

More about this Website

  • About
    • Tags & Categories
    • Archives
    • Copyright
    • Privacy Policy

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org

Copyright © Irfan Essa All Rights Reserved.

Powered by WordPress with Lightning Theme & VK All in One Expansion Unit

MENU
  • Home
  • Blog
  • Publications
  • Team
  • Videos
  • Teaching
  • FAQ
  • Contact
PAGE TOP