Skip to the content Skip to the Navigation

Irfan Essa

  • Home
  • Blog
  • Publications
  • Team
  • Videos
  • Teaching
  • FAQ
  • Contact
Blog
  1. HOME
  2. Blog
  3. June 2019

June 2019

June 17, 2019 / Last updated : February 21, 2021 irfan CVPR

Paper in CVPR 2019 on “Embodied Question Answering in Photorealistic Environments with Point Cloud Perception”

Abstract To help bridge the gap between internet vision-style problems and the goal of vision for embodied perception we instantiate a large-scale navigation task – Embodied Question Answering in photo-realistic environments (Matterport 3D). We thoroughly study navigation policies that utilize 3D point clouds, RGB images, or their combination. Our analysis of these models reveals several […]

June 17, 2019 / Last updated : February 21, 2021 irfan CVPR

Paper in CVPR 2019 on “Audio visual scene-aware dialog”

Abstract We introduce the task of scene-aware dialog. Our goal is to generate a complete and natural response to a question about a scene, given video and audio of the scene and the history of previous turns in the dialog. To answer successfully, agents must ground concepts from the question in the video while leveraging […]

Recent Posts

Paper in ACM CHI 2021 on “Automatic Generation of Two-Level Hierarchical Tutorials from Instructional Makeup Videos”
February 25, 2021
Research Opportunities at Google Atlanta
February 15, 2021
Paper in AAAI 2021 on “Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views”
February 2, 2021
Paper in ACM UIST 2020 on “Automatic Video Creation From a Web Page”
October 28, 2020
Paper in ECCV 2020 on “Neural Design Network: Graphic Layout Generation with Constraints”
August 25, 2020
Panel of ML@GT Researchers working on Covid-19 Relief
June 24, 2020
Invited Speaker at CVPR 2020 Workshop on “AI for Content Creation”
June 15, 2020
Paper in ICLR 2020 on “DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames”
April 27, 2020
Keynote Speaker at CNS/AANS Spine Summit 2020, Las Vegas, Nevada, March 6, 2020, on the topic of “Data-driven Innovation”
March 6, 2020
Paper in ICCV Workshop on Geometry Meets Deep Learning Workshop on “Floors are Flat: Leveraging Semantics for Real-Time Surface Normal Prediction”
November 2, 2019

Tags

ACM (22) Activity Assessment (5) Activity Recognition (52) Affective Computing (9) Artificial Intelligence (18) Audio Analysis (9) Awards (11) Aware Home (17) Behavioral Imaging (11) Best Paper Award (8) Computational Journalism (43) Computational Photography (67) Computational Video (69) Computer Animation (10) Computer Graphics (9) Computer Vision (107) Crowdsourcing (8) CVPR (27) DVFX (10) Events (9) Faces (12) Funding (7) Gesture (7) Google (14) HCI (6) Health (7) ICCV (8) IEEE (30) Machine Learning (30) Medical (10) News (21) NSF (17) NSF-0205507 (10) PhD Thesis (14) Presentations (29) Robotics (7) SIGGRAPH (8) Sports Visualization (6) Teaching (22) Ubiquitous Computing (5) Video Segmentation (7) Video Stabilization (14) Video Textures (5) WACV (7) Wearable Computing (7)

More about this Website

  • About
    • Tags & Categories
    • Archives
    • Copyright
    • Privacy Policy

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org

Copyright © Irfan Essa All Rights Reserved.

Powered by WordPress with Lightning Theme & VK All in One Expansion Unit by Vektor,Inc. technology.

PAGE TOP
MENU
  • Home
  • Blog
  • Publications
  • Team
  • Videos
  • Teaching
  • FAQ
  • Contact