A searchable list of some of my publications is below. You can also access my publications from the following sites.

My ORCID is ORCID iD iconhttps://orcid.org/0000-0002-6236-2969

Publications:

Show all

1.

Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan

StyleDrop: Text-to-Image Generation in Any Style Proceedings Article

In: Advances in Neural Information Processing Systems (NeurIPS), 2023.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, generative AI, google, NeurIPS

2.

Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin Murphy, Alexander G. Hauptmann, Lu Jiang

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs Proceedings Article

In: Advances in Neural Information Processing Systems (NeurIPS), 2023.

Abstract | Links | BibTeX | Tags: arXiv, computational video, computer vision, generative AI, NeurIPS

3.

Nikolai Warner, Meera Hahn, Jonathan Huang, Irfan Essa, Vighnesh Birodkar

Text and Click inputs for unambiguous open vocabulary instance segmentation Proceedings Article

In: Proeedings of British Conference for Machine Vision (BMVC), 2023.

Abstract | Links | BibTeX | Tags: arXiv, BMVC, computer vision, google, image segmentation

4.

K. Niranjan Kumar, Irfan Essa, Sehoon Ha

Words into Action: Learning Diverse Humanoid Robot Behaviors using Language Guided Iterative Motion Refinement Proceedings Article

In: CoRL Workshop on Language and Robot Learning Language as Grounding (with CoRL 2023), 2023.

Abstract | Links | BibTeX | Tags: arXiv, CoRL, robotics, vision & language

5.

Kihyuk Sohn, Albert Shaw, Yuan Hao, Han Zhang, Luisa Polania, Huiwen Chang, Lu Jiang, Irfan Essa

Learning Disentangled Prompts for Compositional Image Synthesis Technical Report

2023.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, generative AI, google, prompt engineering

6.

Harish Haresamudram, Irfan Essa, Thomas Ploetz

Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition Technical Report

2023.

Abstract | Links | BibTeX | Tags: activity recognition, arXiv, wearable computing

7.

Apoorva Beedu, Zhile Ren, Varun Agrawal, Irfan Essa

VideoPose: Estimating 6D object pose from videos Technical Report

2021.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, object detection, pose estimation

8.

Karan Samel, Zelin Zhao, Binghong Chen, Shuang Li, Dharmashankar Subramanian, Irfan Essa, Le Song

Neural Temporal Logic Programming Technical Report

2021.

Abstract | Links | BibTeX | Tags: activity recognition, arXiv, machine learning, openreview

9.

Dan Scarafoni, Irfan Essa, Thomas Ploetz

PLAN-B: Predicting Likely Alternative Next Best Sequences for Action Prediction Technical Report

no. arXiv:2103.15987, 2021.

Abstract | Links | BibTeX | Tags: activity recognition, arXiv, computer vision

10.

Erik Wijmans, Julian Straub, Dhruv Batra, Irfan Essa, Judy Hoffman, Ari Morcos

Analyzing Visual Representations in Embodied Navigation Tasks Technical Report

no. arXiv:2003.05993, 2020.

Abstract | Links | BibTeX | Tags: arXiv, embodied agents, navigation

11.

Jonathan C Balloch, Varun Agrawal, Irfan Essa, Sonia Chernova

Unbiasing Semantic Segmentation For Robot Perception using Synthetic Data Feature Transfer Technical Report

no. arXiv:1809.03676, 2018.

Abstract | Links | BibTeX | Tags: arXiv, robotics, scene understanding

12.

Steven Hickson, Anelia Angelova, Irfan Essa, Rahul Sukthankar

Object category learning and retrieval with weak supervision Technical Report

no. arXiv:1801.08985, 2018.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, machine learning, object detection

13.

Huda Alamri, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Jue Wang, Irfan Essa, Dhruv Batra, Devi Parikh, Anoop Cherian, Tim K Marks, Chiori Hori

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7 Technical Report

no. arXiv:1806.00525, 2018.

Abstract | Links | BibTeX | Tags: arXiv, embodied agents, multimedia, vision & language

Other Publication Sites

A few more sites that aggregate research publications: Academic.edu, Bibsonomy, CiteULike, Mendeley.

      Copyright/About

      [Please see the Copyright Statement that may apply to the content listed here.]

      This list of publications is produced by using the teachPress plugin for WordPress.

      Leave a Reply

      Your email address will not be published. Required fields are marked *

      This site uses Akismet to reduce spam. Learn how your comment data is processed.