A searchable list of some of my publications is below. You can also access my publications from the following sites.

My ORCID is ORCID iD iconhttps://orcid.org/0000-0002-6236-2969

Publications:

247 entries « 1 of 13 »
1.

Seung Hyun Lee, Jijun jiang, Yiran Xu, Zhuofang Li, Junjie Ke, Yinxiao Li, Junfeng He, Steven Hickson, Katie Datsenko, Sangpil Kim, Ming-Hsuan Yang, Irfan Essa, Feng Yang

Cropper: Vision-Language Model for Image Cropping through In-Context Learning Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2025.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative AI, generative media, google

2.

Kyungmin Lee, Xiaohang Li, Qifei Wang, Junfeng He, Junjie Ke, Ming-Hsuan Yang, Irfan Essa, Jinwoo Shin, Feng Yang, Yinxiao Li

Calibrated Multi-Preference Optimization for Aligning Diffusion Models Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2025.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative media, google, reinforcement learning

3.

Gong Zhang, Kihyuk Sohn, Meera Hahn, Humphrey Shi, Irfan Essa

FineStyle: Fine-grained Controllable Style Personalization for Text-to-image Models Proceedings Article

In: Advances in Neural Information Processing Systems (NeurIPS), 2024.

Abstract | Links | BibTeX | Tags: computer vision, generative AI, generative media, machine learning, NeurIPS

4.

Agrim Gupta, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, José Lezama

Photorealistic Video Generation with Diffusion Models Proceedings Article

In: European Conference on Computer Vision (ECCV), 2024.

Abstract | Links | BibTeX | Tags: arXiv, computational video, computer vision, generative AI, google

5.

Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang

Parrot: Pareto-optimal multi-reward reinforcement learning framework for text-to-image generation (inproceedings) Proceedings Article

In: Proceedings of European Conference on Computer Vision (ECCV) , 2024.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, ECCV, generative AI, google, reinforcement learning

6.

Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Josh Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam, Ming-Hsuan Yang, Irfan Essa, Huisheng Wang, David A. Ross, Bryan Seybold, Lu Jiang

VideoPoet: A large language model for zero-shot video generation Best Paper Proceedings Article

In: Proceedings of International Conference on Machine Learning (ICML), 2024.

Abstract | Links | BibTeX | Tags: arXiv, best paper award, computational video, computer vision, generative AI, google, ICML

7.

Xingqian Xu, Jiayi Guo, Zhangyang Wang, Gao Huang, Irfan Essa, Humphrey Shi

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models Proceedings Article

In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 8682–8692, 2024.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, CVPR, generative AI

8.

Harish Haresamudram, Irfan Essa, Thomas Plötz

A Washing Machine is All You Need? On the Feasibility of Machine Data for Self-Supervised Human Activity Recognition Proceedings Article

In: International Conference on Activity and Behavior Computing (ABC) 2024 , 2024.

Abstract | Links | BibTeX | Tags: activity recognition, behavioral imaging, wearable computing

9.

Lijun Yu, José Lezama, Nitesh B. Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Vighnesh Birodkar, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation Proceedings Article

In: Proceedings of International Conference on Learning Representations (ICLR) , 2024.

Abstract | Links | BibTeX | Tags: AI, arXiv, computer vision, generative AI, google, ICLR

10.

Harish Haresamudram, Irfan Essa, Thomas Ploetz

Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition Journal Article

In: Sensors, vol. 24, no. 4, 2024.

Abstract | Links | BibTeX | Tags: activity recognition, arXiv, wearable computing

11.

Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin Murphy, Alexander G. Hauptmann, Lu Jiang

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs Proceedings Article

In: Advances in Neural Information Processing Systems (NeurIPS), 2023.

Abstract | Links | BibTeX | Tags: arXiv, computational video, computer vision, generative AI, NeurIPS

12.

Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan

StyleDrop: Text-to-Image Generation in Any Style Proceedings Article

In: Advances in Neural Information Processing Systems (NeurIPS), 2023.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, generative AI, google, NeurIPS

13.

Nikolai Warner, Meera Hahn, Jonathan Huang, Irfan Essa, Vighnesh Birodkar

Text and Click inputs for unambiguous open vocabulary instance segmentation Proceedings Article

In: Proeedings of British Conference for Machine Vision (BMVC), 2023.

Abstract | Links | BibTeX | Tags: arXiv, BMVC, computer vision, google, image segmentation

14.

K. Niranjan Kumar, Irfan Essa, Sehoon Ha

Words into Action: Learning Diverse Humanoid Robot Behaviors using Language Guided Iterative Motion Refinement Proceedings Article

In: CoRL Workshop on Language and Robot Learning Language as Grounding (with CoRL 2023), 2023.

Abstract | Links | BibTeX | Tags: arXiv, CoRL, robotics, vision & language

15.

K. Niranjan Kumar, Irfan Essa, Sehoon Ha

Cascaded Compositional Residual Learning for Complex Interactive Behaviors Journal Article

In: IEEE Robotics and Automation Letters, vol. 8, iss. 8, pp. 4601–4608, 2023.

Abstract | Links | BibTeX | Tags: IEEE, reinforcement learning, robotics

16.

Kihyuk Sohn, Albert Shaw, Yuan Hao, Han Zhang, Luisa Polania, Huiwen Chang, Lu Jiang, Irfan Essa

Learning Disentangled Prompts for Compositional Image Synthesis Technical Report

2023.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, generative AI, google, prompt engineering

17.

Kihyuk Sohn, Yuan Hao, José Lezama, Luisa Polania, Huiwen Chang, Han Zhang, Irfan Essa, Lu Jiang

Visual Prompt Tuning for Generative Transfer Learning Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative AI, generative media, google

18.

Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang

MAGVIT: Masked Generative Video Transformer Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Abstract | Links | BibTeX | Tags: computational video, computer vision, CVPR, generative AI, generative media, google

19.

Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa

MaskSketch: Unpaired Structure-guided Masked Image Generation Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative AI, generative media, google

20.

Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra

Emergence of Maps in the Memories of Blind Navigation Agents Best Paper Proceedings Article

In: Proceedings of International Conference on Learning Representations (ICLR), 2023.

Abstract | Links | BibTeX | Tags: awards, best paper award, computer vision, google, ICLR, machine learning, robotics

247 entries « 1 of 13 »

Other Publication Sites

A few more sites that aggregate research publications: Academic.edu, Bibsonomy, CiteULike, Mendeley.

Copyright/About

[Please see the Copyright Statement that may apply to the content listed here.]

This list of publications is produced by using the teachPress plugin for WordPress.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.