A searchable list of some of my publications is below. You can also access my publications from the following sites.

My ORCID is ORCID iD iconhttps://orcid.org/0000-0002-6236-2969

Publications:

Show all

30 entries « 1 of 2 »
1.

Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan

StyleDrop: Text-to-Image Generation in Any Style Proceedings Article

In: Advances in Neural Information Processing Systems (NeurIPS), 2023.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, generative AI, google, NeurIPS

2.

Nikolai Warner, Meera Hahn, Jonathan Huang, Irfan Essa, Vighnesh Birodkar

Text and Click inputs for unambiguous open vocabulary instance segmentation Proceedings Article

In: Proeedings of British Conference for Machine Vision (BMVC), 2023.

Abstract | Links | BibTeX | Tags: arXiv, BMVC, computer vision, google, image segmentation

3.

Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa

MaskSketch: Unpaired Structure-guided Masked Image Generation Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative AI, generative media, google

4.

Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang

MAGVIT: Masked Generative Video Transformer Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Abstract | Links | BibTeX | Tags: computational video, computer vision, CVPR, generative AI, generative media, google

5.

Kihyuk Sohn, Yuan Hao, José Lezama, Luisa Polania, Huiwen Chang, Han Zhang, Irfan Essa, Lu Jiang

Visual Prompt Tuning for Generative Transfer Learning Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative AI, generative media, google

6.

Kihyuk Sohn, Albert Shaw, Yuan Hao, Han Zhang, Luisa Polania, Huiwen Chang, Lu Jiang, Irfan Essa

Learning Disentangled Prompts for Compositional Image Synthesis Technical Report

2023.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, generative AI, google, prompt engineering

7.

José Lezama, Tim Salimans, Lu Jiang, Huiwen Chang, Jonathan Ho, Irfan Essa

Discrete Predictor-Corrector Diffusion Models for Image Synthesis Proceedings Article

In: International Conference on Learning Representations (ICLR), 2023.

Abstract | Links | BibTeX | Tags: computer vision, generative AI, generative media, google, ICLR, machine learning

8.

Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra

Emergence of Maps in the Memories of Blind Navigation Agents Best Paper Proceedings Article

In: Proceedings of International Conference on Learning Representations (ICLR), 2023.

Abstract | Links | BibTeX | Tags: awards, best paper award, computer vision, google, ICLR, machine learning, robotics

9.

Yi-Hao Peng, Peggy Chi, Anjuli Kannan, Meredith Morris, Irfan Essa

Slide Gestalt: Automatic Structure Extraction in Slide Decks for Non-Visual Access Proceedings Article

In: ACM Symposium on User Interface Software and Technology (UIST), 2023.

Abstract | Links | BibTeX | Tags: accessibility, CHI, google, human-computer interaction

10.

Tianhao Zhang, Weilong Yang, Honglak Lee, Hung-Yu Tseng, Irfan Essa, Lu Jiang

Image manipulation by text instruction Patent

2023.

Abstract | Links | BibTeX | Tags: content creation, generative AI, google, media generation, patents

11.

José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa

Improved Masked Image Generation with Token-Critic Proceedings Article

In: European Conference on Computer Vision (ECCV), arXiv, 2022, ISBN: 978-3-031-20050-2.

Abstract | Links | BibTeX | Tags: computer vision, ECCV, generative AI, generative media, google

12.

Xiang Kong, Lu Jiang, Huiwen Chang, Han Zhang, Yuan Hao, Haifeng Gong, Irfan Essa

BLT: Bidirectional Layout Transformer for Controllable Layout Generation Proceedings Article

In: European Conference on Computer Vision (ECCV), 2022, ISBN: 978-3-031-19789-5.

Abstract | Links | BibTeX | Tags: computer vision, ECCV, generative AI, generative media, google, vision transformer

13.

Peggy Chi, Tao Dong, Christian Frueh, Brian Colonna, Vivek Kwatra, Irfan Essa

Synthesis-Assisted Video Prototyping From a Document Proceedings Article

In: Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, pp. 1–10, 2022.

Abstract | Links | BibTeX | Tags: computational video, generative media, google, human-computer interaction, UIST, video editing

14.

Chengzhi Mao, Lu Jiang, Mostafa Dehghani, Carl Vondrick, Rahul Sukthankar, Irfan Essa

Discrete Representations Strengthen Vision Transformer Robustness Proceedings Article

In: Proceedings of International Conference on Learning Representations (ICLR), 2022.

Abstract | Links | BibTeX | Tags: computer vision, google, machine learning, vision transformer

15.

Steven Hickson, Karthik Raveendran, Irfan Essa

Sharing Decoders: Network Fission for Multi-Task Pixel Prediction Proceedings Article

In: IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 3771–3780, 2022.

Abstract | Links | BibTeX | Tags: computer vision, google, machine learning

16.

Tianhao Zhang, Hung-Yu Tseng, Lu Jiang, Weilong Yang, Honglak Lee, Irfan Essa

Text as Neural Operator: Image Manipulation by Text Instruction Proceedings Article

In: ACM International Conference on Multimedia (ACM-MM), ACM Press, 2021.

Abstract | Links | BibTeX | Tags: computer vision, generative media, google, multimedia

17.

Peggy Chi, Nathan Frey, Katrina Panovich, Irfan Essa

Automatic Instructional Video Creation from a Markdown-Formatted Tutorial Proceedings Article

In: ACM Symposium on User Interface Software and Technology (UIST), ACM Press, 2021.

Abstract | Links | BibTeX | Tags: google, human-computer interaction, UIST, video editting

18.

Nathan Frey, Peggy Chi, Weilong Yang, Irfan Essa

Automatic Style Transfer for Non-Linear Video Editing Proceedings Article

In: Proceedings of CVPR Workshop on AI for Content Creation (AICC), 2021.

Links | BibTeX | Tags: computational video, CVPR, google, video editing

19.

AJ Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan Essa

Unsupervised Discovery of Actions in Instructional Videos Proceedings Article

In: British Machine Vision Conference (BMVC), 2021.

Abstract | Links | BibTeX | Tags: activity recognition, computational video, computer vision, google

20.

Anh Truong, Peggy Chi, David Salesin, Irfan Essa, Maneesh Agrawala

Automatic Generation of Two-Level Hierarchical Tutorials from Instructional Makeup Videos Proceedings Article

In: ACM CHI Conference on Human factors in Computing Systems, 2021.

Abstract | Links | BibTeX | Tags: CHI, computational video, google, human-computer interaction, video summarization

30 entries « 1 of 2 »

Other Publication Sites

A few more sites that aggregate research publications: Academic.edu, Bibsonomy, CiteULike, Mendeley.

      Copyright/About

      [Please see the Copyright Statement that may apply to the content listed here.]

      This list of publications is produced by using the teachPress plugin for WordPress.

      Leave a Reply

      Your email address will not be published. Required fields are marked *

      This site uses Akismet to reduce spam. Learn how your comment data is processed.