Some recent publications for 2023

Here is a list of some recent works accepted for publication that I am honored to be part of. These will be appearing in CHI, ICLR, and CVPR. Excited to share these new efforts.

Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa

MaskSketch: Unpaired Structure-guided Masked Image Generation Inproceedings Forthcoming

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), Forthcoming.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative media, google

Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang

MAGVIT: Masked Generative Video Transformer Inproceedings Forthcoming

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), Forthcoming.

Abstract | Links | BibTeX | Tags: computational video, computer vision, CVPR, generative media, google

Kihyuk Sohn, Yuan Hao, José Lezama, Luisa Polania, Huiwen Chang, Han Zhang, Irfan Essa, Lu Jiang

Visual Prompt Tuning for Generative Transfer Learning Inproceedings Forthcoming

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), Forthcoming.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative media, google

José Lezama, Tim Salimans, Lu Jiang, Huiwen Chang, Jonathan Ho, Irfan Essa

Discrete Predictor-Corrector Diffusion Models for Image Synthesis Inproceedings Forthcoming

In: International Conference on Learning Representations (ICLR), Forthcoming.

Abstract | Links | BibTeX | Tags: computer vision, generative media, google, ICLR, machine learning

Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra

Emergence of Maps in the Memories of Blind Navigation Agents Inproceedings Forthcoming

In: Proceedings of International Conference on Learning Representations (ICLR), Forthcoming.

Abstract | Links | BibTeX | Tags: awards, best paper award, computer vision, google, ICLR, machine learning, robotics

Yi-Hao Peng, Peggy Chi, Anjuli Kannan, Meredith Morris, Irfan Essa

Slide Gestalt: Automatic Structure Extraction in Slide Decks for Non-Visual Access Inproceedings Forthcoming

In: ACM Symposium on User Interface Software and Technology (UIST), Forthcoming.

Abstract | Links | BibTeX | Tags: accessibility, CHI, google, human-computer interaction

Karan Samel, Jun Ma, Zhengyang Wang, Tong Zhao, Irfan Essa

Knowledge Relevance BERT: Integrating Noisy Knowledge into Language Representation. Inproceedings

In: AAAI workshop on Knowledge Augmented Methods for NLP (KnowledgeNLP-AAAI 2023), 2023.

Abstract | Links | BibTeX | Tags: AI, knowledge representation, NLP

Tianhao Zhang, Weilong Yang, Honglak Lee, Hung-Yu Tseng, Irfan Essa, Lu Jiang

Image manipulation by text instruction Patent

2023.

Abstract | Links | BibTeX | Tags: content creation, google, media generation, patents

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.