Presentation Schedule
All times are in Central time zone
Date: Thursday, June 23, 2022 8:30AM – 10:18AM
Session Title: Image & Video Synthesis and Generation (I)
Session Chairs: Sharon Xiaolei Huang (Pennsylvania State Univ.), Shaodi You (Univ. of Amsterdam)
| Poster ID | Title | Authors |
| 1a | Diffusion Autoencoders: Toward a Meaningful and Decodable Representation |
Konpat Preechakul; Nattanat Chatthee; Suttisak Wizadwongsa; Supasorn Suwajanakorn |
| 2a | Polymorphic-GAN: Generating Aligned Samples Across Multiple Domains With Learned Morph Maps |
Seung Wook Kim; Karsten Kreis; Daiqing Li; Antonio Torralba; Sanja Fidler |
| 3a | Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values |
Ahmed Imtiaz Humayun; Randall Balestriero; Richard Baraniuk |
| 4a | Ensembling Off-the-Shelf Models for GAN Training |
Nupur Kumari; Richard Zhang; Eli Shechtman; Jun-Yan Zhu |
| 5a | Marginal Contrastive Correspondence for Guided Image Generation |
Fangneng Zhan; Yingchen Yu; Rongliang Wu; Jiahui Zhang; Shijian Lu; Changgong Zhang |
| 6a | GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation |
Yu Deng; Jiaolong Yang; Jianfeng Xiang; Xin Tong |
| 7a | High-Resolution Image Synthesis With Latent Diffusion Models |
Robin Rombach; Andreas Blattmann; Dominik Lorenz; Patrick Esser; Björn Ommer |
| 8a | Vector Quantized Diffusion Model for Text-to-Image Synthesis |
Shuyang Gu; Dong Chen; Jianmin Bao; Fang Wen; Bo Zhang; Dongdong Chen; Lu Yuan; Baining Guo |
| 9a | ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-Wise Semantic Alignment and Generation |
Jianan Wang; Guansong Lu; Hang Xu; Zhenguo Li; Chunjing Xu; Yanwei Fu |
| 10a | Dataset Distillation by Matching Training Trajectories |
George Cazenavette; Tongzhou Wang; Antonio Torralba; Alexei A. Efros; Jun-Yan Zhu |
| 11a | Continual Predictive Learning From Videos |
Geng Chen; Wendong Zhang; Han Lu; Siyu Gao; Yunbo Wang; Mingsheng Long; Xiaokang Yang |
| 12a | Motion-Adjustable Neural Implicit Video Representation | Long Mai; Feng Liu |
| 13a | Splicing ViT Features for Semantic Appearance Transfer |
Narek Tumanyan; Omer Bar-Tal; Shai Bagon; Tali Dekel |
| 14a | MAT: Mask-Aware Transformer for Large Hole Image Inpainting |
Wenbo Li; Zhe Lin; Kun Zhou; Lu Qi; Yi Wang; Jiaya Jia |
| 15a | Day-to-Night Image Synthesis for Training Nighttime Neural ISPs |
Abhijith Punnappurath; Abdullah Abuolaim; Abdelrahman Abdelhamed; Alex Levinshtein; Michael S. Brown |
| 16a | Smooth-Swap: A Simple Enhancement for Face-Swapping With Smoothness | Jiseob Kim; Jihoon Lee; Byoung-Tak Zhang |
| 17a | Few-Shot Head Swapping in the Wild |
Changyong Shu; Hemao Wu; Hang Zhou; Jiaming Liu; Zhibin Hong; Changxing Ding; Junyu Han; Jingtuo Liu; Errui Ding; Jingdong Wang |
| 18a | ClothFormer: Taming Video Virtual Try-On in All Module | Jianbin Jiang; Tan Wang; He Yan; Junhui Liu |
Date: Thursday, June 23, 2022 8:30AM – 10:18AM
Session Title: Deep Learning Architectures & Techniques
Session Chairs: Saining Xie (Facebook AI Research), Hao Su (UCSD)
| Poster ID | Title | Authors |
| 19a | A-ViT: Adaptive Tokens for Efficient Vision Transformer |
Hongxu Yin; Arash Vahdat; Jose M. Alvarez; Arun Mallya; Jan Kautz; Pavlo Molchanov |
| 20a | MetaFormer Is Actually What You Need for Vision |
Weihao Yu; Mi Luo; Pan Zhou; Chenyang Si; Yichen Zhou; Xinchao Wang; Jiashi Feng; Shuicheng Yan |
| 21a | Reversible Vision Transformers |
Karttikeya Mangalam; Haoqi Fan; Yanghao Li; Chao-Yuan Wu; Bo Xiong; Christoph Feichtenhofer; Jitendra Malik |
| 22a | Learned Queries for Efficient Local Attention | Moab Arar; Ariel Shamir; Amit H. Bermano |
| 23a | Shunted Self-Attention via Multi-Scale Token Aggregation |
Sucheng Ren; Daquan Zhou; Shengfeng He; Jiashi Feng; Xinchao Wang |
| 24a | Automatic Relation-Aware Graph Network Proliferation |
Shaofei Cai; Liang Li; Xinzhe Han; Jiebo Luo; Zheng-Jun Zha; Qingming Huang |
| 25a | β-DARTS: Beta-Decay Regularization for Differentiable Architecture Search |
Peng Ye; Baopu Li; Yikang Li; Tao Chen; Jiayuan Fan; Wanli Ouyang |
| 26a | Distribution Consistent Neural Architecture Search |
Junyi Pan; Chong Sun; Yizhou Zhou; Ying Zhang; Chen Li |
| 27a | Training-Free Transformer Architecture Search |
Qinqin Zhou; Kekai Sheng; Xiawu Zheng; Ke Li; Xing Sun; Yonghong Tian; Jie Chen; Rongrong Ji |
| 28a | TeachAugment: Data Augmentation Optimization Using Teacher Knowledge | Teppei Suzuki |
| 29a | Knowledge Distillation via the Target-Aware Transformer |
Sihao Lin; Hongwei Xie; Bing Wang; Kaicheng Yu; Xiaojun Chang; Xiaodan Liang; Gang Wang |
| 30a | Knowledge Distillation: A Good Teacher Is Patient and Consistent |
Lucas Beyer; Xiaohua Zhai; Amélie Royer; Larisa Markeeva; Rohan Anil; Alexander Kolesnikov |
| 31a | An Image Patch Is a Wave: Phase-Aware Vision MLP |
Yehui Tang; Kai Han; Jianyuan Guo; Chang Xu; Yanxi Li; Chao Xu; Yunhe Wang |
| 32a | Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information |
Lingfeng Yang; Xiang Li; Renjie Song; Borui Zhao; Juntian Tao; Shihao Zhou; Jiajun Liang; Jian Yang |
| 33a | Controllable Dynamic Multi-Task Architectures |
Dripta S. Raychaudhuri; Yumin Suh; Samuel Schulter; Xiang Yu; Masoud Faraki; Amit K. Roy-Chowdhury; Manmohan Chandraker |
| 34a | Grounded Language-Image Pre-Training |
Liunian Harold Li; Pengchuan Zhang; Haotian Zhang; Jianwei Yang; Chunyuan Li; Yiwu Zhong; Lijuan Wang; Lu Yuan; Lei Zhang; Jenq-Neng Hwang; Kai-Wei Chang; Jianfeng Gao |
| 35a | ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds | Georg Bökman; Fredrik Kahl; Axel Flinth |
| 36a | CADTransformer: Panoptic Symbol Spotting Transformer for CAD Drawings |
Zhiwen Fan; Tianlong Chen; Peihao Wang; Zhangyang Wang |
Date: Thursday, June 23, 2022 8:30AM – 10:18AM
Session Title: Human Pose Estimation & Tracking, Localization, and Object Pose Estimation
Session Chairs: Leonid Sigal (Univ. of British Columbia), Georgios Pavlakos (UC Berkeley), Angela Yao (National Univ. of Singapore)
| Poster ID | Title | Authors |
| 37a | Adversarial Parametric Pose Prior |
Andrey Davydov; Anastasia Remizova; Victor Constantin; Sina Honari; Mathieu Salzmann; Pascal Fua |
| 38a | Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation |
Zhenguang Liu; Runyang Feng; Haoming Chen; Shuang Wu; Yixing Gao; Yunjun Gao; Xiang Wang |
| 39a | PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision |
Kehong Gong; Bingbing Li; Jianfeng Zhang; Tao Wang; Jing Huang; Michael Bi Mi; Jiashi Feng; Xinchao Wang |
| 40a | Generalizable Human Pose Triangulation |
Kristijan Bartol; David Bojanić; Tomislav Petković; Tomislav Pribanić |
| 41a | GLAMR: Global Occlusion-Aware Human Mesh Recovery With Dynamic Cameras |
Ye Yuan; Umar Iqbal; Pavlo Molchanov; Kris Kitani; Jan Kautz |
| 42a | Bailando: 3D Dance Generation by Actor-Critic GPT With Choreographic Memory |
Li Siyao; Weijiang Yu; Tianpei Gu; Chunze Lin; Quan Wang; Chen Qian; Chen Change Loy; Ziwei Liu |
| 43a | Contextual Instance Decoupling for Robust Multi-Person Pose Estimation | Dongkai Wang; Shiliang Zhang |
| 44a | End-to-End Multi-Person Pose Estimation With Transformers |
Dahu Shi; Xing Wei; Liangqi Li; Ye Ren; Wenming Tan |
| 45a | Meta Agent Teaming Active Learning for Pose Estimation |
Jia Gong; Zhipeng Fan; Qiuhong Ke; Hossein Rahmani; Jun Liu |
| 46a | Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation |
Shreyas Hampali; Sayan Deb Sarkar; Mahdi Rad; Vincent Lepetit |
| 47a | Not All Tokens Are Equal: Human-Centric Visual Analysis via Token Clustering Transformer |
Wang Zeng; Sheng Jin; Wentao Liu; Chen Qian; Ping Luo; Wanli Ouyang; Xiaogang Wang |
| 48a | Occlusion-Robust Face Alignment Using a Viewpoint-Invariant Hierarchical Network Architecture |
Congcong Zhu; Xintong Wan; Shaorong Xie; Xiaoqiang Li; Yinzheng Gu |
| 49a | LASER: LAtent SpacE Rendering for 2D Visual Localization |
Zhixiang Min; Naji Khosravan; Zachary Bessinger; Manjunath Narayana; Sing Bing Kang; Enrique Dunn; Ivaylo Boyadzhiev |
| 50a | Learning To Detect Scene Landmarks for Camera Localization |
Tien Do; Ondrej Miksik; Joseph DeGol; Hyun Soo Park; Sudipta N. Sinha |
| 51a | Geometric Transformer for Fast and Robust Point Cloud Registration |
Zheng Qin; Hao Yu; Changjian Wang; Yulan Guo; Yuxing Peng; Kai Xu |
| 52a | ARCS: Accurate Rotation and Correspondence Search |
Liangzu Peng; Manolis C. Tsakiris; René Vidal |
| 53a | FisherMatch: Semi-Supervised Rotation Regression via Entropy-Based Filtering |
Yingda Yin; Yingcheng Cai; He Wang; Baoquan Chen |
| 54a | Uni6D: A Unified CNN Framework Without Projection Breakdown for 6D Pose Estimation |
Xiaoke Jiang; Donghai Li; Hao Chen; Ye Zheng; Rui Zhao; Liwei Wu |