Presentation Schedule
All times are in Central time zone
Date: Thursday, June 23, 2022 8:30AM – 10:18AM
Session Title: Image & Video Synthesis and Generation (I)
Session Chairs: Sharon Xiaolei Huang (Pennsylvania State Univ.), Shaodi You (Univ. of Amsterdam)
Poster ID | Title | Authors |
1a | Diffusion Autoencoders: Toward a Meaningful and Decodable Representation |
Konpat Preechakul; Nattanat Chatthee; Suttisak Wizadwongsa; Supasorn Suwajanakorn |
2a | Polymorphic-GAN: Generating Aligned Samples Across Multiple Domains With Learned Morph Maps |
Seung Wook Kim; Karsten Kreis; Daiqing Li; Antonio Torralba; Sanja Fidler |
3a | Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values |
Ahmed Imtiaz Humayun; Randall Balestriero; Richard Baraniuk |
4a | Ensembling Off-the-Shelf Models for GAN Training |
Nupur Kumari; Richard Zhang; Eli Shechtman; Jun-Yan Zhu |
5a | Marginal Contrastive Correspondence for Guided Image Generation |
Fangneng Zhan; Yingchen Yu; Rongliang Wu; Jiahui Zhang; Shijian Lu; Changgong Zhang |
6a | GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation |
Yu Deng; Jiaolong Yang; Jianfeng Xiang; Xin Tong |
7a | High-Resolution Image Synthesis With Latent Diffusion Models |
Robin Rombach; Andreas Blattmann; Dominik Lorenz; Patrick Esser; Björn Ommer |
8a | Vector Quantized Diffusion Model for Text-to-Image Synthesis |
Shuyang Gu; Dong Chen; Jianmin Bao; Fang Wen; Bo Zhang; Dongdong Chen; Lu Yuan; Baining Guo |
9a | ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-Wise Semantic Alignment and Generation |
Jianan Wang; Guansong Lu; Hang Xu; Zhenguo Li; Chunjing Xu; Yanwei Fu |
10a | Dataset Distillation by Matching Training Trajectories |
George Cazenavette; Tongzhou Wang; Antonio Torralba; Alexei A. Efros; Jun-Yan Zhu |
11a | Continual Predictive Learning From Videos |
Geng Chen; Wendong Zhang; Han Lu; Siyu Gao; Yunbo Wang; Mingsheng Long; Xiaokang Yang |
12a | Motion-Adjustable Neural Implicit Video Representation | Long Mai; Feng Liu |
13a | Splicing ViT Features for Semantic Appearance Transfer |
Narek Tumanyan; Omer Bar-Tal; Shai Bagon; Tali Dekel |
14a | MAT: Mask-Aware Transformer for Large Hole Image Inpainting |
Wenbo Li; Zhe Lin; Kun Zhou; Lu Qi; Yi Wang; Jiaya Jia |
15a | Day-to-Night Image Synthesis for Training Nighttime Neural ISPs |
Abhijith Punnappurath; Abdullah Abuolaim; Abdelrahman Abdelhamed; Alex Levinshtein; Michael S. Brown |
16a | Smooth-Swap: A Simple Enhancement for Face-Swapping With Smoothness | Jiseob Kim; Jihoon Lee; Byoung-Tak Zhang |
17a | Few-Shot Head Swapping in the Wild |
Changyong Shu; Hemao Wu; Hang Zhou; Jiaming Liu; Zhibin Hong; Changxing Ding; Junyu Han; Jingtuo Liu; Errui Ding; Jingdong Wang |
18a | ClothFormer: Taming Video Virtual Try-On in All Module | Jianbin Jiang; Tan Wang; He Yan; Junhui Liu |
Date: Thursday, June 23, 2022 8:30AM – 10:18AM
Session Title: Deep Learning Architectures & Techniques
Session Chairs: Saining Xie (Facebook AI Research), Hao Su (UCSD)
Poster ID | Title | Authors |
19a | A-ViT: Adaptive Tokens for Efficient Vision Transformer |
Hongxu Yin; Arash Vahdat; Jose M. Alvarez; Arun Mallya; Jan Kautz; Pavlo Molchanov |
20a | MetaFormer Is Actually What You Need for Vision |
Weihao Yu; Mi Luo; Pan Zhou; Chenyang Si; Yichen Zhou; Xinchao Wang; Jiashi Feng; Shuicheng Yan |
21a | Reversible Vision Transformers |
Karttikeya Mangalam; Haoqi Fan; Yanghao Li; Chao-Yuan Wu; Bo Xiong; Christoph Feichtenhofer; Jitendra Malik |
22a | Learned Queries for Efficient Local Attention | Moab Arar; Ariel Shamir; Amit H. Bermano |
23a | Shunted Self-Attention via Multi-Scale Token Aggregation |
Sucheng Ren; Daquan Zhou; Shengfeng He; Jiashi Feng; Xinchao Wang |
24a | Automatic Relation-Aware Graph Network Proliferation |
Shaofei Cai; Liang Li; Xinzhe Han; Jiebo Luo; Zheng-Jun Zha; Qingming Huang |
25a | β-DARTS: Beta-Decay Regularization for Differentiable Architecture Search |
Peng Ye; Baopu Li; Yikang Li; Tao Chen; Jiayuan Fan; Wanli Ouyang |
26a | Distribution Consistent Neural Architecture Search |
Junyi Pan; Chong Sun; Yizhou Zhou; Ying Zhang; Chen Li |
27a | Training-Free Transformer Architecture Search |
Qinqin Zhou; Kekai Sheng; Xiawu Zheng; Ke Li; Xing Sun; Yonghong Tian; Jie Chen; Rongrong Ji |
28a | TeachAugment: Data Augmentation Optimization Using Teacher Knowledge | Teppei Suzuki |
29a | Knowledge Distillation via the Target-Aware Transformer |
Sihao Lin; Hongwei Xie; Bing Wang; Kaicheng Yu; Xiaojun Chang; Xiaodan Liang; Gang Wang |
30a | Knowledge Distillation: A Good Teacher Is Patient and Consistent |
Lucas Beyer; Xiaohua Zhai; Amélie Royer; Larisa Markeeva; Rohan Anil; Alexander Kolesnikov |
31a | An Image Patch Is a Wave: Phase-Aware Vision MLP |
Yehui Tang; Kai Han; Jianyuan Guo; Chang Xu; Yanxi Li; Chao Xu; Yunhe Wang |
32a | Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information |
Lingfeng Yang; Xiang Li; Renjie Song; Borui Zhao; Juntian Tao; Shihao Zhou; Jiajun Liang; Jian Yang |
33a | Controllable Dynamic Multi-Task Architectures |
Dripta S. Raychaudhuri; Yumin Suh; Samuel Schulter; Xiang Yu; Masoud Faraki; Amit K. Roy-Chowdhury; Manmohan Chandraker |
34a | Grounded Language-Image Pre-Training |
Liunian Harold Li; Pengchuan Zhang; Haotian Zhang; Jianwei Yang; Chunyuan Li; Yiwu Zhong; Lijuan Wang; Lu Yuan; Lei Zhang; Jenq-Neng Hwang; Kai-Wei Chang; Jianfeng Gao |
35a | ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds | Georg Bökman; Fredrik Kahl; Axel Flinth |
36a | CADTransformer: Panoptic Symbol Spotting Transformer for CAD Drawings |
Zhiwen Fan; Tianlong Chen; Peihao Wang; Zhangyang Wang |
Date: Thursday, June 23, 2022 8:30AM – 10:18AM
Session Title: Human Pose Estimation & Tracking, Localization, and Object Pose Estimation
Session Chairs: Leonid Sigal (Univ. of British Columbia), Georgios Pavlakos (UC Berkeley), Angela Yao (National Univ. of Singapore)
Poster ID | Title | Authors |
37a | Adversarial Parametric Pose Prior |
Andrey Davydov; Anastasia Remizova; Victor Constantin; Sina Honari; Mathieu Salzmann; Pascal Fua |
38a | Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation |
Zhenguang Liu; Runyang Feng; Haoming Chen; Shuang Wu; Yixing Gao; Yunjun Gao; Xiang Wang |
39a | PoseTriplet: Co-Evolving 3D Human Pose Estimation, Imitation, and Hallucination Under Self-Supervision |
Kehong Gong; Bingbing Li; Jianfeng Zhang; Tao Wang; Jing Huang; Michael Bi Mi; Jiashi Feng; Xinchao Wang |
40a | Generalizable Human Pose Triangulation |
Kristijan Bartol; David Bojanić; Tomislav Petković; Tomislav Pribanić |
41a | GLAMR: Global Occlusion-Aware Human Mesh Recovery With Dynamic Cameras |
Ye Yuan; Umar Iqbal; Pavlo Molchanov; Kris Kitani; Jan Kautz |
42a | Bailando: 3D Dance Generation by Actor-Critic GPT With Choreographic Memory |
Li Siyao; Weijiang Yu; Tianpei Gu; Chunze Lin; Quan Wang; Chen Qian; Chen Change Loy; Ziwei Liu |
43a | Contextual Instance Decoupling for Robust Multi-Person Pose Estimation | Dongkai Wang; Shiliang Zhang |
44a | End-to-End Multi-Person Pose Estimation With Transformers |
Dahu Shi; Xing Wei; Liangqi Li; Ye Ren; Wenming Tan |
45a | Meta Agent Teaming Active Learning for Pose Estimation |
Jia Gong; Zhipeng Fan; Qiuhong Ke; Hossein Rahmani; Jun Liu |
46a | Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation |
Shreyas Hampali; Sayan Deb Sarkar; Mahdi Rad; Vincent Lepetit |
47a | Not All Tokens Are Equal: Human-Centric Visual Analysis via Token Clustering Transformer |
Wang Zeng; Sheng Jin; Wentao Liu; Chen Qian; Ping Luo; Wanli Ouyang; Xiaogang Wang |
48a | Occlusion-Robust Face Alignment Using a Viewpoint-Invariant Hierarchical Network Architecture |
Congcong Zhu; Xintong Wan; Shaorong Xie; Xiaoqiang Li; Yinzheng Gu |
49a | LASER: LAtent SpacE Rendering for 2D Visual Localization |
Zhixiang Min; Naji Khosravan; Zachary Bessinger; Manjunath Narayana; Sing Bing Kang; Enrique Dunn; Ivaylo Boyadzhiev |
50a | Learning To Detect Scene Landmarks for Camera Localization |
Tien Do; Ondrej Miksik; Joseph DeGol; Hyun Soo Park; Sudipta N. Sinha |
51a | Geometric Transformer for Fast and Robust Point Cloud Registration |
Zheng Qin; Hao Yu; Changjian Wang; Yulan Guo; Yuxing Peng; Kai Xu |
52a | ARCS: Accurate Rotation and Correspondence Search |
Liangzu Peng; Manolis C. Tsakiris; René Vidal |
53a | FisherMatch: Semi-Supervised Rotation Regression via Entropy-Based Filtering |
Yingda Yin; Yingcheng Cai; He Wang; Baoquan Chen |
54a | Uni6D: A Unified CNN Framework Without Projection Breakdown for 6D Pose Estimation |
Xiaoke Jiang; Donghai Li; Hao Chen; Ye Zheng; Rui Zhao; Liwei Wu |