Presentation Schedule
All times are in Central time zone
Date: Tuesday, June 21, 2022 1:30PM – 3:00PM
Session Title: Segmentation, Grouping and Shape Analysis
Session Chairs: Yi Fang (New York Univ.), Benjamin Kimia (Brown Univ.), Chao Chen (Stony Brook Univ.)
Poster ID | Title | Authors |
1b | CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation |
Qihang Yu; Huiyu Wang; Dahun Kim; Siyuan Qiao; Maxwell Collins; Yukun Zhu; Hartwig Adam; Alan Yuille; Liang-Chieh Chen |
2b | Unsupervised Hierarchical Semantic Segmentation With Multiview Cosegmentation and Clustering Transformers |
Tsung-Wei Ke; Jyh-Jing Hwang; Yunhui Guo; Xudong Wang; Stella X. Yu |
3b | Rethinking Semantic Segmentation: A Prototype View |
Tianfei Zhou; Wenguan Wang; Ender Konukoglu; Luc Van Gool |
4b | Semantic-Aware Domain Generalized Segmentation |
Duo Peng; Yinjie Lei; Munawar Hayat; Yulan Guo; Wen Li |
5b | Adaptive Early-Learning Correction for Segmentation From Noisy Annotations |
Sheng Liu; Kangning Liu; Weicheng Zhu; Yiqiu Shen; Carlos Fernandez-Granda |
6b | Pointly-Supervised Instance Segmentation |
Bowen Cheng; Omkar Parkhi; Alexander Kirillov |
7b | Joint Forecasting of Panoptic Segmentations With Difference Attention |
Colin Graber; Cyril Jazra; Wenjie Luo; Liangyan Gui; Alexander G. Schwing |
8b | FocusCut: Diving Into a Focus View in Interactive Segmentation |
Zheng Lin; Zheng-Peng Duan; Zhao Zhang; Chun-Le Guo; Ming-Ming Cheng |
9b | Human Instance Matting via Mutual Guidance and Multi-Instance Refinement | Yanan Sun; Chi-Keung Tang; Yu-Wing Tai |
10b | Deformable Sprites for Unsupervised Video Decomposition |
Vickie Ye; Zhengqi Li; Richard Tucker; Angjoo Kanazawa; Noah Snavely |
11b | Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation |
Wonhui Park; Dongkwon Jin; Chang-Su Kim |
12b | Robust and Accurate Superquadric Recovery: A Probabilistic Approach |
Weixiao Liu; Yuwei Wu; Sipu Ruan; Gregory S. Chirikjian |
13b | Medial Spectral Coordinates for 3D Shape Analysis |
Morteza Rezanejad; Mohammad Khodadad; Hamidreza Mahyar; Herve Lombaert; Michael Gruninger; Dirk Walther; Kaleem Siddiqi |
14b | Scribble-Supervised LiDAR Semantic Segmentation | Ozan Unal; Dengxin Dai; Luc Van Gool |
15b | SoftGroup for 3D Instance Segmentation on Point Clouds |
Thang Vu; Kookhoi Kim; Tung M. Luu; Thanh Nguyen; Chang D. Yoo |
Date: Tuesday, June 21, 2022 1:30PM – 3:00PM
Session Title: 3D From Single Images
Session Chairs: Angjoo Kanazawa (Univ. of California Berkeley), Pascal Fua (EPFL)
Poster ID | Title | Authors |
16b | Accurate 3D Body Shape Regression Using Metric and Semantic Attributes |
Vasileios Choutas; Lea Müller; Chun-Hao P. Huang; Siyu Tang; Dimitrios Tzionas; Michael J. Black |
17b | JIFF: Jointly-Aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction |
Yukang Cao; Guanying Chen; Kai Han; Wenqi Yang; Kwan-Yee K. Wong |
18b | Tracking People by Predicting 3D Appearance, Location and Pose |
Jathushan Rajasegaran; Georgios Pavlakos; Angjoo Kanazawa; Jitendra Malik |
19b | ArtiBoost: Boosting Articulated 3D Hand-Object Pose Estimation via Online Exploration and Synthesis |
Lixin Yang; Kailin Li; Xinyu Zhan; Jun Lv; Wenqiang Xu; Jiefeng Li; Cewu Lu |
20b | Interacting Attention Graph for Single Image Two-Hand Reconstruction |
Mengcheng Li; Liang An; Hongwen Zhang; Lianpeng Wu; Feng Chen; Tao Yu; Yebin Liu |
21b | 3D Human Tongue Reconstruction From Single “In-the-Wild” Images |
Stylianos Ploumpis; Stylianos Moschoglou; Vasileios Triantafyllou; Stefanos Zafeiriou |
22b | EPro-PnP: Generalized End-to-End Probabilistic Perspective-N-Points for Monocular Object Pose Estimation |
Hansheng Chen; Pichao Wang; Fan Wang; Wei Tian; Lu Xiong; Hao Li |
23b | Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection |
Zhuoling Li; Zhan Qu; Yang Zhou; Jianzhuang Liu; Haoqian Wang; Lihui Jiang |
24b | OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion |
Yuyan Li; Yuliang Guo; Zhixin Yan; Xinyu Huang; Ye Duan; Liu Ren |
25b | Gated2Gated: Self-Supervised Depth Estimation From Gated Images |
Amanpreet Walia; Stefanie Walz; Mario Bijelic; Fahim Mannan; Frank Julca-Aguilar; Michael Langer; Werner Ritter; Felix Heide |
26b | IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes |
Rui Zhu; Zhengqin Li; Janarbek Matai; Fatih Porikli; Manmohan Chandraker |
27b | Egocentric Scene Understanding via Multimodal Spatial Rectifier | Tien Do; Khiem Vuong; Hyun Soo Park |
28b | Multi-View Depth Estimation by Fusing Single-View Depth Probability With Multi-View Geometry |
Gwangbin Bae; Ignas Budvytis; Roberto Cipolla |
29b | The Implicit Values of a Good Hand Shake: Handheld Multi-Frame Neural Depth Refinement |
Ilya Chugunov; Yuxuan Zhang; Zhihao Xia; Xuaner Zhang; Jiawen Chen; Felix Heide |
30b | BANMo: Building Animatable 3D Neural Models From Many Casual Videos |
Gengshan Yang; Minh Vo; Natalia Neverova; Deva Ramanan; Andrea Vedaldi; Hanbyul Joo |
Date: Tuesday, June 21, 2022 1:30PM – 3:00PM
Session Title: Video Analysis & Understanding
Session Chairs: Nicu Sebe (Univ. of Trento), YingLi Tian (City Univ. of New York), Concetto Spampinato (Univ. of Catania)
Poster ID | Title | Authors |
31b | Self-Supervised Video Transformer |
Kanchana Ranasinghe; Muzammal Naseer; Salman Khan; Fahad Shahbaz Khan; Michael S. Ryoo |
32b | Temporally Efficient Vision Transformer for Video Instance Segmentation |
Shusheng Yang; Xinggang Wang; Yu Li; Yuxin Fang; Jiemin Fang; Wenyu Liu; Xun Zhao; Ying Shan |
33b | VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation |
Su Ho Han; Sukjun Hwang; Seoung Wug Oh; Yeonchool Park; Hyunwoo Kim; Min-Jung Kim; Seon Joo Kim |
34b | Temporal Alignment Networks for Long-Term Video | Tengda Han; Weidi Xie; Andrew Zisserman |
35b | Revisiting the “Video” in Video-Language Understanding |
Shyamal Buch; Cristóbal Eyzaguirre; Adrien Gaidon; Jiajun Wu; Li Fei-Fei; Juan Carlos Niebles |
36b | Invariant Grounding for Video Question Answering |
Yicong Li; Xiang Wang; Junbin Xiao; Wei Ji; Tat-Seng Chua |
37b | P3IV: Probabilistic Procedure Planning From Instructional Videos With Weak Supervision |
He Zhao; Isma Hadji; Nikita Dvornik; Konstantinos G. Derpanis; Richard P. Wildes; Allan D. Jepson |
38b | FineDiving: A Fine-Grained Dataset for Procedure-Aware Action Quality Assessment |
Jinglin Xu; Yongming Rao; Xumin Yu; Guangyi Chen; Jie Zhou; Jiwen Lu |
39b | Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition |
Yinghao Xu; Fangyun Wei; Xiao Sun; Ceyuan Yang; Yujun Shen; Bo Dai; Bolei Zhou; Stephen Lin |
40b | Revisiting Skeleton-Based Action Recognition |
Haodong Duan; Yue Zhao; Kai Chen; Dahua Lin; Bo Dai |
41b | OpenTAL: Towards Open Set Temporal Action Localization | Wentao Bao; Qi Yu; Yu Kong |
42b | Dual-AI: Dual-Path Actor Interaction Learning for Group Activity Recognition |
Mingfei Han; David Junhao Zhang; Yali Wang; Rui Yan; Lina Yao; Xiaojun Chang; Yu Qiao |
43b | TransRank: Self-Supervised Video Representation Learning via Ranking-Based Transformation Recognition |
Haodong Duan; Nanxuan Zhao; Kai Chen; Dahua Lin |
44b | Revealing Occlusions With 4D Neural Fields |
Basile Van Hoorick; Purva Tendulkar; Dídac Surís; Dennis Park; Simon Stent; Carl Vondrick |
45b | HODOR: High-Level Object Descriptors for Object Re-Segmentation in Video Learned From Static Images |
Ali Athar; Jonathon Luiten; Alexander Hermans; Deva Ramanan; Bastian Leibe |