Presentation Schedule
All times are in Central time zone
Date: Friday, June 24, 2022 10:00AM – 12:30PM
Session Title | Poster ID | Title | Authors |
Representation Learning | 55a | On the Importance of Asymmetry for Siamese Representation Learning |
Xiao Wang; Haoqi Fan; Yuandong Tian; Daisuke Kihara; Xinlei Chen |
56a | Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy |
Tong Zhang; Congpei Qiu; Wei Ke; Sabine Süsstrunk; Mathieu Salzmann |
|
|
57a | Exploring Set Similarity for Dense Self-Supervised Representation Learning |
Zhaoqing Wang; Qiang Li; Guoxin Zhang; Pengfei Wan; Wen Zheng; Nannan Wang; Mingming Gong; Tongliang Liu |
58a | Align Representations With Base: A New Approach to Self-Supervised Learning |
Shaofeng Zhang; Lyn Qiu; Feng Zhu; Junchi Yan; Hengrui Zhang; Rui Zhao; Hongyang Li; Xiaokang Yang |
|
59a | Identifying Ambiguous Similarity Conditions via Semantic Matching | Han-Jia Ye; Yi Shi; De-Chuan Zhan | |
60a | Node Representation Learning in Graph via Node-to-Neighbourhood Mutual Information Maximization |
Wei Dong; Junsheng Wu; Yi Luo; Zongyuan Ge; Peng Wang |
|
61a | Instance-Dependent Label-Noise Learning With Manifold-Regularized Transition Matrix Estimation |
De Cheng; Tongliang Liu; Yixiong Ning; Nannan Wang; Bo Han; Gang Niu; Xinbo Gao; Masashi Sugiyama |
|
62a | Unsupervised Visual Representation Learning by Online Constrained K-Means |
Qi Qian; Yuanhong Xu; Juhua Hu; Hao Li; Rong Jin |
|
63a | Rethinking the Augmentation Module in Contrastive Learning: Learning Hierarchical Augmentation Invariance With Expanded Views | Junbo Zhang; Kaisheng Ma | |
64a | Use All the Labels: A Hierarchical Multi-Label Contrastive Learning Framework |
Shu Zhang; Ran Xu; Caiming Xiong; Chetan Ramaiah |
|
65a | Robust Contrastive Learning Against Noisy Views |
Ching-Yao Chuang; R Devon Hjelm; Xin Wang; Vibhav Vineet; Neel Joshi; Antonio Torralba; Stefanie Jegelka; Yale Song |
|
66a | On Learning Contrastive Representations for Learning With Noisy Labels |
Li Yi; Sheng Liu; Qi She; A. Ian McLeod; Boyu Wang |
|
67a | Directional Self-Supervised Learning for Heavy Image Augmentations | Yalong Bai; Yifan Yang; Wei Zhang; Tao Mei | |
68a | Continual Learning for Visual Search With Backward Consistent Feature Embedding |
Timmy S. T. Wan; Jun-Cheng Chen; Tzer-Yi Wu; Chu-Song Chen |
|
69a | Probing Representation Forgetting in Supervised and Unsupervised Continual Learning |
MohammadReza Davari; Nader Asadi; Sudhir Mudur; Rahaf Aljundi; Eugene Belilovsky |
|
70a | Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning |
Yujun Shi; Kuangqi Zhou; Jian Liang; Zihang Jiang; Jiashi Feng; Philip H.S. Torr; Song Bai; Vincent Y. F. Tan |
|
71a | Bring Evanescent Representations to Life in Lifelong Class Incremental Learning | Marco Toldo; Mete Ozay | |
72a | Unsupervised Learning of Debiased Representations With Pseudo-Attributes |
Seonguk Seo; Joon-Young Lee; Bohyung Han |
|
73a | A Conservative Approach for Unbiased Learning on Unknown Biases |
Myeongho Jeon; Daekyung Kim; Woochul Lee; Myungjoo Kang; Joonseok Lee |
|
74a | Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions With Superior OOD Generalization |
Damien Teney; Ehsan Abbasnejad; Simon Lucey; Anton van den Hengel |
|
75a | Co-Advise: Cross Inductive Bias Distillation |
Sucheng Ren; Zhengqi Gao; Tianyu Hua; Zihui Xue; Yonglong Tian; Shengfeng He; Hang Zhao |
|
76a | PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures |
Dan Hendrycks; Andy Zou; Mantas Mazeika; Leonard Tang; Bo Li; Dawn Song; Jacob Steinhardt |
|
77a | RegionCLIP: Region-Based Language-Image Pretraining |
Yiwu Zhong; Jianwei Yang; Pengchuan Zhang; Chunyuan Li; Noel Codella; Liunian Harold Li; Luowei Zhou; Xiyang Dai; Lu Yuan; Yin Li; Jianfeng Gao |
|
78a | Uni-Perceiver: Pre-Training Unified Architecture for Generic Perception for Zero-Shot and Few-Shot Tasks |
Xizhou Zhu; Jinguo Zhu; Hao Li; Xiaoshi Wu; Hongsheng Li; Xiaohua Wang; Jifeng Dai |
|
79a | Conditional Prompt Learning for Vision-Language Models |
Kaiyang Zhou; Jingkang Yang; Chen Change Loy; Ziwei Liu |
|
Scene Analysis & Understanding | 80a | Noisy Boundaries: Lemon or Lemonade for Semi-Supervised Instance Segmentation? | Zhenyu Wang; Yali Li; Shengjin Wang |
81a | Partial Class Activation Attention for Semantic Segmentation |
Sun-Ao Liu; Hongtao Xie; Hai Xu; Yongdong Zhang; Qi Tian |
|
82a | Learning Affinity From Attention: End-to-End Weakly-Supervised Semantic Segmentation With Transformers |
Lixiang Ru; Yibing Zhan; Baosheng Yu; Bo Du |
|
83a | Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation | Jing Li; Junsong Fan; Zhaoxiang Zhang | |
84a | Class Similarity Weighted Knowledge Distillation for Continual Semantic Segmentation |
Minh Hieu Phan; The-Anh Ta; Son Lam Phung; Long Tran-Thanh; Abdesselam Bouzerdoum |
|
85a | Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation |
Deyi Ji; Haoran Wang; Mingyuan Tao; Jianqiang Huang; Xian-Sheng Hua; Hongtao Lu |
|
86a | L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation |
Peng-Tao Jiang; Yuqi Yang; Qibin Hou; Yunchao Wei |
|
87a | Weakly Supervised Semantic Segmentation Using Out-of-Distribution Data |
Jungbeom Lee; Seong Joon Oh; Sangdoo Yun; Junsuk Choe; Eunji Kim; Sungroh Yoon |
|
88a | Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation |
Zhiyuan Liang; Tiancai Wang; Xiangyu Zhang; Jian Sun; Jianbing Shen |
|
89a | Bending Reality: Distortion-Aware Transformers for Adapting to Panoramic Semantic Segmentation |
Jiaming Zhang; Kailun Yang; Chaoxiang Ma; Simon Reiß; Kunyu Peng; Rainer Stiefelhagen |
|
90a | MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation |
Inkyu Shin; Yi-Hsuan Tsai; Bingbing Zhuang; Samuel Schulter; Buyu Liu; Sparsh Garg; In So Kweon; Kuk-Jin Yoon |
|
91a | NightLab: A Dual-Level Architecture With Hardness Detection for Segmentation at Night |
Xueqing Deng; Peng Wang; Xiaochen Lian; Shawn Newsam |
|
92a | Fast Point Transformer |
Chunghyun Park; Yoonwoo Jeong; Minsu Cho; Jaesik Park |
|
93a | RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior |
Ruibo Li; Chi Zhang; Guosheng Lin; Zhe Wang; Chunhua Shen |
|
94a | ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes |
Rahul Sajnani; Adrien Poulenard; Jivitesh Jain; Radhika Dua; Leonidas J. Guibas; Srinath Sridhar |
|
95a | DisARM: Displacement Aware Relation Module for 3D Detection |
Yao Duan; Chenyang Zhu; Yuqing Lan; Renjiao Yi; Xinwang Liu; Kai Xu |
|
96a | Learning Object Context for Novel-View Scene Layout Generation |
Xiaotian Qiao; Gerhard P. Hancke; Rynson W.H. Lau |
|
97a | Weakly but Deeply Supervised Occlusion-Reasoned Parametric Road Layouts |
Buyu Liu; Bingbing Zhuang; Manmohan Chandraker |
|
98a | Beyond Cross-View Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image | Yujiao Shi; Hongdong Li | |
99a | Raw High-Definition Radar for Multi-Task Learning |
Julien Rebut; Arthur Ouaknine; Waqas Malik; Patrick Pérez |
|
100a | Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation |
Ziad Al-Halah; Santhosh Kumar Ramakrishnan; Kristen Grauman |
|
101a | UKPGAN: A General Self-Supervised Keypoint Detector |
Yang You; Wenhai Liu; Yanjie Ze; Yong-Lu Li; Weiming Wang; Cewu Lu |
|
102a | Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints To Better Classify Objects in Videos |
Sukjun Hwang; Miran Heo; Seoung Wug Oh; Seon Joo Kim |
|
Navigation & Autonomous Driving | 103a | Rethinking Efficient Lane Detection via Curve Modeling |
Zhengyang Feng; Shaohua Guo; Xin Tan; Ke Xu; Min Wang; Lizhuang Ma |
104a | Exploiting Temporal Relations on Radar Perception for Autonomous Driving |
Peizhao Li; Pu Wang; Karl Berntorp; Hongfu Liu |
|
105a | Towards Robust and Adaptive Motion Forecasting: A Causal Representation Perspective |
Yuejiang Liu; Riccardo Cadei; Jonas Schweizer; Sherwin Bahmani; Alexandre Alahi |
|
106a | BE-STI: Spatial-Temporal Integrated Network for Class-Agnostic Motion Prediction With Bidirectional Enhancement |
Yunlong Wang; Hongyu Pan; Jun Zhu; Yu-Huan Wu; Xin Zhan; Kun Jiang; Diange Yang |
|
107a | ScePT: Scene-Consistent, Policy-Based Trajectory Predictions for Planning | Yuxiao Chen; Boris Ivanovic; Marco Pavone | |
108a | Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion |
Tianpei Gu; Guangyi Chen; Junlong Li; Chunze Lin; Yongming Rao; Jie Zhou; Jiwen Lu |
|
109a | Vehicle Trajectory Prediction Works, but Not Everywhere |
Mohammadhossein Bahari; Saeed Saadatnejad; Ahmad Rahimi; Mohammad Shaverdikondori; Amir Hossein Shahidzadeh; Seyed-Mohsen Moosavi-Dezfooli; Alexandre Alahi |
|
110a | LTP: Lane-Based Trajectory Prediction for Autonomous Driving |
Jingke Wang; Tengju Ye; Ziqing Gu; Junbo Chen |
|
111a | ONCE-3DLanes: Building Monocular 3D Lane Detection |
Fan Yan; Ming Nie; Xinyue Cai; Jianhua Han; Hang Xu; Zhen Yang; Chaoqiang Ye; Yanwei Fu; Michael Bi Mi; Li Zhang |
|
112a | Towards Driving-Oriented Metric for Lane Detection Models | Takami Sato; Qi Alfred Chen | |
113a | Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes |
Dongkwon Jin; Wonhui Park; Seong-Gyun Jeong; Heeyeon Kwon; Chang-Su Kim |
|
114a | LIFT: Learning 4D LiDAR Image Fusion Transformer for 3D Object Detection |
Yihan Zeng; Da Zhang; Chunwei Wang; Zhenwei Miao; Ting Liu; Xin Zhan; Dayang Hao; Chao Ma |
|
115a | DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection |
Yingwei Li; Adams Wei Yu; Tianjian Meng; Ben Caine; Jiquan Ngiam; Daiyi Peng; Junyang Shen; Yifeng Lu; Denny Zhou; Quoc V. Le; Alan Yuille; Mingxing Tan |
|
116a | A Versatile Multi-View Framework for LiDAR-Based 3D Object Detection With Guidance From Panoptic Segmentation |
Hamidreza Fazlali; Yixuan Xu; Yuan Ren; Bingbing Liu |
|
117a | Forecasting From LiDAR via Future Object Detection |
Neehar Peri; Jonathon Luiten; Mengtian Li; Aljoša Ošep; Laura Leal-Taixé; Deva Ramanan |
|
118a | RIDDLE: Lidar Data Compression With Range Image Deep Delta Encoding |
Xuanyu Zhou; Charles R. Qi; Yin Zhou; Dragomir Anguelov |
|
119a | Learning From All Vehicles | Dian Chen; Philipp Krähenbühl | |
120a | Is Mapping Necessary for Realistic PointGoal Navigation? |
Ruslan Partsey; Erik Wijmans; Naoki Yokoyama; Oles Dobosevych; Dhruv Batra; Oleksandr Maksymets |
|
121a | Symmetry-Aware Neural Architecture for Embodied Visual Exploration | Shuang Liu; Takayuki Okatani | |
122a | Coopernaut: End-to-End Driving With Cooperative Perception for Networked Vehicles |
Jiaxun Cui; Hang Qiu; Dian Chen; Peter Stone; Yuke Zhu |
|
123a | Topology Preserving Local Road Network Estimation From Single Onboard Camera Image |
Yigit Baran Can; Alexander Liniger; Danda Pani Paudel; Luc Van Gool |
|
124a | Coupling Vision and Proprioception for Navigation of Legged Robots |
Zipeng Fu; Ashish Kumar; Ananye Agarwal; Haozhi Qi; Jitendra Malik; Deepak Pathak |
|
125a | Pyramid Architecture for Multi-Scale Processing in Point Cloud Segmentation |
Dong Nie; Rui Lan; Ling Wang; Xiaofeng Ren |
|
126a | 3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection |
Alexander Lehner; Stefano Gasperini; Alvaro Marcos-Ramiro; Michael Schmidt; Mohammad-Ali Nikouei Mahani; Nassir Navab; Benjamin Busam; Federico Tombari |
|
127a | Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior |
Davis Rempe; Jonah Philion; Leonidas J. Guibas; Sanja Fidler; Or Litany |
|
128a | SelfD: Self-Learning Large-Scale Driving Policies From the Web |
Jimuyang Zhang; Ruizhao Zhu; Eshed Ohn-Bar |
|
129a | Towards Real-World Navigation With Deep Differentiable Planners | Shu Ishida; João F. Henriques | |
130a | Privacy Preserving Partial Localization |
Marcel Geppert; Viktor Larsson; Johannes L. Schönberger; Marc Pollefeys |
|
131a | Efficient Large-Scale Localization by Global Instance Recognition |
Fei Xue; Ignas Budvytis; Daniel Olmeda Reino; Roberto Cipolla |
|
132a | CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data |
Qi Yan; Jianhao Zheng; Simon Reding; Shanci Li; Iordan Doytchinov |
|
Low-Level Vision | 133a | Bilateral Video Magnification Filter |
Shoichiro Takeda; Kenta Niwa; Mariko Isogawa; Shinya Shimizu; Kazuki Okami; Yushi Aono |
134a | Neural Data-Dependent Transform for Learned Image Compression |
Dezhao Wang; Wenhan Yang; Yueyu Hu; Jiaying Liu |
|
135a | Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence |
Zhihong Pan; Baopu Li; Dongliang He; Mingde Yao; Wenhao Wu; Tianwei Lin; Xin Li; Errui Ding |
|
136a | Deep Generalized Unfolding Networks for Image Restoration | Chong Mou; Qian Wang; Jian Zhang | |
137a | Look Back and Forth: Video Super-Resolution With Explicit Temporal Difference Modeling |
Takashi Isobe; Xu Jia; Xin Tao; Changlin Li; Ruihuang Li; Yongjie Shi; Jing Mu; Huchuan Lu; Yu-Wing Tai |
|
138a | XYDeblur: Divide and Conquer for Single Image Deblurring |
Seo-Won Ji; Jeongmin Lee; Seung-Wook Kim; Jun-Pyo Hong; Seung-Jin Baek; Seung-Won Jung; Sung-Jea Ko |
|
139a | Abandoning the Bayer-Filter To See in the Dark |
Xingbo Dong; Wanyan Xu; Zhihui Miao; Lan Ma; Chao Zhang; Jiewen Yang; Zhe Jin; Andrew Beng Jin Teoh; Jiajun Shen |
|
140a | RSTT: Real-Time Spatial Temporal Transformer for Space-Time Video Super-Resolution |
Zhicheng Geng; Luming Liang; Tianyu Ding; Ilya Zharkov |
|
141a | All-in-One Image Restoration for Unknown Corruption |
Boyun Li; Xiao Liu; Peng Hu; Zhongqin Wu; Jiancheng Lv; Xi Peng |
|
142a | Modeling sRGB Camera Noise With Normalizing Flows |
Shayan Kousha; Ali Maleky; Michael S. Brown; Marcus A. Brubaker |
|
143a | A Differentiable Two-Stage Alignment Scheme for Burst Image Reconstruction With Large Shift |
Shi Guo; Xi Yang; Jianqi Ma; Gaofeng Ren; Lei Zhang |
|
144a | Video Frame Interpolation Transformer |
Zhihao Shi; Xiangyu Xu; Xiaohong Liu; Jun Chen; Ming-Hsuan Yang |
|
145a | The Devil Is in the Details: Window-Based Attention for Image Compression |
Renjie Zou; Chunfeng Song; Zhaoxiang Zhang |
|
146a | Mask-Guided Spectral-Wise Transformer for Efficient Hyperspectral Image Reconstruction |
Yuanhao Cai; Jing Lin; Xiaowan Hu; Haoqian Wang; Xin Yuan; Yulun Zhang; Radu Timofte; Luc Van Gool |
|
147a | RestoreFormer: High-Quality Blind Face Restoration From Undegraded Key-Value Pairs |
Zhouxia Wang; Jiawei Zhang; Runjian Chen; Wenping Wang; Ping Luo |
|
148a | AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-Time Image Enhancement |
Canqian Yang; Meiguang Jin; Xu Jia; Yi Xu; Ying Chen |
|
149a | HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging |
Xuanyu Zhang; Yongbing Zhang; Ruiqin Xiong; Qilin Sun; Jian Zhang |
|
150a | HDNet: High-Resolution Dual-Domain Learning for Spectral Compressive Imaging |
Xiaowan Hu; Yuanhao Cai; Jing Lin; Haoqian Wang; Xin Yuan; Yulun Zhang; Radu Timofte; Luc Van Gool |
|
151a | Learning To Zoom Inside Camera Imaging Pipeline |
Chengzhou Tang; Yuqiang Yang; Bing Zeng; Ping Tan; Shuaicheng Liu |
|
152a | Towards an End-to-End Framework for Flow-Guided Video Inpainting |
Zhen Li; Cheng-Ze Lu; Jianhua Qin; Chun-Le Guo; Ming-Ming Cheng |
|
153a | Context-Aware Video Reconstruction for Rolling Shutter Cameras |
Bin Fan; Yuchao Dai; Zhiyuan Zhang; Qi Liu; Mingyi He |
|
154a | CVF-SID: Cyclic Multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise From Image |
Reyhaneh Neshatavar; Mohsen Yavartanoo; Sanghyun Son; Kyoung Mu Lee |
|
155a | Global Matching With Overlapping Attention for Optical Flow Estimation |
Shiyu Zhao; Long Zhao; Zhixing Zhang; Enyu Zhou; Dimitris Metaxas |
|
156a | CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow |
Xiuchao Sui; Shaohua Li; Xue Geng; Yan Wu; Xinxing Xu; Yong Liu; Rick Goh; Hongyuan Zhu |
|
157a | Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression |
Xiaosu Zhu; Jingkuan Song; Lianli Gao; Feng Zheng; Heng Tao Shen |
|
158a | Video Demoiréing With Relation-Based Temporal Consistency |
Peng Dai; Xin Yu; Lan Ma; Baoheng Zhang; Jia Li; Wenbo Li; Jiajun Shen; Xiaojuan Qi |
|
159a | Noise2NoiseFlow: Realistic Camera Noise Modeling Without Clean Images |
Ali Maleky; Shayan Kousha; Michael S. Brown; Marcus A. Brubaker |
|
160a | Deep Constrained Least Squares for Blind Image Super-Resolution |
Ziwei Luo; Haibin Huang; Lei Yu; Youwei Li; Haoqiang Fan; Shuaicheng Liu |
|
161a | Learning Multiple Adverse Weather Removal via Two-Stage Knowledge Learning and Multi-Contrastive Regularization: Toward a Unified Model |
Wei-Ting Chen; Zhi-Kai Huang; Cheng-Che Tsai; Hao-Hsiang Yang; Jian-Jiun Ding; Sy-Yen Kuo |
|
162a | Unsupervised Homography Estimation With Coplanarity-Aware GAN |
Mingbo Hong; Yuhang Lu; Nianjin Ye; Chunyu Lin; Qijun Zhao; Shuaicheng Liu |
|
Computational Photography | 163a | Attentive Fine-Grained Structured Sparsity for Image Restoration |
Junghun Oh; Heewon Kim; Seungjun Nah; Cheeun Hong; Jonghyun Choi; Kyoung Mu Lee |
164a | Uformer: A General U-Shaped Transformer for Image Restoration |
Zhendong Wang; Xiaodong Cun; Jianmin Bao; Wengang Zhou; Jianzhuang Liu; Houqiang Li |
|
165a | Bringing Old Films Back to Life |
Ziyu Wan; Bo Zhang; Dongdong Chen; Jing Liao |
|
166a | Learning sRGB-to-Raw-RGB De-Rendering With Content-Aware Metadata |
Seonghyeon Nam; Abhijith Punnappurath; Marcus A. Brubaker; Michael S. Brown |
|
167a | SNR-Aware Low-Light Image Enhancement |
Xiaogang Xu; Ruixing Wang; Chi-Wing Fu; Jiaya Jia |
|
168a | AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network |
Wooseok Lee; Sanghyun Son; Kyoung Mu Lee |
|
169a | Synthetic Aperture Imaging With Events and Frames |
Wei Liao; Xiang Zhang; Lei Yu; Shijie Lin; Wen Yang; Ning Qiao |
|
170a | Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition | Junho Kim; Inwoo Hwang; Young Min Kim | |
171a | Time Lens++: Event-Based Frame Interpolation With Parametric Non-Linear Flow and Multi-Scale Fusion |
Stepan Tulyakov; Alfredo Bochicchio; Daniel Gehrig; Stamatios Georgoulis; Yuanyou Li; Davide Scaramuzza |
|
172a | Unifying Motion Deblurring and Frame Interpolation With Events | Xiang Zhang; Lei Yu | |
173a | EvUnroll: Neuromorphic Events Based Rolling Shutter Image Correction | Xinyu Zhou; Peiqi Duan; Yi Ma; Boxin Shi | |
174a | Learning Adaptive Warping for Real-World Rolling Shutter Correction |
Mingdeng Cao; Zhihang Zhong; Jiahao Wang; Yinqiang Zheng; Yujiu Yang |
|
175a | Neural Global Shutter: Learn To Restore Video From a Rolling Shutter Camera With Global Reset Feature |
Zhixiang Wang; Xiang Ji; Jia-Bin Huang; Shin'ichi Satoh; Xiao Zhou; Yinqiang Zheng |
|
176a | TimeReplayer: Unlocking the Potential of Event Cameras for Video Interpolation |
Weihua He; Kaichao You; Zhendong Qiao; Xu Jia; Ziyang Zhang; Wenhui Wang; Huchuan Lu; Yaoyuan Wang; Jianxing Liao |
|
177a | Optimizing Video Prediction via Video Frame Interpolation | Yue Wu; Qiang Wen; Qifeng Chen | |
178a | Reference-Based Video Super-Resolution Using Multi-Camera Video Triplets |
Junyong Lee; Myeonghee Lee; Sunghyun Cho; Seungyong Lee |
|
179a | Memory-Augmented Non-Local Attention for Video Super-Resolution | Jiyang Yu; Jingen Liu; Liefeng Bo; Tao Mei | |
180a | Optical Flow Estimation for Spiking Camera |
Liwen Hu; Rui Zhao; Ziluo Ding; Lei Ma; Boxin Shi; Ruiqin Xiong; Tiejun Huang |
|
181a | Compressive Single-Photon 3D Cameras |
Felipe Gutierrez-Barragan; Atul Ingle; Trevor Seets; Mohit Gupta; Andreas Velten |
|
182a | Single-Photon Structured Light |
Varun Sundar; Sizhuo Ma; Aswin C. Sankaranarayanan; Mohit Gupta |
|
183a | All-Photon Polarimetric Time-of-Flight Imaging | Seung-Hwan Baek; Felix Heide | |
184a | Holocurtains: Programming Light Curtains via Binary Holography |
Dorian Chan; Srinivasa G. Narasimhan; Matthew O'Toole |
|
Vision & Language | 185a | Towards Implicit Text-Guided 3D Shape Generation |
Zhengzhe Liu; Yi Wang; Xiaojuan Qi; Chi-Wing Fu |
186a | Towards Language-Free Training for Text-to-Image Generation |
Yufan Zhou; Ruiyi Zhang; Changyou Chen; Chunyuan Li; Chris Tensmeyer; Tong Yu; Jiuxiang Gu; Jinhui Xu; Tong Sun |
|
187a | ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic |
Yoad Tewel; Yoav Shalev; Idan Schwartz; Lior Wolf |
|
188a | EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching |
Yaya Shi; Xu Yang; Haiyang Xu; Chunfeng Yuan; Bing Li; Weiming Hu; Zheng-Jun Zha |
|
189a | Hierarchical Modular Network for Video Captioning |
Hanhua Ye; Guorong Li; Yuankai Qi; Shuhui Wang; Qingming Huang; Ming-Hsuan Yang |
|
190a | SwinBERT: End-to-End Transformers With Sparse Attention for Video Captioning |
Kevin Lin; Linjie Li; Chung-Ching Lin; Faisal Ahmed; Zhe Gan; Zicheng Liu; Yumao Lu; Lijuan Wang |
|
191a | End-to-End Generative Pretraining for Multimodal Video Captioning |
Paul Hongsuck Seo; Arsha Nagrani; Anurag Arnab; Cordelia Schmid |
|
192a | Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning | Chia-Wen Kuo; Zsolt Kira | |
193a | Scaling Up Vision-Language Pre-Training for Image Captioning |
Xiaowei Hu; Zhe Gan; Jianfeng Wang; Zhengyuan Yang; Zicheng Liu; Yumao Lu; Lijuan Wang |
|
194a | Comprehending and Ordering Semantics for Image Captioning | Yehao Li; Yingwei Pan; Ting Yao; Tao Mei | |
195a | NOC-REK: Novel Object Captioning With Retrieved Vocabulary From External Knowledge |
Duc Minh Vo; Hong Chen; Akihiro Sugimoto; Hideki Nakayama |
|
196a | Injecting Semantic Concepts Into End-to-End Image Captioning |
Zhiyuan Fang; Jianfeng Wang; Xiaowei Hu; Lin Liang; Zhe Gan; Lijuan Wang; Yezhou Yang; Zicheng Liu |
|
197a | DIFNet: Boosting Visual Information Flow for Image Captioning |
Mingrui Wu; Xuying Zhang; Xiaoshuai Sun; Yiyi Zhou; Chao Chen; Jiaxin Gu; Xing Sun; Rongrong Ji |
|
198a | VisualGPT: Data-Efficient Adaptation of Pretrained Language Models for Image Captioning |
Jun Chen; Han Guo; Kai Yi; Boyang Li; Mohamed Elhoseiny |
|
199a | Show, Deconfound and Tell: Image Captioning With Causal Inference |
Bing Liu; Dong Wang; Xu Yang; Yong Zhou; Rui Yao; Zhiwen Shao; Jiaqi Zhao |
|
200a | EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval |
Haoyu Ma; Handong Zhao; Zhe Lin; Ajinkya Kale; Zhangyang Wang; Tong Yu; Jiuxiang Gu; Sunav Choudhary; Xiaohui Xie |
|
201a | CLIPstyler: Image Style Transfer With a Single Text Condition | Gihyun Kwon; Jong Chul Ye | |
202a | HairCLIP: Design Your Hair by Text and Reference Image |
Tianyi Wei; Dongdong Chen; Wenbo Zhou; Jing Liao; Zhentao Tan; Lu Yuan; Weiming Zhang; Nenghai Yu |
|
203a | DenseCLIP: Language-Guided Dense Prediction With Context-Aware Prompting |
Yongming Rao; Wenliang Zhao; Guangyi Chen; Yansong Tang; Zheng Zhu; Guan Huang; Jie Zhou; Jiwen Lu |
|
204a | On Guiding Visual Attention With Language Specification |
Suzanne Petryk; Lisa Dunlap; Keyan Nasseri; Joseph Gonzalez; Trevor Darrell; Anna Rohrbach |
|
205a | UTC: A Unified Transformer With Inter-Task Contrastive Learning for Visual Dialog |
Cheng Chen; Zhenshan Tan; Qingrong Cheng; Xin Jiang; Qun Liu; Yudong Zhu; Xiaodong Gu |
|
206a | Text-to-Image Synthesis Based on Object-Guided Joint-Decoding Transformer |
Fuxiang Wu; Liu Liu; Fusheng Hao; Fengxiang He; Jun Cheng |
|
207a | LiT: Zero-Shot Transfer With Locked-Image Text Tuning |
Xiaohua Zhai; Xiao Wang; Basil Mustafa; Andreas Steiner; Daniel Keysers; Alexander Kolesnikov; Lucas Beyer |
|
208a | GroupViT: Semantic Segmentation Emerges From Text Supervision |
Jiarui Xu; Shalini De Mello; Sifei Liu; Wonmin Byeon; Thomas Breuel; Jan Kautz; Xiaolong Wang |
|
209a | ReSTR: Convolution-Free Referring Image Segmentation Using Transformers |
Namyup Kim; Dongwon Kim; Cuiling Lan; Wenjun Zeng; Suha Kwak |
|
210a | LAVT: Language-Aware Vision Transformer for Referring Image Segmentation |
Zhao Yang; Jiaqi Wang; Yansong Tang; Kai Chen; Hengshuang Zhao; Philip H.S. Torr |
|
211a | An Empirical Study of Training End-to-End Vision-and-Language Transformers |
Zi-Yi Dou; Yichong Xu; Zhe Gan; Jianfeng Wang; Shuohang Wang; Lijuan Wang; Chenguang Zhu; Pengchuan Zhang; Lu Yuan; Nanyun Peng; Zicheng Liu; Michael Zeng |
|
212a | Are Multimodal Transformers Robust to Missing Modality? |
Mengmeng Ma; Jian Ren; Long Zhao; Davide Testuggine; Xi Peng |
|
Image & Video Synthesis and Generation | 213a | Text to Image Generation With Semantic-Spatial Aware GAN |
Wentong Liao; Kai Hu; Michael Ying Yang; Bodo Rosenhahn |
214a | StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis |
Zhiheng Li; Martin Renqiang Min; Kai Li; Chenliang Xu |
|
215a | Blended Diffusion for Text-Driven Editing of Natural Images |
Omri Avrahami; Dani Lischinski; Ohad Fried |
|
216a | Make It Move: Controllable Image-to-Video Generation With Text Descriptions | Yaosi Hu; Chong Luo; Zhenzhong Chen | |
217a | Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model |
Zipeng Xu; Tianwei Lin; Hao Tang; Fu Li; Dongliang He; Nicu Sebe; Radu Timofte; Luc Van Gool; Errui Ding |
|
218a | A Style-Aware Discriminator for Controllable Image Translation |
Kunhee Kim; Sanghun Park; Eunyeong Jeon; Taehun Kim; Daijin Kim |
|
219a | Alleviating Semantics Distortion in Unsupervised Low-Level Image-to-Image Translation via Structure Consistency Constraint |
Jiaxian Guo; Jiachen Li; Huan Fu; Mingming Gong; Kun Zhang; Dacheng Tao |
|
220a | Exploring Patch-Wise Semantic Relation for Contrastive Learning in Image-to-Image Translation Tasks | Chanyong Jung; Gihyun Kwon; Jong Chul Ye | |
221a | FlexIT: Towards Flexible Semantic Image Translation |
Guillaume Couairon; Asya Grechka; Jakob Verbeek; Holger Schwenk; Matthieu Cord |
|
222a | Modulated Contrast for Versatile Image Synthesis |
Fangneng Zhan; Jiahui Zhang; Yingchen Yu; Rongliang Wu; Shijian Lu |
|
223a | QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation |
Xueqi Hu; Xinyue Zhou; Qiusheng Huang; Zhengyi Shi; Li Sun; Qingli Li |
|
224a | Self-Supervised Dense Consistency Regularization for Image-to-Image Translation |
Minsu Ko; Eunju Cha; Sungjoo Suh; Huijin Lee; Jae-Joon Han; Jinwoo Shin; Bohyung Han |
|
225a | Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation |
Yanwu Xu; Shaoan Xie; Wenhao Wu; Kun Zhang; Mingming Gong; Kayhan Batmanghelich |
|
226a | InstaFormer: Instance-Aware Image-to-Image Translation With Transformer |
Soohyun Kim; Jongbeom Baek; Jihye Park; Gyeongnyeon Kim; Seungryong Kim |
|
227a | Unsupervised Image-to-Image Translation With Generative Prior |
Shuai Yang; Liming Jiang; Ziwei Liu; Chen Change Loy |
|
228a | StylizedNeRF: Consistent 3D Scene Stylization As Stylized NeRF via 2D-3D Mutual Learning |
Yi-Hua Huang; Yue He; Yu-Jie Yuan; Yu-Kun Lai; Lin Gao |
|
229a | NeRF-Editing: Geometry Editing of Neural Radiance Fields |
Yu-Jie Yuan; Yang-Tian Sun; Yu-Kun Lai; Yuewen Ma; Rongfei Jia; Lin Gao |
|
230a | GeoNeRF: Generalizing NeRF With Geometry Priors |
Mohammad Mahdi Johari; Yann Lepoittevin; François Fleuret |
|
231a | Ray Priors Through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation |
Jian Zhang; Yuanqing Zhang; Huan Fu; Xiaowei Zhou; Bowen Cai; Jinchi Huang; Rongfei Jia; Binqiang Zhao; Xing Tang |
|
232a | AR-NeRF: Unsupervised Learning of Depth and Defocus Effects From Natural Images With Aperture Rendering Neural Radiance Fields | Takuhiro Kaneko | |
233a | HDR-NeRF: High Dynamic Range Neural Radiance Fields |
Xin Huang; Qi Zhang; Ying Feng; Hongdong Li; Xuan Wang; Qing Wang |
|
234a | NeRFReN: Neural Radiance Fields With Reflections |
Yuan-Chen Guo; Di Kang; Linchao Bao; Yu He; Song-Hai Zhang |
|
235a | Neural Point Light Fields |
Julian Ost; Issam Laradji; Alejandro Newell; Yuval Bahat; Felix Heide |
|
236a | 3D-Aware Image Synthesis via Learning Structural and Textural Representations |
Yinghao Xu; Sida Peng; Ceyuan Yang; Yujun Shen; Bolei Zhou |
|
237a | GIRAFFE HD: A High-Resolution 3D-Aware Generative Model |
Yang Xue; Yuheng Li; Krishna Kumar Singh; Yong Jae Lee |
|
238a | Multi-View Consistent Generative Adversarial Networks for 3D-Aware Image Synthesis |
Xuanmeng Zhang; Zhedong Zheng; Daiheng Gao; Bang Zhang; Pan Pan; Yi Yang |
|
239a | Bi-Level Doubly Variational Learning for Energy-Based Latent Variable Models |
Ge Kan; Jinhu Lü; Tian Wang; Baochang Zhang; Aichun Zhu; Lei Huang; Guodong Guo; Hichem Snoussi |
|
240a | High-Resolution Image Harmonization via Collaborative Dual Transformations |
Wenyan Cong; Xinhao Tao; Li Niu; Jing Liang; Xuesong Gao; Qihao Sun; Liqing Zhang |
|
241a | Brain-Supervised Image Editing |
Keith M. Davis III; Carlos de la Torre-Ortiz; Tuukka Ruotsalo |
|
Vision & Graphics | 242a | De-Rendering 3D Objects in the Wild |
Felix Wimbauer; Shangzhe Wu; Christian Rupprecht |
243a | Neural Fields As Learnable Kernels for 3D Reconstruction |
Francis Williams; Zan Gojcic; Sameh Khamis; Denis Zorin; Joan Bruna; Sanja Fidler; Or Litany |
|
244a | HyperStyle: StyleGAN Inversion With HyperNetworks for Real Image Editing |
Yuval Alaluf; Omer Tov; Ron Mokady; Rinon Gal; Amit Bermano |
|
245a | 3PSDF: Three-Pole Signed Distance Function for Learning Surfaces With Arbitrary Topologies |
Weikai Chen; Cheng Lin; Weiyang Li; Bo Yang |
|
246a | Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian |
Jihyun Lee; Minhyuk Sung; Hyunjin Kim; Tae-Kyun Kim |
|
247a | Deep Image-Based Illumination Harmonization |
Zhongyun Bao; Chengjiang Long; Gang Fu; Daquan Liu; Yuanzhen Li; Jiaming Wu; Chunxia Xiao |
|
248a | GLASS: Geometric Latent Augmentation for Shape Spaces |
Sanjeev Muralikrishnan; Siddhartha Chaudhuri; Noam Aigerman; Vladimir G. Kim; Matthew Fisher; Niloy J. Mitra |
|
249a | PhotoScene: Photorealistic Material and Lighting Transfer for Indoor Scenes |
Yu-Ying Yeh; Zhengqin Li; Yannick Hold-Geoffroy; Rui Zhu; Zexiang Xu; Miloš Hašan; Kalyan Sunkavalli; Manmohan Chandraker |
|
250a | Neural Template: Topology-Aware Reconstruction and Disentangled Generation of 3D Meshes |
Ka-Hei Hui; Ruihui Li; Jingyu Hu; Chi-Wing Fu |
|
251a | Neural Mesh Simplification |
Rolandos Alexandros Potamias; Stylianos Ploumpis; Stefanos Zafeiriou |
|
252a | SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters |
Albert Mosella-Montoro; Javier Ruiz-Hidalgo |
|
253a | CLIP-Forge: Towards Zero-Shot Text-To-Shape Generation |
Aditya Sanghi; Hang Chu; Joseph G. Lambourne; Ye Wang; Chin-Yi Cheng; Marco Fumero; Kamal Rahimi Malekshan |
|
254a | UNIST: Unpaired Neural Implicit Shape Translation Network |
Qimin Chen; Johannes Merz; Aditya Sanghi; Hooman Shayani; Ali Mahdavi-Amiri; Hao Zhang |
|
255a | CoNeRF: Controllable Neural Radiance Fields |
Kacper Kania; Kwang Moo Yi; Marek Kowalski; Tomasz Trzciński; Andrea Tagliasacchi |
|
256a | Neural Points: Point Cloud Representation With Neural Fields for Arbitrary Upsampling |
Wanquan Feng; Jin Li; Hongrui Cai; Xiaonan Luo; Juyong Zhang |
|
257a | Modeling Indirect Illumination for Inverse Rendering |
Yuanqing Zhang; Jiaming Sun; Xingyi He; Huan Fu; Rongfei Jia; Xiaowei Zhou |
|
258a | Neural Head Avatars From Monocular RGB Videos |
Philip-William Grassal; Malte Prinzler; Titus Leistner; Carsten Rother; Matthias Nießner; Justus Thies |
|
259a | DeepCurrents: Learning Implicit Representations of Shapes With Boundaries |
David Palmer; Dmitriy Smirnov; Stephanie Wang; Albert Chern; Justin Solomon |