Presentation Schedule
All times are in Central time zone
Date: Thursday, June 23, 2022 10:00AM – 12:30PM
Session Title | Poster ID | Title | Authors |
Image & Video Synthesis and Generation | 55a | OSSGAN: Open-Set Semi-Supervised Image Generation |
Kai Katsumata; Duc Minh Vo; Hideki Nakayama |
56a | Attribute Group Editing for Reliable Few-Shot Image Generation |
Guanqi Ding; Xinzhe Han; Shuhui Wang; Shuzhe Wu; Xin Jin; Dandan Tu; Qingming Huang |
|
|
57a | Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment |
Jiayu Xiao; Liang Li; Chaofei Wang; Zheng-Jun Zha; Qingming Huang |
58a | Semantic-Shape Adaptive Feature Modulation for Semantic Image Synthesis |
Zhengyao Lv; Xiaoming Li; Zhenxing Niu; Bing Cao; Wangmeng Zuo |
|
59a | Retrieval-Based Spatially Adaptive Normalization for Semantic Image Synthesis |
Yupeng Shi; Xiao Liu; Yuxiang Wei; Zhongqin Wu; Wangmeng Zuo |
|
60a | Generative Flows With Invertible Attentions |
Rhea Sanjay Sukthanker; Zhiwu Huang; Suryansh Kumar; Radu Timofte; Luc Van Gool |
|
61a | Style-Structure Disentangled Features and Normalizing Flows for Diverse Icon Colorization |
Yuan-kui Li; Yun-Hsuan Lien; Yu-Shuen Wang |
|
62a | SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing |
Yichun Shi; Xiao Yang; Yangyue Wan; Xiaohui Shen |
|
63a | Manifold Learning Benefits GANs |
Yao Ni; Piotr Koniusz; Richard Hartley; Richard Nock |
|
64a | DO-GAN: A Double Oracle Framework for Generative Adversarial Networks |
Aye Phyu Phyu Aung; Xinrun Wang; Runsheng Yu; Bo An; Senthilnath Jayavelu; Xiaoli Li |
|
65a | Improving GAN Equilibrium by Raising Spatial Awareness |
Jianyuan Wang; Ceyuan Yang; Yinghao Xu; Yujun Shen; Hongdong Li; Bolei Zhou |
|
66a | Feature Statistics Mixing Regularization for Generative Adversarial Networks | Junho Kim; Yunjey Choi; Youngjung Uh | |
67a | StyleSwin: Transformer-Based GAN for High-Resolution Image Generation |
Bowen Zhang; Shuyang Gu; Bo Zhang; Jianmin Bao; Dong Chen; Fang Wen; Yong Wang; Baining Guo |
|
68a | MaskGIT: Masked Generative Image Transformer |
Huiwen Chang; Han Zhang; Lu Jiang; Ce Liu; William T. Freeman |
|
69a | StyTr2: Image Style Transfer With Transformers |
Yingying Deng; Fan Tang; Weiming Dong; Chongyang Ma; Xingjia Pan; Lei Wang; Changsheng Xu |
|
70a | Style Transformer for Image Inversion and Editing |
Xueqi Hu; Qiusheng Huang; Zhengyi Shi; Siyuan Li; Changxin Gao; Li Sun; Qingli Li |
|
71a | Reduce Information Loss in Transformers for Pluralistic Image Inpainting |
Qiankun Liu; Zhentao Tan; Dongdong Chen; Qi Chu; Xiyang Dai; Yinpeng Chen; Mengchen Liu; Lu Yuan; Nenghai Yu |
|
72a | Incremental Transformer Structure Enhanced Image Inpainting With Masking Positional Encoding | Qiaole Dong; Chenjie Cao; Yanwei Fu | |
73a | UniCoRN: A Unified Conditional Image Repainting Network |
Jimeng Sun; Shuchen Weng; Zheng Chang; Si Li; Boxin Shi |
|
74a | High-Fidelity GAN Inversion for Image Attribute Editing |
Tengfei Wang; Yong Zhang; Yanbo Fan; Jue Wang; Qifeng Chen |
|
75a | HyperInverter: Improving StyleGAN Inversion via Hypernetwork |
Tan M. Dinh; Anh Tuan Tran; Rang Nguyen; Binh-Son Hua |
|
76a | Spatially-Adaptive Multilayer Selection for GAN Inversion and Editing |
Gaurav Parmar; Yijun Li; Jingwan Lu; Richard Zhang; Jun-Yan Zhu; Krishna Kumar Singh |
|
77a | On Aliased Resizing and Surprising Subtleties in GAN Evaluation |
Gaurav Parmar; Richard Zhang; Jun-Yan Zhu |
|
78a | Dual-Path Image Inpainting With Auxiliary GAN Inversion |
Wentao Wang; Li Niu; Jianfu Zhang; Xue Yang; Liqing Zhang |
|
79a | InOut: Diverse Image Outpainting via GAN Inversion |
Yen-Chi Cheng; Chieh Hubert Lin; Hsin-Ying Lee; Jian Ren; Sergey Tulyakov; Ming-Hsuan Yang |
|
80a | Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation |
Naofumi Akimoto; Yuhi Matsuo; Yoshimitsu Aoki |
|
81a | Contextual Outpainting With Object-Level Contrastive Learning | Jiacheng Li; Chang Chen; Zhiwei Xiong | |
82a | RePaint: Inpainting Using Denoising Diffusion Probabilistic Models |
Andreas Lugmayr; Martin Danelljan; Andres Romero; Fisher Yu; Radu Timofte; Luc Van Gool |
|
83a | Perception Prioritized Training of Diffusion Models |
Jooyoung Choi; Jungbeom Lee; Chaehun Shin; Sungwon Kim; Hyunwoo Kim; Sungroh Yoon |
|
84a | Dynamic Dual-Output Diffusion Models | Yaniv Benny; Lior Wolf | |
85a | Generating High Fidelity Data From Low-Density Regions Using Diffusion Models |
Vikash Sehwag; Caner Hazirbas; Albert Gordo; Firat Ozgenel; Cristian Canton |
|
86a | Global Context With Discrete Diffusion in Vector Quantised Modelling for Image Generation |
Minghui Hu; Yujie Wang; Tat-Jen Cham; Jianfei Yang; P.N. Suganthan |
|
87a | Bridging Global Context Interactions for High-Fidelity Image Completion |
Chuanxia Zheng; Tat-Jen Cham; Jianfei Cai; Dinh Phung |
|
88a | Autoregressive Image Generation Using Residual Quantization |
Doyup Lee; Chiheon Kim; Saehoon Kim; Minsu Cho; Wook-Shin Han |
|
89a | Arbitrary-Scale Image Synthesis |
Evangelos Ntavelis; Mohamad Shahbazi; Iason Kastanis; Radu Timofte; Martin Danelljan; Luc Van Gool |
|
90a | Cluster-Guided Image Synthesis With Unconditional Models |
Markos Georgopoulos; James Oldfield; Grigorios G. Chrysos; Yannis Panagakis |
|
Segmentation, Grouping and Shape Analysis | 91a | Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation |
Jie Liu; Yanqi Bao; Guo-Sen Xie; Huan Xiong; Jan-Jakob Sonke; Efstratios Gavves |
92a | Generalized Few-Shot Semantic Segmentation |
Zhuotao Tian; Xin Lai; Li Jiang; Shu Liu; Michelle Shu; Hengshuang Zhao; Jiaya Jia |
|
93a | Learning Non-Target Knowledge for Few-Shot Semantic Segmentation |
Yuanwei Liu; Nian Liu; Qinglong Cao; Xiwen Yao; Junwei Han; Ling Shao |
|
94a | Decoupling Zero-Shot Semantic Segmentation |
Jian Ding; Nan Xue; Gui-Song Xia; Dengxin Dai |
|
95a | Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation |
Ruihuang Li; Shuai Li; Chenhang He; Yabin Zhang; Xu Jia; Lei Zhang |
|
96a | ContrastMask: Contrastive Learning To Segment Every Thing |
Xuehui Wang; Kai Zhao; Ruixin Zhang; Shouhong Ding; Yan Wang; Wei Shen |
|
97a | The Neurally-Guided Shape Parser: Grammar-Based Labeling of 3D Shape Regions With Approximate Inference |
R. Kenny Jones; Aalia Habib; Rana Hanocka; Daniel Ritchie |
|
98a | AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation |
Xueyi Liu; Xiaomeng Xu; Anyi Rao; Chuang Gan; Li Yi |
|
99a | APES: Articulated Part Extraction From Sprite Sheets |
Zhan Xu; Matthew Fisher; Yang Zhou; Deepali Aneja; Rushikesh Dudhat; Li Yi; Evangelos Kalogerakis |
|
100a | GASP, a Generalized Framework for Agglomerative Clustering of Signed Graphs and Its Application to Instance Segmentation |
Alberto Bailoni; Constantin Pape; Nathan Hütsch; Steffen Wolf; Thorsten Beier; Anna Kreshuk; Fred A. Hamprecht |
|
101a | CycleMix: A Holistic Strategy for Medical Image Segmentation From Scribble Supervision | Ke Zhang; Xiahai Zhuang | |
102a | Cross-Patch Dense Contrastive Learning for Semi-Supervised Segmentation of Cellular Nuclei in Histopathologic Images |
Huisi Wu; Zhaoze Wang; Youyi Song; Lin Yang; Jing Qin |
|
103a | C-CAM: Causal CAM for Weakly Supervised Semantic Segmentation on Medical Image |
Zhang Chen; Zhiqiang Tian; Jihua Zhu; Ce Li; Shaoyi Du |
|
104a | CRIS: CLIP-Driven Referring Image Segmentation |
Zhaoqing Wang; Yu Lu; Qiang Li; Xunqiang Tao; Yandong Guo; Mingming Gong; Tongliang Liu |
|
105a | MatteFormer: Transformer-Based Image Matting via Prior-Tokens |
GyuTae Park; SungJoon Son; JaeYoung Yoo; SeHo Kim; Nojun Kwak |
|
106a | Boosting Robustness of Image Matting With Context Assembling and Strong Data Augmentation |
Yutong Dai; Brian Price; He Zhang; Chunhua Shen |
|
107a | Pyramid Grafting Network for One-Stage High Resolution Saliency Detection |
Chenxi Xie; Changqun Xia; Mingcan Ma; Zhirui Zhao; Xiaowu Chen; Jia Li |
|
108a | Multi-Source Uncertainty Mining for Deep Unsupervised Saliency Detection |
Yifan Wang; Wenbo Zhang; Lijun Wang; Ting Liu; Huchuan Lu |
|
109a | Modeling Motion With Multi-Modal Features for Text-Based Video Segmentation |
Wangbo Zhao; Kai Wang; Xiangxiang Chu; Fuzhao Xue; Xinchao Wang; Yang You |
|
110a | GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD Drawings |
Zhaohua Zheng; Jianfang Li; Lingjie Zhu; Honghua Li; Frank Petzold; Ping Tan |
|
111a | Bending Graphs: Hierarchical Shape Matching Using Gated Optimal Transport |
Mahdi Saleh; Shun-Cheng Wu; Luca Cosmo; Nassir Navab; Benjamin Busam; Federico Tombari |
|
112a | CAPRI-Net: Learning Compact CAD Shapes With Adaptive Primitive Assembly |
Fenggen Yu; Zhiqin Chen; Manyi Li; Aditya Sanghi; Hooman Shayani; Ali Mahdavi-Amiri; Hao Zhang |
|
113a | RIM-Net: Recursive Implicit Fields for Unsupervised Learning of Hierarchical Shape Structures | Chengjie Niu; Manyi Li; Kai Xu; Hao Zhang | |
114a | Discovering Objects That Can Move |
Zhipeng Bao; Pavel Tokmakov; Allan Jabri; Yu-Xiong Wang; Adrien Gaidon; Martial Hebert |
|
115a | PatchFormer: An Efficient Point Transformer With Patch Attention |
Cheng Zhang; Haocheng Wan; Xinyi Shen; Zizhao Wu |
|
116a | Panoptic-PHNet: Towards Real-Time and High-Precision LiDAR Panoptic Segmentation via Clustering Pseudo Heatmap |
Jinke Li; Xiao He; Yang Wen; Yuan Gao; Xiaoqiang Cheng; Dan Zhang |
|
117a | SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation |
Ziyi Wang; Yongming Rao; Xumin Yu; Jie Zhou; Jiwen Lu |
|
118a | An MIL-Derived Transformer for Weakly Supervised Point Cloud Segmentation |
Cheng-Kun Yang; Ji-Jia Wu; Kai-Syun Chen; Yung-Yu Chuang; Yen-Yu Lin |
|
119a | Weakly Supervised Segmentation on Outdoor 4D Point Clouds With Temporal Matching and Spatial Graph Propagation |
Hanyu Shi; Jiacheng Wei; Ruibo Li; Fayao Liu; Guosheng Lin |
|
120a | Point2Cyl: Reverse Engineering 3D Objects From Point Clouds to Extrusion Cylinders |
Mikaela Angelina Uy; Yen-Yu Chang; Minhyuk Sung; Purvi Goel; Joseph G. Lambourne; Tolga Birdal; Leonidas J. Guibas |
|
Deep Learning Architectures & Techniques | 121a | Demystifying the Neural Tangent Kernel From a Practical Perspective: Can It Be Trusted for Neural Architecture Search Without Training? |
Jisoo Mok; Byunggook Na; Ji-Hoon Kim; Dongyoon Han; Sungroh Yoon |
122a | BaLeNAS: Differentiable Architecture Search via the Bayesian Learning Rule |
Miao Zhang; Shirui Pan; Xiaojun Chang; Steven Su; Jilin Hu; Gholamreza (Reza) Haffari; Bin Yang |
|
123a | Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search |
Minbin Huang; Zhijian Huang; Changlin Li; Xin Chen; Hang Xu; Zhenguo Li; Xiaodan Liang |
|
124a | Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search |
Han Xiao; Ziwei Wang; Zheng Zhu; Jie Zhou; Jiwen Lu |
|
125a | GreedyNASv2: Greedier Search With a Greedy Path Filter |
Tao Huang; Shan You; Fei Wang; Chen Qian; Changshui Zhang; Xiaogang Wang; Chang Xu |
|
126a | Neural Architecture Search With Representation Mutual Information |
Xiawu Zheng; Xiang Fei; Lei Zhang; Chenglin Wu; Fei Chao; Jianzhuang Liu; Wei Zeng; Yonghong Tian; Rongrong Ji |
|
127a | Performance-Aware Mutual Knowledge Distillation for Improving Neural Architecture Search | Pengtao Xie; Xuefeng Du | |
128a | Knowledge Distillation With the Reused Teacher Classifier |
Defang Chen; Jian-Ping Mei; Hailin Zhang; Can Wang; Yan Feng; Chun Chen |
|
129a | Self-Distillation From the Last Mini-Batch for Consistency Regularization |
Yiqing Shen; Liwu Xu; Yuzhe Yang; Yaqian Li; Yandong Guo |
|
130a | Decoupled Knowledge Distillation |
Borui Zhao; Quan Cui; Renjie Song; Yiyu Qiu; Jiajun Liang |
|
131a | Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs |
Xiaohan Ding; Xiangyu Zhang; Jungong Han; Guiguang Ding |
|
132a | A ConvNet for the 2020s |
Zhuang Liu; Hanzi Mao; Chao-Yuan Wu; Christoph Feichtenhofer; Trevor Darrell; Saining Xie |
|
133a | Beyond Fixation: Dynamic Window Visual Transformer |
Pengzhen Ren; Changlin Li; Guangrun Wang; Yun Xiao; Qing Du; Xiaodan Liang; Xiaojun Chang |
|
134a | Lite Vision Transformer With Enhanced Self-Attention |
Chenglin Yang; Yilin Wang; Jianming Zhang; He Zhang; Zijun Wei; Zhe Lin; Alan Yuille |
|
135a | Swin Transformer V2: Scaling Up Capacity and Resolution |
Ze Liu; Han Hu; Yutong Lin; Zhuliang Yao; Zhenda Xie; Yixuan Wei; Jia Ning; Yue Cao; Zheng Zhang; Li Dong; Furu Wei; Baining Guo |
|
136a | The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy |
Tianlong Chen; Zhenyu Zhang; Yu Cheng; Ahmed Awadallah; Zhangyang Wang |
|
137a | MulT: An End-to-End Multitask Learning Transformer |
Deblina Bhattacharjee; Tong Zhang; Sabine Süsstrunk; Mathieu Salzmann |
|
138a | Towards Robust Vision Transformer |
Xiaofeng Mao; Gege Qi; Yuefeng Chen; Xiaodan Li; Ranjie Duan; Shaokai Ye; Yuan He; Hui Xue |
|
139a | DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers |
Xianing Chen; Qiong Cao; Yujie Zhong; Jing Zhang; Shenghua Gao; Dacheng Tao |
|
140a | MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens |
Jiemin Fang; Lingxi Xie; Xinggang Wang; Xiaopeng Zhang; Wenyu Liu; Qi Tian |
|
141a | NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition |
Hao Liu; Xinghua Jiang; Xin Li; Zhimin Bao; Deqiang Jiang; Bo Ren |
|
142a | TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation |
Wenqiang Zhang; Zilong Huang; Guozhong Luo; Tao Chen; Xinggang Wang; Wenyu Liu; Gang Yu; Chunhua Shen |
|
143a | Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation |
Jiaqi Gu; Hyoukjun Kwon; Dilin Wang; Wei Ye; Meng Li; Yu-Hsin Chen; Liangzhen Lai; Vikas Chandra; David Z. Pan |
|
144a | Scaling Vision Transformers |
Xiaohua Zhai; Alexander Kolesnikov; Neil Houlsby; Lucas Beyer |
|
145a | Bridged Transformer for Vision and Point Cloud 3D Object Detection |
Yikai Wang; TengQi Ye; Lele Cao; Wenbing Huang; Fuchun Sun; Fengxiang He; Dacheng Tao |
|
146a | CSWin Transformer: A General Vision Transformer Backbone With Cross-Shaped Windows |
Xiaoyi Dong; Jianmin Bao; Dongdong Chen; Weiming Zhang; Nenghai Yu; Lu Yuan; Dong Chen; Baining Guo |
|
147a | TransMix: Attend To Mix for Vision Transformers |
Jie-Neng Chen; Shuyang Sun; Ju He; Philip H.S. Torr; Alan Yuille; Song Bai |
|
148a | MiniViT: Compressing Vision Transformers With Weight Multiplexing |
Jinnian Zhang; Houwen Peng; Kan Wu; Mengchen Liu; Bin Xiao; Jianlong Fu; Lu Yuan |
|
149a | Fine-Tuning Image Transformers Using Learnable Memory |
Mark Sandler; Andrey Zhmoginov; Max Vladymyrov; Andrew Jackson |
|
150a | Patch Slimming for Efficient Vision Transformers |
Yehui Tang; Kai Han; Yunhe Wang; Chang Xu; Jianyuan Guo; Chao Xu; Dacheng Tao |
|
151a | CMT: Convolutional Neural Networks Meet Vision Transformers |
Jianyuan Guo; Kai Han; Han Wu; Yehui Tang; Xinghao Chen; Yunhe Wang; Chang Xu |
|
152a | Multimodal Token Fusion for Vision Transformers |
Yikai Wang; Xinghao Chen; Lele Cao; Wenbing Huang; Fuchun Sun; Yunhe Wang |
|
Efficient Learning & Inference | 153a | CAFE: Learning To Condense Dataset by Aligning Features |
Kai Wang; Bo Zhao; Xiangyu Peng; Zheng Zhu; Shuo Yang; Shuo Wang; Guan Huang; Hakan Bilen; Xinchao Wang; Yang You |
154a | Lite-MDETR: A Lightweight Multi-Modal Detector |
Qian Lou; Yen-Chang Hsu; Burak Uzkent; Ting Hua; Yilin Shen; Hongxia Jin |
|
155a | DeeCap: Dynamic Early Exiting for Efficient Image Captioning |
Zhengcong Fei; Xu Yan; Shuhui Wang; Qi Tian |
|
156a | Searching the Deployable Convolution Neural Networks for GPUs |
Linnan Wang; Chenhan Yu; Satish Salian; Slawomir Kierat; Szymon Migacz; Alex Fit Florea |
|
157a | Active Learning by Feature Mixing |
Amin Parvaneh; Ehsan Abbasnejad; Damien Teney; Gholamreza (Reza) Haffari; Anton van den Hengel; Javen Qinfeng Shi |
|
158a | When To Prune? A Policy Towards Early Structural Pruning |
Maying Shen; Pavlo Molchanov; Hongxu Yin; Jose M. Alvarez |
|
159a | Contrastive Dual Gating: Learning Sparse Features With Contrastive Learning |
Jian Meng; Li Yang; Jinwoo Shin; Deliang Fan; Jae-sun Seo |
|
160a | How Well Do Sparse ImageNet Models Transfer? |
Eugenia Iofinova; Alexandra Peste; Mark Kurtz; Dan Alistarh |
|
161a | Rep-Net: Efficient On-Device Learning via Feature Reprogramming | Li Yang; Adnan Siraj Rakin; Deliang Fan | |
162a | CHEX: CHannel EXploration for CNN Model Compression |
Zejiang Hou; Minghai Qin; Fei Sun; Xiaolong Ma; Kun Yuan; Yi Xu; Yen-Kuang Chen; Rong Jin; Yuan Xie; Sun-Yuan Kung |
|
163a | HODEC: Towards Efficient High-Order DEcomposed Convolutional Neural Networks |
Miao Yin; Yang Sui; Wanzhao Yang; Xiao Zang; Yu Gong; Bo Yuan |
|
164a | AdaViT: Adaptive Vision Transformers for Efficient Image Recognition |
Lingchen Meng; Hengduo Li; Bor-Chun Chen; Shiyi Lan; Zuxuan Wu; Yu-Gang Jiang; Ser-Nam Lim |
|
165a | Cross-Image Relational Knowledge Distillation for Semantic Segmentation |
Chuanguang Yang; Helong Zhou; Zhulin An; Xue Jiang; Yongjun Xu; Qian Zhang |
|
166a | Mr.BiQ: Post-Training Non-Uniform Quantization Based on Minimizing the Reconstruction Error |
Yongkweon Jeon; Chungman Lee; Eulrang Cho; Yeonju Ro |
|
167a | IntraQ: Learning Synthetic Images With Intra-Class Heterogeneity for Zero-Shot Network Quantization |
Yunshan Zhong; Mingbao Lin; Gongrui Nan; Jianzhuang Liu; Baochang Zhang; Yonghong Tian; Rongrong Ji |
|
168a | DECORE: Deep Compression With Reinforcement Learning |
Manoj Alwani; Yang Wang; Vashisht Madhavan |
|
169a | Towards Efficient and Scalable Sharpness-Aware Minimization |
Yong Liu; Siqi Mai; Xiangning Chen; Cho-Jui Hsieh; Yang You |
|
170a | AEGNN: Asynchronous Event-Based Graph Neural Networks |
Simon Schaefer; Daniel Gehrig; Davide Scaramuzza |
|
171a | DiSparse: Disentangled Sparsification for Multitask Model Compression |
Xinglong Sun; Ali Hassani; Zhangyang Wang; Gao Huang; Humphrey Shi |
|
172a | Multi-Modal Extreme Classification |
Anshul Mittal; Kunal Dahiya; Shreya Malani; Janani Ramaswamy; Seba Kuruvilla; Jitendra Ajmera; Keng-hao Chang; Sumeet Agarwal; Purushottam Kar; Manik Varma |
|
173a | A Sampling-Based Approach for Efficient Clustering in Large Datasets |
Georgios Exarchakis; Omar Oubari; Gregor Lenz |
|
174a | Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems Through Stochastic Contraction |
Hyungjin Chung; Byeongsu Sim; Jong Chul Ye |
|
175a | Learnable Lookup Table for Neural Network Quantization |
Longguang Wang; Xiaoyu Dong; Yingqian Wang; Li Liu; Wei An; Yulan Guo |
|
176a | Instance-Aware Dynamic Neural Network Quantization |
Zhenhua Liu; Yunhe Wang; Kai Han; Siwei Ma; Wen Gao |
|
177a | Training High-Performance Low-Latency Spiking Neural Networks by Differentiation on Spike Representation |
Qingyan Meng; Mingqing Xiao; Shen Yan; Yisen Wang; Zhouchen Lin; Zhi-Quan Luo |
|
178a | Fire Together Wire Together: A Dynamic Pruning Approach With Self-Supervised Mask Prediction |
Sara Elkerdawy; Mostafa Elhoushi; Hong Zhang; Nilanjan Ray |
|
179a | Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation |
Linfeng Zhang; Xin Chen; Xiaobing Tu; Pengfei Wan; Ning Xu; Kaisheng Ma |
|
180a | PokeBNN: A Binary Pursuit of Lightweight Accuracy | Yichi Zhang; Zhiru Zhang; Lukasz Lew | |
181a | Automated Progressive Learning for Efficient Training of Vision Transformers |
Changlin Li; Bohan Zhuang; Guangrun Wang; Xiaodan Liang; Xiaojun Chang; Yi Yang |
|
182a | DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos |
Mathias Parger; Chengcheng Tang; Christopher D. Twigg; Cem Keskin; Robert Wang; Markus Steinberger |
|
183a | Channel Balancing for Accurate Quantization of Winograd Convolutions | Vladimir Chikin; Vladimir Kryzhanovskiy | |
184a | ClusterGNN: Cluster-Based Coarse-To-Fine Graph Neural Network for Efficient Feature Matching |
Yan Shi; Jun-Xiong Cai; Yoli Shavit; Tai-Jiang Mu; Wensen Feng; Kai Zhang |
|
185a | Interspace Pruning: Using Adaptive Filter Representations To Improve Training of Sparse CNNs |
Paul Wimmer; Jens Mehnert; Alexandru Condurache |
|
186a | AlignQ: Alignment Quantization With ADMM-Based Correlation Preservation |
Ting-An Chen; De-Nian Yang; Ming-Syan Chen |
|
187a | TVConv: Efficient Translation Variant Convolution for Layout-Aware Visual Processing |
Jierun Chen; Tianlang He; Weipeng Zhuo; Li Ma; Sangtae Ha; S.-H. Gary Chan |
|
188a | SplitNets: Designing Neural Architectures for Efficient Distributed Computing on Head-Mounted Systems |
Xin Dong; Barbara De Salvo; Meng Li; Chiao Liu; Zhongnan Qu; H.T. Kung; Ziyun Li |
|
189a | TO-FLOW: Efficient Continuous Normalizing Flows With Temporal Optimization Adjoint With Moving Speed |
Shian Du; Yihong Luo; Wei Chen; Jian Xu; Delu Zeng |
|
Physics-Based Vision and Shape-From-X | 190a | DiLiGenT102: A Photometric Stereo Benchmark Dataset With Controlled Shape and Material Variation |
Jieji Ren; Feishi Wang; Jiahao Zhang; Qian Zheng; Mingjun Ren; Boxin Shi |
191a | Universal Photometric Stereo Network Using Global Lighting Contexts | Satoshi Ikehata | |
192a | Uncertainty-Aware Deep Multi-View Photometric Stereo |
Berk Kaya; Suryansh Kumar; Carlos Oliveira; Vittorio Ferrari; Luc Van Gool |
|
193a | Fast Light-Weight Near-Field Photometric Stereo |
Daniel Lichy; Soumyadip Sengupta; David W. Jacobs |
|
194a | Glass Segmentation Using Intensity and Spectral Polarization Cues |
Haiyang Mei; Bo Dong; Wen Dong; Jiaxi Yang; Seung-Hwan Baek; Felix Heide; Pieter Peers; Xiaopeng Wei; Xin Yang |
|
195a | Shape From Polarization for Complex Scenes in the Wild |
Chenyang Lei; Chenyang Qi; Jiaxin Xie; Na Fan; Vladlen Koltun; Qifeng Chen |
|
196a | Deep Depth From Focus With Differential Focus Volume | Fengting Yang; Xiaolei Huang; Zihan Zhou | |
197a | Optimal LED Spectral Multiplexing for NIR2RGB Translation |
Lei Liu; Yuze Chen; Junchi Yan; Yinqiang Zheng |
|
198a | Shape From Thermal Radiation: Passive Ranging Using Multi-Spectral LWIR Measurements |
Yasuto Nagase; Takahiro Kushida; Kenichiro Tanaka; Takuya Funatomi; Yasuhiro Mukaigawa |
|
199a | NAN: Noise-Aware NeRFs for Burst-Denoising | Naama Pearl; Tali Treibitz; Simon Korman | |
200a | Estimating Fine-Grained Noise Model via Contrastive Learning | Yunhao Zou; Ying Fu | |
201a | Real-Time Hyperspectral Imaging in Hardware via Trained Metasurface Encoders |
Maksim Makarenko; Arturo Burguete-Lopez; Qizhou Wang; Fedor Getman; Silvio Giancola; Bernard Ghanem; Andrea Fratalocchi |
|
202a | MNSRNet: Multimodal Transformer Network for 3D Surface Super-Resolution |
Wuyuan Xie; Tengcong Huang; Miaohui Wang |
|
203a | PhyIR: Physics-Based Inverse Rendering for Panoramic Indoor Images |
Zhen Li; Lingli Wang; Xiang Huang; Cihui Pan; Jiaqi Yang |
|
Visual Reasoning | 204a | Neural Shape Mating: Self-Supervised Object Assembly With Adversarial Shape Priors |
Yun-Chun Chen; Haoda Li; Dylan Turpin; Alec Jacobson; Animesh Garg |
205a | Learning To Anticipate Future With Dynamic Context Removal | Xinyu Xu; Yong-Lu Li; Cewu Lu | |
206a | Self-Supervised Spatial Reasoning on Multi-View Line Drawings |
Siyuan Xiang; Anbang Yang; Yanfei Xue; Yaoqing Yang; Chen Feng |
|
207a | Contextual Debiasing for Visual Recognition With Causal Mechanisms |
Ruyang Liu; Hao Liu; Ge Li; Haodi Hou; TingHao Yu; Tao Yang |
|
3D From Multi-View & Sensors | 208a | Relative Pose From a Calibrated and an Uncalibrated Smartphone Image |
Yaqing Ding; Daniel Barath; Jian Yang; Zuzana Kukelova |
209a | Exploiting Rigidity Constraints for LiDAR Scene Flow Estimation |
Guanting Dong; Yueyi Zhang; Hanlin Li; Xiaoyan Sun; Zhiwei Xiong |
|
210a | NICE-SLAM: Neural Implicit Scalable Encoding for SLAM |
Zihan Zhu; Songyou Peng; Viktor Larsson; Weiwei Xu; Hujun Bao; Zhaopeng Cui; Martin R. Oswald; Marc Pollefeys |
|
211a | NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning |
Tony Ng; Hyo Jin Kim; Vincent T. Lee; Daniel DeTone; Tsun-Yi Yang; Tianwei Shen; Eddy Ilg; Vassileios Balntas; Krystian Mikolajczyk; Chris Sweeney |
|
212a | ScaleNet: A Shallow Architecture for Scale Estimation |
Axel Barroso-Laguna; Yurun Tian; Krystian Mikolajczyk |
|
213a | Camera Pose Estimation Using Implicit Distortion Models | Linfei Pan; Marc Pollefeys; Viktor Larsson | |
214a | GIFS: Neural Implicit Function for General Shape Representation |
Jianglong Ye; Yuntao Chen; Naiyan Wang; Xiaolong Wang |
|
215a | Learning Deep Implicit Functions for 3D Shapes With Dynamic Code Clouds |
Tianyang Li; Xin Wen; Yu-Shen Liu; Hua Su; Zhizhong Han |
|
216a | SPAMs: Structured Implicit Parametric Models |
Pablo Palafox; Nikolaos Sarafianos; Tony Tung; Angela Dai |
|
217a | Deblur-NeRF: Neural Radiance Fields From Blurry Images |
Li Ma; Xiaoyu Li; Jing Liao; Qi Zhang; Xuan Wang; Jue Wang; Pedro V. Sander |
|
218a | Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation |
Abhijit Kundu; Kyle Genova; Xiaoqi Yin; Alireza Fathi; Caroline Pantofaru; Leonidas J. Guibas; Andrea Tagliasacchi; Frank Dellaert; Thomas Funkhouser |
|
219a | Depth-Supervised NeRF: Fewer Views and Faster Training for Free |
Kangle Deng; Andrew Liu; Jun-Yan Zhu; Deva Ramanan |
|
220a | Dense Depth Priors for Neural Radiance Fields From Sparse Input Views |
Barbara Roessle; Jonathan T. Barron; Ben Mildenhall; Pratul P. Srinivasan; Matthias Nießner |
|
221a | EfficientNeRF Efficient Neural Radiance Fields |
Tao Hu; Shu Liu; Yilun Chen; Tiancheng Shen; Jiaya Jia |
|
222a | InfoNeRF: Ray Entropy Minimization for Few-Shot Neural Volume Rendering | Mijeong Kim; Seonguk Seo; Bohyung Han | |
223a | Mega-NERF: Scalable Construction of Large-Scale NeRFs for Virtual Fly-Throughs |
Haithem Turki; Deva Ramanan; Mahadev Satyanarayanan |
|
224a | Urban Radiance Fields |
Konstantinos Rematas; Andrew Liu; Pratul P. Srinivasan; Jonathan T. Barron; Andrea Tagliasacchi; Thomas Funkhouser; Vittorio Ferrari |
|
225a | Hallucinated Neural Radiance Fields in the Wild |
Xingyu Chen; Qi Zhang; Xiaoyu Li; Yue Chen; Ying Feng; Xuan Wang; Jue Wang |
|
226a | Towards Multimodal Depth Estimation From Light Fields |
Titus Leistner; Radek Mackowiak; Lynton Ardizzone; Ullrich Köthe; Carsten Rother |
|
227a | Degradation-Agnostic Correspondence From Resolution-Asymmetric Stereo |
Xihao Chen; Zhiwei Xiong; Zhen Cheng; Jiayong Peng; Yueyi Zhang; Zheng-Jun Zha |
|
228a | Uniform Subdivision of Omnidirectional Camera Space for Efficient Spherical Stereo Matching |
Donghun Kang; Hyeonjoong Jang; Jungeon Lee; Chong-Min Kyung; Min H. Kim |
|
229a | Attention Concatenation Volume for Accurate and Efficient Stereo Matching |
Gangwei Xu; Junda Cheng; Peng Guo; Xin Yang |
|
230a | Generalized Binary Search Network for Highly-Efficient Multi-View Stereo | Zhenxing Mi; Chang Di; Dan Xu | |
231a | Revisiting Domain Generalized Stereo Matching Networks From a Feature Consistency Perspective |
Jiawei Zhang; Xiang Wang; Xiao Bai; Chen Wang; Lei Huang; Yimin Chen; Lin Gu; Jun Zhou; Tatsuya Harada; Edwin R. Hancock |
|
232a | GraftNet: Towards Domain Generalized Stereo Matching With a Broad-Spectrum and Task-Oriented Feature | Biyang Liu; Huimin Yu; Guodong Qi | |
233a | ITSA: An Information-Theoretic Approach to Automatic Shortcut Avoidance and Domain Generalization in Stereo Matching Networks |
WeiQin Chuah; Ruwan Tennakoon; Reza Hoseinnezhad; Alireza Bab-Hadiashar; David Suter |
|
234a | ActiveZero: Mixed Domain Learning for Active Stereovision With Zero Annotation |
Isabella Liu; Edward Yang; Jianyu Tao; Rui Chen; Xiaoshuai Zhang; Qing Ran; Zhu Liu; Hao Su |
|
235a | FoggyStereo: Stereo Matching With Fog Volume Representation | Chengtang Yao; Lidong Yu | |
Pose Estimation & Tracking | 236a | Multi-Person Extreme Motion Prediction |
Wen Guo; Xiaoyu Bie; Xavier Alameda-Pineda; Francesc Moreno-Noguer |
237a | Learning Local-Global Contextual Adaptation for Multi-Person Pose Estimation |
Nan Xue; Tianfu Wu; Gui-Song Xia; Liangpei Zhang |
|
238a | AdaptPose: Cross-Dataset Adaptation for 3D Human Pose Estimation by Learnable Motion Generation |
Mohsen Gholami; Bastian Wandt; Helge Rhodin; Rabab Ward; Z. Jane Wang |
|
239a | Single-Stage Is Enough: Multi-Person Absolute 3D Pose Estimation |
Lei Jin; Chenyang Xu; Xiaojuan Wang; Yabo Xiao; Yandong Guo; Xuecheng Nie; Jian Zhao |
|
240a | Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation |
Zitian Wang; Xuecheng Nie; Xiaochao Qu; Yunpeng Chen; Si Liu |
|
241a | Trajectory Optimization for Physics-Based Reconstruction of 3D Human Pose From Monocular Video |
Erik Gärtner; Mykhaylo Andriluka; Hongyi Xu; Cristian Sminchisescu |
|
242a | Ray3D: Ray-Based 3D Human Pose Estimation for Monocular Absolute 3D Localization |
Yu Zhan; Fenghai Li; Renliang Weng; Wongun Choi |
|
243a | Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation |
Yihan Wang; Muyang Li; Han Cai; Wei-Ming Chen; Song Han |
|
244a | Location-Free Human Pose Estimation |
Xixia Xu; Yingguo Gao; Ke Yan; Xue Lin; Qi Zou |
|
245a | MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation |
Wenhao Li; Hong Liu; Hao Tang; Pichao Wang; Luc Van Gool |
|
246a | Estimating Egocentric 3D Human Pose in the Wild With External Weak Supervision |
Jian Wang; Lingjie Liu; Weipeng Xu; Kripasindhu Sarkar; Diogo Luvizon; Christian Theobalt |
|
247a | Physical Inertial Poser (PIP): Physics-Aware Real-Time Human Motion Tracking From Sparse Inertial Sensors |
Xinyu Yi; Yuxiao Zhou; Marc Habermann; Soshi Shimada; Vladislav Golyanik; Christian Theobalt; Feng Xu |
|
248a | PoseKernelLifter: Metric Lifting of 3D Human Pose Using Sound |
Zhijian Yang; Xiaoran Fan; Volkan Isler; Hyun Soo Park |
|
249a | Differentiable Dynamics for Articulated 3D Human Motion Reconstruction |
Erik Gärtner; Mykhaylo Andriluka; Erwin Coumans; Cristian Sminchisescu |
|
250a | COAP: Compositional Articulated Occupancy of People |
Marko Mihajlovic; Shunsuke Saito; Aayush Bansal; Michael Zollhöfer; Siyu Tang |
|
251a | Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation From Monocular Video |
Wen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu; Hong-Yuan Mark Liao |
|
252a | SC2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration | Zhi Chen; Kun Sun; Fan Yang; Wenbing Tao | |
253a | MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video |
Jinlu Zhang; Zhigang Tu; Jianyu Yang; Yujin Chen; Junsong Yuan |
|
254a | Putting People in Their Place: Monocular Regression of 3D People in Depth |
Yu Sun; Wu Liu; Qian Bao; Yili Fu; Tao Mei; Michael J. Black |
|
255a | FLAG: Flow-Based 3D Avatar Generation From Sparse Observations |
Sadegh Aliakbarian; Pashmina Cameron; Federica Bogo; Andrew Fitzgibbon; Thomas J. Cashman |
|
256a | GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping |
Omid Taheri; Vasileios Choutas; Michael J. Black; Dimitrios Tzionas |
|
257a | Capturing and Inferring Dense Full-Body Human-Scene Contact |
Chun-Hao P. Huang; Hongwei Yi; Markus Höschle; Matvey Safroshkin; Tsvetelina Alexiadis; Senya Polikovsky; Daniel Scharstein; Michael J. Black |
|
258a | BodyMap: Learning Full-Body Dense Correspondence Map |
Anastasia Ianina; Nikolaos Sarafianos; Yuanlu Xu; Ignacio Rocco; Tony Tung |
|
259a | ICON: Implicit Clothed Humans Obtained From Normals |
Yuliang Xiu; Jinlong Yang; Dimitrios Tzionas; Michael J. Black |