Skip to content

Latest commit

 

History

History
1919 lines (1918 loc) · 616 KB

eccv2022.md

File metadata and controls

1919 lines (1918 loc) · 616 KB

ECCV2022 Paper List

论文 作者 摘要 代码 引用数
Learning Mutual Modulation for Self-supervised Cross-Modal Super-Resolution Xiaoyu Dong, Naoto Yokoya, Longguang Wang, Tatsumi Uezato code -1
Spectrum-Aware and Transferable Architecture Search for Hyperspectral Image Restoration Wei He, Quanming Yao, Naoto Yokoya, Tatsumi Uezato, Hongyan Zhang, Liangpei Zhang code -1
Neural Color Operators for Sequential Image Retouching Yili Wang, Xin Li, Kun Xu, Dongliang He, Qi Zhang, Fu Li, Errui Ding code -1
Optimizing Image Compression via Joint Learning with Denoising Ka Leong Cheng, Yueqi Xie, Qifeng Chen code -1
Restore Globally, Refine Locally: A Mask-Guided Scheme to Accelerate Super-Resolution Networks Xiaotao Hu, Jun Xu, Shuhang Gu, MingMing Cheng, Li Liu code -1
Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution Yushu Wu, Yifan Gong, Pu Zhao, Yanyu Li, Zheng Zhan, Wei Niu, Hao Tang, Minghai Qin, Bin Ren, Yanzhi Wang code -1
Modeling Mask Uncertainty in Hyperspectral Image Reconstruction Jiamian Wang, Yulun Zhang, Xin Yuan, Ziyi Meng, Zhiqiang Tao code -1
Perceiving and Modeling Density for Image Dehazing Tian Ye, Yunchen Zhang, Mingchao Jiang, Liang Chen, Yun Liu, Sixiang Chen, Erkang Chen code -1
Stripformer: Strip Transformer for Fast Image Deblurring FuJen Tsai, YanTsung Peng, YenYu Lin, ChungChi Tsai, ChiaWen Lin code -1
Deep Fourier-Based Exposure Correction Network with Spatial-Frequency Interaction Jie Huang, Yajing Liu, Feng Zhao, Keyu Yan, Jinghao Zhang, Yukun Huang, Man Zhou, Zhiwei Xiong code -1
Frequency and Spatial Dual Guidance for Image Dehazing Hu Yu, Naishan Zheng, Man Zhou, Jie Huang, Zeyu Xiao, Feng Zhao code -1
Towards Real-World HDRTV Reconstruction: A Data Synthesis-Based Approach Zhen Cheng, Tao Wang, Yong Li, Fenglong Song, Chang Chen, Zhiwei Xiong code -1
Learning Discriminative Shrinkage Deep Networks for Image Deconvolution PinHung Kuo, Jinshan Pan, ShaoYi Chien, MingHsuan Yang code -1
KXNet: A Model-Driven Deep Neural Network for Blind Super-Resolution Jiahong Fu, Hong Wang, Qi Xie, Qian Zhao, Deyu Meng, Zongben Xu code -1
ARM: Any-Time Super-Resolution Method Bohong Chen, Mingbao Lin, Kekai Sheng, Mengdan Zhang, Peixian Chen, Ke Li, Liujuan Cao, Rongrong Ji code -1
Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines Haina Qin, Longfei Han, Juan Wang, Congxuan Zhang, Yanwei Li, Bing Li, Weiming Hu code -1
RealFlow: EM-Based Realistic Optical Flow Dataset Generation from Videos Yunhui Han, Kunming Luo, Ao Luo, Jiangyu Liu, Haoqiang Fan, Guiming Luo, Shuaicheng Liu code -1
Memory-Augmented Model-Driven Network for Pansharpening Keyu Yan, Man Zhou, Li Zhang, Chengjun Xie code -1
All You Need Is RAW: Defending Against Adversarial Attacks with Camera Image Pipelines Yuxuan Zhang, Bo Dong, Felix Heide code -1
Ghost-free High Dynamic Range Imaging with Context-Aware Transformer Zhen Liu, Yinglong Wang, Bing Zeng, Shuaicheng Liu code -1
Style-Guided Shadow Removal Jin Wan, Hui Yin, Zhenyao Wu, Xinyi Wu, Yanting Liu, Song Wang code -1
D2C-SR: A Divergence to Convergence Approach for Real-World Image Super-Resolution Youwei Li, Haibin Huang, Lanpeng Jia, Haoqiang Fan, Shuaicheng Liu code -1
GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training Jaeseok Byun, Taebaek Hwang, Jianlong Fu, Taesup Moon code -1
Efficient Video Deblurring Guided by Motion Magnitude Yusheng Wang, Yunfan Lu, Ye Gao, Lin Wang, Zhihang Zhong, Yinqiang Zheng, Atsushi Yamashita code -1
Single Frame Atmospheric Turbulence Mitigation: A Benchmark Study and a New Physics-Inspired Transformer Model Zhiyuan Mao, Ajay Jaiswal, Zhangyang Wang, Stanley H. Chan code -1
Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression Ahmet Burakhan Koyuncu, Han Gao, Atanas Boev, Georgii Gaikov, Elena Alshina, Eckehard G. Steinbach code -1
Image Super-Resolution with Deep Dictionary Shunta Maeda code -1
TempFormer: Temporally Consistent Transformer for Video Denoising Mingyang Song, Yang Zhang, Tunç Ozan Aydin code -1
RAWtoBit: A Fully End-to-end Camera ISP Network Wooseok Jeong, SeungWon Jung code -1
DRCNet: Dynamic Image Restoration Contrastive Network Fei Li, Lingfeng Shen, Yang Mi, Zhenbo Li code -1
Zero-Shot Learning for Reflection Removal of Single 360-Degree Image ByeongJu Han, JaeYoung Sim code -1
Transformer with Implicit Edges for Particle-Based Physics Simulation Yidi Shao, Chen Change Loy, Bo Dai code -1
Rethinking Video Rain Streak Removal: A New Synthesis Model and a Deraining Network with Video Rain Prior Shuai Wang, Lei Zhu, Huazhu Fu, Jing Qin, CarolaBibiane Schönlieb, Wei Feng, Song Wang code -1
Super-Resolution by Predicting Offsets: An Ultra-Efficient Super-Resolution Network for Rasterized Images Jinjin Gu, Haoming Cai, Chenyu Dong, Ruofan Zhang, Yulun Zhang, Wenming Yang, Chun Yuan code -1
Animation from Blur: Multi-modal Blur Decomposition with Motion Guidance Zhihang Zhong, Xiao Sun, Zhirong Wu, Yinqiang Zheng, Stephen Lin, Imari Sato code -1
AlphaVC: High-Performance and Efficient Learned Video Compression Yibo Shi, Yunying Ge, Jing Wang, Jue Mao code -1
Content-Oriented Learned Image Compression Meng Li, Shangyin Gao, Yihui Feng, Yibo Shi, Jing Wang code -1
RRSR: Reciprocal Reference-Based Image Super-Resolution with Progressive Feature Alignment and Selection Lin Zhang, Xin Li, Dongliang He, Fu Li, Yili Wang, Zhaoxiang Zhang code -1
Contrastive Prototypical Network with Wasserstein Confidence Penalty Haoqing Wang, ZhiHong Deng code -1
Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Recognition Xinyi Zou, Yan Yan, JingHao Xue, Si Chen, Hanzi Wang code -1
Self-support Few-Shot Semantic Segmentation Qi Fan, Wenjie Pei, YuWing Tai, ChiKeung Tang code -1
Few-Shot Object Detection with Model Calibration Qi Fan, ChiKeung Tang, YuWing Tai code -1
Self-Supervision Can Be a Good Few-Shot Learner Yuning Lu, Liangjian Wen, Jianzhuang Liu, Yajing Liu, Xinmei Tian code -1
BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai code -1
Category-Level 6D Object Pose and Size Estimation Using Self-supervised Deep Prior Deformation Networks Jiehong Lin, Zewei Wei, Changxing Ding, Kui Jia code -1
Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object Detection Hongyu Zhou, Zheng Ge, Songtao Liu, Weixin Mao, Zeming Li, Haiyan Yu, Jian Sun code -1
Point-to-Box Network for Accurate Object Detection via Single Point Supervision Pengfei Chen, Xuehui Yu, Xumeng Han, Najmul Hassan, Kai Wang, Jiachen Li, Jian Zhao, Humphrey Shi, Zhenjun Han, Qixiang Ye code -1
Domain Adaptive Hand Keypoint and Pixel Localization in the Wild Takehiko Ohkawa, YuJhe Li, Qichen Fu, Ryosuke Furuta, Kris M. Kitani, Yoichi Sato code -1
Towards Data-Efficient Detection Transformers Wen Wang, Jing Zhang, Yang Cao, Yongliang Shen, Dacheng Tao code -1
Open-Vocabulary DETR with Conditional Matching Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy code -1
Prediction-Guided Distillation for Dense Object Detection Chenhongyi Yang, Mateusz Ochal, Amos J. Storkey, Elliot J. Crowley code -1
Multimodal Object Detection via Probabilistic Ensembling YiTing Chen, Jinghao Shi, Zelin Ye, Christoph Mertz, Deva Ramanan, Shu Kong code -1
Exploiting Unlabeled Data with Vision and Language Models for Object Detection Shiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao, B. G. Vijay Kumar, Anastasis Stathopoulos, Manmohan Chandraker, Dimitris N. Metaxas code -1
CPO: Change Robust Panorama to Point Cloud Localization Junho Kim, Hojun Jang, Changwoon Choi, Young Min Kim code -1
INT: Towards Infinite-Frames 3D Detection with an Efficient Framework Jianyun Xu, Zhenwei Miao, Da Zhang, Hongyu Pan, Kaixuan Liu, Peihan Hao, Jun Zhu, Zhengyang Sun, Hongmin Li, Xin Zhan code -1
End-to-End Weakly Supervised Object Detection with Sparse Proposal Evolution Mingxiang Liao, Fang Wan, Yuan Yao, Zhenjun Han, Jialing Zou, Yuze Wang, Bailan Feng, Peng Yuan, Qixiang Ye code -1
Calibration-Free Multi-view Crowd Counting Qi Zhang, Antoni B. Chan code -1
Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-training Zhenyu Li, Zehui Chen, Ang Li, Liangji Fang, Qinhong Jiang, Xianming Liu, Junjun Jiang code -1
SuperLine3D: Self-supervised Line Segmentation and Description for LiDAR Point Cloud Xiangrui Zhao, Sheng Yang, Tianxin Huang, Jun Chen, Teng Ma, Mingyang Li, Yong Liu code -1
Exploring Plain Vision Transformer Backbones for Object Detection Yanghao Li, Hanzi Mao, Ross B. Girshick, Kaiming He code -1
Adversarially-Aware Robust Object Detector Ziyi Dong, Pengxu Wei, Liang Lin code -1
HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors Luting Wang, Xiaojie Li, Yue Liao, Zeren Jiang, Jianlong Wu, Fei Wang, Chen Qian, Si Liu code -1
You Should Look at All Objects Zhenchao Jin, Dongdong Yu, Luchuan Song, Zehuan Yuan, Lequan Yu code -1
Detecting Twenty-Thousand Classes Using Image-Level Supervision Xingyi Zhou, Rohit Girdhar, Armand Joulin, Philipp Krähenbühl, Ishan Misra code -1
DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation Hongyang Li, Jiehong Lin, Kui Jia code -1
Monocular 3D Object Detection with Depth from Motion Tai Wang, Jiangmiao Pang, Dahua Lin code -1
DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation Yilin Wen, Xiangyu Li, Hao Pan, Lei Yang, Zheng Wang, Taku Komura, Wenping Wang code -1
Distilling Object Detectors with Global Knowledge Sanli Tang, Zhongyu Zhang, Zhanzhan Cheng, Jing Lu, Yunlu Xu, Yi Niu, Fan He code -1
Unifying Visual Perception by Dispersible Points Learning Jianming Liang, Guanglu Song, Biao Leng, Yu Liu code -1
PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection Gang Li, Xiang Li, Yujie Wang, Yichao Wu, Ding Liang, Shanshan Zhang code -1
Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection Ziteng Cui, Yingying Zhu, Lin Gu, GuoJun Qi, Xiaoxiao Li, Renrui Zhang, Zenghui Zhang, Tatsuya Harada code -1
Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features Wufei Ma, Angtian Wang, Alan L. Yuille, Adam Kortylewski code -1
Translation, Scale and Rotation: Cross-Modal Alignment Meets RGB-Infrared Vehicle Detection Maoxun Yuan, Yinyan Wang, Xingxing Wei code -1
RFLA: Gaussian Receptive Field Based Label Assignment for Tiny Object Detection Chang Xu, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, GuiSong Xia code -1
Rethinking IoU-based Optimization for Single-stage 3D Object Detection Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Jianqiang Huang, XianSheng Hua, Minjian Zhao, Gim Hee Lee code -1
TD-Road: Top-Down Road Network Extraction with Holistic Graph Construction Yang He, Ravi Garg, Amber Roy Chowdhury code -1
Multi-faceted Distillation of Base-Novel Commonality for Few-Shot Object Detection Shuang Wu, Wenjie Pei, Dianwen Mei, Fanglin Chen, Jiandong Tian, Guangming Lu code -1
PointCLM: A Contrastive Learning-based Framework for Multi-instance Point Cloud Registration Mingzhi Yuan, Zhihao Li, Qiuye Jin, Xinrong Chen, Manning Wang code -1
Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration Haotian Bai, Ruimao Zhang, Jiong Wang, Xiang Wan code -1
MTTrans: Cross-domain Object Detection with Mean Teacher Transformer Jinze Yu, Jiaming Liu, Xiaobao Wei, Haoyi Zhou, Yohei Nakata, Denis A. Gudovskiy, Tomoyuki Okuno, Jianxin Li, Kurt Keutzer, Shanghang Zhang code -1
Multi-domain Multi-definition Landmark Localization for Small Datasets David Ferman, Gaurav Bharaj code -1
DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection Abhinav Kumar, Garrick Brazil, Enrique Corona, Armin Parchami, Xiaoming Liu code -1
Label-Guided Auxiliary Training Improves 3D Object Detector Yaomin Huang, Xinmei Liu, Yichen Zhu, Zhiyuan Xu, Chaomin Shen, Zhengping Che, Guixu Zhang, Yaxin Peng, Feifei Feng, Jian Tang code -1
PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images Chengjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma code -1
Densely Constrained Depth Estimator for Monocular 3D Object Detection Yingyan Li, Yuntao Chen, Jiawei He, Zhaoxiang Zhang code -1
Polarimetric Pose Prediction Daoyi Gao, Yitong Li, Patrick Ruhkamp, Iuliia Skobleva, Magdalena Wysocki, HyunJun Jung, Pengyuan Wang, Arturo Guridi, Benjamin Busam code -1
TOCH: Spatio-Temporal Object-to-Hand Correspondence for Motion Refinement Keyang Zhou, Bharat Lal Bhatnagar, Jan Eric Lenssen, Gerard PonsMoll code -1
LaTeRF: Label and Text Driven Object Radiance Fields Ashkan Mirzaei, Yash Kant, Jonathan Kelly, Igor Gilitschenski code -1
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis Yaqian Liang, Shanshan Zhao, Baosheng Yu, Jing Zhang, Fazhi He code -1
Unsupervised Deep Multi-shape Matching Dongliang Cao, Florian Bernard code -1
Texturify: Generating Textures on 3D Shape Surfaces Yawar Siddiqui, Justus Thies, Fangchang Ma, Qi Shan, Matthias Nießner, Angela Dai code -1
Autoregressive 3D Shape Generation via Canonical Mapping AnChieh Cheng, Xueting Li, Sifei Liu, Min Sun, MingHsuan Yang code -1
PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees JunKun Chen, YuXiong Wang code -1
UNIF: United Neural Implicit Functions for Clothed Human Reconstruction and Animation Shenhan Qian, Jiale Xu, Ziwei Liu, Liqian Ma, Shenghua Gao code -1
PRIF: Primary Ray-Based Implicit Function Brandon Yushan Feng, Yinda Zhang, Danhang Tang, Ruofei Du, Amitabh Varshney code -1
Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction Hanxue Liang, Hehe Fan, Zhiwen Fan, Yi Wang, Tianlong Chen, Yu Cheng, Zhangyang Wang code -1
CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes Kim Youwang, JiYeon Kim, TaeHyun Oh code -1
PlaneFormers: From Sparse View Planes to 3D Reconstruction Samir Agarwala, Linyi Jin, Chris Rockwell, David F. Fouhey code -1
Learning Implicit Templates for Point-Based Clothed Human Modeling Siyou Lin, Hongwen Zhang, Zerong Zheng, Ruizhi Shao, Yebin Liu code -1
Exploring the Devil in Graph Spectral Domain for 3D Point Cloud Attacks Qianjiang Hu, Daizong Liu, Wei Hu code -1
Structure-Aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation Jingwang Ling, Zhibo Wang, Ming Lu, Quan Wang, Chen Qian, Feng Xu code -1
MoFaNeRF: Morphable Facial Neural Radiance Field Yiyu Zhuang, Hao Zhu, Xusen Sun, Xun Cao code -1
PointInst3D: Segmenting 3D Instances by Points Tong He, Wei Yin, Chunhua Shen, Anton van den Hengel code -1
Cross-modal 3D Shape Generation and Manipulation Zezhou Cheng, Menglei Chai, Jian Ren, HsinYing Lee, Kyle Olszewski, Zeng Huang, Subhransu Maji, Sergey Tulyakov code -1
Latent Partition Implicit with Surface Codes for 3D Representation Chao Chen, YuShen Liu, Zhizhong Han code -1
Implicit Field Supervision for Robust Non-rigid Shape Matching Ramana Sundararaman, Gautam Pai, Maks Ovsjanikov code -1
Learning Self-prior for Mesh Denoising Using Dual Graph Convolutional Networks Shota Hattori, Tatsuya Yatagawa, Yutaka Ohtake, Hiromasa Suzuki code -1
DiffConv: Analyzing Irregular Point Clouds with an Irregular View Manxi Lin, Aasa Feragen code -1
PD-Flow: A Point Cloud Denoising Framework with Normalizing Flows Aihua Mao, Zihui Du, YuHui Wen, Jun Xuan, YongJin Liu code -1
SeedFormer: Patch Seeds Based Point Cloud Completion with Upsample Transformer Haoran Zhou, Yun Cao, Wenqing Chu, Junwei Zhu, Tong Lu, Ying Tai, Chengjie Wang code -1
DeepMend: Learning Occupancy Functions to Represent Shape for Repair Nikolas Lamb, Sean Banerjee, Natasha Kholgade Banerjee code -1
A Repulsive Force Unit for Garment Collision Handling in Neural Networks Qingyang Tan, Yi Zhou, Tuanfeng Y. Wang, Duygu Ceylan, Xin Sun, Dinesh Manocha code -1
Shape-Pose Disentanglement Using SE(3)-Equivariant Vector Neurons Oren Katzir, Dani Lischinski, Daniel CohenOr code -1
3D Equivariant Graph Implicit Functions Yunlu Chen, Basura Fernando, Hakan Bilen, Matthias Nießner, Efstratios Gavves code -1
PatchRD: Detail-Preserving Shape Completion by Learning Patch Retrieval and Deformation Bo Sun, Vladimir G. Kim, Noam Aigerman, Qixing Huang, Siddhartha Chaudhuri code -1
3D Shape Sequence of Human Comparison and Classification Using Current and Varifolds Emery Pierson, Mohamed Daoudi, Sylvain Arguillère code -1
Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty Quantification Jianxiong Shen, Antonio Agudo, Francesc MorenoNoguer, Adria Ruiz code -1
Unsupervised Pose-aware Part Decomposition for Man-Made Articulated Objects Yuki Kawana, Yusuke Mukuta, Tatsuya Harada code -1
MeshUDF: Fast and Differentiable Meshing of Unsigned Distance Field Networks Benoît Guillard, Federico Stella, Pascal Fua code -1
SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement Zhaofan Qiu, Yehao Li, Yu Wang, Yingwei Pan, Ting Yao, Tao Mei code -1
The Shape Part Slot Machine: Contact-Based Reasoning for Generating 3D Shapes from Parts Kai Wang, Paul Guerrero, Vladimir G. Kim, Siddhartha Chaudhuri, Minhyuk Sung, Daniel Ritchie code -1
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, XianSheng Hua, Lei Zhang code -1
Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning Sauradip Nag, Xiatian Zhu, YiZhe Song, Tao Xiang code -1
Semi-supervised Temporal Action Detection with Proposal-Free Masking Sauradip Nag, Xiatian Zhu, YiZhe Song, Tao Xiang code -1
Zero-Shot Temporal Action Detection via Vision-Language Prompting Sauradip Nag, Xiatian Zhu, YiZhe Song, Tao Xiang code -1
CycDA: Unsupervised Cycle Domain Adaptation to Learn from Image to Video Wei Lin, Anna Kukleva, Kunyang Sun, Horst Possegger, Hilde Kuehne, Horst Bischof code -1
S2N: Suppression-Strengthen Network for Event-Based Recognition Under Variant Illuminations Zengyu Wan, Yang Wang, Ganchao Tan, Yang Cao, ZhengJun Zha code -1
CMD: Self-supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation Yunyao Mao, Wengang Zhou, Zhenbo Lu, Jiajun Deng, Houqiang Li code -1
CT2: Colorization Transformer via Color Tokens Shuchen Weng, Jimeng Sun, Yu Li, Si Li, Boxin Shi code -1
Simple Baselines for Image Restoration Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, Jian Sun code -1
Spike Transformer: Monocular Depth Estimation for Spiking Camera Jiyuan Zhang, Lulu Tang, Zhaofei Yu, Jiwen Lu, TieJun Huang code -1
Improving Image Restoration by Revisiting Global Information Aggregation Xiaojie Chu, Liangyu Chen, Chengpeng Chen, Xin Lu code -1
Data Association Between Event Streams and Intensity Frames Under Diverse Baselines Dehao Zhang, Qiankun Ding, Peiqi Duan, Chu Zhou, Boxin Shi code -1
D2HNet: Joint Denoising and Deblurring with Hierarchical Network for Robust Night Image Restoration Yuzhi Zhao, Yongzhe Xu, Qiong Yan, Dingdong Yang, Xuehui Wang, LaiMan Po code -1
Learning Graph Neural Networks for Image Style Transfer Yongcheng Jing, Yining Mao, Yiding Yang, Yibing Zhan, Mingli Song, Xinchao Wang, Dacheng Tao code -1
DeepPS2: Revisiting Photometric Stereo Using Two Differently Illuminated Images Ashish Tiwari, Shanmuganathan Raman code -1
Instance Contour Adjustment via Structure-Driven CNN Shuchen Weng, Yi Wei, MingChing Chang, Boxin Shi code -1
Synthesizing Light Field Video from Monocular Video Shrisudhan Govindarajan, Prasan A. Shedligeri, Sarah, Kaushik Mitra code -1
Human-Centric Image Cropping with Partition-Aware and Content-Preserving Features Bo Zhang, Li Niu, Xing Zhao, Liqing Zhang code -1
DeMFI: Deep Joint Deblurring and Multi-frame Interpolation with Flow-Guided Attentive Correlation and Recursive Boosting Jihyong Oh, Munchurl Kim code -1
Neural Image Representations for Multi-image Fusion and Layer Separation Seonghyeon Nam, Marcus A. Brubaker, Michael S. Brown code -1
Bringing Rolling Shutter Images Alive with Dual Reversed Distortion Zhihang Zhong, Mingdeng Cao, Xiao Sun, Zhirong Wu, Zhongyi Zhou, Yinqiang Zheng, Stephen Lin, Imari Sato code -1
FILM: Frame Interpolation for Large Motion Fitsum A. Reda, Janne Kontkanen, Eric Tabellion, Deqing Sun, Caroline Pantofaru, Brian Curless code -1
Video Interpolation by Event-Driven Anisotropic Adjustment of Optical Flow Song Wu, Kaichao You, Weihua He, Chen Yang, Yang Tian, Yaoyuan Wang, Ziyang Zhang, Jianxing Liao code -1
EvAC3D: From Event-Based Apparent Contours to 3D Models via Continuous Visual Hulls Ziyun Wang, Kenneth Chaney, Kostas Daniilidis code -1
DCCF: Deep Comprehensible Color Filter Learning Framework for High-Resolution Image Harmonization Ben Xue, Shenghui Ran, Quan Chen, Rongfei Jia, Binqiang Zhao, Xing Tang code -1
SelectionConv: Convolutional Neural Networks for Non-rectilinear Image Data David Hart, Michael Whitney, Bryan S. Morse code -1
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization Jingtang Liang, Xiaodong Cun, ChiMan Pun, Jue Wang code -1
BigColor: Colorization Using a Generative Color Prior for Natural Images Geonung Kim, Kyoungkook Kang, Seongtae Kim, Hwayoon Lee, Sehoon Kim, Jonghyun Kim, SeungHwan Baek, Sunghyun Cho code -1
CADyQ: Content-Aware Dynamic Quantization for Image Super-Resolution Cheeun Hong, Sungyong Baik, Heewon Kim, Seungjun Nah, Kyoung Mu Lee code -1
Deep Semantic Statistics Matching (D2SM) Denoising Network Kangfu Mei, Vishal M. Patel, Rui Huang code -1
3D Scene Inference from Transient Histograms Sacha Jungerman, Atul Ingle, Yin Li, Mohit Gupta code -1
Neural Space-Filling Curves Hanyu Wang, Kamal Gupta, Larry Davis, Abhinav Shrivastava code -1
Exposure-Aware Dynamic Weighted Learning for Single-Shot HDR Imaging Vien Gia An, Chul Lee code -1
Seeing Through a Black Box: Toward High-Quality Terahertz Imaging via Subspace-and-Attention Guided Restoration WenTai Su, YiChun Hung, PoJen Yu, ShangHua Yang, ChiaWen Lin code -1
Tomography of Turbulence Strength Based on Scintillation Imaging Nir Shaul, Yoav Y. Schechner code -1
Realistic Blur Synthesis for Learning Image Deblurring Jaesung Rim, Geonung Kim, Jungeon Kim, Junyong Lee, Seungyong Lee, Sunghyun Cho code -1
Learning Phase Mask for Privacy-Preserving Passive Depth Estimation Zaid Tasneem, Giovanni Milione, YiHsuan Tsai, Xiang Yu, Ashok Veeraraghavan, Manmohan Chandraker, Francesco Pittaluga code -1
LWGNet - Learned Wirtinger Gradients for Fourier Ptychographic Phase Retrieval Atreyee Saha, Salman Siddique Khan, Sagar Sehrawat, Sanjana S. Prabhu, Shanti Bhattacharya, Kaushik Mitra code -1
PANDORA: Polarization-Aided Neural Decomposition of Radiance Akshat Dave, Yongyi Zhao, Ashok Veeraraghavan code -1
HuMMan: Multi-modal 4D Human Dataset for Versatile Sensing and Modeling Zhongang Cai, Daxuan Ren, Ailing Zeng, Zhengyu Lin, Tao Yu, Wenjia Wang, Xiangyu Fan, Yang Gao, Yifan Yu, Liang Pan, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu code -1
DVS-Voltmeter: Stochastic Process-Based Event Simulator for Dynamic Vision Sensors Songnan Lin, Ye Ma, Zhenhua Guo, Bihan Wen code -1
Benchmarking Omni-Vision Representation Through the Lens of Visual Realms Yuanhan Zhang, Zhenfei Yin, Jing Shao, Ziwei Liu code -1
BEAT: A Large-Scale Semantic and Emotional Multi-modal Dataset for Conversational Gestures Synthesis Haiyang Liu, Zihao Zhu, Naoya Iwamoto, Yichen Peng, Zhengqing Li, You Zhou, Elif Bozkurt, Bo Zheng code -1
Neuromorphic Data Augmentation for Training Spiking Neural Networks Yuhang Li, Youngeun Kim, Hyoungseob Park, Tamar Geller, Priyadarshini Panda code -1
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset Hao Zhu, Wayne Wu, Wentao Zhu, Liming Jiang, Siwei Tang, Li Zhang, Ziwei Liu, Chen Change Loy code -1
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition Alejandro Pardo, Fabian Caba Heilbron, Juan León Alcázar, Ali K. Thabet, Bernard Ghanem code -1
LaMAR: Benchmarking Localization and Mapping for Augmented Reality PaulEdouard Sarlin, Mihai Dusmanu, Johannes L. Schönberger, Pablo Speciale, Lukas Gruber, Viktor Larsson, Ondrej Miksik, Marc Pollefeys code -1
Unitail: Detecting, Reading, and Matching in Retail Scene Fangyi Chen, Han Zhang, Zaiwang Li, Jiachen Dou, Shentong Mo, Hao Chen, Yongxin Zhang, Uzair Ahmed, Chenchen Zhu, Marios Savvides code -1
Not Just Streaks: Towards Ground Truth for Single Image Deraining Yunhao Ba, Howard Zhang, Ethan Yang, Akira Suzuki, Arnold Pfahnl, Chethan Chinder Chandrappa, Celso M. de Melo, Suya You, Stefano Soatto, Alex Wong, Achuta Kadambi code -1
MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views Haitian Zeng, Xin Yu, Jiaxu Miao, Yi Yang code -1
Depth Map Decomposition for Monocular Depth Estimation Jinyoung Jun, Jaehan Lee, Chul Lee, ChangSu Kim code -1
Monitored Distillation for Positive Congruent Depth Completion Tian Yu Liu, Parth Agrawal, Allison Chen, ByungWoo Hong, Alex Wong code -1
Resolution-Free Point Cloud Sampling Network with Data Distillation Tianxin Huang, Jiangning Zhang, Jun Chen, Yuang Liu, Yong Liu code -1
Organic Priors in Non-rigid Structure from Motion Suryansh Kumar, Luc Van Gool code -1
Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation Yinlin Hu, Pascal Fua, Mathieu Salzmann code -1
DANBO: Disentangled Articulated Neural Body Representations via Graph Neural Networks ShihYang Su, Timur M. Bagautdinov, Helge Rhodin code -1
CHORE: Contact, Human and Object Reconstruction from a Single RGB Image Xianghui Xie, Bharat Lal Bhatnagar, Gerard PonsMoll code -1
Learned Vertex Descent: A New Direction for 3D Human Model Fitting Enric Corona, Gerard PonsMoll, Guillem Alenyà, Francesc MorenoNoguer code -1
Self-calibrating Photometric Stereo by Neural Inverse Rendering Junxuan Li, Hongdong Li code -1
3D Clothed Human Reconstruction in the Wild Gyeongsik Moon, Hyeongjin Nam, Takaaki Shiratori, Kyoung Mu Lee code -1
Directed Ray Distance Functions for 3D Scene Reconstruction Nilesh Kulkarni, Justin Johnson, David F. Fouhey code -1
Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image Zhaoxin Fan, Zhenbo Song, Jian Xu, Zhicheng Wang, Kejian Wu, Hongyan Liu, Jun He code -1
Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression Dongting Hu, Liuhua Peng, Tingjin Chu, Xiaoxing Zhang, Yinian Mao, Howard D. Bondell, Mingming Gong code -1
CostDCNet: Cost Volume Based Depth Completion for a Single RGB-D Image Jaewon Kam, Jungeon Kim, Soongjin Kim, Jaesik Park, Seungyong Lee code -1
ShAPO: Implicit Representations for Multi-object Shape, Appearance, and Pose Optimization Muhammad Zubair Irshad, Sergey Zakharov, Rares Ambrus, Thomas Kollar, Zsolt Kira, Adrien Gaidon code -1
3D Siamese Transformer Network for Single Object Tracking on Point Clouds Le Hui, Lingpeng Wang, Linghua Tang, Kaihao Lan, Jin Xie, Jian Yang code -1
Object Wake-Up: 3D Object Rigging from a Single Image Ji Yang, Xinxin Zuo, Sen Wang, Zhenbo Yu, Xingyu Li, Bingbing Ni, Minglun Gong, Li Cheng code -1
IntegratedPIFu: Integrated Pixel Aligned Implicit Function for Single-View Human Reconstruction Kennard Yanting Chan, Guosheng Lin, Haiyu Zhao, Weisi Lin code -1
Realistic One-Shot Mesh-Based Head Avatars Taras Khakhulin, Vanessa Sklyarova, Victor Lempitsky, Egor Zakharov code -1
A Kendall Shape Space Approach to 3D Shape Estimation from 2D Landmarks Martha Paskin, Daniel Baum, Mason N. Dean, Christoph von Tycowicz code -1
Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion Zian Wang, Wenzheng Chen, David Acuna, Jan Kautz, Sanja Fidler code -1
Perspective Phase Angle Model for Polarimetric 3D Reconstruction Guangcheng Chen, Li He, Yisheng Guan, Hong Zhang code -1
DeepShadow: Neural Shape from Shadow Asaf Karnieli, Ohad Fried, Yacov HelOr code -1
Camera Auto-calibration from the Steiner Conic of the Fundamental Matrix Yu Liu, Hui Zhang code -1
Super-Resolution 3D Human Shape from a Single Low-Resolution Image Marco Pesavento, Marco Volino, Adrian Hilton code -1
Minimal Neural Atlas: Parameterizing Complex Surfaces with Minimal Charts and Distortion Weng Fei Low, Gim Hee Lee code -1
ExtrudeNet: Unsupervised Inverse Sketch-and-Extrude for Shape Parsing Daxuan Ren, Jianmin Zheng, Jianfei Cai, Jiatong Li, Junzhe Zhang code -1
CATRE: Iterative Point Clouds Alignment for Category-Level Object Pose Refinement Xingyu Liu, Gu Wang, Yi Li, Xiangyang Ji code -1
Optimization over Disentangled Encoding: Unsupervised Cross-Domain Point Cloud Completion via Occlusion Factor Manipulation Jingyu Gong, Fengqi Liu, Jiachen Xu, Min Wang, Xin Tan, Zhizhong Zhang, Ran Yi, Haichuan Song, Yuan Xie, Lizhuang Ma code -1
Unsupervised Learning of 3D Semantic Keypoints with Mutual Reconstruction Haocheng Yuan, Chen Zhao, Shichao Fan, Jiaxi Jiang, Jiaqi Yang code -1
MvDeCor: Multi-view Dense Correspondence Learning for Fine-Grained 3D Segmentation Gopal Sharma, Kangxue Yin, Subhransu Maji, Evangelos Kalogerakis, Or Litany, Sanja Fidler code -1
SUPR: A Sparse Unified Part-Based Human Representation Ahmed A. A. Osman, Timo Bolkart, Dimitrios Tzionas, Michael J. Black code -1
Revisiting Point Cloud Simplification: A Learnable Feature Preserving Approach Rolandos Alexandros Potamias, Giorgos Bouritsas, Stefanos Zafeiriou code -1
Masked Autoencoders for Point Cloud Self-supervised Learning Yatian Pang, Wenxiao Wang, Francis E. H. Tay, Wei Liu, Yonghong Tian, Li Yuan code -1
Intrinsic Neural Fields: Learning Functions on Manifolds Lukas Koestler, Daniel Grittner, Michael Möller, Daniel Cremers, Zorah Lähner code -1
Skeleton-Free Pose Transfer for Stylized 3D Characters Zhouyingcheng Liao, Jimei Yang, Jun Saito, Gerard PonsMoll, Yang Zhou code -1
Masked Discrimination for Self-supervised Learning on Point Clouds Haotian Liu, Mu Cai, Yong Jae Lee code -1
FBNet: Feedback Network for Point Cloud Completion Xuejun Yan, Hongyu Yan, Jingjing Wang, Hang Du, Zhihong Wu, Di Xie, Shiliang Pu, Li Lu code -1
Meta-sampler: Almost-Universal yet Task-Oriented Sampling for Point Clouds Ta Ying Cheng, Qingyong Hu, Qian Xie, Niki Trigoni, Andrew Markham code -1
A Level Set Theory for Neural Implicit Evolution Under Explicit Flows Ishit Mehta, Manmohan Chandraker, Ravi Ramamoorthi code -1
Efficient Point Cloud Analysis Using Hilbert Curve Wanli Chen, Xinge Zhu, Guojin Chen, Bei Yu code -1
Expanding Language-Image Pretrained Models for General Video Recognition Bolin Ni, Houwen Peng, Minghao Chen, Songyang Zhang, Gaofeng Meng, Jianlong Fu, Shiming Xiang, Haibin Ling code -1
Hunting Group Clues with Transformers for Social Group Activity Recognition Masato Tamura, Rahul Vishwakarma, Ravigopal Vennelakanti code -1
Contrastive Positive Mining for Unsupervised 3D Action Representation Learning Haoyuan Zhang, Yonghong Hou, Wenjing Zhang, Wanqing Li code -1
Target-Absent Human Attention Zhibo Yang, Sounak Mondal, Seoyoung Ahn, Gregory J. Zelinsky, Minh Hoai, Dimitris Samaras code -1
Uncertainty-Based Spatial-Temporal Attention for Online Action Detection Hongji Guo, Zhou Ren, Yi Wu, Gang Hua, Qiang Ji code -1
Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen code -1
Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions Yijun Qian, Lijun Yu, Wenhe Liu, Alexander G. Hauptmann code -1
Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection Xiaoqian Wu, YongLu Li, Xinpeng Liu, Junyi Zhang, Yuzhe Wu, Cewu Lu code -1
Collaborating Domain-Shared and Target-Specific Feature Clustering for Cross-domain 3D Action Recognition Qinying Liu, Zilei Wang code -1
Is Appearance Free Action Recognition Possible? Filip Ilic, Thomas Pock, Richard P. Wildes code -1
Learning Spatial-Preserved Skeleton Representations for Few-Shot Action Recognition Ning Ma, Hongyi Zhang, Xuhui Li, Sheng Zhou, Zhen Zhang, Jun Wen, Haifeng Li, Jingjun Gu, Jiajun Bu code -1
Dual-Evidential Learning for Weakly-supervised Temporal Action Localization Mengyuan Chen, Junyu Gao, Shicai Yang, Changsheng Xu code -1
Global-Local Motion Transformer for Unsupervised Skeleton-Based Action Learning Boeun Kim, Hyung Jin Chang, Jungho Kim, Jin Young Choi code -1
AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition Yulin Wang, Yang Yue, Xinhong Xu, Ali Hassani, Victor Kulikov, Nikita Orlov, Shiji Song, Humphrey Shi, Gao Huang code -1
Panoramic Human Activity Recognition Ruize Han, Haomin Yan, Jiacheng Li, Song Wang, Wei Feng code -1
Delving into Details: Synopsis-to-Detail Networks for Video Recognition Shuxian Liang, Xu Shen, Jianqiang Huang, XianSheng Hua code -1
A Generalized and Robust Framework for Timestamp Supervision in Temporal Action Segmentation Rahul Rahaman, Dipika Singhania, Alexandre H. Thiery, Angela Yao code -1
Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning Sipeng Zheng, Shizhe Chen, Qin Jin code -1
PrivHAR: Recognizing Human Actions from Privacy-Preserving Lens Carlos Hinojosa, Miguel Marquez, Henry Arguello, Ehsan Adeli, Li FeiFei, Juan Carlos Niebles code -1
Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection Guoqiu Li, Guanxiong Cai, Xingyu Zeng, Rui Zhao code -1
Compound Prototype Matching for Few-Shot Action Recognition Yifei Huang, Lijin Yang, Yoichi Sato code -1
Continual 3D Convolutional Neural Networks for Real-time Processing of Videos Lukas Hedegaard, Alexandros Iosifidis code -1
Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition Tianjiao Li, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Anran Wang, Jinghua Wang, Jun Liu code -1
Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection Zhiwei Yang, Peng Wu, Jing Liu, Xiaotao Liu code -1
Action Quality Assessment with Temporal Parsing Transformer Yang Bai, Desen Zhou, Songyang Zhang, Jian Wang, Errui Ding, Yu Guan, Yang Long, Jingdong Wang code -1
Entry-Flipped Transformer for Inference and Prediction of Participant Behavior Bo Hu, TatJen Cham code -1
Pairwise Contrastive Learning Network for Action Quality Assessment Mingzhe Li, Hongbo Zhang, Qing Lei, Zongwen Fan, Jinghua Liu, JiXiang Du code -1
Geometric Features Informed Multi-person Human-Object Interaction Recognition in Videos Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima, Hubert P. H. Shum code -1
ActionFormer: Localizing Moments of Actions with Transformers ChenLin Zhang, Jianxin Wu, Yin Li code -1
SocialVAE: Human Trajectory Prediction Using Timewise Latents Pei Xu, JeanBernard Hayet, Ioannis Karamouzas code -1
Shape Matters: Deformable Patch Attack Zhaoyu Chen, Bo Li, Shuang Wu, Jianghe Xu, Shouhong Ding, Wenqiang Zhang code -1
Frequency Domain Model Augmentation for Adversarial Attack Yuyang Long, Qilong Zhang, Boheng Zeng, Lianli Gao, Xianglong Liu, Jian Zhang, Jingkuan Song code -1
Prior-Guided Adversarial Initialization for Fast Adversarial Training Xiaojun Jia, Yong Zhang, Xingxing Wei, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao code -1
Enhanced Accuracy and Robustness via Multi-teacher Adversarial Distillation Shiji Zhao, Jie Yu, Zhenlong Sun, Bo Zhang, Xingxing Wei code -1
LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity Martin Gubri, Maxime Cordy, Mike Papadakis, Yves Le Traon, Koushik Sen code -1
A Large-Scale Multiple-objective Method for Black-box Attack Against Object Detection Siyuan Liang, Longkang Li, Yanbo Fan, Xiaojun Jia, Jingzhi Li, Baoyuan Wu, Xiaochun Cao code -1
GradAuto: Energy-Oriented Attack on Dynamic Neural Networks Jianhong Pan, Qichen Zheng, Zhipeng Fan, Hossein Rahmani, Qiuhong Ke, Jun Liu code -1
A Spectral View of Randomized Smoothing Under Common Corruptions: Benchmarking and Improving Certified Robustness Jiachen Sun, Akshay Mehra, Bhavya Kailkhura, PinYu Chen, Dan Hendrycks, Jihun Hamm, Z. Morley Mao code -1
Improving Adversarial Robustness of 3D Point Cloud Classification Models Guanlin Li, Guowen Xu, Han Qiu, Ruan He, Jiwei Li, Tianwei Zhang code -1
Learning Extremely Lightweight and Robust Model with Differentiable Constraints on Sparsity and Condition Number Xian Wei, Yangyu Xu, Yanhui Huang, Hairong Lv, Hai Lan, Mingsong Chen, Xuan Tang code -1
RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN Huy Phan, Cong Shi, Yi Xie, Tianfang Zhang, Zhuohang Li, Tianming Zhao, Jian Liu, Yan Wang, Yingying Chen, Bo Yuan code -1
Boosting Transferability of Targeted Adversarial Examples via Hierarchical Generative Networks Xiao Yang, Yinpeng Dong, Tianyu Pang, Hang Su, Jun Zhu code -1
tSF: Transformer-Based Semantic Filter for Few-Shot Learning Jinxiang Lai, Siqian Yang, Wenlong Liu, Yi Zeng, Zhongyi Huang, Wenlong Wu, Jun Liu, BinBin Gao, Chengjie Wang code -1
Adversarial Feature Augmentation for Cross-domain Few-Shot Classification Yanxu Hu, Andy J. Ma code -1
Constructing Balance from Imbalance for Long-Tailed Image Recognition Yue Xu, YongLu Li, Jiefeng Li, Cewu Lu code -1
On Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization and Beyond Yuzhe Yang, Hao Wang, Dina Katabi code -1
Few-Shot Video Object Detection Qi Fan, ChiKeung Tang, YuWing Tai code -1
Worst Case Matters for Few-Shot Recognition Minghao Fu, YunHao Cao, Jianxin Wu code -1
Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification Kai Yi, Xiaoqian Shen, Yunhao Gou, Mohamed Elhoseiny code -1
Doubly Deformable Aggregation of Covariance Matrices for Few-Shot Segmentation Zhitong Xiong, Haopeng Li, Xiao Xiang Zhu code -1
Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation Xinyu Shi, Dong Wei, Yu Zhang, Donghuan Lu, Munan Ning, Jiashun Chen, Kai Ma, Yefeng Zheng code -1
Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning Xingping Dong, Jianbing Shen, Ling Shao code -1
CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition Shreyank N. Gowda, Laura SevillaLara, Frank Keller, Marcus Rohrbach code -1
Few-Shot Class-Incremental Learning for 3D Point Cloud Objects Townim F. Chowdhury, Ali Cheraghian, Sameera Ramasinghe, Sahar Ahmadi, Morteza Saberi, Shafin Rahman code -1
Meta-Learning with Less Forgetting on Large-Scale Non-Stationary Task Distributions Zhenyi Wang, Li Shen, Le Fang, Qiuling Suo, Donglin Zhan, Tiehang Duan, Mingchen Gao code -1
DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luowei Zhou, Lu Yuan, Ahmed Hassan Awadallah, Zhangyang Wang code -1
Learning Instance and Task-Aware Dynamic Kernels for Few-Shot Learning Rongkai Ma, Pengfei Fang, Gil Avraham, Yan Zuo, Tianyu Zhu, Tom Drummond, Mehrtash Harandi code -1
Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding Quande Liu, Youpeng Wen, Jianhua Han, Chunjing Xu, Hang Xu, Xiaodan Liang code -1
Few-Shot Classification with Contrastive Learning Zhanyuan Yang, Jinghua Wang, Yingying Zhu code -1
Time-rEversed DiffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection Shan Zhang, Naila Murray, Lei Wang, Piotr Koniusz code -1
Self-Promoted Supervision for Few-Shot Transformer Bowen Dong, Pan Zhou, Shuicheng Yan, Wangmeng Zuo code -1
Few-Shot Object Counting and Detection Thanh Nguyen, Chau Pham, Khoi Nguyen, Minh Hoai code -1
Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark Kibok Lee, Hao Yang, Satyaki Chakraborty, Zhaowei Cai, Gurumurthy Swaminathan, Avinash Ravichandran, Onkar Dabeer code -1
Cross-Domain Cross-Set Few-Shot Learning via Learning Compact and Aligned Representations Wentao Chen, Zhang Zhang, Wei Wang, Liang Wang, Zilei Wang, Tieniu Tan code -1
Mutually Reinforcing Structure with Proposal Contrastive Consistency for Few-Shot Object Detection TianXue Ma, Mingwei Bi, Jian Zhang, Wang Yuan, Zhizhong Zhang, Yuan Xie, Shouhong Ding, Lizhuang Ma code -1
Dual Contrastive Learning with Anatomical Auxiliary Supervision for Few-Shot Medical Image Segmentation Huisi Wu, Fangyan Xiao, Chongxin Liang code -1
Improving Few-Shot Learning Through Multi-task Representation Learning Theory Quentin Bouniot, Ievgen Redko, Romaric Audigier, Angélique Loesch, Amaury Habrard code -1
Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation Min Zhang, Siteng Huang, Wenbin Li, Donglin Wang code -1
Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments Khoi D. Nguyen, QuocHuy Tran, Khoi Nguyen, BinhSon Hua, Rang Nguyen code -1
Temporal and Cross-modal Attention for Audio-Visual Zero-Shot Learning OtnielBogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata code -1
HM: Hybrid Masking for Few-Shot Segmentation Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia code -1
TransVLAD: Focusing on Locally Aggregated Descriptors for Few-Shot Learning Haoquan Li, Laoming Zhang, Daoan Zhang, Lang Fu, Peng Yang, Jianguo Zhang code -1
Kernel Relative-prototype Spectral Filtering for Few-Shot Learning Tao Zhang, Wu Huang code -1
"This Is My Unicorn, Fluffy": Personalizing Frozen Vision-Language Representations Niv Cohen, Rinon Gal, Eli A. Meirom, Gal Chechik, Yuval Atzmon code -1
CLOSE: Curriculum Learning on the Sharing Extent Towards Better One-Shot NAS Zixuan Zhou, Xuefei Ning, Yi Cai, Jiashu Han, Yiping Deng, Yuhan Dong, Huazhong Yang, Yu Wang code -1
Streamable Neural Fields Junwoo Cho, Seungtae Nam, Daniel Rho, Jong Hwan Ko, Eunbyung Park code -1
Gradient-Based Uncertainty for Monocular Depth Estimation Julia Hornauer, Vasileios Belagiannis code -1
Online Continual Learning with Contrastive Vision Transformer Zhen Wang, Liu Liu, Yajing Kong, Jiaxian Guo, Dacheng Tao code -1
CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution Taeho Kim, Yongin Kwon, Jemin Lee, Taeho Kim, Sangtae Ha code -1
EAutoDet: Efficient Architecture Search for Object Detection Xiaoxing Wang, Jiale Lin, Juanping Zhao, Xiaokang Yang, Junchi Yan code -1
A Max-Flow Based Approach for Neural Architecture Search Chao Xue, Xiaoxing Wang, Junchi Yan, ChunGuang Li code -1
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses Robik Shrestha, Kushal Kafle, Christopher Kanan code -1
ERA: Enhanced Rational Activations Martin Trimmel, Mihai Zanfir, Richard I. Hartley, Cristian Sminchisescu code -1
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger Cong Wang, Hongmin Xu, Xiong Zhang, Li Wang, Zhitong Zheng, Haifeng Liu code -1
Learning Depth from Focus in the Wild Changyeon Won, HaeGon Jeon code -1
Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World Zheng Dang, Lizhou Wang, Yu Guo, Mathieu Salzmann code -1
An End-to-End Transformer Model for Crowd Localization Dingkang Liang, Wei Xu, Xiang Bai code -1
Few-Shot Single-View 3D Reconstruction with Memory Prior Contrastive Network Zhen Xing, Yijiang Chen, Zhixin Ling, Xiangdong Zhou, Yu Xiang code -1
DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection Liang Peng, Xiaopei Wu, Zheng Yang, Haifeng Liu, Deng Cai code -1
Adaptive Co-teaching for Unsupervised Monocular Depth Estimation Weisong Ren, Lijun Wang, Yongri Piao, Miao Zhang, Huchuan Lu, Ting Liu code -1
Fusing Local Similarities for Retrieval-Based 3D Orientation Estimation of Unseen Objects Chen Zhao, Yinlin Hu, Mathieu Salzmann code -1
Lidar Point Cloud Guided Monocular 3D Object Detection Liang Peng, Fei Liu, Zhengxu Yu, Senbo Yan, Dan Deng, Zheng Yang, Haifeng Liu, Deng Cai code -1
Structural Causal 3D Reconstruction Weiyang Liu, Zhen Liu, Liam Paull, Adrian Weller, Bernhard Schölkopf code -1
3D Human Pose Estimation Using Möbius Graph Convolutional Networks Niloofar Azizi, Horst Possegger, Emanuele Rodolà, Horst Bischof code -1
Learning to Train a Point Cloud Reconstruction Network Without Matching Tianxin Huang, Xuemeng Yang, Jiangning Zhang, Jinhao Cui, Hao Zou, Jun Chen, Xiangrui Zhao, Yong Liu code -1
PanoFormer: Panorama Transformer for Indoor 360$^{\circ }$ Depth Estimation Zhijie Shen, Chunyu Lin, Kang Liao, Lang Nie, Zishuo Zheng, Yao Zhao code -1
Self-supervised Human Mesh Recovery with Cross-Representation Alignment Xuan Gong, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, David S. Doermann, Ziyan Wu code -1
AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction Zerui Chen, Yana Hasson, Cordelia Schmid, Ivan Laptev code -1
A Reliable Online Method for Joint Estimation of Focal Length and Camera Rotation Yiming Qian, James H. Elder code -1
PS-NeRF: Neural Inverse Rendering for Multi-view Photometric Stereo Wenqi Yang, Guanying Chen, Chaofeng Chen, Zhenfang Chen, KwanYee K. Wong code -1
Share with Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency Tom Monnier, Matthew Fisher, Alexei A. Efros, Mathieu Aubry code -1
Towards Comprehensive Representation Enhancement in Semantics-Guided Self-supervised Monocular Depth Estimation Jingyuan Ma, Xiangyu Lei, Nan Liu, Xian Zhao, Shiliang Pu code -1
AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture Zhe Li, Zerong Zheng, Hongwen Zhang, Chaonan Ji, Yebin Liu code -1
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers Junhyeong Cho, Kim Youwang, TaeHyun Oh code -1
GeoRefine: Self-supervised Online Depth Refinement for Accurate Dense Mapping Pan Ji, Qingan Yan, Yuxin Ma, Yi Xu code -1
Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion Zhiqiang Yan, Xiang Li, Kun Wang, Zhenyu Zhang, Jun Li, Jian Yang code -1
GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation Shi Gong, Xiaoqing Ye, Xiao Tan, Jingdong Wang, Errui Ding, Yu Zhou, Xiang Bai code -1
Learning Visibility for Robust Dense Human Body Estimation ChunHan Yao, Jimei Yang, Duygu Ceylan, Yi Zhou, Yang Zhou, MingHsuan Yang code -1
Towards High-Fidelity Single-View Holistic Reconstruction of Indoor Scenes Haolin Liu, Yujian Zheng, Guanying Chen, Shuguang Cui, Xiaoguang Han code -1
CompNVS: Novel View Synthesis with Scene Completion Zuoyue Li, Tianxing Fan, Zhenqiang Li, Zhaopeng Cui, Yoichi Sato, Marc Pollefeys, Martin R. Oswald code -1
SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling Chenjian Gao, Qian Yu, Lu Sheng, YiZhe Song, Dong Xu code -1
LocalBins: Improving Depth Estimation by Learning Local Distributions Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka code -1
2D GANs Meet Unsupervised Single-View 3D Reconstruction Feng Liu, Xiaoming Liu code -1
InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images Zhengqi Li, Qianqian Wang, Noah Snavely, Angjoo Kanazawa code -1
Semi-supervised Single-View 3D Reconstruction via Prototype Shape Priors Zhen Xing, Hengduo Li, Zuxuan Wu, YuGang Jiang code -1
Bilateral Normal Integration Xu Cao, Hiroaki Santo, Boxin Shi, Fumio Okura, Yasuyuki Matsushita code -1
S2Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-supervised Learning Tze Ho Elden Tse, Zhongqun Zhang, Kwang In Kim, Ales Leonardis, Feng Zheng, Hyung Jin Chang code -1
SC-wLS: Towards Interpretable Feed-forward Camera Re-localization Xin Wu, Hao Zhao, Shunkai Li, Yingdian Cao, Hongbin Zha code -1
FloatingFusion: Depth from ToF and Image-Stabilized Stereo Cameras Andreas Meuleman, Hakyeong Kim, James Tompkin, Min H. Kim code -1
DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image Yijin Li, Xinyang Liu, Wenqi Dong, Han Zhou, Hujun Bao, Guofeng Zhang, Yinda Zhang, Zhaopeng Cui code -1
3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform Yining Zhao, Chao Wen, Zhou Xue, Yue Gao code -1
RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation Ruida Zhang, Yan Di, Zhiqiang Lou, Fabian Manhardt, Federico Tombari, Xiangyang Ji code -1
Monocular 3D Object Reconstruction with GAN Inversion Junzhe Zhang, Daxuan Ren, Zhongang Cai, Chai Kiat Yeo, Bo Dai, Chen Change Loy code -1
Map-Free Visual Relocalization: Metric Pose Relative to a Single Image Eduardo Arnold, Jamie Wynn, Sara Vicente, Guillermo GarciaHernando, Áron Monszpart, Victor Prisacariu, Daniyar Turmukhambetov, Eric Brachmann code -1
Self-distilled Feature Aggregation for Self-supervised Monocular Depth Estimation Zhengming Zhou, Qiulei Dong code -1
Planes vs. Chairs: Category-Guided 3D Shape Learning Without any 3D Cues Zixuan Huang, Stefan Stojanov, Anh Thai, Varun Jampani, James M. Rehg code -1
ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO Sanghyuk Chun, Wonjae Kim, Song Park, Minsuk Chang, Seong Joon Oh code -1
MOTCOM: The Multi-Object Tracking Dataset Complexity Metric Malte Pedersen, Joakim Bruslund Haurum, Patrick Dendorfer, Thomas B. Moeslund code -1
How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset? Yuchi Liu, Zhongdao Wang, Tom Gedeon, Liang Zheng code -1
A Real World Dataset for Multi-view 3D Reconstruction Rakesh Shrestha, Siqi Hu, Minghao Gou, Ziyuan Liu, Ping Tan code -1
REALY: Rethinking the Evaluation of 3D Face Reconstruction Zenghao Chai, Haoxian Zhang, Jing Ren, Di Kang, Zhengzhuo Xu, Xuefei Zhe, Chun Yuan, Linchao Bao code -1
Capturing, Reconstructing, and Simulating: The UrbanScene3D Dataset Liqiang Lin, Yilin Liu, Yue Hu, Xingguang Yan, Ke Xie, Hui Huang code -1
3D CoMPaT: Composition of Materials on Parts of 3D Things Yuchen Li, Ujjwal Upadhyay, Habib Slim, Ahmed Abdelreheem, Arpit Prajapati, Suhail Pothigara, Peter Wonka, Mohamed Elhoseiny code -1
PartImageNet: A Large, High-Quality Dataset of Parts Ju He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jieneng Chen, Shuai Liu, Cheng Yang, Qihang Yu, Alan L. Yuille code -1
A-OKVQA: A Benchmark for Visual Question Answering Using World Knowledge Dustin Schwenk, Apoorv Khandelwal, Christopher Clark, Kenneth Marino, Roozbeh Mottaghi code -1
OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images Bingchen Zhao, Shaozuo Yu, Wufei Ma, Mingxin Yu, Shenxiao Mei, Angtian Wang, Ju He, Alan L. Yuille, Adam Kortylewski code -1
Facial Depth and Normal Estimation Using Single Dual-Pixel Camera Minjun Kang, Jaesung Choe, Hyowon Ha, HaeGon Jeon, Sunghoon Im, In So Kweon, KukJin Yoon code -1
The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing Dawit Mureja Argaw, Fabian Caba Heilbron, JoonYoung Lee, Markus Woodson, In So Kweon code -1
StyleBabel: Artistic Style Tagging and Captioning Dan Ruta, Andrew Gilbert, Pranav Aggarwal, Naveen Marri, Ajinkya Kale, Jo Briggs, Chris Speed, Hailin Jin, Baldo Faieta, Alex Filipkowski, Zhe Lin, John P. Collomosse code -1
PANDORA: A Panoramic Detection Dataset for Object with Orientation Hang Xu, Qiang Zhao, Yike Ma, Xiaodong Li, Peng Yuan, Bailan Feng, Chenggang Yan, Feng Dai code -1
FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context Pinaki Nath Chowdhury, Aneeshan Sain, Ayan Kumar Bhunia, Tao Xiang, Yulia Gryaditskaya, YiZhe Song code -1
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset Grant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge J. Belongie code -1
The Caltech Fish Counting Dataset: A Benchmark for Multiple-Object Tracking and Counting Justin Kay, Peter Kulits, Suzanne Stathatos, Siqi Deng, Erik Young, Sara Beery, Grant Van Horn, Pietro Perona code -1
A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility Andrea Burns, Deniz Arsan, Sanjna Agrawal, Ranjitha Kumar, Kate Saenko, Bryan A. Plummer code -1
Dress Code: High-Resolution Multi-category Virtual Try-On Davide Morelli, Matteo Fincato, Marcella Cornia, Federico Landi, Fabio Cesari, Rita Cucchiara code -1
A Data-Centric Approach for Improving Ambiguous Labels with Combined Semi-supervised Classification and Clustering Lars Schmarje, Monty Santarossa, SimonMartin Schröder, Claudius Zelenka, Rainer Kiko, Jenny Stracke, Nina Volkmann, Reinhard Koch code -1
ClearPose: Large-scale Transparent Object Dataset and Benchmark Xiaotong Chen, Huijie Zhang, Zeren Yu, Anthony Opipari, Odest Chadwicke Jenkins code -1
When Deep Classifiers Agree: Analyzing Correlations Between Learning Order and Image Statistics Iuliia Pliushch, Martin Mundt, Nicolas Lupp, Visvanathan Ramesh code -1
AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment Kangyeol Kim, Sunghyun Park, Jaeseong Lee, Sunghyo Chung, Junsoo Lee, Jaegul Choo code -1
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh code -1
A Dense Material Segmentation Dataset for Indoor and Outdoor Scene Parsing Paul Upchurch, Ransen Niu code -1
MimicME: A Large Scale Diverse 4D Database for Facial Expression Analysis Athanasios Papaioannou, Baris Gecer, Shiyang Cheng, Grigorios Chrysos, Jiankang Deng, Eftychia Fotiadou, Christos Kampouris, Dimitrios Kollias, Stylianos Moschoglou, Kritaphat Songsriin, Stylianos Ploumpis, George Trigeorgis, Panagiotis Tzirakis, Evangelos Ververas, Yuxiang Zhou, Allan Ponniah, Anastasios Roussos, Stefanos Zafeiriou code -1
Delving into Universal Lesion Segmentation: Method, Dataset, and Benchmark Yu Qiu, Jing Xu code -1
Large Scale Real-World Multi-person Tracking Bing Shuai, Alessandro Bergamo, Uta Büchler, Andrew G. Berneshawi, Alyssa Boden, Joseph Tighe code -1
D2-TPred: Discontinuous Dependency for Trajectory Prediction Under Traffic Lights Yuzhen Zhang, Wentong Wang, Weizhi Guo, Pei Lv, Mingliang Xu, Wei Chen, Dinesh Manocha code -1
The Missing Link: Finding Label Relations Across Datasets Jasper R. R. Uijlings, Thomas Mensink, Vittorio Ferrari code -1
Learning Omnidirectional Flow in 360$^\circ $ Video via Siamese Representation Keshav Bhandari, Bin Duan, Gaowen Liu, Hugo Latapie, Ziliang Zong, Yan Yan code -1
VizWiz-FewShot: Locating Objects in Images Taken by People with Visual Impairments YuYun Tseng, Alexander Bell, Danna Gurari code -1
TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments Shubham Dokania, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar code -1
Trapped in Texture Bias? A Large Scale Comparison of Deep Instance Segmentation Johannes Theodoridis, Jessica Hofmann, Johannes Maucher, Andreas Schilling code -1
Deformable Feature Aggregation for Dynamic Multi-modal 3D Object Detection Zehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinhong Jiang, Feng Zhao code -1
WeLSA: Learning to Predict 6D Pose from Weakly Labeled Data Using Shape Alignment Shishir Reddy Vutukur, Ivan Shugurov, Benjamin Busam, Andreas Hutter, Slobodan Ilic code -1
Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local Graph Honghui Yang, Zili Liu, Xiaopei Wu, Wenxiao Wang, Wei Qian, Xiaofei He, Deng Cai code -1
MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection Xuesong Chen, Shaoshuai Shi, Benjin Zhu, Ka Chun Cheung, Hang Xu, Hongsheng Li code -1
Long-tail Detection with Effective Class-Margins Jang Hyun Cho, Philipp Krähenbühl code -1
Semi-supervised Monocular 3D Object Detection by Multi-view Consistency Qing Lian, Yanbo Xu, Weilong Yao, Yingcong Chen, Tong Zhang code -1
PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection Han Wang, Jun Tang, Xiaodong Liu, Shanyan Guan, Rong Xie, Li Song code -1
AU-Aware 3D Face Reconstruction through Personalized AU-Specific Blendshape Learning Chenyi Kuang, Zijun Cui, Jeffrey O. Kephart, Qiang Ji code -1
BézierPalm: A Free Lunch for Palmprint Recognition Kai Zhao, Lei Shen, Yingyi Zhang, Chuhan Zhou, Tao Wang, Ruixin Zhang, Shouhong Ding, Wei Jia, Wei Shen code -1
Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing HsinPing Huang, Deqing Sun, Yaojie Liu, WenSheng Chu, Taihong Xiao, Jinwei Yuan, Hartwig Adam, MingHsuan Yang code -1
Face2Faceρ: Real-Time High-Resolution One-Shot Face Reenactment Kewei Yang, Kang Chen, Daoliang Guo, SongHai Zhang, Yuanchen Guo, Weidong Zhang code -1
Towards Racially Unbiased Skin Tone Estimation via Scene Disambiguation Haiwen Feng, Timo Bolkart, Joachim Tesch, Michael J. Black, Victoria Fernández Abrevaya code -1
BoundaryFace: A Mining Framework with Noise Label Self-correction for Face Recognition Shijie Wu, Xun Gong code -1
Pre-training Strategies and Datasets for Facial Representation Learning Adrian Bulat, Shiyang Cheng, Jing Yang, Andrew Garbett, Enrique SánchezLozano, Georgios Tzimiropoulos code -1
Look Both Ways: Self-supervising Driver Gaze Estimation and Road Scene Saliency Isaac Kasahara, Simon Stent, Hyun Soo Park code -1
MFIM: Megapixel Facial Identity Manipulation Sanghyeon Na code -1
3D Face Reconstruction with Dense Landmarks Erroll Wood, Tadas Baltrusaitis, Charlie Hewitt, Matthew Johnson, Jingjing Shen, Nikola Milosavljevic, Daniel Wilde, Stephan J. Garbin, Toby Sharp, Ivan Stojiljkovic, Tom Cashman, Julien P. C. Valentin code -1
Emotion-aware Multi-view Contrastive Learning for Facial Emotion Recognition Dae Ha Kim, Byung Cheol Song code -1
Order Learning Using Partially Ordered Data via Chainization SeonHo Lee, ChangSu Kim code -1
Unsupervised High-Fidelity Facial Texture Generation and Reconstruction Ron Slossberg, Ibrahim Jubran, Ron Kimmel code -1
Multi-domain Learning for Updating Face Anti-spoofing Models Xiao Guo, Yaojie Liu, Anil K. Jain, Xiaoming Liu code -1
Towards Metrical Reconstruction of Human Faces Wojciech Zielonka, Timo Bolkart, Justus Thies code -1
Discover and Mitigate Unknown Biases with Debiasing Alternate Networks Zhiheng Li, Anthony Hoogs, Chenliang Xu code -1
Unsupervised and Semi-supervised Bias Benchmarking in Face Recognition Alexandra Chouldechova, Siqi Deng, Yongxin Wang, Wei Xia, Pietro Perona code -1
Towards Efficient Adversarial Training on Vision Transformers Boxi Wu, Jindong Gu, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu code -1
MIME: Minority Inclusion for Majority Group Enhancement of AI Performance Pradyumna Chari, Yunhao Ba, Shreeram S. Athreya, Achuta Kadambi code -1
Studying Bias in GANs Through the Lens of Race Vongani H. Maluleke, Neerja Thakkar, Tim Brooks, Ethan Weber, Trevor Darrell, Alexei A. Efros, Angjoo Kanazawa, Devin Guillory code -1
Trust, but Verify: Using Self-supervised Probing to Improve Trustworthiness Ailin Deng, Shen Li, Miao Xiong, Zhirui Chen, Bryan Hooi code -1
Learning to Censor by Noisy Sampling Ayush Chopra, Abhinav Java, Abhishek Singh, Vivek Sharma, Ramesh Raskar code -1
An Invisible Black-Box Backdoor Attack Through Frequency Domain Tong Wang, Yuan Yao, Feng Xu, Shengwei An, Hanghang Tong, Ting Wang code -1
FairGRAPE: Fairness-Aware GRAdient Pruning mEthod for Face Attribute Classification Xiaofeng Lin, Seungbae Kim, Jungseock Joo code -1
Attaining Class-Level Forgetting in Pretrained Model Using Few Samples Pravendra Singh, Pratik Mazumder, Mohammed Asad Karim code -1
Anti-Neuron Watermarking: Protecting Personal Data Against Unauthorized Neural Networks Zihang Zou, Boqing Gong, Liqiang Wang code -1
An Impartial Take to the CNN vs Transformer Robustness Contest Francesco Pinto, Philip H. S. Torr, Puneet K. Dokania code -1
Recover Fair Deep Classification Models via Altering Pre-trained Structure Yanfu Zhang, Shangqian Gao, Heng Huang code -1
Decouple-and-Sample: Protecting Sensitive Information in Task Agnostic Data Release Abhishek Singh, Ethan Garza, Ayush Chopra, Praneeth Vepakomma, Vivek Sharma, Ramesh Raskar code -1
Privacy-Preserving Action Recognition via Motion Difference Quantization Sudhakar Kumawat, Hajime Nagahara code -1
Latent Space Smoothing for Individually Fair Representations Momchil Peychev, Anian Ruoss, Mislav Balunovic, Maximilian Baader, Martin T. Vechev code -1
Parameterized Temperature Scaling for Boosting the Expressive Power in Post-Hoc Uncertainty Calibration Christian Tomani, Daniel Cremers, Florian Buettner code -1
FairStyle: Debiasing StyleGAN2 with Style Channel Manipulations Cemre Karakas, Alara Dirik, Eylul Yalcinkaya, Pinar Yanardag code -1
Distilling the Undistillable: Learning from a Nasty Teacher Surgan Jandial, Yash Khasbage, Arghya Pal, Vineeth N. Balasubramanian, Balaji Krishnamurthy code -1
SOS! Self-supervised Learning over Sets of Handled Objects in Egocentric Action Recognition Victor Escorcia, Ricardo Guerrero, Xiatian Zhu, Brais Martínez code -1
Egocentric Activity Recognition and Localization on a 3D Map Miao Liu, Lingni Ma, Kiran Somasundaram, Yin Li, Kristen Grauman, James M. Rehg, Chao Li code -1
Generative Adversarial Network for Future Hand Segmentation from Egocentric Video Wenqi Jia, Miao Liu, James M. Rehg code -1
My View is the Best View: Procedure Learning from Egocentric Videos Siddhant Bansal, Chetan Arora, C. V. Jawahar code -1
GIMO: Gaze-Informed Human Motion Prediction in Context Yang Zheng, Yanchao Yang, Kaichun Mo, Jiaman Li, Tao Yu, Yebin Liu, C. Karen Liu, Leonidas J. Guibas code -1
Image-Based CLIP-Guided Essence Transfer Hila Chefer, Sagie Benaim, Roni Paiss, Lior Wolf code -1
Detecting and Recovering Sequential DeepFake Manipulation Rui Shao, Tianxing Wu, Ziwei Liu code -1
Self-supervised Sparse Representation for Video Anomaly Detection JhihCiang Wu, HeYen Hsieh, DingJie Chen, ChiouShann Fuh, TyngLuh Liu code -1
Adaptive Image Transformations for Transfer-Based Adversarial Attack Zheng Yuan, Jie Zhang, Shiguang Shan code -1
Generative Multiplane Images: Making a 2D GAN 3D-Aware Xiaoming Zhao, Fangchang Ma, David Güera, Zhile Ren, Alexander G. Schwing, Alex Colburn code -1
AdvDO: Realistic Adversarial Attacks for Trajectory Prediction Yulong Cao, Chaowei Xiao, Anima Anandkumar, Danfei Xu, Marco Pavone code -1
Adversarial Contrastive Learning via Asymmetric InfoNCE Qiying Yu, Jieming Lou, Xianyuan Zhan, Qizhang Li, Wangmeng Zuo, Yang Liu, Jingjing Liu code -1
One Size Does NOT Fit All: Data-Adaptive Adversarial Training Shuo Yang, Chang Xu code -1
UniCR: Universally Approximated Certified Robustness via Randomized Smoothing Hanbin Hong, Binghui Wang, Yuan Hong code -1
Hardly Perceptible Trojan Attack Against Neural Networks with Bit Flips Jiawang Bai, Kuofeng Gao, Dihong Gong, ShuTao Xia, Zhifeng Li, Wei Liu code -1
Robust Network Architecture Search via Feature Distortion Restraining Yaguan Qian, Shenghui Huang, Bin Wang, Xiang Ling, Xiaohui Guan, Zhaoquan Gu, Shaoning Zeng, Wujie Zhou, Haijiang Wang code -1
SecretGen: Privacy Recovery on Pre-trained Models via Distribution Discrimination Zhuowen Yuan, Fan Wu, Yunhui Long, Chaowei Xiao, Bo Li code -1
Triangle Attack: A Query-Efficient Decision-Based Adversarial Attack Xiaosen Wang, Zeliang Zhang, Kangheng Tong, Dihong Gong, Kun He, Zhifeng Li, Wei Liu code -1
Data-Free Backdoor Removal Based on Channel Lipschitzness Runkai Zheng, Rongjun Tang, Jianze Li, Li Liu code -1
Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack Yixu Wang, Jie Li, Hong Liu, Yan Wang, Yongjian Wu, Feiyue Huang, Rongrong Ji code -1
Learning Energy-Based Models with Adversarial Training Xuwang Yin, Shiying Li, Gustavo K. Rohde code -1
Adversarial Label Poisoning Attack on Graph Neural Networks via Label Propagation Ganlin Liu, Xiaowei Huang, Xinping Yi code -1
Revisiting Outer Optimization in Adversarial Training Ali Dabouei, Fariborz Taherkhani, Sobhan Soleymani, Nasser M. Nasrabadi code -1
Zero-Shot Attribute Attacks on Fine-Grained Recognition Models Nasim Shafiee, Ehsan Elhamifar code -1
Towards Effective and Robust Neural Trojan Defenses via Input Filtering Kien Do, Haripriya Harikumar, Hung Le, Dung Nguyen, Truyen Tran, Santu Rana, Dang Nguyen, Willy Susilo, Svetha Venkatesh code -1
Scaling Adversarial Training to Large Perturbation Bounds Sravanti Addepalli, Samyak Jain, Gaurang Sriramanan, R. Venkatesh Babu code -1
Exploiting the Local Parabolic Landscapes of Adversarial Losses to Accelerate Black-Box Adversarial Attack Hoang Tran, Dan Lu, Guannan Zhang code -1
Generative Domain Adaptation for Face Anti-Spoofing Qianyu Zhou, KeYue Zhang, Taiping Yao, Ran Yi, Kekai Sheng, Shouhong Ding, Lizhuang Ma code -1
MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition Huanzhang Dou, Pengyi Zhang, Wei Su, Yunlong Yu, Xi Li code -1
GaitEdge: Beyond Plain End-to-End Gait Recognition for Better Practicality Junhao Liang, Chao Fan, Saihui Hou, Chuanfu Shen, Yongzhen Huang, Shiqi Yu code -1
UIA-ViT: Unsupervised Inconsistency-Aware Method Based on Vision Transformer for Face Forgery Detection Wanyi Zhuang, Qi Chu, Zhentao Tan, Qiankun Liu, Haojie Yuan, Changtao Miao, Zixiang Luo, Nenghai Yu code -1
Effective Presentation Attack Detection Driven by Face Related Task Wentian Zhang, Haozhe Liu, Feng Liu, Raghavendra Ramachandra, Christoph Busch code -1
PPT: Token-Pruned Pose Transformer for Monocular and Multi-view Human Pose Estimation Haoyu Ma, Zhe Wang, Yifei Chen, Deying Kong, Liangjian Chen, Xingwei Liu, Xiangyi Yan, Hao Tang, Xiaohui Xie code -1
AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing Jiaxi Jiang, Paul Streli, Huajian Qiu, Andreas Fender, Larissa Laich, Patrick Snape, Christian Holz code -1
P-STMO: Pre-trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao code -1
D &D: Learning Human Dynamics from Dynamic Camera Jiefeng Li, Siyuan Bian, Chao Xu, Gang Liu, Gang Yu, Cewu Lu code -1
Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation Qihao Liu, Yi Zhang, Song Bai, Alan L. Yuille code -1
COUCH: Towards Controllable Human-Chair Interactions Xiaohan Zhang, Bharat Lal Bhatnagar, Sebastian Starke, Vladimir Guzov, Gerard PonsMoll code -1
Identity-Aware Hand Mesh Estimation and Personalization from RGB Images Deying Kong, Linguang Zhang, Liangjian Chen, Haoyu Ma, Xiangyi Yan, Shanlin Sun, Xingwei Liu, Kun Han, Xiaohui Xie code -1
C3P: Cross-Domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation Cunlin Wu, Yang Xiao, Boshen Zhang, Mingyang Zhang, Zhiguo Cao, Joey Tianyi Zhou code -1
Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields Garvita Tiwari, Dimitrije Antic, Jan Eric Lenssen, Nikolaos Sarafianos, Tony Tung, Gerard PonsMoll code -1
CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation Zhihao Li, Jianzhuang Liu, Zhensong Zhang, Songcen Xu, Youliang Yan code -1
DeciWatch: A Simple Baseline for 10˟ Efficient 2D and 3D Pose Estimation Ailing Zeng, Xuan Ju, Lei Yang, Ruiyuan Gao, Xizhou Zhu, Bo Dai, Qiang Xu code -1
SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos Ailing Zeng, Lei Yang, Xuan Ju, Jiefeng Li, Jianyi Wang, Qiang Xu code -1
PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation Wentao Jiang, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Si Liu code -1
Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement Junuk Cha, Muhammad Saqlain, GeonU Kim, Mingyu Shin, Seungryul Baek code -1
Overlooked Poses Actually Make Sense: Distilling Privileged Knowledge for Human Motion Prediction Xiaoning Sun, Qiongjie Cui, Huaijiang Sun, Bin Li, Weiqing Li, Jianfeng Lu code -1
Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estimation Zhuo Chen, Xu Zhao, Xiaoyue Wan code -1
Audio-Driven Stylized Gesture Generation with Flow-Based Model Sheng Ye, YuHui Wen, Yanan Sun, Ying He, Ziyang Zhang, Yaoyuan Wang, Weihua He, YongJin Liu code -1
Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation Zhehan Kan, Shuoshuo Chen, Zeng Li, Zhihai He code -1
A Simple Approach and Benchmark for 21, 000-Category Object Detection Yutong Lin, Chen Li, Yue Cao, Zheng Zhang, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Han Hu code -1
Knowledge Condensation Distillation Chenxin Li, Mingbao Lin, Zhiyuan Ding, Nie Lin, Yihong Zhuang, Yue Huang, Xinghao Ding, Liujuan Cao code -1
Reducing Information Loss for Spiking Neural Networks Yufei Guo, Yuanpei Chen, Liwen Zhang, YingLei Wang, Xiaode Liu, Xinyi Tong, Yuanyuan Ou, Xuhui Huang, Zhe Ma code -1
Masked Generative Distillation Zhendong Yang, Zhe Li, Mingqi Shao, Dachuan Shi, Zehuan Yuan, Chun Yuan code -1
Fine-grained Data Distribution Alignment for Post-Training Quantization Yunshan Zhong, Mingbao Lin, Mengzhao Chen, Ke Li, Yunhang Shen, Fei Chao, Yongjian Wu, Rongrong Ji code -1
Learning with Recoverable Forgetting Jingwen Ye, Yifang Fu, Jie Song, Xingyi Yang, Songhua Liu, Xin Jin, Mingli Song, Xinchao Wang code -1
Efficient One Pass Self-distillation with Zipf's Label Smoothing Jiajun Liang, Linze Li, Zhaodong Bing, Borui Zhao, Yao Tang, Bo Lin, Haoqiang Fan code -1
Prune Your Model Before Distill It Jinhyuk Park, Albert No code -1
Deep Partial Updating: Towards Communication Efficient Updating for On-Device Inference Zhongnan Qu, Cong Liu, Lothar Thiele code -1
Patch Similarity Aware Data-Free Quantization for Vision Transformers Zhikai Li, Liping Ma, Mengjuan Chen, Junrui Xiao, Qingyi Gu code -1
L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training Jonghyun Bae, Woohyeon Baek, Tae Jun Ham, Jae W. Lee code -1
Streaming Multiscale Deep Equilibrium Models Can Ufuk Ertenli, Emre Akbas, Ramazan Gokberk Cinbis code -1
Symmetry Regularization and Saturating Nonlinearity for Robust Quantization Sein Park, Yeongsang Jang, Eunhyeok Park code -1
SP-Net: Slowly Progressing Dynamic Inference Networks Huanyu Wang, Wenhu Zhang, Shihao Su, Hui Wang, Zhenwei Miao, Xin Zhan, Xi Li code -1
Equivariance and Invariance Inductive Bias for Learning from Insufficient Data Tan Wang, Qianru Sun, Sugiri Pranata, Jayashree Karlekar, Hanwang Zhang code -1
Mixed-Precision Neural Network Quantization via Learned Layer-Wise Importance Chen Tang, Kai Ouyang, Zhi Wang, Yifei Zhu, Wen Ji, Yaowei Wang, Wenwu Zhu code -1
Event Neural Networks Matthew Dutson, Yin Li, Mohit Gupta code -1
EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martínez code -1
PalQuant: Accelerating High-Precision Networks on Low-Precision Accelerators Qinghao Hu, Gang Li, Qiman Wu, Jian Cheng code -1
Disentangled Differentiable Network Pruning Shangqian Gao, Feihu Huang, Yanfu Zhang, Heng Huang code -1
IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lü code -1
Learning to Weight Samples for Dynamic Early-Exiting Networks Yizeng Han, Yifan Pu, Zihang Lai, Chaofei Wang, Shiji Song, Junfen Cao, Wenhui Huang, Chao Deng, Gao Huang code -1
AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets Zhijun Tu, Xinghao Chen, Pengju Ren, Yunhe Wang code -1
Adaptive Token Sampling for Efficient Vision Transformers Mohsen Fayyaz, Soroush Abbasi Koohpayegani, Farnoush Rezaei Jafari, Sunando Sengupta, Hamid Reza Vaezi Joze, Eric Sommerlade, Hamed Pirsiavash, Jürgen Gall code -1
Weight Fixing Networks Christopher SubiaWaud, Srinandan Dasmahapatra code -1
Self-slimmed Vision Transformer Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu code -1
Switchable Online Knowledge Distillation Biao Qian, Yang Wang, Hongzhi Yin, Richang Hong, Meng Wang code -1
ℓ ∞-Robustness and Beyond: Unleashing Efficient Adversarial Training Hadi M. Dolatabadi, Sarah M. Erfani, Christopher Leckie code -1
Multi-granularity Pruning for Model Acceleration on Mobile Devices Tianli Zhao, Xi Sheryl Zhang, Wentao Zhu, Jiaxing Wang, Sen Yang, Ji Liu, Jian Cheng code -1
Deep Ensemble Learning by Diverse Knowledge Distillation for Fine-Grained Object Classification Naoki Okamoto, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi code -1
Helpful or Harmful: Inter-task Association in Continual Learning Hyundong Jin, Eunwoo Kim code -1
Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies Xingrun Xing, Yangguang Li, Wei Li, Wenrui Ding, Yalong Jiang, Yufeng Wang, Jing Shao, Chunlei Liu, Xianglong Liu code -1
SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks ChienYu Lin, Anish Prabhu, Thomas Merth, Sachin Mehta, Anurag Ranjan, Maxwell Horton, Mohammad Rastegari code -1
Ensemble Knowledge Guided Sub-network Search and Fine-Tuning for Filter Pruning Seunghyun Lee, Byung Cheol Song code -1
Network Binarization via Contrastive Learning Yuzhang Shang, Dan Xu, Ziliang Zong, Liqiang Nie, Yan Yan code -1
Lipschitz Continuity Retained Binary Neural Network Yuzhang Shang, Dan Xu, Bin Duan, Ziliang Zong, Liqiang Nie, Yan Yan code -1
SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Wei Niu, Mengshu Sun, Xuan Shen, Geng Yuan, Bin Ren, Hao Tang, Minghai Qin, Yanzhi Wang code -1
Soft Masking for Cost-Constrained Channel Pruning Ryan Humble, Maying Shen, Jorge Albericio Latorre, Eric Darve, Jose M. Alvarez code -1
Non-uniform Step Size Quantization for Accurate Post-training Quantization Sangyun Oh, Hyeonuk Sim, Jounghyun Kim, Jongeun Lee code -1
SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning Haoran You, Baopu Li, Zhanyi Sun, Xu Ouyang, Yingyan Lin code -1
Meta-GF: Training Dynamic-Depth Neural Networks Harmoniously Yi Sun, Jian Li, Xin Xu code -1
Towards Ultra Low Latency Spiking Neural Networks for Vision and Sequential Tasks Using Temporal Pruning Sayeed Shafayet Chowdhury, Nitin Rathi, Kaushik Roy code -1
Towards Accurate Network Quantization with Equivalent Smooth Regularizer Kirill Solodskikh, Vladimir Chikin, Ruslan Aydarkhanov, Dehua Song, Irina Zhelavskaya, Jiansheng Wei code -1
DFNet: Enhance Absolute Pose Regression with Direct Feature Matching Shuai Chen, Xinghui Li, Zirui Wang, Victor Adrian Prisacariu code -1
Cornerformer: Purifying Instances for Corner-Based Detectors Haoran Wei, Xin Chen, Lingxi Xie, Qi Tian code -1
PillarNet: Real-Time and High-Performance Pillar-Based 3D Object Detection Guangsheng Shi, Ruifeng Li, Chao Ma code -1
Robust Object Detection with Inaccurate Bounding Boxes Chengxin Liu, Kewei Wang, Hao Lu, Zhiguo Cao, Ziming Zhang code -1
Efficient Decoder-Free Object Detection with Transformers Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen code -1
Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection Yu Hong, Hang Dai, Yong Ding code -1
ReAct: Temporal Action Detection with Relational Queries Dingfeng Shi, Yujie Zhong, Qiong Cao, Jing Zhang, Lin Ma, Jia Li, Dacheng Tao code -1
Towards Accurate Active Camera Localization Qihang Fang, Yingda Yin, Qingnan Fan, Fei Xia, Siyan Dong, Sheng Wang, Jue Wang, Leonidas J. Guibas, Baoquan Chen code -1
Camera Pose Auto-encoders for Improving Pose Regression Yoli Shavit, Yosi Keller code -1
Improving the Intra-class Long-Tail in 3D Detection via Rare Example Mining Chiyu Max Jiang, Mahyar Najibi, Charles R. Qi, Yin Zhou, Dragomir Anguelov code -1
Bagging Regional Classification Activation Maps for Weakly Supervised Object Localization Lei Zhu, Qian Chen, Lujia Jin, Yunfei You, Yanye Lu code -1
UC-OWOD: Unknown-Classified Open World Object Detection Zhiheng Wu, Yue Lu, Xingyu Chen, Zhengxing Wu, Liwen Kang, Junzhi Yu code -1
RayTran: 3D Pose Estimation and Shape Reconstruction of Multiple Objects from Videos with Ray-Traced Transformers Michal J. Tyszkiewicz, KevisKokitsi Maninis, Stefan Popov, Vittorio Ferrari code -1
GTCaR: Graph Transformer for Camera Re-localization Xinyi Li, Haibin Ling code -1
3D Object Detection with a Self-supervised Lidar Scene Flow Backbone Emeç Erçelik, Ekim Yurtsever, Mingyu Liu, Zhijie Yang, Hanzhen Zhang, Pinar Topçam, Maximilian Listl, Yilmaz Kaan Çayli, Alois C. Knoll code -1
Open Vocabulary Object Detection with Pseudo Bounding-Box Labels Mingfei Gao, Chen Xing, Juan Carlos Niebles, Junnan Li, Ran Xu, Wenhao Liu, Caiming Xiong code -1
Few-Shot Object Detection by Knowledge Distillation Using Bag-of-Visual-Words Representations Wenjie Pei, Shuang Wu, Dianwen Mei, Fanglin Chen, Jiandong Tian, Guangming Lu code -1
SALISA: Saliency-Based Input Sampling for Efficient Video Object Detection Babak Ehteshami Bejnordi, Amirhossein Habibian, Fatih Porikli, Amir Ghodrati code -1
ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement Dongli Tan, JiangJiang Liu, Xingyu Chen, Chao Chen, Ruixin Zhang, Yunhang Shen, Shouhong Ding, Rongrong Ji code -1
Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting Yangzheng Wu, Mohsen Zand, Ali Etemad, Michael A. Greenspan code -1
Long-Tailed Instance Segmentation Using Gumbel Optimized Loss Konstantinos Panagiotis Alexandridis, Jiankang Deng, Anh Nguyen, Shan Luo code -1
DetMatch: Two Teachers are Better than One for Joint 2D and 3D Semi-Supervised Object Detection Jinhyung Park, Chenfeng Xu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan code -1
ObjectBox: From Centers to Boxes for Anchor-Free Object Detection Mohsen Zand, Ali Etemad, Michael A. Greenspan code -1
Is Geometry Enough for Matching in Visual Localization? Qunjie Zhou, Sérgio Agostinho, Aljosa Osep, Laura LealTaixé code -1
SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds Pei Sun, Mingxing Tan, Weiyue Wang, Chenxi Liu, Fei Xia, Zhaoqi Leng, Dragomir Anguelov code -1
PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry Yu Zhang, Junle Yu, Xiaolin Huang, Wenhui Zhou, Ji Hou code -1
GLAMD: Global and Local Attention Mask Distillation for Object Detectors Younho Jang, Wheemyung Shin, Jinbeom Kim, Simon S. Woo, SungHo Bae code -1
FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection Danila Rukhovich, Anna Vorontsova, Anton Konushin code -1
Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang code -1
Class-Agnostic Object Detection with Multi-modal Transformer Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, MingHsuan Yang code -1
Enhancing Multi-modal Features Using Local Self-attention for 3D Object Detection Hao Li, Zehan Zhang, Xian Zhao, Yulong Wang, Yuxi Shen, Shiliang Pu, Hui Mao code -1
Object Detection as Probabilistic Set Prediction Georg Hess, Christoffer Petersson, Lennart Svensson code -1
Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions Zhi Li, Lu He, Huijuan Xu code -1
Neural Correspondence Field for Object Pose Estimation Lin Huang, Tomas Hodan, Lingni Ma, Linguang Zhang, Luan Tran, Christopher D. Twigg, PoChen Wu, Junsong Yuan, Cem Keskin, Robert Wang code -1
On Label Granularity and Object Localization Elijah Cole, Kimberly Wilber, Grant Van Horn, Xuan Yang, Marco Fornoni, Pietro Perona, Serge J. Belongie, Andrew G. Howard, Oisin Mac Aodha code -1
OIMNet++: Prototypical Normalization and Localization-Aware Learning for Person Search Sanghoon Lee, Youngmin Oh, Donghyeon Baek, Junghyup Lee, Bumsub Ham code -1
Out-of-Distribution Identification: Let Detector Tell Which I Am Not Sure Ruoqi Li, Chongyang Zhang, Hao Zhou, Chao Shi, Yan Luo code -1
Learning with Free Object Segments for Long-Tailed Instance Segmentation Cheng Zhang, TaiYu Pan, Tianle Chen, Jike Zhong, Wenjin Fu, WeiLun Chao code -1
Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction Yuxuan Liu, Nikhil Mishra, Maximilian Sieb, Yide Shentu, Pieter Abbeel, Xi Chen code -1
3D Random Occlusion and Multi-layer Projection for Deep Multi-camera Pedestrian Localization Rui Qiu, Ming Xu, Yuyao Yan, Jeremy S. Smith, Xi Yang code -1
A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation Wuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, TsungYi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou code -1
Simple Open-Vocabulary Object Detection Matthias Minderer, Alexey A. Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby code -1
UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture Hiroyasu Akada, Jian Wang, Soshi Shimada, Masaki Takahashi, Christian Theobalt, Vladislav Golyanik code -1
Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction Maosen Li, Siheng Chen, Zijing Zhang, Lingxi Xie, Qi Tian, Ya Zhang code -1
Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-person Human Pose Estimation William J. McNally, Kanav Vats, Alexander Wong, John McPhee code -1
VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data Jiajun Su, Chunyu Wang, Xiaoxuan Ma, Wenjun Zeng, Yizhou Wang code -1
Poseur: Direct Human Pose Regression with Transformers Weian Mao, Yongtao Ge, Chunhua Shen, Zhi Tian, Xinlong Wang, Zhibin Wang, Anton van den Hengel code -1
SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation Yanjie Li, Sen Yang, Peidong Liu, Shoukui Zhang, Yunxiao Wang, Zhicheng Wang, Wankou Yang, ShuTao Xia code -1
Regularizing Vector Embedding in Bottom-Up Human Pose Estimation Haixin Wang, Lu Zhou, Yingying Chen, Ming Tang, Jinqiao Wang code -1
A Visual Navigation Perspective for Category-Level Object Pose Estimation Jiaxin Guo, Fangxun Zhong, Rong Xiong, Yunhui Liu, Yue Wang, Yiyi Liao code -1
Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection Hang Ye, Wentao Zhu, Chunyu Wang, Rujie Wu, Yizhou Wang code -1
Learning to Fit Morphable Models Vasileios Choutas, Federica Bogo, Jingjing Shen, Julien P. C. Valentin code -1
EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices Siwei Zhang, Qianli Ma, Yan Zhang, Zhiyin Qian, Taein Kwon, Marc Pollefeys, Federica Bogo, Siyu Tang code -1
Grasp'D: Differentiable Contact-Rich Grasp Synthesis for Multi-Fingered Hands Dylan Turpin, Liquan Wang, Eric Heiden, YunChun Chen, Miles Macklin, Stavros Tsogkas, Sven J. Dickinson, Animesh Garg code -1
AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling Ziqian Bai, Timur M. Bagautdinov, Javier Romero, Michael Zollhöfer, Ping Tan, Shunsuke Saito code -1
Deep Radial Embedding for Visual Sequence Learning Yuecong Min, Peiqi Jiao, Yanan Li, Xiaotao Wang, Lei Lei, Xiujuan Chai, Xilin Chen code -1
SAGA: Stochastic Whole-Body Grasping with Contact Yan Wu, Jiahao Wang, Yan Zhang, Siwei Zhang, Otmar Hilliges, Fisher Yu, Siyu Tang code -1
Neural Capture of Animatable 3D Human from Monocular Video Gusi Te, Xiu Li, Xiao Li, Jinglu Wang, Wei Hu, Yan Lu code -1
General Object Pose Transformation Network from Unpaired Data Yukun Su, Guosheng Lin, Ruizhou Sun, Qingyao Wu code -1
Compositional Human-Scene Interaction Synthesis with Semantic Control Kaifeng Zhao, Shaofei Wang, Yan Zhang, Thabo Beeler, Siyu Tang code -1
PressureVision: Estimating Hand Pressure from a Single RGB Image Patrick Grady, Chengcheng Tang, Samarth Brahmbhatt, Christopher D. Twigg, Chengde Wan, James Hays, Charles C. Kemp code -1
PoseScript: 3D Human Poses from Natural Language Ginger Delmas, Philippe Weinzaepfel, Thomas Lucas, Francesc MorenoNoguer, Grégory Rogez code -1
DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation Jaewoo Park, Nam Ik Cho code -1
3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal Hao Meng, Sheng Jin, Wentao Liu, Chen Qian, Mengxiang Lin, Wanli Ouyang, Ping Luo code -1
Pose for Everything: Towards Category-Agnostic Pose Estimation Lumin Xu, Sheng Jin, Wang Zeng, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang code -1
PoseGPT: Quantization-Based 3D Human Motion Generation and Forecasting Thomas Lucas, Fabien Baradel, Philippe Weinzaepfel, Grégory Rogez code -1
DH-AUG: DH Forward Kinematics Model Driven Augmentation for 3D Human Pose Estimation Linzhi Huang, Jiahao Liang, Weihong Deng code -1
Estimating Spatially-Varying Lighting in Urban Scenes with Disentangled Representation Jiajun Tang, Yongjie Zhu, Haoyu Wang, Jun Hoong Chan, Si Li, Boxin Shi code -1
Boosting Event Stream Super-Resolution with a Recurrent Neural Network Wenming Weng, Yueyi Zhang, Zhiwei Xiong code -1
Projective Parallel Single-Pixel Imaging to Overcome Global Illumination in 3D Structure Light Scanning Yuxi Li, Huijie Zhao, Hongzhi Jiang, Xudong Li code -1
Semantic-Sparse Colorization Network for Deep Exemplar-Based Colorization Yunpeng Bai, Chao Dong, Zenghao Chai, Andong Wang, Zhengzhuo Xu, Chun Yuan code -1
Practical and Scalable Desktop-Based High-Quality Facial Capture Alexandros Lattas, Yiming Lin, Jayanth Kannan, Ekin Ozturk, Luca Filipi, Giuseppe Claudio Guarnera, Gaurav Chawla, Abhijeet Ghosh code -1
FAST-VQA: Efficient End-to-End Video Quality Assessment with Fragment Sampling Haoning Wu, Chaofeng Chen, Jingwen Hou, Liang Liao, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin code -1
Physically-Based Editing of Indoor Scene Lighting from a Single Image Zhengqin Li, Jia Shi, Sai Bi, Rui Zhu, Kalyan Sunkavalli, Milos Hasan, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker code -1
LEDNet: Joint Low-Light Enhancement and Deblurring in the Dark Shangchen Zhou, Chongyi Li, Chen Change Loy code -1
MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects Juewen Peng, Jianming Zhang, Xianrui Luo, Hao Lu, Ke Xian, Zhiguo Cao code -1
Real-RawVSR: Real-World Raw Video Super-Resolution with a Benchmark Dataset Huanjing Yue, Zhiming Zhang, JingYu Yang code -1
Transform Your Smartphone into a DSLR Camera: Learning the ISP in the Wild Ardhendu Shekhar Tripathi, Martin Danelljan, Samarth Shukla, Radu Timofte, Luc Van Gool code -1
Learning Deep Non-blind Image Deconvolution Without Ground Truths Yuhui Quan, Zhuojie Chen, Huan Zheng, Hui Ji code -1
NEST: Neural Event Stack for Event-Based Image Enhancement Minggui Teng, Chu Zhou, Hanyue Lou, Boxin Shi code -1
Editable Indoor Lighting Estimation Henrique Weber, Mathieu Garon, JeanFrançois Lalonde code -1
Fast Two-Step Blind Optical Aberration Correction Thomas Eboli, JeanMichel Morel, Gabriele Facciolo code -1
Seeing Far in the Dark with Patterned Flash Zhanghao Sun, Jian Wang, Yicheng Wu, Shree Nayar code -1
PseudoClick: Interactive Image Segmentation with Click Imitation Qin Liu, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, Marc Niethammer, Ziyan Wu code -1
WaveGAN: Frequency-Aware GAN for High-Fidelity Few-Shot Image Generation Mengping Yang, Zhe Wang, Ziqiu Chi, Wenyi Feng code -1
End-to-End Visual Editing with a Generatively Pre-trained Artist Andrew Brown, ChengYang Fu, Omkar M. Parkhi, Tamara L. Berg, Andrea Vedaldi code -1
High-Fidelity GAN Inversion with Padding Space Qingyan Bai, Yinghao Xu, Jiapeng Zhu, Weihao Xia, Yujiu Yang, Yujun Shen code -1
Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping Chao Xu, Jiangning Zhang, Yue Han, Guanzhong Tian, Xianfang Zeng, Ying Tai, Yabiao Wang, Chengjie Wang, Yong Liu code -1
Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives Wentao Yuan, Qingtian Zhu, Xiangyue Liu, Yikang Ding, Haotian Zhang, Chi Zhang code -1
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman code -1
3D-FM GAN: Towards 3D-Controllable Face Manipulation Yuchen Liu, Zhixin Shu, Yijun Li, Zhe Lin, Richard Zhang, SunYuan Kung code -1
Multi-Curve Translator for High-Resolution Photorealistic Image Translation Yuda Song, Hui Qian, Xin Du code -1
Deep Bayesian Video Frame Interpolation Zhiyang Yu, Yu Zhang, Xujie Xiang, Dongqing Zou, Xijun Chen, Jimmy S. Ren code -1
Cross Attention Based Style Distribution for Controllable Person Image Synthesis Xinyue Zhou, Mingyu Yin, Xinyuan Chen, Li Sun, Changxin Gao, Qingli Li code -1
KeypointNeRF: Generalizing Image-Based Volumetric Avatars Using Relative Spatial Encoding of Keypoints Marko Mihajlovic, Aayush Bansal, Michael Zollhöfer, Siyu Tang, Shunsuke Saito code -1
ViewFormer: NeRF-Free Neural Rendering from Few Images Using Transformers Jonás Kulhánek, Erik Derner, Torsten Sattler, Robert Babuska code -1
L-Tracing: Fast Light Visibility Estimation on Neural Surfaces by Sphere Tracing Ziyu Chen, Chenjing Ding, Jianfei Guo, Dongliang Wang, Yikang Li, Xuan Xiao, Wei Wu, Li Song code -1
A Perceptual Quality Metric for Video Frame Interpolation Qiqi Hou, Abhijay Ghildyal, Feng Liu code -1
Adaptive Feature Interpolation for Low-Shot Image Generation Mengyu Dai, Haibin Hang, Xiaoyang Guo code -1
PalGAN: Image Colorization with Palette Generative Adversarial Networks Yi Wang, Menghan Xia, Lu Qi, Jing Shao, Yu Qiao code -1
Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis Long Zhuo, Guangcong Wang, Shikai Li, Wayne Wu, Ziwei Liu code -1
Learning Prior Feature and Attention Enhanced Image Inpainting Chenjie Cao, Qiaole Dong, Yanwei Fu code -1
Temporal-MPI: Enabling Multi-plane Images for Dynamic Scene Modelling via Temporal Basis Learning Wenpeng Xing, Jie Chen code -1
3D-Aware Semantic-Guided Generative Model for Human Synthesis Jichao Zhang, Enver Sangineto, Hao Tang, Aliaksandr Siarohin, Zhun Zhong, Nicu Sebe, Wei Wang code -1
Temporally Consistent Semantic Video Editing Yiran Xu, Badour AlBahar, JiaBin Huang code -1
Error Compensation Framework for Flow-Guided Video Inpainting Jaeyeon Kang, Seoung Wug Oh, Seon Joo Kim code -1
Scraping Textures from Natural Images for Synthesis and Editing Xueting Li, Xiaolong Wang, MingHsuan Yang, Alexei A. Efros, Sifei Liu code -1
Single Stage Virtual Try-On Via Deformable Attention Flows Shuai Bai, Huiling Zhou, Zhikang Li, Chang Zhou, Hongxia Yang code -1
Improving GANs for Long-Tailed Data Through Group Spectral Regularization Harsh Rangwani, Naman Jaswani, Tejan Karmali, Varun Jampani, R. Venkatesh Babu code -1
Hierarchical Semantic Regularization of Latent Spaces in StyleGANs Tejan Karmali, Rishubh Parihar, Susmit Agrawal, Harsh Rangwani, Varun Jampani, Maneesh Kumar Singh, R. Venkatesh Babu code -1
IntereStyle: Encoding an Interest Region for Robust StyleGAN Inversion Seung Jun Moon, GyeongMoon Park code -1
StyleLight: HDR Panorama Generation for Lighting Estimation and Editing Guangcong Wang, Yinuo Yang, Chen Change Loy, Ziwei Liu code -1
Contrastive Monotonic Pixel-Level Modulation Kun Lu, Rongpeng Li, Honggang Zhang code -1
Learning Cross-Video Neural Representations for High-Quality Frame Interpolation Wentao Shangguan, Yu Sun, Weijie Gan, Ulugbek S. Kamilov code -1
Learning Continuous Implicit Representation for Near-Periodic Patterns Bowei Chen, Tiancheng Zhi, Martial Hebert, Srinivasa G. Narasimhan code -1
End-to-End Graph-Constrained Vectorized Floorplan Generation with Panoptic Refinement Jiachen Liu, Yuan Xue, José Pinto Duarte, Krishnendra Shekhawat, Zihan Zhou, Xiaolei Huang code -1
Few-Shot Image Generation with Mixup-Based Distance Learning Chaerin Kong, Jeesoo Kim, Donghoon Han, Nojun Kwak code -1
A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos Xu Yao, Alasdair Newson, Yann Gousseau, Pierre Hellier code -1
FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs Ziqiang Li, Chaoyue Wang, Heliang Zheng, Jing Zhang, Bin Li code -1
BlobGAN: Spatially Disentangled Scene Representations Dave Epstein, Taesung Park, Richard Zhang, Eli Shechtman, Alexei A. Efros code -1
Unified Implicit Neural Stylization Zhiwen Fan, Yifan Jiang, Peihao Wang, Xinyu Gong, Dejia Xu, Zhangyang Wang code -1
GAN with Multivariate Disentangling for Controllable Hair Editing Xuyang Guo, Meina Kan, Tianle Chen, Shiguang Shan code -1
Discovering Transferable Forensic Features for CNN-Generated Images Detection Keshigeyan Chandrasegaran, NgocTrung Tran, Alexander Binder, NgaiMan Cheung code -1
Harmonizer: Learning to Perform White-Box Image and Video Harmonization Zhanghan Ke, Chunyi Sun, Lei Zhu, Ke Xu, Rynson W. H. Lau code -1
Text2LIVE: Text-Driven Layered Image and Video Editing Omer BarTal, Dolev OfriAmar, Rafail Fridman, Yoni Kasten, Tali Dekel code -1
Digging into Radiance Grid for Real-Time View Synthesis with Detail Preservation Jian Zhang, Jinchi Huang, Bowen Cai, Huan Fu, Mingming Gong, Chaohui Wang, Jiaming Wang, Hongchen Luo, Rongfei Jia, Binqiang Zhao, Xing Tang code -1
StyleGAN-Human: A Data-Centric Odyssey of Human Generation Jianglin Fu, Shikai Li, Yuming Jiang, KwanYee Lin, Chen Qian, Chen Change Loy, Wayne Wu, Ziwei Liu code -1
ColorFormer: Image Colorization via Color Memory Assisted Hybrid-Attention Transformer Xiaozhong Ji, Boyuan Jiang, Donghao Luo, Guangpin Tao, Wenqing Chu, Zhifeng Xie, Chengjie Wang, Ying Tai code -1
EAGAN: Efficient Two-Stage Evolutionary Architecture Search for GANs Guohao Ying, Xin He, Bin Gao, Bo Han, Xiaowen Chu code -1
Weakly-Supervised Stitching Network for Real-World Panoramic Image Generation DaeYoung Song, Geonsoo Lee, Heekyung Lee, GiMun Um, Donghyeon Cho code -1
DynaST: Dynamic Sparse Transformer for Exemplar-Guided Image Generation Songhua Liu, Jingwen Ye, Sucheng Ren, Xinchao Wang code -1
Multimodal Conditional Image Synthesis with Product-of-Experts GANs Xun Huang, Arun Mallya, TingChun Wang, MingYu Liu code -1
Auto-regressive Image Synthesis with Integrated Quantization Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Changgong Zhang, Shijian Lu code -1
JoJoGAN: One Shot Face Stylization Min Jin Chong, David A. Forsyth code -1
VecGAN: Image-to-Image Translation with Interpretable Latent Directions Yusuf Dalva, Said Fahri Altindis, Aysegul Dundar code -1
Any-Resolution Training for High-Resolution Image Synthesis Lucy Chai, Michaël Gharbi, Eli Shechtman, Phillip Isola, Richard Zhang code -1
CCPL: Contrastive Coherence Preserving Loss for Versatile Style Transfer Zijie Wu, Zhen Zhu, Junping Du, Xiang Bai code -1
CANF-VC: Conditional Augmented Normalizing Flows for Video Compression YungHan Ho, ChihPeng Chang, PengYu Chen, Alessandro Gnutti, WenHsiao Peng code -1
Bi-level Feature Alignment for Versatile Image Translation and Manipulation Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Aoran Xiao, Shijian Lu, Chunyan Miao code -1
High-Fidelity Image Inpainting with GAN Inversion Yongsheng Yu, Libo Zhang, Heng Fan, Tiejian Luo code -1
DeltaGAN: Towards Diverse Few-Shot Image Generation with Sample-Specific Delta Yan Hong, Li Niu, Jianfu Zhang, Liqing Zhang code -1
Image Inpainting with Cascaded Modulation GAN and Object-Aware Training Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Ning Xu, Sohrab Amirghodsi, Jiebo Luo code -1
StyleFace: Towards Identity-Disentangled Face Generation on Megapixels Yuchen Luo, Junwei Zhu, Keke He, Wenqing Chu, Ying Tai, Chengjie Wang, Junchi Yan code -1
Video Extrapolation in Space and Time Yunzhi Zhang, Jiajun Wu code -1
Contrastive Learning for Diverse Disentangled Foreground Generation Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh code -1
BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-Aided Adversarial Learning Changgyoon Oh, Wonjune Cho, Yujeong Chae, Daehee Park, Lin Wang, KukJin Yoon code -1
Augmentation of rPPG Benchmark Datasets: Learning to Remove and Embed rPPG Signals via Double Cycle Consistent Learning from Unpaired Facial Videos ChengJu Hsieh, WeiHao Chung, ChiouTing Hsu code -1
Geometry-Aware Single-Image Full-Body Human Relighting Chaonan Ji, Tao Yu, Kaiwen Guo, Jingxin Liu, Yebin Liu code -1
3D-Aware Indoor Scene Synthesis with Depth Priors Zifan Shi, Yujun Shen, Jiapeng Zhu, DitYan Yeung, Qifeng Chen code -1
Deep Portrait Delighting Joshua Weir, Junhong Zhao, Andrew Chalmers, Taehyun Rhee code -1
Vector Quantized Image-to-Image Translation YuJie Chen, ShinI Cheng, WeiChen Chiu, HungYu Tseng, HsinYing Lee code -1
The Surprisingly Straightforward Scene Text Removal Method with Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis Hyeonsu Lee, Chankyu Choi code -1
Free-Viewpoint RGB-D Human Performance Capture and Rendering Phong NguyenHa, Nikolaos Sarafianos, Christoph Lassner, Janne Heikkilä, Tony Tung code -1
Multiview Regenerative Morphing with Dual Flows ChihJung Tsai, Cheng Sun, HwannTzong Chen code -1
Hallucinating Pose-Compatible Scenes Tim Brooks, Alexei A. Efros code -1
Motion and Appearance Adaptation for Cross-domain Motion Transfer Borun Xu, Biao Wang, Jinhong Deng, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan code -1
Layered Controllable Video Generation Jiahui Huang, Yuhe Jin, Kwang Moo Yi, Leonid Sigal code -1
Custom Structure Preservation in Face Aging Guillermo GomezTrenado, Stéphane Lathuilière, Pablo Mesejo, Oscar Cordón code -1
Spatio-Temporal Deformable Attention Network for Video Deblurring Huicong Zhang, Haozhe Xie, Hongxun Yao code -1
NeuMesh: Learning Disentangled Neural Mesh-Based Implicit Field for Geometry and Texture Editing Bangbang Yang, Chong Bao, Junyi Zeng, Hujun Bao, Yinda Zhang, Zhaopeng Cui, Guofeng Zhang code -1
NeRF for Outdoor Scene Relighting Viktor Rudnev, Mohamed Elgharib, William A. P. Smith, Lingjie Liu, Vladislav Golyanik, Christian Theobalt code -1
CoGS: Controllable Generation and Search from Sketch and Style Cusuh Ham, Gemma Canet Tarres, Tu Bui, James Hays, Zhe Lin, John P. Collomosse code -1
HairNet: Hairstyle Transfer with Pose Changes Peihao Zhu, Rameen Abdal, John Femiani, Peter Wonka code -1
Unbiased Multi-modality Guidance for Image Inpainting Yongsheng Yu, Dawei Du, Libo Zhang, Tiejian Luo code -1
Intelli-Paint: Towards Developing More Human-Intelligible Painting Agents Jaskirat Singh, Cameron Smith, Jose Echevarria, Liang Zheng code -1
Motion Transformer for Unsupervised Image Animation Jiale Tao, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan code -1
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion Chenfei Wu, Jian Liang, Lei Ji, Fan Yang, Yuejian Fang, Daxin Jiang, Nan Duan code -1
EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer Chenyu Yang, Wanrong He, Yingqing Xu, Yang Gao code -1
Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks Yunshan Zhong, Mingbao Lin, Xunchao Li, Ke Li, Yunhang Shen, Fei Chao, Yongjian Wu, Rongrong Ji code -1
OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers Jialun Pei, Tianyang Cheng, DengPing Fan, He Tang, Chuanbo Chen, Luc Van Gool code -1
Highly Accurate Dichotomous Image Segmentation Xuebin Qin, Hang Dai, Xiaobin Hu, DengPing Fan, Ling Shao, Luc Van Gool code -1
Boosting Supervised Dehazing Methods via Bi-level Patch Reweighting Xingyu Jiang, Hongkun Dou, Chengwei Fu, Bingquan Dai, Tianrun Xu, Yue Deng code -1
Flow-Guided Transformer for Video Inpainting Kaidong Zhang, Jingjing Fu, Dong Liu code -1
Shift-Tolerant Perceptual Similarity Metric Abhijay Ghildyal, Feng Liu code -1
Perception-Distortion Balanced ADMM Optimization for Single-Image Super-Resolution Yuehan Zhang, Bo Ji, Jia Hao, Angela Yao code -1
VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder Yuchao Gu, Xintao Wang, Liangbin Xie, Chao Dong, Gen Li, Ying Shan, MingMing Cheng code -1
Uncertainty Learning in Kernel Estimation for Multi-stage Blind Image Super-Resolution Zhenxuan Fang, Weisheng Dong, Xin Li, Jinjian Wu, Leida Li, Guangming Shi code -1
Learning Spatio-Temporal Downsampling for Effective Video Upscaling Xiaoyu Xiang, Yapeng Tian, Vijay Rengarajan, Lucas D. Young, Bo Zhu, Rakesh Ranjan code -1
Learning Local Implicit Fourier Representation for Image Warping Jaewon Lee, Kwang Pyo Choi, Kyong Hwan Jin code -1
SepLUT: Separable Image-Adaptive Lookup Tables for Real-Time Image Enhancement Canqian Yang, Meiguang Jin, Yi Xu, Rui Zhang, Ying Chen, Huaida Liu code -1
Blind Image Decomposition Junlin Han, Weihao Li, Pengfei Fang, Chunyi Sun, Jie Hong, Mohammad Ali Armin, Lars Petersson, Hongdong Li code -1
MuLUT: Cooperating Multiple Look-Up Tables for Efficient Image Super-Resolution Jiacheng Li, Chang Chen, Zhen Cheng, Zhiwei Xiong code -1
Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution Zhongwei Qiu, Huan Yang, Jianlong Fu, Dongmei Fu code -1
Spatial-Frequency Domain Information Integration for Pan-Sharpening Man Zhou, Jie Huang, Keyu Yan, Hu Yu, Xueyang Fu, Aiping Liu, Xian Wei, Feng Zhao code -1
Adaptive Patch Exiting for Scalable Single Image Super-Resolution Shizun Wang, Jiaming Liu, Kaixin Chen, Xiaoqi Li, Ming Lu, Yandong Guo code -1
Efficient Meta-Tuning for Content-Aware Neural Video Delivery Xiaoqi Li, Jiaming Liu, Shizun Wang, Cheng Lyu, Ming Lu, Yurong Chen, Anbang Yao, Yandong Guo, Shanghang Zhang code -1
Reference-Based Image Super-Resolution with Deformable Attention Transformer Jiezhang Cao, Jingyun Liang, Kai Zhang, Yawei Li, Yulun Zhang, Wenguan Wang, Luc Van Gool code -1
Local Color Distributions Prior for Image Enhancement Haoyuan Wang, Ke Xu, Rynson W. H. Lau code -1
L-CoDer: Language-Based Colorization with Color-Object Decoupling Transformer Zheng Chang, Shuchen Weng, Yu Li, Si Li, Boxin Shi code -1
From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution Xiaoming Li, Chaofeng Chen, Xianhui Lin, Wangmeng Zuo, Lei Zhang code -1
Towards Interpretable Video Super-Resolution via Alternating Optimization Jiezhang Cao, Jingyun Liang, Kai Zhang, Wenguan Wang, Qin Wang, Yulun Zhang, Hao Tang, Luc Van Gool code -1
Event-Based Fusion for Motion Deblurring with Cross-modal Attention Lei Sun, Christos Sakaridis, Jingyun Liang, Qi Jiang, Kailun Yang, Peng Sun, Yaozu Ye, Kaiwei Wang, Luc Van Gool code -1
Fast and High Quality Image Denoising via Malleable Convolution Yifan Jiang, Bartlomiej Wronski, Ben Mildenhall, Jonathan T. Barron, Zhangyang Wang, Tianfan Xue code -1
TAPE: Task-Agnostic Prior Embedding for Image Restoration Lin Liu, Lingxi Xie, Xiaopeng Zhang, Shanxin Yuan, Xiangyu Chen, Wengang Zhou, Houqiang Li, Qi Tian code -1
Uncertainty Inspired Underwater Image Enhancement Zhenqi Fu, Wu Wang, Yue Huang, Xinghao Ding, KaiKuang Ma code -1
Hourglass Attention Network for Image Inpainting Ye Deng, Siqi Hui, Rongye Meng, Sanping Zhou, Jinjun Wang code -1
Unfolded Deep Kernel Estimation for Blind Image Super-Resolution Hongyi Zheng, Hongwei Yong, Lei Zhang code -1
Event-guided Deblurring of Unknown Exposure Time Videos Taewoo Kim, Jeongmin Lee, Lin Wang, KukJin Yoon code -1
ReCoNet: Recurrent Correction Network for Fast and Efficient Multi-modality Image Fusion Zhanbo Huang, Jinyuan Liu, Xin Fan, Risheng Liu, Wei Zhong, Zhongxuan Luo code -1
Content Adaptive Latents and Decoder for Neural Image Compression Guanbo Pan, Guo Lu, Zhihao Hu, Dong Xu code -1
Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution Jie Liang, Hui Zeng, Lei Zhang code -1
Unidirectional Video Denoising by Mimicking Backward Recurrent Modules with Look-Ahead Forward Ones Junyi Li, Xiaohe Wu, Zhenxing Niu, Wangmeng Zuo code -1
Self-supervised Learning for Real-World Super-Resolution from Dual Zoomed Observations Zhilu Zhang, Ruohao Wang, Hongzhi Zhang, Yunjin Chen, Wangmeng Zuo code -1
Secrets of Event-Based Optical Flow Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego code -1
Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoiréing Xin Yu, Peng Dai, Wenbo Li, Lan Ma, Jiajun Shen, Jia Li, Xiaojuan Qi code -1
ERDN: Equivalent Receptive Field Deformable Network for Video Deblurring Bangrui Jiang, Zhihuai Xie, Zhen Xia, Songnan Li, Shan Liu code -1
Rethinking Generic Camera Models for Deep Single Image Camera Calibration to Recover Rotation and Fisheye Distortion Nobuhiko Wakai, Satoshi Sato, Yasunori Ishii, Takayoshi Yamashita code -1
ART-SS: An Adaptive Rejection Technique for Semi-supervised Restoration for Adverse Weather-Affected Images Rajeev Yasarla, Carey E. Priebe, Vishal M. Patel code -1
Fusion from Decomposition: A Self-Supervised Decomposition Approach for Image Fusion Pengwei Liang, Junjun Jiang, Xianming Liu, Jiayi Ma code -1
Learning Degradation Representations for Image Deblurring Dasong Li, Yi Zhang, Ka Chun Cheung, Xiaogang Wang, Hongwei Qin, Hongsheng Li code -1
Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal Xinwei Liu, Jian Liu, Yang Bai, Jindong Gu, Tao Chen, Xiaojun Jia, Xiaochun Cao code -1
Explaining Deepfake Detection by Analysing Image Matching Shichao Dong, Jin Wang, Jiajun Liang, Haoqiang Fan, Renhe Ji code -1
FrequencyLowCut Pooling - Plug and Play Against Catastrophic Overfitting Julia Grabinski, Steffen Jung, Janis Keuper, Margret Keuper code -1
TAFIM: Targeted Adversarial Attacks Against Facial Image Manipulations Shivangi Aneja, Lev Markhasin, Matthias Nießner code -1
FingerprintNet: Synthesized Fingerprints for Generated Image Detection Yonghyun Jeong, Doyeon Kim, Youngmin Ro, Pyounggeon Kim, Jongwon Choi code -1
Detecting Generated Images by Real Images Bo Liu, Fan Yang, Xiuli Bi, Bin Xiao, Weisheng Li, Xinbo Gao code -1
An Information Theoretic Approach for Attention-Driven Face Forgery Detection Ke Sun, Hong Liu, Taiping Yao, Xiaoshuai Sun, Shen Chen, Shouhong Ding, Rongrong Ji code -1
Exploring Disentangled Content Information for Face Forgery Detection Jiahao Liang, Huafeng Shi, Weihong Deng code -1
RepMix: Representation Mixing for Robust Attribution of Synthesized Images Tu Bui, Ning Yu, John P. Collomosse code -1
Totems: Physical Objects for Verifying Visual Integrity Jingwei Ma, Lucy Chai, Minyoung Huh, Tongzhou Wang, SerNam Lim, Phillip Isola, Antonio Torralba code -1
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval Pandeng Li, Hongtao Xie, Jiannan Ge, Lei Zhang, Shaobo Min, Yongdong Zhang code -1
PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification Kuan Zhu, Haiyun Guo, Tianyi Yan, Yousong Zhu, Jinqiao Wang, Ming Tang code -1
Adaptive Cross-domain Learning for Generalizable Person Re-identification Pengyi Zhang, Huanzhang Dou, Yunlong Yu, Xi Li code -1
Multi-query Video Retrieval Zeyu Wang, Yu Wu, Karthik Narasimhan, Olga Russakovsky code -1
Hierarchical Average Precision Training for Pertinent Image Retrieval Elias Ramzi, Nicolas Audebert, Nicolas Thome, Clément Rambour, Xavier Bitot code -1
Learning Semantic Correspondence with Sparse Annotations Shuaiyi Huang, Luyu Yang, Bo He, Songyang Zhang, Xuming He, Abhinav Shrivastava code -1
Dynamically Transformed Instance Normalization Network for Generalizable Person Re-Identification Bingliang Jiao, Lingqiao Liu, Liying Gao, Guosheng Lin, Lu Yang, Shizhou Zhang, Peng Wang, Yanning Zhang code -1
Domain Adaptive Person Search Junjie Li, Yichao Yan, Guanshuo Wang, Fufu Yu, Qiong Jia, Shouhong Ding code -1
TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval Yuqi Liu, Pengfei Xiong, Luhui Xu, Shengming Cao, Qin Jin code -1
Unstructured Feature Decoupling for Vehicle Re-identification Wen Qian, Hao Luo, Silong Peng, Fan Wang, Chen Chen, Hao Li code -1
Deep Hash Distillation for Image Retrieval Young Kyun Jang, Geonmo Gu, ByungSoo Ko, Isaac Kang, Nam Ik Cho code -1
Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification Boqiang Xu, Jian Liang, Lingxiao He, Zhenan Sun code -1
Granularity-Aware Adaptation for Image Retrieval Over Multiple Tasks Jon Almazán, ByungSoo Ko, Geonmo Gu, Diane Larlus, Yannis Kalantidis code -1
Learning Audio-Video Modalities from Image Captions Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid code -1
RVSL: Robust Vehicle Similarity Learning in Real Hazy Scenes Based on Semi-supervised Learning WeiTing Chen, IHsiang Chen, ChihYuan Yeh, HaoHsiang Yang, HuaEn Chang, JianJiun Ding, SyYen Kuo code -1
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval Fan Hu, Aozhu Chen, Ziyue Wang, Fangming Zhou, Jianfeng Dong, Xirong Li code -1
Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification Yiyuan Zhang, Sanyuan Zhao, Yuhao Kang, Jianbing Shen code -1
Cross-Modality Transformer for Visible-Infrared Person Re-Identification Kongzhu Jiang, Tianzhu Zhang, Xiang Liu, Bingqiao Qian, Yongdong Zhang, Feng Wu code -1
Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment Sangmin Lee, Sungjune Park, Yong Man Ro code -1
Connecting Compression Spaces with Transformer for Approximate Nearest Neighbor Search Haokui Zhang, Buzhou Tang, Wenze Hu, Xiaoyu Wang code -1
SEMICON: A Learning-to-Hash Solution for Large-Scale Fine-Grained Image Retrieval Yang Shen, Xuhao Sun, XiuShen Wei, QingYuan Jiang, Jian Yang code -1
CAViT: Contextual Alignment Vision Transformer for Video Object Re-identification Jinlin Wu, Lingxiao He, Wu Liu, Yang Yang, Zhen Lei, Tao Mei, Stan Z. Li code -1
Text-Based Temporal Localization of Novel Events Sudipta Paul, Niluthpol Chowdhury Mithun, Amit K. RoyChowdhury code -1
Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval Zhaopeng Dou, Zhongdao Wang, Weihua Chen, Yali Li, Shengjin Wang code -1
Relighting4D: Neural Relightable Human from Videos Zhaoxi Chen, Ziwei Liu code -1
Real-Time Intermediate Flow Estimation for Video Frame Interpolation Zhewei Huang, Tianyuan Zhang, Wen Heng, Boxin Shi, Shuchang Zhou code -1
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation Jing He, Yiyi Zhou, Qi Zhang, Jun Peng, Yunhang Shen, Xiaoshuai Sun, Chao Chen, Rongrong Ji code -1
StyleSwap: Style-Based Generator Empowers Robust Face Swapping Zhiliang Xu, Hang Zhou, Zhibin Hong, Ziwei Liu, Jiaming Liu, Zhizhi Guo, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang code -1
Paint2Pix: Interactive Painting Based Progressive Image Synthesis and Editing Jaskirat Singh, Liang Zheng, Cameron Smith, Jose Echevarria code -1
FurryGAN: High Quality Foreground-Aware Image Synthesis Jeongmin Bae, Mingi Kwon, Youngjung Uh code -1
SCAM! Transferring Humans Between Images with Semantic Cross Attention Modulation Nicolas Dufour, David Picard, Vicky Kalogeiton code -1
Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields Yuedong Chen, Qianyi Wu, Chuanxia Zheng, TatJen Cham, Jianfei Cai code -1
Editing Out-of-Domain GAN Inversion via Differential Activations Haorui Song, Yong Du, Tianyi Xiang, Junyu Dong, Jing Qin, Shengfeng He code -1
On the Robustness of Quality Measures for GANs Motasem Alfarra, Juan C. Pérez, Anna Frühstück, Philip H. S. Torr, Peter Wonka, Bernard Ghanem code -1
Sound-Guided Semantic Video Generation Seung Hyun Lee, Gyeongrok Oh, Wonmin Byeon, Chan Young Kim, Won Jeong Ryoo, Sang Ho Yoon, Hyunjun Cho, Jihyun Bae, Jinkyu Kim, Sangpil Kim code -1
Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-curation Lingzhi Zhang, Connelly Barnes, Kevin Wampler, Sohrab Amirghodsi, Eli Shechtman, Zhe Lin, Jianbo Shi code -1
Controllable Video Generation Through Global and Local Motion Dynamics Aram Davtyan, Paolo Favaro code -1
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN Fei Yin, Yong Zhang, Xiaodong Cun, Mingdeng Cao, Yanbo Fan, Xuan Wang, Qingyan Bai, Baoyuan Wu, Jue Wang, Yujiu Yang code -1
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, JiaBin Huang, Devi Parikh code -1
Combining Internal and External Constraints for Unrolling Shutter in Videos Eyal Naor, Itai Antebi, Shai Bagon, Michal Irani code -1
WISE: Whitebox Image Stylization by Example-Based Learning Winfried Lötzsch, Max Reimann, Martin Büßemeyer, Amir Semmo, Jürgen Döllner, Matthias Trapp code -1
Neural Radiance Transfer Fields for Relightable Novel-View Synthesis with Global Illumination Linjie Lyu, Ayush Tewari, Thomas Leimkühler, Marc Habermann, Christian Theobalt code -1
Transformers as Meta-learners for Implicit Neural Representations Yinbo Chen, Xiaolong Wang code -1
Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment Taewoo Kim, Chaeyeon Chung, Yoonseo Kim, Sunghyun Park, Kangyeol Kim, Jaegul Choo code -1
High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions Sangyun Lee, Gyojung Gu, Sunghyun Park, Seunghwan Choi, Jaegul Choo code -1
A Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution Hengsheng Zhang, Xueyi Zou, Jiaming Guo, Youliang Yan, Rong Xie, Li Song code -1
Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis Jeonggi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim, David K. Han, Hanseok Ko code -1
AdaNeRF: Adaptive Sampling for Real-Time Rendering of Neural Radiance Fields Andreas Kurz, Thomas Neff, Zhaoyang Lv, Michael Zollhöfer, Markus Steinberger code -1
Improving the Perceptual Quality of 2D Animation Interpolation Shuhong Chen, Matthias Zwicker code -1
Selective TransHDR: Transformer-Based Selective HDR Imaging Using Ghost Region Mask Jou Won Song, Ye In Park, Kyeongbo Kong, Jaeho Kwak, SukJu Kang code -1
Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution Cheng Ma, Jingyi Zhang, Jie Zhou, Jiwen Lu code -1
GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints Di Chen, Yu Liu, Lianghua Huang, Bin Wang, Pan Pan code -1
DoodleFormer: Creative Sketch Drawing with Transformers Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen, Michael Felsberg code -1
Implicit Neural Representations for Variable Length Human Motion Generation Pablo Cervantes, Yusuke Sekikawa, Ikuro Sato, Koichi Shinoda code -1
Learning Object Placement via Dual-Path Graph Completion Siyuan Zhou, Liu Liu, Li Niu, Liqing Zhang code -1
Expanded Adaptive Scaling Normalization for End to End Image Compression Chajin Shin, Hyeongmin Lee, Hanbin Son, Sangjin Lee, Dogyoon Lee, Sangyoun Lee code -1
Generator Knows What Discriminator Should Learn in Unconditional GANs Gayoung Lee, Hyunsu Kim, Junho Kim, Seonghyeon Kim, JungWoo Ha, Yunjey Choi code -1
Compositional Visual Generation with Composable Diffusion Models Nan Liu, Shuang Li, Yilun Du, Antonio Torralba, Joshua B. Tenenbaum code -1
ManiFest: Manifold Deformation for Few-Shot Image Translation Fabio Pizzati, JeanFrançois Lalonde, Raoul de Charette code -1
Supervised Attribute Information Removal and Reconstruction for Image Manipulation Nannan Li, Bryan A. Plummer code -1
BLT: Bidirectional Layout Transformer for Controllable Layout Generation Xiang Kong, Lu Jiang, Huiwen Chang, Han Zhang, Yuan Hao, Haifeng Gong, Irfan Essa code -1
Diverse Generation from a Single Video Made Possible Niv Haim, Ben Feinstein, Niv Granot, Assaf Shocher, Shai Bagon, Tali Dekel, Michal Irani code -1
Rayleigh EigenDirections (REDs): Nonlinear GAN Latent Space Traversals for Multidimensional Features Guha Balakrishnan, Raghudeep Gadde, Aleix Martinez, Pietro Perona code -1
Bridging the Domain Gap Towards Generalization in Automatic Colorization Hyejin Lee, Daehee Kim, Daeun Lee, Jinkyu Kim, Jaekoo Lee code -1
Generating Natural Images with Direct Patch Distributions Matching Ariel Elnekave, Yair Weiss code -1
Context-Consistent Semantic Image Editing with Style-Preserved Modulation Wuyang Luo, Su Yang, Hong Wang, Bo Long, Weishan Zhang code -1
Eliminating Gradient Conflict in Reference-based Line-Art Colorization Zekun Li, Zhengyang Geng, Zhao Kang, Wenyu Chen, Yibo Yang code -1
Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada code -1
JPEG Artifacts Removal via Contrastive Representation Learning Xi Wang, Xueyang Fu, Yurui Zhu, ZhengJun Zha code -1
Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning Xiang Chen, Zhentao Fan, Pengpeng Li, Longgang Dai, Caihua Kong, Zhuoran Zheng, Yufeng Huang, Yufeng Li code -1
Efficient Long-Range Attention Network for Image Super-Resolution Xindong Zhang, Hui Zeng, Shi Guo, Lei Zhang code -1
FlowFormer: A Transformer Architecture for Optical Flow Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li code -1
Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction Yuanhao Cai, Jing Lin, Xiaowan Hu, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc Van Gool code -1
Learning Shadow Correspondence for Video Shadow Detection Xinpeng Ding, Jingwen Yang, Xiaowei Hu, Xiaomeng Li code -1
Metric Learning Based Interactive Modulation for Real-World Super-Resolution Chong Mou, Yanze Wu, Xintao Wang, Chao Dong, Jian Zhang, Ying Shan code -1
Explicit Model Size Control and Relaxation via Smooth Regularization for Mixed-Precision Quantization Vladimir Chikin, Kirill Solodskikh, Irina Zhelavskaya code -1
BASQ: Branch-wise Activation-clipping Search Quantization for Sub-4-bit Neural Networks HanByul Kim, Eunhyeok Park, Sungjoo Yoo code -1
You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding Geng Yuan, SungEn Chang, Qing Jin, Alec Lu, Yanyu Li, Yushu Wu, Zhenglun Kong, Yanyue Xie, Peiyan Dong, Minghai Qin, Xiaolong Ma, Xulong Tang, Zhenman Fang, Yanzhi Wang code -1
Real Spike: Learning Real-Valued Spikes for Spiking Neural Networks Yufei Guo, Liwen Zhang, Yuanpei Chen, Xinyi Tong, Xiaode Liu, YingLei Wang, Xuhui Huang, Zhe Ma code -1
FedLTN: Federated Learning for Sparse and Personalized Lottery Ticket Networks Vaikkunth Mugunthan, Eric Lin, Vignesh Gokul, Christian Lau, Lalana Kagal, Steven D. Pieper code -1
Theoretical Understanding of the Information Flow on Continual Learning Performance Joshua Andle, Salimeh Yasaei Sekeh code -1
Exploring Lottery Ticket Hypothesis in Spiking Neural Networks Youngeun Kim, Yuhang Li, Hyoungseob Park, Yeshwanth Venkatesha, Ruokai Yin, Priyadarshini Panda code -1
On the Angular Update and Hyperparameter Tuning of a Scale-Invariant Network Juseung Yun, Janghyeon Lee, Hyounguk Shon, Eojindl Yi, Seung Hwan Kim, Junmo Kim code -1
LANA: Latency Aware Network Acceleration Pavlo Molchanov, Jimmy Hall, Hongxu Yin, Jan Kautz, Nicolò Fusi, Arash Vahdat code -1
RDO-Q: Extremely Fine-Grained Channel-Wise Quantization via Rate-Distortion Optimization Zhe Wang, Jie Lin, Xue Geng, Mohamed M. Sabry Aly, Vijay Chandrasekhar code -1
U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search Ahmet Caner Yüzügüler, Nikolaos Dimitriadis, Pascal Frossard code -1
PTQ4ViT: Post-training Quantization for Vision Transformers with Twin Uniform Quantization Zhihang Yuan, Chenhao Xue, Yiqi Chen, Qiang Wu, Guangyu Sun code -1
Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach Jiseok Youn, Jaehun Song, HyungSin Kim, Saewoong Bahk code -1
Understanding the Dynamics of DNNs Using Graph Modularity Yao Lu, Wen Yang, Yunzhe Zhang, Zuohui Chen, Jinyin Chen, Qi Xuan, Zhen Wang, Xiaoniu Yang code -1
Latent Discriminant Deterministic Uncertainty Gianni Franchi, Xuanlong Yu, Andrei Bursuc, Emanuel Aldea, Séverine Dubuisson, David Filliat code -1
Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals Simon Vandenhende, Dhruv Mahajan, Filip Radenovic, Deepti Ghadiyaram code -1
HIVE: Evaluating the Human Interpretability of Visual Explanations Sunnie S. Y. Kim, Nicole Meister, Vikram V. Ramaswamy, Ruth Fong, Olga Russakovsky code -1
BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks Uddeshya Upadhyay, Shyamgopal Karthik, Yanbei Chen, Massimiliano Mancini, Zeynep Akata code -1
SESS: Saliency Enhancing with Scaling and Sliding Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes code -1
No Token Left Behind: Explainability-Aided Image Classification and Generation Roni Paiss, Hila Chefer, Lior Wolf code -1
Interpretable Image Classification with Differentiable Prototypes Assignment Dawid Rymarczyk, Lukasz Struski, Michal Górszczak, Koryna Lewandowska, Jacek Tabor, Bartosz Zielinski code -1
Contributions of Shape, Texture, and Color in Visual Recognition Yunhao Ge, Yao Xiao, Zhi Xu, Xingrui Wang, Laurent Itti code -1
STEEX: Steering Counterfactual Explanations with Semantics Paul Jacob, Éloi Zablocki, Hédi BenYounes, Mickaël Chen, Patrick Pérez, Matthieu Cord code -1
Are Vision Transformers Robust to Patch Perturbations? Jindong Gu, Volker Tresp, Yao Qin code -1
A Dataset Generation Framework for Evaluating Megapixel Image Classifiers and Their Explanations Gautam Machiraju, Sylvia K. Plevritis, Parag Mallick code -1
Cartoon Explanations of Image Classifiers Stefan Kolek, Duc Anh Nguyen, Ron Levie, Joan Bruna, Gitta Kutyniok code -1
Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value Quan Zheng, ZiWei Wang, Jie Zhou, Jiwen Lu code -1
Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain Jiazhen Ji, Huan Wang, Yuge Huang, Jiaxiang Wu, Xingkun Xu, Shouhong Ding, Shengchuan Zhang, Liujuan Cao, Rongrong Ji code -1
Contrast-Phys: Unsupervised Video-Based Remote Physiological Measurement via Spatiotemporal Contrast Zhaodong Sun, Xiaobai Li code -1
Source-Free Domain Adaptation with Contrastive Domain Alignment and Self-supervised Exploration for Face Anti-spoofing Yuchen Liu, Yabo Chen, Wenrui Dai, Mengran Gou, ChunTing Huang, Hongkai Xiong code -1
On Mitigating Hard Clusters for Face Clustering Yingjie Chen, Huasong Zhong, Chong Chen, Chen Shen, Jianqiang Huang, Tao Wang, Yun Liang, Qianru Sun code -1
OneFace: One Threshold for All Jiaheng Liu, Zhipeng Yu, Haoyu Qin, Yichao Wu, Ding Liang, Gangming Zhao, Ke Xu code -1
Label2Label: A Language Modeling Framework for Multi-attribute Learning Wanhua Li, Zhexuan Cao, Jianjiang Feng, Jie Zhou, Jiwen Lu code -1
AgeTransGAN for Facial Age Transformation with Rectified Performance Metrics GeeSern Hsu, RuiCang Xie, ZhiTing Chen, YuHong Lin code -1
Hierarchical Contrastive Inconsistency Learning for Deepfake Video Detection Zhihao Gu, Taiping Yao, Yang Chen, Shouhong Ding, Lizhuang Ma code -1
Rethinking Robust Representation Learning Under Fine-Grained Noisy Faces Bingqi Ma, Guanglu Song, Boxiao Liu, Yu Liu code -1
Teaching Where to Look: Attention Similarity Knowledge Distillation for Low Resolution Face Recognition Sungho Shin, Joosoon Lee, Junseok Lee, Yeonguk Yu, Kyoobin Lee code -1
Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions Tohar Lukov, Na Zhao, Gim Hee Lee, SerNam Lim code -1
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis Shuai Shen, Wanhua Li, Zheng Zhu, Yueqi Duan, Jie Zhou, Jiwen Lu code -1
CoupleFace: Relation Matters for Face Recognition Distillation Jiaheng Liu, Haoyu Qin, Yichao Wu, Jinyang Guo, Ding Liang, Ke Xu code -1
Controllable and Guided Face Synthesis for Unconstrained Face Recognition Feng Liu, Minchul Kim, Anil K. Jain, Xiaoming Liu code -1
Towards Robust Face Recognition with Comprehensive Search Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li code -1
Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian Zhiwen Cao, Dongfang Liu, Qifan Wang, Yingjie Victor Chen code -1
ByteTrack: Multi-object Tracking by Associating Every Detection Box Yifu Zhang, Peize Sun, Yi Jiang, Dongdong Yu, Fucheng Weng, Zehuan Yuan, Ping Luo, Wenyu Liu, Xinggang Wang code -1
Robust Multi-object Tracking by Marginal Inference Yifu Zhang, Chunyu Wang, Xinggang Wang, Wenjun Zeng, Wenyu Liu code -1
PolarMOT: How Far Can Geometric Relations Take us in 3D Multi-object Tracking? Aleksandr Kim, Guillem Brasó, Aljosa Osep, Laura LealTaixé code -1
Particle Video Revisited: Tracking Through Occlusions Using Point Trajectories Adam W. Harley, Zhaoyuan Fang, Katerina Fragkiadaki code -1
Tracking Objects as Pixel-Wise Distributions Zelin Zhao, Ze Wu, Yueqing Zhuang, Boxun Li, Jiaya Jia code -1
CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds Zhiyang Guo, Yunyao Mao, Wengang Zhou, Min Wang, Houqiang Li code -1
Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline Jinyu Yang, Zhongqun Zhang, Zhe Li, Hyung Jin Chang, Ales Leonardis, Feng Zheng code -1
Hierarchical Latent Structure for Multi-modal Vehicle Trajectory Forecasting Dooseop Choi, KyoungWook Min code -1
AiATrack: Attention in Attention for Transformer Visual Tracking Shenyuan Gao, Chunluan Zhou, Chao Ma, Xinggang Wang, Junsong Yuan code -1
Disentangling Architecture and Training for Optical Flow Deqing Sun, Charles Herrmann, Fitsum A. Reda, Michael Rubinstein, David J. Fleet, William T. Freeman code -1
A Perturbation-Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow Jenny Schmalfuss, Philipp Scholze, Andrés Bruhn code -1
Robust Landmark-Based Stent Tracking in X-ray Fluoroscopy Luojie Huang, Yikang Liu, Li Chen, Eric Z. Chen, Xiao Chen, Shanhui Sun code -1
Social ODE: Multi-agent Trajectory Forecasting with Neural Ordinary Differential Equations Song Wen, Hao Wang, Dimitris N. Metaxas code -1
Social-SSL: Self-supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction LiWu Tsao, YanKai Wang, HaoSiang Lin, HongHan Shuai, LaiKuan Wong, WenHuang Cheng code -1
Diverse Human Motion Prediction Guided by Multi-level Spatial-Temporal Anchors Sirui Xu, YuXiong Wang, LiangYan Gui code -1
Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction Inhwan Bae, JinHwi Park, HaeGon Jeon code -1
Sequential Multi-view Fusion Network for Fast LiDAR Point Motion Estimation Gang Zhang, Xiaoyan Li, Zhenhua Wang code -1
E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs Yanyan Li, Federico Tombari code -1
Point Cloud Compression with Range Image-Based Entropy Model for Autonomous Driving Sukai Wang, Ming Liu code -1
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework Botao Ye, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen code -1
MotionCLIP: Exposing Human Motion Generation to CLIP Space Guy Tevet, Brian Gordon, Amir Hertz, Amit H. Bermano, Daniel CohenOr code -1
Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking Boyu Chen, Peixia Li, Lei Bai, Lei Qiao, Qiuhong Shen, Bo Li, Weihao Gan, Wei Wu, Wanli Ouyang code -1
Aware of the History: Trajectory Forecasting with the Local Behavior Data Yiqi Zhong, Zhenyang Ni, Siheng Chen, Ulrich Neumann code -1
Optical Flow Training Under Limited Label Budget via Active Learning Shuai Yuan, Xian Sun, Hannah Halin Kim, Shuzhi Yu, Carlo Tomasi code -1
Hierarchical Feature Embedding for Visual Tracking Zhixiong Pi, Weitao Wan, Chong Sun, Changxin Gao, Nong Sang, Chen Li code -1
Tackling Background Distraction in Video Object Segmentation Suhwan Cho, Heansung Lee, Minhyeok Lee, Chaewon Park, Sungjun Jang, Minjung Kim, Sangyoun Lee code -1
Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation Abduallah A. Mohamed, Deyao Zhu, Warren Vu, Mohamed Elhoseiny, Christian G. Claudel code -1
TEMOS: Generating Diverse Human Motions from Textual Descriptions Mathis Petrovich, Michael J. Black, Gül Varol code -1
Tracking Every Thing in the Wild Siyuan Li, Martin Danelljan, Henghui Ding, Thomas E. Huang, Fisher Yu code -1
HULC: 3D HUman Motion Capture with Pose Manifold SampLing and Dense Contact Guidance Soshi Shimada, Vladislav Golyanik, Zhi Li, Patrick Pérez, Weipeng Xu, Christian Theobalt code -1
Towards Sequence-Level Training for Visual Tracking Minji Kim, Seungkwan Lee, Jungseul Ok, Bohyung Han, Minsu Cho code -1
Learned Monocular Depth Priors in Visual-Inertial Initialization Yunwen Zhou, Abhishek Kar, Eric Turner, Adarsh Kowdle, Chao X. Guo, Ryan C. DuToit, Konstantine Tsotsos code -1
Robust Visual Tracking by Segmentation Matthieu Paul, Martin Danelljan, Christoph Mayer, Luc Van Gool code -1
MeshLoc: Mesh-Based Visual Localization Vojtech Panek, Zuzana Kukelova, Torsten Sattler code -1
S2F2: Single-Stage Flow Forecasting for Future Multiple Trajectories Prediction YuWen Chen, HsuanKung Yang, ChuChi Chiu, ChunYi Lee code -1
Large-Displacement 3D Object Tracking with Hybrid Non-local Optimization Xuhui Tian, Xinran Lin, Fan Zhong, Xueying Qin code -1
FEAR: Fast, Efficient, Accurate and Robust Visual Tracker Vasyl Borsuk, Roman Vei, Orest Kupyn, Tetiana Martyniuk, Igor Krashenyi, Jiri Matas code -1
PREF: Predictability Regularized Neural Motion Fields Liangchen Song, Xuan Gong, Benjamin Planche, Meng Zheng, David S. Doermann, Junsong Yuan, Terrence Chen, Ziyan Wu code -1
View Vertically: A Hierarchical Network for Trajectory Prediction via Fourier Spectrums Conghao Wong, Beihao Xia, Ziming Hong, Qinmu Peng, Wei Yuan, Qiong Cao, Yibo Yang, Xinge You code -1
HVC-Net: Unifying Homography, Visibility, and Confidence Learning for Planar Object Tracking Haoxian Zhang, Yonggen Ling code -1
RamGAN: Region Attentive Morphing GAN for Region-Level Makeup Transfer Jianfeng Xiang, Junliang Chen, Wenshuang Liu, Xianxu Hou, Linlin Shen code -1
SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang code -1
Entropy-Driven Sampling and Training Scheme for Conditional Diffusion Generation Guangcong Zheng, Shengming Li, Hui Wang, Taiping Yao, Yang Chen, Shouhong Ding, Xi Li code -1
Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel C. Castro, Anton Schwaighofer, Stephanie L. Hyland, Maria Wetscherek, Tristan Naumann, Aditya V. Nori, Javier AlvarezValle, Hoifung Poon, Ozan Oktay code -1
Generative Negative Text Replay for Continual Vision-Language Pretraining Shipeng Yan, Lanqing Hong, Hang Xu, Jianhua Han, Tinne Tuytelaars, Zhenguo Li, Xuming He code -1
Video Graph Transformer for Video Question Answering Junbin Xiao, Pan Zhou, TatSeng Chua, Shuicheng Yan code -1
Trace Controlled Text to Image Generation Kun Yan, Lei Ji, Chenfei Wu, Jianmin Bao, Ming Zhou, Nan Duan, Shuai Ma code -1
Video Question Answering with Iterative Video-Text Co-tokenization A. J. Piergiovanni, Kairo Morton, Weicheng Kuo, Michael S. Ryoo, Anelia Angelova code -1
Rethinking Data Augmentation for Robust Visual Question Answering Long Chen, Yuhang Zheng, Jun Xiao code -1
Explicit Image Caption Editing Zhen Wang, Long Chen, Wenbo Ma, Guangxing Han, Yulei Niu, Jian Shao, Jun Xiao code -1
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding Jiachang Hao, Haifeng Sun, Pengfei Ren, Jingyu Wang, Qi Qi, Jianxin Liao code -1
Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly Spencer Whitehead, Suzanne Petryk, Vedaad Shakib, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach code -1
GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features VanQuang Nguyen, Masanori Suganuma, Takayuki Okatani code -1
Selective Query-Guided Debiasing for Video Corpus Moment Retrieval Sunjae Yoon, Ji Woo Hong, Eunseop Yoon, Dahyun Kim, Junyeong Kim, Hee Suk Yoon, Chang D. Yoo code -1
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding Cheng Shi, Sibei Yang code -1
Object-Centric Unsupervised Image Captioning Zihang Meng, David Yang, Xuefei Cao, Ashish Shah, SerNam Lim code -1
Contrastive Vision-Language Pre-training with Limited Resources Quan Cui, Boyan Zhou, Yu Guo, Weidong Yin, Hao Wu, Osamu Yoshie, Yubo Chen code -1
Learning Linguistic Association Towards Efficient Text-Video Retrieval Sheng Fang, Shuhui Wang, Junbao Zhuo, Xinzhe Han, Qingming Huang code -1
ASSISTER: Assistive Navigation via Conditional Instruction Generation Zanming Huang, Zhongkai Shangguan, Jimuyang Zhang, Gilad Bar, Matthew Boyd, Eshed OhnBar code -1
X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks Zhaowei Cai, Gukyeong Kwon, Avinash Ravichandran, Erhan Bas, Zhuowen Tu, Rahul Bhotika, Stefano Soatto code -1
Learning Disentanglement with Decoupled Labels for Vision-Language Navigation Wenhao Cheng, Xingping Dong, Salman H. Khan, Jianbing Shen code -1
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input Qingpei Guo, Kaisheng Yao, Wei Chu code -1
Word-Level Fine-Grained Story Visualization Bowen Li code -1
Unifying Event Detection and Captioning as Sequence Generation via Pre-training Qi Zhang, Yuqing Song, Qin Jin code -1
Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation Chuang Lin, Yi Jiang, Jianfei Cai, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan code -1
Fine-Grained Visual Entailment Christopher Thomas, Yipeng Zhang, ShihFu Chang code -1
Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds Ayush Jain, Nikolaos Gkanatsios, Ishita Mediratta, Katerina Fragkiadaki code -1
New Datasets and Models for Contextual Reasoning in Visual Dialog Yifeng Zhang, Ming Jiang, Qi Zhao code -1
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection Joanna Hong, Minsu Kim, Yong Man Ro code -1
Classification-Regression for Chart Comprehension Matan Levy, Rami BenAri, Dani Lischinski code -1
AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant Benita Wong, Joya Chen, You Wu, Stan Weixian Lei, Dongxing Mao, Difei Gao, Mike Zheng Shou code -1
FindIt: Generalized Localization with Natural Language Queries Weicheng Kuo, Fred Bertsch, Wei Li, A. J. Piergiovanni, Mohammad Saffar, Anelia Angelova code -1
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Faisal Ahmed, Zicheng Liu, Yumao Lu, Lijuan Wang code -1
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels Golnaz Ghiasi, Xiuye Gu, Yin Cui, TsungYi Lin code -1
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning Jack Hessel, Jena D. Hwang, Jae Sung Park, Rowan Zellers, Chandra Bhagavatula, Anna Rohrbach, Kate Saenko, Yejin Choi code -1
Speaker-Adaptive Lip Reading with User-Dependent Padding Minsu Kim, Hyunjun Kim, Yong Man Ro code -1
TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation Tan M. Dinh, Rang Nguyen, BinhSon Hua code -1
SemAug: Semantically Meaningful Image Augmentations for Object Detection Through Language Grounding Morgan Heisler, Amin BanitalebiDehkordi, Yong Zhang code -1
Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance Myungsub Choi code -1
NewsStories: Illustrating Articles with Visual Summaries Reuben Tan, Bryan A. Plummer, Kate Saenko, J. P. Lewis, Avneesh Sud, Thomas Leung code -1
Webly Supervised Concept Expansion for General Purpose Vision Models Amita Kamath, Christopher Clark, Tanmay Gupta, Eric Kolve, Derek Hoiem, Aniruddha Kembhavi code -1
FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation Kaiwen Zhou, Xin Eric Wang code -1
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval Haoran Wang, Dongliang He, Wenhao Wu, Boyang Xia, Min Yang, Fu Li, Yunlong Yu, Zhong Ji, Errui Ding, Jingdong Wang code -1
Language-Driven Artistic Style Transfer TsuJui Fu, Xin Eric Wang, William Yang Wang code -1
Single-Stream Multi-level Alignment for Vision-Language Pretraining Zaid Khan, B. G. Vijay Kumar, Xiang Yu, Samuel Schulter, Manmohan Chandraker, Yun Fu code -1
Most and Least Retrievable Images in Visual-Language Query Systems Liuwan Zhu, Rui Ning, Jiang Li, Chunsheng Xin, Hongyi Wu code -1
Sports Video Analysis on Large-Scale Data Dekun Wu, He Zhao, Xingce Bao, Richard P. Wildes code -1
Grounding Visual Representations with Texts for Domain Generalization Seonwoo Min, Nokyung Park, Siwon Kim, Seunghyun Park, Jinkyu Kim code -1
Bridging the Visual Semantic Gap in VLN via Semantically Richer Instructions Joaquín Ossandón, Benjamín Earle, Álvaro Soto code -1
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation Adyasha Maharana, Darryl Hannan, Mohit Bansal code -1
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance Katherine Crowson, Stella Biderman, Daniel Kornis, Dashiell Stander, Eric Hallahan, Louis Castricato, Edward Raff code -1
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation Xian Liu, Yinghao Xu, Qianyi Wu, Hang Zhou, Wayne Wu, Bolei Zhou code -1
End-to-End Active Speaker Detection Juan León Alcázar, Moritz Cordes, Chen Zhao, Bernard Ghanem code -1
Emotion Recognition for Multiple Context Awareness Dingkang Yang, Shuai Huang, Shunli Wang, Yang Liu, Peng Zhai, Liuzhen Su, Mingcheng Li, Lihua Zhang code -1
Adaptive Fine-Grained Sketch-Based Image Retrieval Ayan Kumar Bhunia, Aneeshan Sain, Parth Hiren Shah, Animesh Gupta, Pinaki Nath Chowdhury, Tao Xiang, YiZhe Song code -1
Quantized GAN for Complex Music Generation from Dance Videos Ye Zhu, Kyle Olszewski, Yu Wu, Panos Achlioptas, Menglei Chai, Yan Yan, Sergey Tulyakov code -1
Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network Prediction Hu Wang, Jianpeng Zhang, Yuanhong Chen, Congbo Ma, Jodie Avery, Louise Hull, Gustavo Carneiro code -1
Localizing Visual Sounds the Easy Way Shentong Mo, Pedro Morgado code -1
Learning Visual Styles from Audio-Visual Associations Tingle Li, Yichen Liu, Andrew Owens, Hang Zhao code -1
Remote Respiration Monitoring of Moving Person Using Radio Signals JaeHo Choi, KiBong Kang, KyungTae Kim code -1
Camera Pose Estimation and Localization with Active Audio Sensing Karren Yang, Michael Firman, Eric Brachmann, Clément Godard code -1
PACS: A Dataset for Physical Audiovisual CommonSense Reasoning Samuel Yu, Peter Wu, Paul Pu Liang, Ruslan Salakhutdinov, LouisPhilippe Morency code -1
VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer Juan F. Montesinos, Venkatesh S. Kadandale, Gloria Haro code -1
Telepresence Video Quality Assessment Zhenqiang Ying, Deepti Ghadiyaram, Alan C. Bovik code -1
MultiMAE: Multi-modal Multi-task Masked Autoencoders Roman Bachmann, David Mizrahi, Andrei Atanov, Amir Zamir code -1
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey code -1
Audio-Visual Segmentation Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Lingpeng Kong, Meng Wang, Yiran Zhong code -1
Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression Yeying Jin, Wenhan Yang, Robby T. Tan code -1
Relationformer: A Unified Framework for Image-to-Graph Generation Suprosanna Shit, Rajat Koner, Bastian Wittmann, Johannes C. Paetzold, Ivan Ezhov, Hongwei Li, Jiazhen Pan, Sahand Sharifzadeh, Georgios Kaissis, Volker Tresp, Bjoern H. Menze code -1
GAMa: Cross-View Video Geo-Localization Shruti Vyas, Chen Chen, Mubarak Shah code -1
Revisiting a kNN-Based Image Classification System with High-Capacity Storage Kengo Nakata, Youyang Ng, Daisuke Miyashita, Asuka Maki, YuChieh Lin, Jun Deguchi code -1
Geometric Representation Learning for Document Image Rectification Hao Feng, Wengang Zhou, Jiajun Deng, Yuechen Wang, Houqiang Li code -1
S2-VER: Semi-supervised Visual Emotion Recognition Guoli Jia, Jufeng Yang code -1
Image Coding for Machines with Omnipotent Feature Learning Ruoyu Feng, Xin Jin, Zongyu Guo, Runsen Feng, Yixin Gao, Tianyu He, Zhizheng Zhang, Simeng Sun, Zhibo Chen code -1
Feature Representation Learning for Unsupervised Cross-Domain Image Retrieval Conghui Hu, Gim Hee Lee code -1
Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition Shilin Xu, Xiangtai Li, Jingbo Wang, Guangliang Cheng, Yunhai Tong, Dacheng Tao code -1
Semantic-Guided Multi-mask Image Harmonization Xuqian Ren, Yifan Liu code -1
Learning an Isometric Surface Parameterization for Texture Unwrapping Sagnik Das, Ke Ma, Zhixin Shu, Dimitris Samaras code -1
Towards Regression-Free Neural Networks for Diverse Compute Platforms Rahul Duggal, Hao Zhou, Shuo Yang, Jun Fang, Yuanjun Xiong, Wei Xia code -1
Relationship Spatialization for Depth Estimation Xiaoyu Xu, Jiayan Qiu, Xinchao Wang, Zhou Wang code -1
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models Chenfeng Xu, Shijia Yang, Tomer Galanti, Bichen Wu, Xiangyu Yue, Bohan Zhai, Wei Zhan, Peter Vajda, Kurt Keutzer, Masayoshi Tomizuka code -1
FAR: Fourier Aerial Video Recognition Divya Kothandaraman, Tianrui Guan, Xijun Wang, Shuowen Hu, Ming C. Lin, Dinesh Manocha code -1
Translating a Visual LEGO Manual to a Machine-Executable Plan Ruocheng Wang, Yunzhi Zhang, Jiayuan Mao, ChinYi Cheng, Jiajun Wu code -1
Fabric Material Recovery from Video Using Multi-scale Geometric Auto-Encoder Junbang Liang, Ming C. Lin code -1
MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment Jie Ren, Wenteng Liang, Ran Yan, Luo Mai, Shiwen Liu, Xiao Liu code -1
The One Where They Reconstructed 3D Humans and Environments in TV Shows Georgios Pavlakos, Ethan Weber, Matthew Tancik, Angjoo Kanazawa code -1
SITTA: Single Image Texture Translation for Data Augmentation Boyi Li, Yin Cui, TsungYi Lin, Serge J. Belongie code -1
Learning from Noisy Labels with Coarse-to-Fine Sample Credibility Modeling Boshen Zhang, Yuxi Li, Yuanpeng Tu, Jinlong Peng, Yabiao Wang, Cunlin Wu, Yang Xiao, Cairong Zhao code -1
PLMCL: Partial-Label Momentum Curriculum Learning for Multi-label Image Classification Rabab Abdelfattah, Xin Zhang, Zhenyao Wu, Xinyi Wu, Xiaofeng Wang, Song Wang code -1
Open-Vocabulary Semantic Segmentation Using Test-Time Distillation Nir Zabari, Yedid Hoshen code -1
SW-VAE: Weakly Supervised Learn Disentangled Representation via Latent Factor Swapping Jiageng Zhu, Hanchen Xie, Wael AbdAlmageed code -1
Learning Multiple Probabilistic Degradation Generators for Unsupervised Real World Image Super Resolution Sangyun Lee, Sewoong Ahn, Kwangjin Yoon code -1
Out-of-Distribution Detection Without Class Labels Niv Cohen, Ron Abutbul, Yedid Hoshen code -1
Unsupervised Domain Adaptive Object Detection with Class Label Shift Weighted Local Features Andong Tan, Niklas Hanselmann, Shuxiao Ding, Federico Tombari, Marius Cordts code -1
OpenCoS: Contrastive Semi-supervised Learning for Handling Open-Set Unlabeled Data Jongjin Park, Sukmin Yun, Jongheon Jeong, Jinwoo Shin code -1
Semi-supervised Domain Adaptation by Similarity Based Pseudo-Label Injection Abhay Rawat, Isha Dua, Saurav Gupta, Rahul Tallamraju code -1
Evaluating Image Super-Resolution Performance on Mobile Devices: An Online Benchmark Xindong Zhang, Hui Zeng, Lei Zhang code -1
Style Adaptive Semantic Image Editing with Transformers Edward Günther, Rui Gong, Luc Van Gool code -1
Third Time's the Charm? Image and Video Editing with StyleGAN3 Yuval Alaluf, Or Patashnik, Zongze Wu, Asif Zamir, Eli Shechtman, Dani Lischinski, Daniel CohenOr code -1
CNSNet: A Cleanness-Navigated-Shadow Network for Shadow Removal Qianhao Yu, Naishan Zheng, Jie Huang, Feng Zhao code -1
Unifying Conditional and Unconditional Semantic Image Synthesis with OCO-GAN Marlène Careil, Stéphane Lathuilière, Camille Couprie, Jakob Verbeek code -1
Efficient Image Super-Resolution Using Vast-Receptive-Field Attention Lin Zhou, Haoming Cai, Jinjin Gu, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Yu Qiao, Chao Dong code -1
Unsupervised Scene Sketch to Photo Synthesis Jiayun Wang, Sangryul Jeon, Stella X. Yu, Xi Zhang, Himanshu Arora, Yu Lou code -1
U-shape Transformer for Underwater Image Enhancement Lintao Peng, Chunli Zhu, Liheng Bian code -1
Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation Snehal Singh Tomar, Maitreya Suin, A. N. Rajagopalan code -1
Towards Real-World Video Deblurring by Exploring Blur Formation Process Mingdeng Cao, Zhihang Zhong, Yanbo Fan, Jiahao Wang, Yong Zhang, Jue Wang, Yujiu Yang, Yinqiang Zheng code -1
Unified Transformer Network for Multi-Weather Image Restoration Ashutosh Kulkarni, Shruti S. Phutke, Subrahmanyam Murala code -1
DSR: Towards Drone Image Super-Resolution Xiaoyu Lin, Baran Ozaydin, Vidit Vidit, Majed El Helou, Sabine Süsstrunk code -1
CEN-HDR: Computationally Efficient Neural Network for Real-Time High Dynamic Range Imaging Steven Tel, Barthélémy Heyrman, Dominique Ginhac code -1
Image Super-Resolution with Deep Variational Autoencoders Darius Chira, Ilian Haralampiev, Ole Winther, Andrea Dittadi, Valentin Liévin code -1
Light Field Angular Super-Resolution via Dense Correspondence Field Reconstruction Yu Mo, Yingqian Wang, Longguang Wang, JunGang Yang, Wei An code -1
Adaptive Mask-Based Pyramid Network for Realistic Bokeh Rendering Konstantinos Georgiadis, Albert SaàGarriga, Mehmet Kerim Yucel, Anastasios Drosou, Bruno Manganelli code -1
RISPNet: A Network for Reversed Image Signal Processing Xiaoyi Dong, Yu Zhu, Chenghua Li, Peisong Wang, Jian Cheng code -1
CIDBNet: A Consecutively-Interactive Dual-Branch Network for JPEG Compressed Image Super-Resolution Xiaoran Qin, Yu Zhu, Chenghua Li, Peisong Wang, Jian Cheng code -1
XCAT - Lightweight Quantized Single Image Super-Resolution Using Heterogeneous Group Convolutions and Cross Concatenation Mustafa Ayazoglu, Bahri Batuhan Bilecen code -1
Learned Reverse ISP with Soft Supervision Beiji Zou, Yue Zhang code -1
LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile Devices Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang code -1
MSSNet: Multi-Scale-Stage Network for Single Image Deblurring Kiyeon Kim, Seungyong Lee, Sunghyun Cho code -1
RCBSR: Re-parameterization Convolution Block for Super-Resolution Si Gao, Chengjian Zheng, Xiaofeng Zhang, Shaoli Liu, Biao Wu, Kaidi Lu, Diankai Zhang, Ning Wang code -1
Multi-patch Learning: Looking More Pixels in the Training Phase Lei Li, Jingzhu Tang, Ming Chen, Shijie Zhao, Junlin Li, Li Zhang code -1
Fast Nearest Convolution for Real-Time Efficient Image Super-Resolution Ziwei Luo, Youwei Li, Lei Yu, Qi Wu, Zhihong Wen, Haoqiang Fan, Shuaicheng Liu code -1
Real-Time Channel Mixing Net for Mobile Image Super-Resolution Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He code -1
Sliding Window Recurrent Network for Efficient Video Super-Resolution Wenyi Lian, Wenjing Lian code -1
EESRNet: A Network for Energy Efficient Super-Resolution Shijie Yue, Chenghua Li, Zhengyang Zhuge, Ruixia Song code -1
Bokeh-Loss GAN: Multi-stage Adversarial Training for Realistic Edge-Aware Bokeh Brian Lee, Fei Lei, Huaijin G. Chen, Alexis Baudron code -1
Residual Feature Distillation Channel Spatial Attention Network for ISP on Smartphone Jiesi Zheng, Zhihao Fan, Xun Wu, Yaqi Wu, Feng Zhang code -1
HST: Hierarchical Swin Transformer for Compressed Image Super-Resolution Bingchen Li, Xin Li, Yiting Lu, Sen Liu, Ruoyu Feng, Zhibo Chen code -1
Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration Marcos V. Conde, UiJin Choi, Maxime Burchi, Radu Timofte code -1
Reversing Image Signal Processors by Reverse Style Transferring Furkan Kinli, Baris Özcan, Furkan Kiraç code -1
Overexposure Mask Fusion: Generalizable Reverse ISP Multi-step Refinement Jinha Kim, Jun Jiang, Jinwei Gu code -1
CAIR: Fast and Lightweight Multi-scale Color Attention Network for Instagram Filter Removal WoonHa Yeo, WangTaek Oh, KyungSu Kang, YoungIl Kim, HanCheol Ryu code -1
MicroISP: Processing 32MP Photos on Mobile Devices with Deep Learning Andrey Ignatov, Anastasia Sycheva, Radu Timofte, Yu Tseng, YuSyuan Xu, PoHsiang Yu, ChengMing Chiang, HsienKai Kuo, MinHung Chen, ChiaMing Cheng, Luc Van Gool code -1
Real-Time Under-Display Cameras Image Restoration and HDR on Mobile Devices Marcos V. Conde, FlorinAlexandru Vasluianu, Sabari Nathan, Radu Timofte code -1
Globally Optimal Event-Based Divergence Estimation for Ventral Landing Sofia McLeod, Gabriele Meoni, Dario Izzo, Anne Mergy, Daqi Liu, Yasir Latif, Ian D. Reid, TatJun Chin code -1
Transfer Learning for On-Orbit Ship Segmentation Vincenzo Fanizza, David Rijlaarsdam, Pablo Tomás Toledano González, José Luis EspinosaAranda code -1
Spacecraft Pose Estimation Based on Unsupervised Domain Adaptation and on a 3D-Guided Loss Combination Juan Ignacio Bravo PérezVillar, Álvaro GarcíaMartín, Jesús Bescós code -1
MaRF: Representing Mars as Neural Radiance Fields Lorenzo Giusti, Josue Garcia, Steven Cozine, Darrick Suen, Christina Nguyen, Ryan Alimo code -1
Asynchronous Kalman Filter for Event-Based Star Tracking Yonhon Ng, Yasir Latif, TatJun Chin, Robert E. Mahony code -1
Using Moffat Profiles to Register Astronomical Images Mason Schuckman, Roy Prouty, David Chapman, Don Engel code -1
Mixed-Domain Training Improves Multi-mission Terrain Segmentation Grace Vincent, Alice Yepremyan, Jingdao Chen, Edwin Goh code -1
CubeSat-CDT: A Cross-Domain Dataset for 6-DoF Trajectory Estimation of a Symmetric Spacecraft Mohamed Adel Musallam, Arunkumar Rathinam, Vincent Gaudillière, Miguel Ortiz del Castillo, Djamila Aouada code -1
Data Lifecycle Management in Evolving Input Distributions for Learning-based Aerospace Applications Somrita Banerjee, Apoorva Sharma, Edward Schmerling, Max Spolaor, Michael Nemerouf, Marco Pavone code -1
Strong Gravitational Lensing Parameter Estimation with Vision Transformer KuanWei Huang, Geoff ChihFan Chen, PoWen Chang, ShengChieh Lin, ChiaJung Hsu, Vishal Thengane, Joshua YaoYu Lin code -1
End-to-end Neural Estimation of Spacecraft Pose with Intermediate Detection of Keypoints Antoine Legrand, Renaud Detry, Christophe De Vleeschouwer code -1
Improving Contrastive Learning on Visually Homogeneous Mars Rover Images Isaac Ronald Ward, Charles Moore, Kai Pak, Jingdao Chen, Edwin Goh code -1
Monocular 6-DoF Pose Estimation for Non-cooperative Spacecrafts Using Riemannian Regression Network Sunhao Chu, Yuxiao Duan, Klaus Schilling, Shufan Wu code -1
HyperNST: Hyper-Networks for Neural Style Transfer Dan Ruta, Andrew Gilbert, Saeid Motiian, Baldo Faieta, Zhe Lin, John P. Collomosse code -1
DEArt: Dataset of European Art Artem Reshetnikov, MariaCristina V. Marinescu, Joaquim Moré López code -1
How Well Do Vision Transformers (VTs) Transfer to the Non-natural Image Domain? An Empirical Study Involving Art Classification Vincent Tonkes, Matthia Sabatelli code -1
On-the-Go Reflectance Transformation Imaging with Ordinary Smartphones Mara Pistellato, Filippo Bergamasco code -1
Is GPT-3 All You Need for Visual Question Answering in Cultural Heritage? Pietro Bongini, Federico Becattini, Alberto Del Bimbo code -1
Automatic Analysis of Human Body Representations in Western Art Shu Zhao, Alkim Almila Akdag Salah, Albert Ali Salah code -1
ArtFacePoints: High-Resolution Facial Landmark Detection in Paintings and Prints Aline Sindel, Andreas Maier, Vincent Christlein code -1
TransPatch: A Transformer-based Generator for Accelerating Transferable Patch Generation in Adversarial Attacks Against Object Detection Models Jinghao Wang, Chenling Cui, Xuejun Wen, Jie Shi code -1
Feature-Level Augmentation to Improve Robustness of Deep Neural Networks to Affine Transformations Adrian Sandru, MarianaIuliana Georgescu, Radu Tudor Ionescu code -1
Benchmarking Robustness Beyond lp Norm Adversaries Akshay Agarwal, Nalini K. Ratha, Mayank Vatsa, Richa Singh code -1
Masked Faces with Faced Masks Jiayi Zhu, Qing Guo, Felix JuefeiXu, Yihao Huang, Yang Liu, Geguang Pu code -1
Adversarially Robust Panoptic Segmentation (ARPaS) Benchmark Laura Alexandra Daza, Jordi PontTuset, Pablo Arbeláez code -1
BadDet: Backdoor Attacks on Object Detection ShihHan Chan, Yinpeng Dong, Jun Zhu, Xiaolu Zhang, Jun Zhou code -1
Universal, Transferable Adversarial Perturbations for Visual Object Trackers Krishna Kanth Nakka, Mathieu Salzmann code -1
Why Is the Video Analytics Accuracy Fluctuating, and What Can We Do About It? Sibendu Paul, Kunal Rao, Giuseppe Coviello, Murugan Sankaradas, Oliver Po, Y. Charlie Hu, Srimat Chakradhar code -1
SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning Nilaksh Das, ShengYun Peng, Duen Horng Chau code -1
Unrestricted Black-Box Adversarial Attack Using GAN with Limited Queries Dongbin Na, Sangwoo Ji, Jong Kim code -1
Truth-Table Net: A New Convolutional Architecture Encodable by Design into SAT Formulas Adrien Benamira, Thomas Peyrin, Bryan Hooi KuenYew code -1
Attribution-Based Confidence Metric for Detection of Adversarial Attacks on Breast Histopathological Images Steven Lawrence Fernandes, Senka Krivic, Poonam Sharma, Sumit Kumar Jha code -1
Improving Adversarial Robustness by Penalizing Natural Accuracy Kshitij Chandna code -1
4D-StOP: Panoptic Segmentation of 4D LiDAR Using Spatio-Temporal Object Proposal Generation and Aggregation Lars Kreuzberg, Idil Esen Zulfikar, Sabarinath Mahadevan, Francis Engelmann, Bastian Leibe code -1
BlindSpotNet: Seeing Where We Cannot See Taichi Fukuda, Kotaro Hasegawa, Shinya Ishizaki, Shohei Nobuhara, Ko Nishino code -1
Gesture Recognition with Keypoint and Radar Stream Fusion for Automated Vehicles Adrian Holzbock, Nicolai Kern, Christian Waldschmidt, Klaus Dietmayer, Vasileios Belagiannis code -1
An Improved Lightweight Network Based on YOLOv5s for Object Detection in Autonomous Driving Guofa Li, Yingjie Zhang, Delin Ouyang, Xingda Qu code -1
Plausibility Verification for 3D Object Detectors Using Energy-Based Optimization Abhishek Vivekanandan, Niels Maier, J. Marius Zöllner code -1
Lane Change Classification and Prediction with Action Recognition Networks Kai Liang, Jun Wang, Abhir Bhalerao code -1
Joint Prediction of Amodal and Visible Semantic Segmentation for Automated Driving Jasmin Breitenstein, Jonas Löhdefink, Tim Fingscheidt code -1
Human-Vehicle Cooperative Visual Perception for Autonomous Driving Under Complex Traffic Environments Yiyue Zhao, Cailin Lei, Yu Shen, Yuchuan Du, Qijun Chen code -1
MCIP: Multi-Stream Network for Pedestrian Crossing Intention Prediction JeSeok Ham, Kangmin Bae, Jinyoung Moon code -1
SimpleTrack: Understanding and Rethinking 3D Multi-object Tracking Ziqi Pang, Zhichao Li, Naiyan Wang code -1
Ego-Motion Compensation of Range-Beam-Doppler Radar Data for Object Detection Michael Meyer, Marc Unzueta, Georg Kuschk, Sven Tomforde code -1
RPR-Net: A Point Cloud-Based Rotation-Aware Large Scale Place Recognition Network Zhaoxin Fan, Zhenbo Song, Wenping Zhang, Hongyan Liu, Jun He, Xiaoyong Du code -1
Learning 3D Semantics From Pose-Noisy 2D Images with Hierarchical Full Attention Network Yuhang He, Lin Chen, Junkun Xie, Long Chen code -1
SIM2E: Benchmarking the Group Equivariant Capability of Correspondence Matching Algorithms Shuai Su, Zhongkai Zhao, Yixin Fei, Shuda Li, Qijun Chen, Rui Fan code -1
Talisman: Targeted Active Learning for Object Detection with Rare Classes and Slices Using Submodular Mutual Information Suraj Kothawade, Saikat Ghosh, Sumit Shekhar, Yu Xiang, Rishabh K. Iyer code -1
An Efficient Person Clustering Algorithm for Open Checkout-free Groceries Junde Wu, Yu Zhang, Rao Fu, Yuanpei Liu, Jing Gao code -1
POP: Mining POtential Performance of New Fashion Products via Webly Cross-modal Query Expansion Christian Joppi, Geri Skenderi, Marco Cristani code -1
Pose Forecasting in Industrial Human-Robot Collaboration Alessio Sampieri, Guido Maria D'Amely di Melendugno, Andrea Avogaro, Federico Cunico, Francesco Setti, Geri Skenderi, Marco Cristani, Fabio Galasso code -1
Actor-Centered Representations for Action Localization in Streaming Videos Sathyanarayanan N. Aakur, Sudeep Sarkar code -1
Bandwidth-Aware Adaptive Codec for DNN Inference Offloading in IoT Xiufeng Xie, Ning Zhou, Wentao Zhu, Ji Liu code -1
Domain Knowledge-Informed Self-supervised Representations for Workout Form Assessment Paritosh Parmar, Amol Gharat, Helge Rhodin code -1
Responsive Listening Head Generation: A Benchmark Dataset and Baseline Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei code -1
Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics Sen Zhang, Jing Zhang, Dacheng Tao code -1
TIPS: Text-Induced Pose Synthesis Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein code -1
Addressing Heterogeneity in Federated Learning via Distributional Transformation Haolin Yuan, Bo Hui, Yuchen Yang, Philippe Burlina, Neil Zhenqiang Gong, Yinzhi Cao code -1
Where in the World Is This Image? Transformer-Based Geo-localization in the Wild Shraman Pramanick, Ewa Magdalena Nowara, Joshua Gleason, Carlos Domingo Castillo, Rama Chellappa code -1
Colorization for in situ Marine Plankton Images Guannan Guo, Qi Lin, Tao Chen, Zhenghui Feng, Zheng Wang, Jianping Li code -1
Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection Mingyu Yang, Yu Chen, HunSeok Kim code -1
A Sketch is Worth a Thousand Words: Image Retrieval with Text and Sketch Patsorn Sangkloy, Wittawat Jitkrittum, Diyi Yang, James Hays code -1
A Cloud 3D Dataset and Application-Specific Learned Image Compression in Cloud 3D Tianyi Liu, Sen He, Vinodh Kumaran Jayakumar, Wei Wang code -1
AutoTransition: Learning to Recommend Video Transition Effects Yaojie Shen, Libo Zhang, Kai Xu, Xiaojie Jin code -1
Online Segmentation of LiDAR Sequences: Dataset and Algorithm Romain Loiseau, Mathieu Aubry, Loïc Landrieu code -1
Open-world Semantic Segmentation for LIDAR Point Clouds Jun Cen, Peng Yun, Shiwei Zhang, Junhao Cai, Di Luan, Mingqian Tang, Ming Liu, Michael Yu Wang code -1
KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients Niklas Hanselmann, Katrin Renz, Kashyap Chitta, Apratim Bhattacharyya, Andreas Geiger code -1
Differentiable Raycasting for Self-Supervised Occupancy Forecasting Tarasha Khurana, Peiyun Hu, Achal Dave, Jason Ziglar, David Held, Deva Ramanan code -1
InAction: Interpretable Action Decision Making for Autonomous Driving Taotao Jing, Haifeng Xia, Renran Tian, Haoran Ding, Xiao Luo, Joshua E. Domeyer, Rini Sherony, Zhengming Ding code -1
CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection JyhJing Hwang, Henrik Kretzschmar, Joshua Manela, Sean Rafferty, Nicholas ArmstrongCrews, Tiffany Chen, Dragomir Anguelov code -1
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei Zhang, Chunjing Xu, DitYan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu code -1
Motion Inspired Unsupervised Perception and Prediction in Autonomous Driving Mahyar Najibi, Jingwei Ji, Yin Zhou, Charles R. Qi, Xinchen Yan, Scott Ettinger, Dragomir Anguelov code -1
StretchBEV: Stretching Future Instance Prediction Spatially and Temporally Adil Kaan Akan, Fatma Güney code -1
RCLane: Relay Chain Prediction for Lane Detection Shenghua Xu, Xinyue Cai, Bin Zhao, Li Zhang, Hang Xu, Yanwei Fu, Xiangyang Xue code -1
Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-Modal Distillation Antonín Vobecký, David Hurych, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivic code -1
CenterFormer: Center-Based Transformer for 3D Object Detection Zixiang Zhou, Xiangchen Zhao, Yu Wang, Panqu Wang, Hassan Foroosh code -1
Physical Attack on Monocular Depth Estimation with Optimal Adversarial Patches Zhiyuan Cheng, James Liang, Hongjun Choi, Guanhong Tao, Zhiwen Cao, Dongfang Liu, Xiangyu Zhang code -1
ST-P3: End-to-End Vision-Based Autonomous Driving via Spatial-Temporal Feature Learning Shengchao Hu, Li Chen, Penghao Wu, Hongyang Li, Junchi Yan, Dacheng Tao code -1
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan code -1
PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation Kwonyoung Kim, Jungin Park, Jiyoung Lee, Dongbo Min, Kwanghoon Sohn code -1
BRNet: Exploring Comprehensive Features for Monocular Depth Estimation Wencheng Han, Junbo Yin, Xiaogang Jin, Xiangdong Dai, Jianbing Shen code -1
SiamDoGe: Domain Generalizable Semantic Segmentation Using Siamese Network Zhenyao Wu, Xinyi Wu, Xiaoping Zhang, Lili Ju, Song Wang code -1
Context-Aware Streaming Perception in Dynamic Environments GurEyal Sela, Ionel Gog, Justin Wong, Kumar Krishna Agrawal, Xiangxi Mo, Sukrit Kalra, Peter Schafhalter, Eric Leong, Xin Wang, Bharathan Balaji, Joseph Gonzalez, Ion Stoica code -1
SpOT: Spatiotemporal Modeling for 3D Object Tracking Colton Stearns, Davis Rempe, Jie Li, Rares Ambrus, Sergey Zakharov, Vitor Guizilini, Yanchao Yang, Leonidas J. Guibas code -1
Multimodal Transformer for Automatic 3D Annotation and Object Detection Chang Liu, Xiaoyan Qian, Binxiao Huang, Xiaojuan Qi, Edmund Y. Lam, SiewChong Tan, Ngai Wong code -1
Dynamic 3D Scene Analysis by Point Cloud Accumulation Shengyu Huang, Zan Gojcic, Jiahui Huang, Andreas Wieser, Konrad Schindler code -1
Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection Xin Li, Botian Shi, Yuenan Hou, Xingjiao Wu, Tianlong Ma, Yikang Li, Liang He code -1
JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes Haimei Zhao, Jing Zhang, Sen Zhang, Dacheng Tao code -1
Semi-supervised 3D Object Detection with Proficient Teachers Junbo Yin, Jin Fang, Dingfu Zhou, Liangjun Zhang, ChengZhong Xu, Jianbing Shen, Wenguan Wang code -1
Point Cloud Compression with Sibling Context and Surface Priors Zhili Chen, Zian Qian, Sukai Wang, Qifeng Chen code -1
Lane Detection Transformer Based on Multi-frame Horizontal and Vertical Attention and Visual Transformer Module Han Zhang, Yunchao Gu, Xinliang Wang, Junjun Pan, Minghui Wang code -1
ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection Junbo Yin, Dingfu Zhou, Liangjun Zhang, Jin Fang, ChengZhong Xu, Jianbing Shen, Wenguan Wang code -1
PreTraM: Self-supervised Pre-training via Connecting Trajectory and Map Chenfeng Xu, Tian Li, Chen Tang, Lingfeng Sun, Kurt Keutzer, Masayoshi Tomizuka, Alireza Fathi, Wei Zhan code -1
Master of All: Simultaneous Generalization of Urban-Scene Segmentation to All Adverse Weather Conditions Nikhil Reddy, Abhinav Singhal, Abhishek Kumar, Mahsa Baktashmotlagh, Chetan Arora code -1
LESS: Label-Efficient Semantic Segmentation for LiDAR Point Clouds Minghua Liu, Yin Zhou, Charles R. Qi, Boqing Gong, Hao Su, Dragomir Anguelov code -1
Visual Cross-View Metric Localization with Dense Uncertainty Estimates Zimin Xia, Olaf Booij, Marco Manfredi, Julian F. P. Kooij code -1
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer Runsheng Xu, Hao Xiang, Zhengzhong Tu, Xin Xia, MingHsuan Yang, Jiaqi Ma code -1
DevNet: Self-supervised Monocular Depth Learning via Density Volume Construction Kaichen Zhou, Lanqing Hong, Changhao Chen, Hang Xu, Chaoqiang Ye, Qingyong Hu, Zhenguo Li code -1
Action-Based Contrastive Learning for Trajectory Prediction Marah Halawa, Olaf Hellwich, Pia Bideau code -1
Radatron: Accurate Detection Using Multi-resolution Cascaded MIMO Radar Sohrab Madani, Jayden Guan, Waleed Ahmed, Saurabh Gupta, Haitham Hassanieh code -1
LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection Yi Wei, Zibu Wei, Yongming Rao, Jiaxin Li, Jie Zhou, Jiwen Lu code -1
Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks Maosheng Ye, Rui Wan, Shuangjie Xu, Tongyi Cao, Qifeng Chen code -1
FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds Lihe Ding, Shaocong Dong, Tingfa Xu, Xinli Xu, Jie Wang, Jianan Li code -1
SpatialDETR: Robust Scalable Transformer-Based 3D Object Detection From Multi-view Camera Images With Global Cross-Sensor Attention Simon Doll, Richard Schulz, Lukas Schneider, Viviane Benzin, Markus Enzweiler, Hendrik P. A. Lensch code -1
Pixel-Wise Energy-Biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving Scenes Yu Tian, Yuyuan Liu, Guansong Pang, Fengbei Liu, Yuanhong Chen, Gustavo Carneiro code -1
Rethinking Closed-Loop Training for Autonomous Driving Chris Zhang, Runsheng Guo, Wenyuan Zeng, Yuwen Xiong, Binbin Dai, Rui Hu, Mengye Ren, Raquel Urtasun code -1
SLiDE: Self-supervised LiDAR De-snowing Through Reconstruction Difficulty Gwangtak Bae, Byungjun Kim, Seongyong Ahn, Jihong Min, Inwook Shim code -1
Generative Meta-Adversarial Network for Unseen Object Navigation Sixian Zhang, Weijie Li, Xinhang Song, Yubing Bai, Shuqiang Jiang code -1
Object Manipulation via Visual Target Localization Kiana Ehsani, Ali Farhadi, Aniruddha Kembhavi, Roozbeh Mottaghi code -1
MoDA: Map Style Transfer for Self-supervised Domain Adaptation of Embodied Agents Eun Sun Lee, Junho Kim, SangWon Park, Young Min Kim code -1
Housekeep: Tidying Virtual Households Using Commonsense Reasoning Yash Kant, Arun Ramachandran, Sriram Yenamandra, Igor Gilitschenski, Dhruv Batra, Andrew Szot, Harsh Agrawal code -1
Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects Qiyu Dai, Jiyao Zhang, Qiwei Li, Tianhao Wu, Hao Dong, Ziyuan Liu, Ping Tan, He Wang code -1
Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction ChiaChi Chuang, Donglin Yang, Chuan Wen, Yang Gao code -1
OPD: Single-View 3D Openable Part Detection Hanxiao Jiang, Yongsen Mao, Manolis Savva, Angel X. Chang code -1
AirDet: Few-Shot Detection Without Fine-Tuning for Autonomous Exploration Bowen Li, Chen Wang, Pranay Reddy, Seungchan Kim, Sebastian A. Scherer code -1
TransGrasp: Grasp Pose Estimation of a Category of Objects by Transferring Grasps from Only One Labeled Instance Hongtao Wen, Jianhang Yan, Wanli Peng, Yi Sun code -1
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning Jinghuan Shang, Kumara Kahatapitiya, Xiang Li, Michael S. Ryoo code -1
TIDEE: Tidying Up Novel Rooms Using Visuo-Semantic Commonsense Priors Gabriel Sarch, Zhaoyuan Fang, Adam W. Harley, Paul Schydlo, Michael J. Tarr, Saurabh Gupta, Katerina Fragkiadaki code -1
Learning Efficient Multi-agent Cooperative Visual Exploration Chao Yu, Xinyi Yang, Jiaxuan Gao, Huazhong Yang, Yu Wang, Yi Wu code -1
Zero-Shot Category-Level Object Pose Estimation Walter Goodwin, Sagar Vaze, Ioannis Havoutis, Ingmar Posner code -1
Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking Kai Chen, Rui Cao, Stephen James, Yichuan Li, YunHui Liu, Pieter Abbeel, Qi Dou code -1
Active Audio-Visual Separation of Dynamic Sound Sources Sagnik Majumder, Kristen Grauman code -1
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos Yuzhe Qin, YuehHua Wu, Shaowei Liu, Hanwen Jiang, Ruihan Yang, Yang Fu, Xiaolong Wang code -1
Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments Jacob Krantz, Stefan Lee code -1
Style-Agnostic Reinforcement Learning Juyong Lee, Seokjun Ahn, Jaesik Park code -1
Self-supervised Interactive Object Segmentation Through a Singulation-and-Grasping Approach Houjian Yu, Changhyun Choi code -1
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation Shizhe Chen, PierreLouis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev code -1
BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking Dorian Henning, Tristan Laidlow, Stefan Leutenegger code -1
FusionVAE: A Deep Hierarchical Variational Autoencoder for RGB Image Fusion Fabian Duffhauss, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann code -1
Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning Chi Zhang, Sirui Xie, Baoxiong Jia, Ying Nian Wu, SongChun Zhu, Yixin Zhu code -1
Video Dialog as Conversation About Objects Living in Space-Time HoangAnh Pham, Thao Minh Le, Vuong Le, Tu Minh Phuong, Truyen Tran code -1
Improving Vision Transformers by Revisiting High-Frequency Components Jiawang Bai, Li Yuan, ShuTao Xia, Shuicheng Yan, Zhifeng Li, Wei Liu code -1
Recurrent Bilinear Optimization for Binary Neural Networks Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lü, Guodong Guo code -1
Neural Architecture Search for Spiking Neural Networks Youngeun Kim, Yuhang Li, Hyoungseob Park, Yeshwanth Venkatesha, Priyadarshini Panda code -1
Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained Visual Classification Yang Liu, Lei Zhou, Pengcheng Zhang, Xiao Bai, Lin Gu, Xiaohan Yu, Jun Zhou, Edwin R. Hancock code -1
DaViT: Dual Attention Vision Transformers Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan code -1
Optimal Transport for Label-Efficient Visible-Infrared Person Re-Identification Jiangming Wang, Zhizhong Zhang, Mingang Chen, Yi Zhang, Cong Wang, Bin Sheng, Yanyun Qu, Yuan Xie code -1
Locality Guidance for Improving Vision Transformers on Tiny Datasets Kehan Li, Runyi Yu, Zhennan Wang, Li Yuan, Guoli Song, Jie Chen code -1
Neighborhood Collective Estimation for Noisy Label Identification and Correction Jichang Li, Guanbin Li, Feng Liu, Yizhou Yu code -1
Few-Shot Class-Incremental Learning via Entropy-Regularized Data-Free Replay Huan Liu, Li Gu, Zhixiang Chi, Yang Wang, Yuanhao Yu, Jun Chen, Jin Tang code -1
Anti-retroactive Interference for Lifelong Learning Runqi Wang, Yuxiang Bao, Baochang Zhang, Jianzhuang Liu, Wentao Zhu, Guodong Guo code -1
Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-Tailed Learning Hualiang Wang, Siming Fu, Xiaoxuan He, Hangxiang Fang, Zuozhu Liu, Haoji Hu code -1
Dynamic Metric Learning with Cross-Level Concept Distillation Wenzhao Zheng, Yuan Huang, Borui Zhang, Jie Zhou, Jiwen Lu code -1
MENet: A Memory-Based Network with Dual-Branch for Efficient Event Stream Processing Linhui Sun, Yifan Zhang, Ke Cheng, Jian Cheng, Hanqing Lu code -1
Out-of-distribution Detection with Boundary Aware Learning Sen Pei, Xin Zhang, Bin Fan, Gaofeng Meng code -1
Learning Hierarchy Aware Features for Reducing Mistake Severity Ashima Garg, Depanshu Sani, Saket Anand code -1
Learning to Detect Every Thing in an Open World Kuniaki Saito, Ping Hu, Trevor Darrell, Kate Saenko code -1
KVT: k-NN Attention for Boosting Vision Transformers Pichao Wang, Xue Wang, Fan Wang, Ming Lin, Shuning Chang, Hao Li, Rong Jin code -1
Registration Based Few-Shot Anomaly Detection Chaoqin Huang, Haoyan Guan, Aofan Jiang, Ya Zhang, Michael W. Spratling, YanFeng Wang code -1
Improving Robustness by Enhancing Weak Subnets Yong Guo, David Stutz, Bernt Schiele code -1
Learning Invariant Visual Representations for Compositional Zero-Shot Learning Tian Zhang, Kongming Liang, Ruoyi Du, Xian Sun, Zhanyu Ma, Jun Guo code -1
Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality Yue Song, Nicu Sebe, Wei Wang code -1
Out-of-Distribution Detection with Semantic Mismatch Under Masking Yijun Yang, Ruiyuan Gao, Qiang Xu code -1
Data-Free Neural Architecture Search via Recursive Label Calibration Zechun Liu, Zhiqiang Shen, Yun Long, Eric P. Xing, KwangTing Cheng, Chas Leichner code -1
Learning from Multiple Annotator Noisy Labels via Sample-Wise Label Fusion Zhengqi Gao, FanKeng Sun, Mingran Yang, Sucheng Ren, Zikai Xiong, Marc Engeler, Antonio Burazer, Linda Wildling, Luca Daniel, Duane S. Boning code -1
Acknowledging the Unknown for Multi-label Learning with Single Positive Labels Donghao Zhou, Pengfei Chen, Qiong Wang, Guangyong Chen, PhengAnn Heng code -1
AutoMix: Unveiling the Power of Mixup for Stronger Classifiers Zicheng Liu, Siyuan Li, Di Wu, Zihan Liu, Zhiyuan Chen, Lirong Wu, Stan Z. Li code -1
MaxViT: Multi-axis Vision Transformer Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan C. Bovik, Yinxiao Li code -1
ScalableViT: Rethinking the Context-Oriented Generalization of Vision Transformer Rui Yang, Hailong Ma, Jie Wu, Yansong Tang, Xuefeng Xiao, Min Zheng, Xiu Li code -1
Three Things Everyone Should Know About Vision Transformers Hugo Touvron, Matthieu Cord, Alaaeldin ElNouby, Jakob Verbeek, Hervé Jégou code -1
DeiT III: Revenge of the ViT Hugo Touvron, Matthieu Cord, Hervé Jégou code -1
MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition Chuanguang Yang, Zhulin An, Helong Zhou, Linhang Cai, Xiang Zhi, Jiwen Wu, Yongjun Xu, Qian Zhang code -1
Self-feature Distillation with Uncertainty Modeling for Degraded Image Recognition Zhou Yang, Weisheng Dong, Xin Li, Jinjian Wu, Leida Li, Guangming Shi code -1
Novel Class Discovery Without Forgetting K. J. Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N. Balasubramanian code -1
SAFA: Sample-Adaptive Feature Augmentation for Long-Tailed Image Classification Yan Hong, Jianfu Zhang, Zhongyi Sun, Ke Yan code -1
Negative Samples are at Large: Leveraging Hard-Distance Elastic Loss for Re-identification Hyungtae Lee, Sungmin Eum, Heesung Kwon code -1
Discrete-Constrained Regression for Local Counting Models Haipeng Xiong, Angela Yao code -1
Breadcrumbs: Adversarial Class-Balanced Sampling for Long-Tailed Recognition Bo Liu, Haoxiang Li, Hao Kang, Gang Hua, Nuno Vasconcelos code -1
Chairs Can Be Stood On: Overcoming Object Bias in Human-Object Interaction Detection Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan S. Kankanhalli code -1
A Fast Knowledge Distillation Framework for Visual Recognition Zhiqiang Shen, Eric P. Xing code -1
DICE: Leveraging Sparsification for Out-of-Distribution Detection Yiyou Sun, Yixuan Li code -1
Invariant Feature Learning for Generalized Long-Tailed Classification Kaihua Tang, Mingyuan Tao, Jiaxin Qi, Zhenguang Liu, Hanwang Zhang code -1
Sliced Recursive Transformer Zhiqiang Shen, Zechun Liu, Eric P. Xing code -1
Relative Contrastive Loss for Unsupervised Representation Learning Shixiang Tang, Feng Zhu, Lei Bai, Rui Zhao, Wanli Ouyang code -1
Fine-Grained Fashion Representation Learning by Online Deep Clustering Yang Jiao, Ning Xie, Yan Gao, ChienChih Wang, Yi Sun code -1
NashAE: Disentangling Representations Through Adversarial Covariance Minimization Eric C. Yeats, Frank Liu, David Womble, Hai Helen Li code -1
A Gyrovector Space Approach for Symmetric Positive Semi-definite Matrix Learning Xuan Son Nguyen code -1
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, ShihFu Chang, Lu Yuan code -1
Contrasting Quadratic Assignments for Set-Based Representation Learning Artem Moskalev, Ivan Sosnovik, Volker Fischer, Arnold W. M. Smeulders code -1
Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer Arjun Ashok, K. J. Joseph, Vineeth N. Balasubramanian code -1
Object Discovery and Representation Networks Olivier J. Hénaff, Skanda Koppula, Evan Shelhamer, Daniel Zoran, Andrew Jaegle, Andrew Zisserman, João Carreira, Relja Arandjelovic code -1
Trading Positional Complexity vs Deepness in Coordinate Networks Jianqiao Zheng, Sameera Ramasinghe, Xueqian Li, Simon Lucey code -1
MVDG: A Unified Multi-view Framework for Domain Generalization Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao code -1
Panoptic Scene Graph Generation Jingkang Yang, Yi Zhe Ang, Zujin Guo, Kaiyang Zhou, Wayne Zhang, Ziwei Liu code -1
Object-Compositional Neural Implicit Surfaces Qianyi Wu, Xian Liu, Yuedong Chen, Kejie Li, Chuanxia Zheng, Jianfei Cai, Jianmin Zheng code -1
RigNet: Repetitive Image Guided Network for Depth Completion Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang code -1
FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling Hao Lu, Wenze Liu, Hongtao Fu, Zhiguo Cao code -1
LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation Zeyu Hu, Xuyang Bai, Runze Zhang, Xin Wang, Guangyuan Sun, Hongbo Fu, ChiewLan Tai code -1
Hierarchical Memory Learning for Fine-Grained Scene Graph Generation Youming Deng, Yansheng Li, Yongjun Zhang, Xiang Xiang, Jian Wang, Jingdong Chen, Jiayi Ma code -1
DODA: Data-Oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation Runyu Ding, Jihan Yang, Li Jiang, Xiaojuan Qi code -1
MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning Xiaogang Xu, Hengshuang Zhao, Vibhav Vineet, SerNam Lim, Antonio Torralba code -1
MonoPLFlowNet: Permutohedral Lattice FlowNet for Real-Scale 3D Scene Flow Estimation with Monocular Images Runfa Li, Truong Nguyen code -1
TO-Scene: A Large-Scale Dataset for Understanding 3D Tabletop Scenes Mutian Xu, Pei Chen, Haolin Liu, Xiaoguang Han code -1
Is It Necessary to Transfer Temporal Knowledge for Domain Adaptive Video Semantic Segmentation? Xinyi Wu, Zhenyao Wu, Jin Wan, Lili Ju, Song Wang code -1
Meta Spatio-Temporal Debiasing for Video Scene Graph Generation Li Xu, Haoxuan Qu, Jason Kuen, Jiuxiang Gu, Jun Liu code -1
Improving the Reliability for Confidence Estimation Haoxuan Qu, Yanchao Li, Lin Geng Foo, Jason Kuen, Jiuxiang Gu, Jun Liu code -1
Fine-Grained Scene Graph Generation with Data Transfer Ao Zhang, Yuan Yao, Qianyu Chen, Wei Ji, Zhiyuan Liu, Maosong Sun, TatSeng Chua code -1
Pose2Room: Understanding 3D Scenes from Human Activities Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner code -1
Towards Hard-Positive Query Mining for DETR-Based Human-Object Interaction Detection Xubin Zhong, Changxing Ding, Zijian Li, Shaoli Huang code -1
Discovering Human-Object Interaction Concepts via Self-Compositional Learning Zhi Hou, Baosheng Yu, Dacheng Tao code -1
Primitive-Based Shape Abstraction via Nonparametric Bayesian Inference Yuwei Wu, Weixiao Liu, Sipu Ruan, Gregory S. Chirikjian code -1
Stereo Depth Estimation with Echoes Chenghao Zhang, Kun Tian, Bolin Ni, Gaofeng Meng, Bin Fan, Zhaoxiang Zhang, Chunhong Pan code -1
Inverted Pyramid Multi-task Transformer for Dense Scene Understanding Hanrong Ye, Dan Xu code -1
PETR: Position Embedding Transformation for Multi-view 3D Object Detection Yingfei Liu, Tiancai Wang, Xiangyu Zhang, Jian Sun code -1
S2Net: Stochastic Sequential Pointcloud Forecasting Xinshuo Weng, Junyu Nan, KuanHui Lee, Rowan McAllister, Adrien Gaidon, Nicholas Rhinehart, Kris M. Kitani code -1
RA-Depth: Resolution Adaptive Self-supervised Monocular Depth Estimation Mu He, Le Hui, Yikai Bian, Jian Ren, Jin Xie, Jian Yang code -1
PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation Haobo Yuan, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao code -1
SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds Qingyong Hu, Bo Yang, Guangchi Fang, Yulan Guo, Ales Leonardis, Niki Trigoni, Andrew Markham code -1
PointMixer: MLP-Mixer for Point Cloud Understanding Jaesung Choe, Chunghyun Park, François Rameau, Jaesik Park, In So Kweon code -1
Initialization and Alignment for Adversarial Texture Optimization Xiaoming Zhao, Zhizhen Zhao, Alexander G. Schwing code -1
MOTR: End-to-End Multiple-Object Tracking with Transformer Fangao Zeng, Bin Dong, Yuang Zhang, Tiancai Wang, Xiangyu Zhang, Yichen Wei code -1
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen code -1
LaLaLoc++: Global Floor Plan Comprehension for Layout Localisation in Unvisited Environments Henry HowardJenkins, Victor Adrian Prisacariu code -1
3D-PL: Domain Adaptive Depth Estimation with 3D-Aware Pseudo-Labeling YuTing Yen, ChiaNi Lu, WeiChen Chiu, YiHsuan Tsai code -1
Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation Xiangtai Li, Shilin Xu, Yibo Yang, Guangliang Cheng, Yunhai Tong, Dacheng Tao code -1
GOCA: Guided Online Cluster Assignment for Self-supervised Video Representation Learning Huseyin Coskun, Alireza Zareian, Joshua L. Moore, Federico Tombari, Chen Wang code -1
Constrained Mean Shift Using Distant yet Related Neighbors for Representation Learning K. L. Navaneet, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Kossar Pourahmadi, Akshayvarun Subramanya, Hamed Pirsiavash code -1
Revisiting the Critical Factors of Augmentation-Invariant Representation Learning Junqiang Huang, Xiangwen Kong, Xiangyu Zhang code -1
CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation Lu Qi, Jason Kuen, Zhe Lin, Jiuxiang Gu, Fengyun Rao, Dian Li, Weidong Guo, Zhen Wen, MingHsuan Yang, Jiaya Jia code -1
Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai, Chen Qian code -1
Semantic-Aware Fine-Grained Correspondence Yingdong Hu, Renhao Wang, Kaifeng Zhang, Yang Gao code -1
Self-Supervised Classification Network Elad Amrani, Leonid Karlinsky, Alexander M. Bronstein code -1
Data Invariants to Understand Unsupervised Out-of-Distribution Detection Lars Doorenbos, Raphael Sznitman, Pablo MárquezNeila code -1
Domain Invariant Masked Autoencoders for Self-supervised Learning from Multi-domains Haiyang Yang, Shixiang Tang, Meilin Chen, Yizhou Wang, Feng Zhu, Lei Bai, Rui Zhao, Wanli Ouyang code -1
Semi-supervised Object Detection via VC Learning Changrui Chen, Kurt Debattista, Jungong Han code -1
Completely Self-supervised Crowd Counting via Distribution Matching Deepak Babu Sam, Abhinav Agarwalla, Jimmy Joseph, Vishwanath A. Sindagi, R. Venkatesh Babu, Vishal M. Patel code -1
Coarse-To-Fine Incremental Few-Shot Learning Xiang Xiang, Yuwen Tan, Qian Wan, Jing Ma, Alan L. Yuille, Gregory D. Hager code -1
Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling Jian Hu, Haowen Zhong, Fei Yang, Shaogang Gong, Guile Wu, Junchi Yan code -1
Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition Shreyank N. Gowda, Marcus Rohrbach, Frank Keller, Laura SevillaLara code -1
CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation Renhao Wang, Hang Zhao, Yang Gao code -1
PSS: Progressive Sample Selection for Open-World Visual Representation Learning Tianyue Cao, Yongxin Wang, Yifan Xing, Tianjun Xiao, Tong He, Zheng Zhang, Hao Zhou, Joseph Tighe code -1
Improving Self-supervised Lightweight Model Learning via Hard-Aware Metric Distillation Hao Liu, Mang Ye code -1
Object Discovery via Contrastive Learning for Weakly Supervised Object Detection Jinhwan Seo, Wonho Bae, Danica J. Sutherland, Junhyug Noh, Daijin Kim code -1
Stochastic Consensus: Enhancing Semi-Supervised Learning with Consistency of Stochastic Classifiers Hui Tang, Lin Sun, Kui Jia code -1
DiffuseMorph: Unsupervised Deformable Image Registration Using Diffusion Model Boah Kim, Inhwa Han, Jong Chul Ye code -1
Semi-Leak: Membership Inference Attacks Against Semi-supervised Learning Xinlei He, Hongbin Liu, Neil Zhenqiang Gong, Yang Zhang code -1
OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah code -1
Embedding Contrastive Unsupervised Features to Cluster In- And Out-of-Distribution Noise in Corrupted Image Datasets Paul Albert, Eric Arazo, Noel E. O'Connor, Kevin McGuinness code -1
Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space Shuo Li, Fang Liu, Zehua Hao, Kaibo Zhao, Licheng Jiao code -1
Towards Realistic Semi-supervised Learning Mamshad Nayeem Rizve, Navid Kardan, Mubarak Shah code -1
Masked Siamese Networks for Label-Efficient Learning Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Florian Bordes, Pascal Vincent, Armand Joulin, Mike Rabbat, Nicolas Ballas code -1
Natural Synthetic Anomalies for Self-supervised Anomaly Detection and Localization Hannah M. Schlüter, Jeremy Tan, Benjamin Hou, Bernhard Kainz code -1
Understanding Collapse in Non-contrastive Siamese Representation Learning Alexander C. Li, Alexei A. Efros, Deepak Pathak code -1
Federated Self-supervised Learning for Video Understanding Yasar Abbas Ur Rehman, Yan Gao, Jiajun Shen, Pedro Porto Buarque de Gusmão, Nicholas D. Lane code -1
Towards Efficient and Effective Self-supervised Learning of Visual Representations Sravanti Addepalli, Kaushal Bhogale, Priyam Dey, R. Venkatesh Babu code -1
DSR - A Dual Subspace Re-Projection Network for Surface Anomaly Detection Vitjan Zavrtanik, Matej Kristan, Danijel Skocaj code -1
PseudoAugment: Learning to Use Unlabeled Data for Data Augmentation in Point Clouds Zhaoqi Leng, Shuyang Cheng, Benjamin Caine, Weiyue Wang, Xiao Zhang, Mingxing Tan, Dragomir Anguelov code -1
MVSTER: Epipolar Transformer for Efficient Multi-view Stereo Xiaofeng Wang, Zheng Zhu, Guan Huang, Fangbo Qin, Yun Ye, Yijia He, Xu Chi, Xingang Wang code -1
RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild Jason Y. Zhang, Deva Ramanan, Shubham Tulsiani code -1
R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis Huan Wang, Jian Ren, Zeng Huang, Kyle Olszewski, Menglei Chai, Yun Fu, Sergey Tulyakov code -1
KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo Yikang Ding, Qingtian Zhu, Xiangyue Liu, Wentao Yuan, Haotian Zhang, Chi Zhang code -1
SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas John Lambert, Yuguang Li, Ivaylo Boyadzhiev, Lambert Wixson, Manjunath Narayana, Will Hutchcroft, James Hays, Frank Dellaert, Sing Bing Kang code -1
RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering Di Chang, Aljaz Bozic, Tong Zhang, Qingsong Yan, Yingcong Chen, Sabine Süsstrunk, Matthias Nießner code -1
Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation using Bounding Boxes Julian Chibane, Francis Engelmann, Tuan Anh Tran, Gerard PonsMoll code -1
NeILF: Neural Incident Light Field for Physically-based Material Estimation Yao Yao, Jingyang Zhang, Jingbo Liu, Yihang Qu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan code -1
ARF: Artistic Radiance Fields Kai Zhang, Nicholas I. Kolkin, Sai Bi, Fujun Luan, Zexiang Xu, Eli Shechtman, Noah Snavely code -1
Multiview Stereo with Cascaded Epipolar RAFT Zeyu Ma, Zachary Teed, Jia Deng code -1
Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling Hengyuan Ma, Li Zhang, Xiatian Zhu, Jianfeng Feng code -1
Learning to Generate Realistic LiDAR Point Clouds Vlas Zyrianov, Xiyue Zhu, Shenlong Wang code -1
RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds TuanAnh Vu, Duc Thanh Nguyen, BinhSon Hua, QuangHieu Pham, SaiKit Yeung code -1
Diverse Image Inpainting with Normalizing Flow Cairong Wang, Yiming Zhu, Chun Yuan code -1
Improved Masked Image Generation with Token-Critic José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa code -1
TREND: Truncated Generalized Normal Density Estimation of Inception Embeddings for GAN Evaluation Junghyuk Lee, JongSeok Lee code -1
Exploring Gradient-Based Multi-directional Controls in GANs Zikun Chen, Ruowei Jiang, Brendan Duke, Han Zhao, Parham Aarabi code -1
Spatially Invariant Unsupervised 3D Object-Centric Learning and Scene Decomposition Tianyu Wang, Miaomiao Liu, Kee Siong Ng code -1
Neural Scene Decoration from a Single Photograph HongWing Pang, Yingshu Chen, PhuocHieu Le, BinhSon Hua, Duc Thanh Nguyen, SaiKit Yeung code -1
Outpainting by Queries Kai Yao, Penglei Gao, Xi Yang, Jie Sun, Rui Zhang, Kaizhu Huang code -1
Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes Sam BondTaylor, Peter Hessey, Hiroshi Sasaki, Toby P. Breckon, Chris G. Willcocks code -1
ChunkyGAN: Real Image Inversion via Segments Adéla Subrtová, David Futschik, Jan Cech, Michal Lukác, Eli Shechtman, Daniel Sýkora code -1
GAN Cocktail: Mixing GANs Without Dataset Access Omri Avrahami, Dani Lischinski, Ohad Fried code -1
Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering Mingfei Chen, Jianfeng Zhang, Xiangyu Xu, Lijuan Liu, Yujun Cai, Jiashi Feng, Shuicheng Yan code -1
Controllable Shadow Generation Using Pixel Height Maps Yichen Sheng, Yifan Liu, Jianming Zhang, Wei Yin, A. Cengiz Öztireli, He Zhang, Zhe Lin, Eli Shechtman, Bedrich Benes code -1
Learning Where to Look - Generative NAS is Surprisingly Efficient Jovita Lukasik, Steffen Jung, Margret Keuper code -1
Subspace Diffusion Generative Models Bowen Jing, Gabriele Corso, Renato Berlinghieri, Tommi S. Jaakkola code -1
DuelGAN: A Duel Between Two Discriminators Stabilizes the GAN Training Jiaheng Wei, Minghao Liu, Jiahao Luo, Andrew Zhu, James Davis, Yang Liu code -1
MINER: Multiscale Implicit Neural Representation Vishwanath Saragadam, Jasper Tan, Guha Balakrishnan, Richard G. Baraniuk, Ashok Veeraraghavan code -1
An Embedded Feature Whitening Approach to Deep Neural Network Optimization Hongwei Yong, Lei Zhang code -1
Q-FW: A Hybrid Classical-Quantum Frank-Wolfe for Quadratic Binary Optimization Alp Yurtsever, Tolga Birdal, Vladislav Golyanik code -1
Self-supervised Learning of Visual Graph Matching Chang Liu, Shaofeng Zhang, Xiaokang Yang, Junchi Yan code -1
Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models Xuxi Chen, Tianlong Chen, Yu Cheng, Weizhu Chen, Ahmed Hassan Awadallah, Zhangyang Wang code -1
QISTA-ImageNet: A Deep Compressive Image Sensing Framework Solving ℓ q-Norm Optimization Problem GangXuan Lin, ShihWei Hu, ChunShien Lu code -1
R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning Qiankun Gao, Chen Zhao, Bernard Ghanem, Jian Zhang code -1
Domain Generalization by Mutual-Information Regularization with Pre-trained Models Junbum Cha, Kyungjae Lee, Sungrae Park, Sanghyuk Chun code -1
Predicting Is Not Understanding: Recognizing and Addressing Underspecification in Machine Learning Damien Teney, Maxime Peyrard, Ehsan Abbasnejad code -1
Neural-Sim: Learning to Generate Training Data with NeRF Yunhao Ge, Harkirat S. Behl, Jiashu Xu, Suriya Gunasekar, Neel Joshi, Yale Song, Xin Wang, Laurent Itti, Vibhav Vineet code -1
Bayesian Optimization with Clustering and Rollback for CNN Auto Pruning Hanwei Fan, Jiandong Mu, Wei Zhang code -1
Learned Variational Video Color Propagation Markus Hofinger, Erich Kobler, Alexander Effland, Thomas Pock code -1
Continual Variational Autoencoder Learning via Online Cooperative Memorization Fei Ye, Adrian G. Bors code -1
Learning to Learn with Smooth Regularization Yuanhao Xiong, ChoJui Hsieh code -1
Incremental Task Learning with Incremental Rank Updates Rakib Hyder, Ken Shao, Boyu Hou, Panos P. Markopoulos, Ashley PraterBennette, M. Salman Asif code -1
Batch-Efficient EigenDecomposition for Small and Medium Matrices Yue Song, Nicu Sebe, Wei Wang code -1
Ensemble Learning Priors Driven Deep Unfolding for Scalable Video Snapshot Compressive Imaging Chengshuai Yang, Shiyu Zhang, Xin Yuan code -1
Approximate Discrete Optimal Transport Plan with Auxiliary Measure Method Dongsheng An, Na Lei, Xianfeng Gu code -1
Improving Generalization in Federated Learning by Seeking Flat Minima Debora Caldarola, Barbara Caputo, Marco Ciccone code -1
Semidefinite Relaxations of Truncated Least-Squares in Robust Rotation Search: Tight or Not Liangzu Peng, Mahyar Fazlyab, René Vidal code -1
Transfer Without Forgetting Matteo Boschini, Lorenzo Bonicelli, Angelo Porrello, Giovanni Bellitto, Matteo Pennisi, Simone Palazzo, Concetto Spampinato, Simone Calderara code -1
AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation Farshid Varno, Marzie Saghayi, Laya Rafiee Sevyeri, Sharut Gupta, Stan Matwin, Mohammad Havaei code -1
Tackling Long-Tailed Category Distribution Under Domain Shifts Xiao Gu, Yao Guo, Zeju Li, Jianing Qiu, Qi Dou, Yuxuan Liu, Benny Lo, GuangZhong Yang code -1
Doubly-Fused ViT: Fuse Information from Vision Transformer Doubly with Local Representation Li Gao, Dong Nie, Bo Li, Xiaofeng Ren code -1
Salient Object Detection for Point Clouds Songlin Fan, Wei Gao, Ge Li code -1
Learning Semantic Segmentation from Multiple Datasets with Label Shifts Dongwan Kim, YiHsuan Tsai, Yumin Suh, Masoud Faraki, Sparsh Garg, Manmohan Chandraker, Bohyung Han code -1
Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination Kangcheng Liu, Yuzhi Zhao, Qiang Nie, Zhi Gao, Ben M. Chen code -1
Towards Open-Vocabulary Scene Graph Generation with Prompt-Based Finetuning Tao He, Lianli Gao, Jingkuan Song, YuanFang Li code -1
Variance-Aware Weight Initialization for Point Convolutional Neural Networks Pedro Hermosilla, Michael Schelling, Tobias Ritschel, Timo Ropinski code -1
Break and Make: Interactive Structural Understanding Using LEGO Bricks Aaron Walsman, Muru Zhang, Klemen Kotar, Karthik Desingh, Ali Farhadi, Dieter Fox code -1
Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation Wencan Cheng, Jong Hwan Ko code -1
3DG-STFM: 3D Geometric Guided Student-Teacher Feature Matching Runyu Mao, Chen Bai, Yatong An, Fengqing Zhu, Cheng Lu code -1
Video Restoration Framework and Its Meta-adaptations to Data-Poor Conditions Prashant W. Patil, Sunil Gupta, Santu Rana, Svetha Venkatesh code -1
MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud Michaël Ramamonjisoa, Sinisa Stekovic, Vincent Lepetit code -1
Scene Text Recognition with Permuted Autoregressive Sequence Models Darwin Bautista, Rowel Atienza code -1
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition Bohan Li, Ye Yuan, Dingkang Liang, Xiao Liu, Zhilong Ji, Jinfeng Bai, Wenyu Liu, Xiang Bai code -1
Detecting Tampered Scene Text in the Wild Yuxin Wang, Hongtao Xie, Mengting Xing, Jing Wang, Shenggao Zhu, Yongdong Zhang code -1
Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning Jingqun Tang, Wenming Qian, Luchuan Song, Xiena Dong, Lan Li, Xiang Bai code -1
GLASS: Global to Local Attention for Scene-Text Spotting Roi Ronen, Shahar Tsiper, Oron Anschel, Inbal Lavi, Amir Markovitz, R. Manmatha code -1
COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts Jeonghun Baek, Yusuke Matsui, Kiyoharu Aizawa code -1
Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting Chuhui Xue, Wenqing Zhang, Yu Hao, Shijian Lu, Philip H. S. Torr, Song Bai code -1
Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition Xudong Xie, Ling Fu, Zhifei Zhang, Zhaowen Wang, Xiang Bai code -1
Levenshtein OCR Cheng Da, Peng Wang, Cong Yao code -1
Multi-granularity Prediction for Scene Text Recognition Peng Wang, Cheng Da, Cong Yao code -1
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting Ying Chen, Liang Qiao, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Xi Li code -1
Contextual Text Block Detection Towards Scene Text Understanding Chuhui Xue, Jiaxing Huang, Wenqing Zhang, Shijian Lu, Changhu Wang, Song Bai code -1
CoMER: Modeling Coverage for Transformer-Based Handwritten Mathematical Expression Recognition Wenqi Zhao, Liangcai Gao code -1
Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context Chongyu Liu, Lianwen Jin, Yuliang Liu, Canjie Luo, Bangdong Chen, Fengjun Guo, Kai Ding code -1
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers Oren Nuriel, Sharon Fogel, Ron Litman code -1
Multi-modal Text Recognition Networks: Interactive Enhancements Between Visual and Semantic Features Byeonghu Na, Yoonsik Kim, Sungrae Park code -1
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition Dajian Zhong, Shujing Lyu, Palaiahnakote Shivakumara, Bing Yin, Jiajia Wu, Umapada Pal, Yue Lu code -1
Pure Transformer with Integrated Experts for Scene Text Recognition Yew Lee Tan, Adams WaiKin Kong, JungJae Kim code -1
OCR-Free Document Understanding Transformer Geewook Kim, Teakgyu Hong, Moonbin Yim, JeongYeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park code -1
CAR: Class-Aware Regularizations for Semantic Segmentation Ye Huang, Di Kang, Liang Chen, Xuefei Zhe, Wenjing Jia, Linchao Bao, Xiangjian He code -1
Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee code -1
SeqFormer: Sequential Transformer for Video Instance Segmentation Junfeng Wu, Yi Jiang, Song Bai, Wenqing Zhang, Xiang Bai code -1
Saliency Hierarchy Modeling via Generative Kernels for Salient Object Detection Wenhu Zhang, Liangli Zheng, Huanyu Wang, Xintian Wu, Xi Li code -1
In Defense of Online Models for Video Instance Segmentation Junfeng Wu, Qihao Liu, Yi Jiang, Song Bai, Alan L. Yuille, Xiang Bai code -1
Active Pointly-Supervised Instance Segmentation Chufeng Tang, Lingxi Xie, Gang Zhang, Xiaopeng Zhang, Qi Tian, Xiaolin Hu code -1
A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining Bowen Shi, Dongsheng Jiang, Xiaopeng Zhang, Han Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian code -1
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model Ho Kei Cheng, Alexander G. Schwing code -1
Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving Jiale Li, Hang Dai, Yong Ding code -1
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds Yan Xu, Jiantao Gao, Chaoda Zheng, Chao Zheng, Ruimao Zhang, Shuguang Cui, Zhen Li code -1
Extract Free Dense Labels from CLIP Chong Zhou, Chen Change Loy, Bo Dai code -1
3D Compositional Zero-Shot Learning with DeCompositional Consensus Muhammad Ferjad Naeem, Evin Pinar Örnek, Yongqin Xian, Luc Van Gool, Federico Tombari code -1
Video Mask Transfiner for High-Quality Video Instance Segmentation Lei Ke, Henghui Ding, Martin Danelljan, YuWing Tai, ChiKeung Tang, Fisher Yu code -1
SimpleRecon: 3D Reconstruction Without 3D Convolutions Mohamed Sayed, John Gibson, Jamie Watson, Victor Prisacariu, Michael Firman, Clément Godard code -1
Structure and Motion from Casual Videos Zhoutong Zhang, Forrester Cole, Zhengqi Li, Michael Rubinstein, Noah Snavely, William T. Freeman code -1
What Matters for 3D Scene Flow Network Guangming Wang, Yunzhe Hu, Zhe Liu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang code -1
Correspondence Reweighted Translation Averaging Lalit Manam, Venu Madhav Govindu code -1
Neural Strands: Learning Hair Geometry and Appearance from Multi-view Images Radu Alexandru Rosu, Shunsuke Saito, Ziyan Wang, Chenglei Wu, Sven Behnke, Giljoo Nam code -1
GraphCSPN: Geometry-Aware Depth Completion via Dynamic GCNs Xin Liu, Xiaofei Shao, Bo Wang, Yali Li, Shengjin Wang code -1
Objects Can Move: 3D Change Detection by Geometric Transformation Consistency Aikaterini Adam, Torsten Sattler, Konstantinos Karantzalos, Tomás Pajdla code -1
Language-Grounded Indoor 3D Semantic Segmentation in the Wild Dávid Rozenberszki, Or Litany, Angela Dai code -1
Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs Sameera Ramasinghe, Simon Lucey code -1
Deforming Radiance Fields with Cages Tianhan Xu, Tatsuya Harada code -1
FLEX: Extrinsic Parameters-free Multi-view 3D Human Motion Reconstruction Brian Gordon, Sigal Raab, Guy Azov, Raja Giryes, Daniel CohenOr code -1
MODE: Multi-view Omnidirectional Depth Estimation with 360$^\circ $ Cameras Ming Li, Xueqian Jin, Xuejiao Hu, Jingzhao Dai, Sidan Du, Yang Li code -1
GigaDepth: Learning Depth from Structured Light with Branching Neural Networks Simon Schreiberhuber, JeanBaptiste Weibel, Timothy Patten, Markus Vincze code -1
ActiveNeRF: Learning Where to See with Uncertainty Estimation Xuran Pan, Zihang Lai, Shiji Song, Gao Huang code -1
PoserNet: Refining Relative Camera Poses Exploiting Object Detections Matteo Taiana, Matteo Toso, Stuart James, Alessio Del Bue code -1
Gaussian Activated Neural Radiance Fields for High Fidelity Reconstruction and Pose Estimation ShinFang Chng, Sameera Ramasinghe, Jamie Sherrah, Simon Lucey code -1
Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling Jan U. Müller, Michael Weinmann, Reinhard Klein code -1
Towards Learning Neural Representations from Shadows Kushagra Tiwary, Tzofi Klinghoffer, Ramesh Raskar code -1
Class-Incremental Novel Class Discovery Subhankar Roy, Mingxuan Liu, Zhun Zhong, Nicu Sebe, Elisa Ricci code -1
Unknown-Oriented Learning for Open Set Domain Adaptation Jie Liu, Xiaoqing Guo, Yixuan Yuan code -1
Prototype-Guided Continual Adaptation for Class-Incremental Unsupervised Domain Adaptation Hongbin Lin, Yifan Zhang, Zhen Qiu, Shuaicheng Niu, Chuang Gan, Yanxia Liu, Mingkui Tan code -1
DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation Xin Lai, Zhuotao Tian, Xiaogang Xu, YingCong Chen, Shu Liu, Hengshuang Zhao, Liwei Wang, Jiaya Jia code -1
Class-Agnostic Object Counting Robust to Intraclass Diversity Shenjian Gong, Shanshan Zhang, Jian Yang, Dengxin Dai, Bernt Schiele code -1
Burn After Reading: Online Adaptation for Cross-domain Streaming Data Luyu Yang, Mingfei Gao, Zeyuan Chen, Ran Xu, Abhinav Shrivastava, Chetan Ramaiah code -1
Mind the Gap in Distilling StyleGANs Guodong Xu, Yuenan Hou, Ziwei Liu, Chen Change Loy code -1
Improving Test-Time Adaptation Via Shift-Agnostic Weight Regularization and Nearest Source Prototypes Sungha Choi, Seunghan Yang, Seokeon Choi, Sungrack Yun code -1
Learning Instance-Specific Adaptation for Cross-Domain Segmentation Yuliang Zou, Zizhao Zhang, ChunLiang Li, Han Zhang, Tomas Pfister, JiaBin Huang code -1
RegionCL: Exploring Contrastive Region Pairs for Self-supervised Representation Learning Yufei Xu, Qiming Zhang, Jing Zhang, Dacheng Tao code -1
Long-Tailed Class Incremental Learning Xialei Liu, YuSong Hu, XuSheng Cao, Andrew D. Bagdanov, Ke Li, MingMing Cheng code -1
DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning Hyounguk Shon, Janghyeon Lee, Seung Hwan Kim, Junmo Kim code -1
Adversarial Partial Domain Adaptation by Cycle Inconsistency KunYu Lin, Jiaming Zhou, Yukun Qiu, WeiShi Zheng code -1
Combating Label Distribution Shift for Active Domain Adaptation Sehyun Hwang, Sohyun Lee, Sungyeon Kim, Jungseul Ok, Suha Kwak code -1
GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation Cristiano Saltori, Evgeny Krivosheev, Stéphane Lathuilière, Nicu Sebe, Fabio Galasso, Giuseppe Fiameni, Elisa Ricci, Fabio Poiesi code -1
CoSMix: Compositional Semantic Mix for Domain Adaptation in 3D LiDAR Segmentation Cristiano Saltori, Fabio Galasso, Giuseppe Fiameni, Nicu Sebe, Elisa Ricci, Fabio Poiesi code -1
A Unified Framework for Domain Adaptive Pose Estimation Donghyun Kim, Kaihong Wang, Kate Saenko, Margrit Betke, Stan Sclaroff code -1
A Broad Study of Pre-training for Domain Generalization and Adaptation Donghyun Kim, Kaihong Wang, Stan Sclaroff, Kate Saenko code -1
Prior Knowledge Guided Unsupervised Domain Adaptation Tao Sun, Cheng Lu, Haibin Ling code -1
GCISG: Guided Causal Invariant Learning for Improved Syn-to-Real Generalization Gilhyun Nam, Gyeongjae Choi, Kyungmin Lee code -1
AcroFOD: An Adaptive Method for Cross-Domain Few-Shot Object Detection Yipeng Gao, Lingxiao Yang, Yunmu Huang, Song Xie, Shiyong Li, WeiShi Zheng code -1
Unsupervised Domain Adaptation for One-Stage Object Detector Using Offsets to Bounding Box Jayeon Yoo, Inseop Chung, Nojun Kwak code -1
Visual Prompt Tuning Menglin Jia, Luming Tang, BorChun Chen, Claire Cardie, Serge J. Belongie, Bharath Hariharan, SerNam Lim code -1
Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap Yongwei Chen, Zihao Wang, Longkun Zou, Ke Chen, Kui Jia code -1
Cross-domain Ensemble Distillation for Domain Generalization Kyungmoon Lee, Sungyeon Kim, Suha Kwak code -1
Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels Ganlong Zhao, Guanbin Li, Yipeng Qin, Feng Liu, Yizhou Yu code -1
Hyperspherical Learning in Multi-Label Classification Bo Ke, Yunquan Zhu, Mengtian Li, Xiujun Shu, Ruizhi Qiao, Bo Ren code -1
When Active Learning Meets Implicit Semantic Data Augmentation Zhuangzhuang Chen, Jin Zhang, Pan Wang, Jie Chen, Jianqiang Li code -1
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition Changyao Tian, Wenhai Wang, Xizhou Zhu, Jifeng Dai, Yu Qiao code -1
Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization Jiaxin Qi, Kaihua Tang, Qianru Sun, XianSheng Hua, Hanwang Zhang code -1
Hierarchical Semi-supervised Contrastive Learning for Contamination-Resistant Anomaly Detection Gaoang Wang, Yibing Zhan, Xinchao Wang, Mingli Song, Klara Nahrstedt code -1
Tracking by Associating Clips Sanghyun Woo, Kwanyong Park, Seoung Wug Oh, In So Kweon, JoonYoung Lee code -1
RealPatch: A Statistical Matching Framework for Model Patching with Real Samples Sara Romiti, Christopher Inskip, Viktoriia Sharmanska, Novi Quadrianto code -1
Background-Insensitive Scene Text Recognition with Text Semantic Segmentation Liang Zhao, Zhenyao Wu, Xinyi Wu, Greg Wilsbacher, Song Wang code -1
Semantic Novelty Detection via Relational Reasoning Francesco Cappio Borlino, Silvia Bucci, Tatiana Tommasi code -1
Improving Closed and Open-Vocabulary Attribute Prediction Using Transformers Khoi Pham, Kushal Kafle, Zhe Lin, Zhihong Ding, Scott Cohen, Quan Tran, Abhinav Shrivastava code -1
Training Vision Transformers with only 2040 Images YunHao Cao, Hao Yu, Jianxin Wu code -1
Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection Sanghyun Woo, Kwanyong Park, Seoung Wug Oh, In So Kweon, JoonYoung Lee code -1
TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNs Shantanu Jaiswal, Basura Fernando, Cheston Tan code -1
Automatic Check-Out via Prototype-Based Classifier Learning from Single-Product Exemplars Hao Chen, XiuShen Wei, Faen Zhang, Yang Shen, Hui Xu, Liang Xiao code -1
Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Claudia Blaiotta, Mauricio Munoz, Volker Fischer code -1
Photo-realistic Neural Domain Randomization Sergey Zakharov, Rares Ambrus, Vitor Guizilini, Wadim Kehl, Adrien Gaidon code -1
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning Ting Yao, Yingwei Pan, Yehao Li, ChongWah Ngo, Tao Mei code -1
Tailoring Self-Supervision for Supervised Learning WonJun Moon, JiHwan Kim, JaePil Heo code -1
Difficulty-Aware Simulator for Open Set Recognition WonJun Moon, Jun Ho Park, Hyun Seok Seong, CheolHo Cho, JaePil Heo code -1
Few-Shot Class-Incremental Learning from an Open-Set Perspective Can Peng, Kun Zhao, Tianren Wang, Meng Li, Brian C. Lovell code -1
FOSTER: Feature Boosting and Compression for Class-Incremental Learning FuYun Wang, DaWei Zhou, HanJia Ye, DeChuan Zhan code -1
Visual Knowledge Tracing Neehar Kondapaneni, Pietro Perona, Oisin Mac Aodha code -1
S3C: Self-Supervised Stochastic Classifiers for Few-Shot Class-Incremental Learning Jayateja Kalla, Soma Biswas code -1
Improving Fine-Grained Visual Recognition in Low Data Regimes via Self-boosting Attention Mechanism Yangyang Shu, Baosheng Yu, Haiming Xu, Lingqiao Liu code -1
VSA: Learning Varied-Size Window Attention in Vision Transformers Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao code -1
Unbiased Manifold Augmentation for Coarse Class Subdivision Baoming Yan, Ke Gao, Bo Gao, Lin Wang, Jiang Yang, Xiaobo Li code -1
DenseHybrid: Hybrid Anomaly Detection for Dense Open-Set Recognition Matej Grcic, Petra Bevandic, Sinisa Segvic code -1
Rethinking Confidence Calibration for Failure Prediction Fei Zhu, Zhen Cheng, XuYao Zhang, ChengLin Liu code -1
Uncertainty-Guided Source-Free Domain Adaptation Subhankar Roy, Martin Trapp, Andrea Pilzer, Juho Kannala, Nicu Sebe, Elisa Ricci, Arno Solin code -1
Should All Proposals Be Treated Equally in Object Detection? Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Ying Jin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos code -1
ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers Junbo Li, Huan Zhang, Cihang Xie code -1
incDFM: Incremental Deep Feature Modeling for Continual Novelty Detection Amanda Rios, Nilesh A. Ahuja, Ibrahima J. Ndiour, Ergin Utku Genc, Laurent Itti, Omesh Tickoo code -1
IGFormer: Interaction Graph Transformer for Skeleton-Based Human Interaction Recognition Yunsheng Pang, Qiuhong Ke, Hossein Rahmani, James Bailey, Jun Liu code -1
PRIME: A Few Primitives Can Boost Robustness to Common Corruptions Apostolos Modas, Rahul Rade, Guillermo OrtizJiménez, SeyedMohsen MoosaviDezfooli, Pascal Frossard code -1
Rotation Regularization Without Rotation Takumi Kobayashi code -1
Towards Accurate Open-Set Recognition via Background-Class Regularization Wonwoo Cho, Jaegul Choo code -1
In Defense of Image Pre-Training for Spatiotemporal Recognition Xianhang Li, Huiyu Wang, Chen Wei, Jieru Mei, Alan L. Yuille, Yuyin Zhou, Cihang Xie code -1
Augmenting Deep Classifiers with Polynomial Neural Networks Grigorios G. Chrysos, Markos Georgopoulos, Jiankang Deng, Jean Kossaifi, Yannis Panagakis, Anima Anandkumar code -1
Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label Miscorrection Seong Min Kye, Kwanghee Choi, Joonyoung Yi, Buru Chang code -1
Online Task-free Continual Learning with Dynamic Sparse Distributed Memory Julien Pourcel, NgocSon Vu, Robert M. French code -1
Contrastive Deep Supervision Linfeng Zhang, Xin Chen, Junbo Zhang, Runpei Dong, Kaisheng Ma code -1
Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective Quan Cui, Bingchen Zhao, ZhaoMin Chen, Borui Zhao, Renjie Song, Boyan Zhou, Jiajun Liang, Osamu Yoshie code -1
LocVTP: Video-Text Pre-training for Temporal Localization Meng Cao, Tianyu Yang, Junwu Weng, Can Zhang, Jue Wang, Yuexian Zou code -1
Few-Shot End-to-End Object Detection via Constantly Concentrated Encoding Across Heads Jiawei Ma, Guangxing Han, Shiyuan Huang, Yuncong Yang, ShihFu Chang code -1
Implicit Neural Representations for Image Compression Yannick Strümpler, Janis Postels, Ren Yang, Luc Van Gool, Federico Tombari code -1
LiP-Flow: Learning Inference-Time Priors for Codec Avatars via Normalizing Flows in Latent Space Emre Aksan, Shugao Ma, Akin Caliskan, Stanislav Pidhorskyi, Alexander Richard, ShihEn Wei, Jason M. Saragih, Otmar Hilliges code -1
Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining Qihang Zhang, Zhenghao Peng, Bolei Zhou code -1
Learning Ego 3D Representation as Ray Tracing Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang code -1
Static and Dynamic Concepts for Self-supervised Video Representation Learning Rui Qian, Shuangrui Ding, Xian Liu, Dahua Lin code -1
SphereFed: Hyperspherical Federated Learning Xin Dong, Sai Qian Zhang, Ang Li, H. T. Kung code -1
Hierarchically Self-supervised Transformer for Human Skeleton Representation Learning Yuxiao Chen, Long Zhao, Jianbo Yuan, Yu Tian, Zhaoyang Xia, Shijie Geng, Ligong Han, Dimitris N. Metaxas code -1
Posterior Refinement on Metric Matrix Improves Generalization Bound in Metric Learning Mingda Wang, Canqian Yang, Yi Xu code -1
Balancing Stability and Plasticity Through Advanced Null Space in Continual Learning Yajing Kong, Liu Liu, Zhen Wang, Dacheng Tao code -1
DisCo: Remedying Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning Yuting Gao, JiaXin Zhuang, Shaohui Lin, Hao Cheng, Xing Sun, Ke Li, Chunhua Shen code -1
CoSCL: Cooperation of Small Continual Learners is Stronger Than a Big One Liyuan Wang, Xingxing Zhang, Qian Li, Jun Zhu, Yi Zhong code -1
Manifold Adversarial Learning for Cross-domain 3D Shape Representation Hao Huang, Cheng Chen, Yi Fang code -1
Fast-MoCo: Boost Momentum-Based Contrastive Learning with Combinatorial Patches Yuanzheng Ci, Chen Lin, Lei Bai, Wanli Ouyang code -1
LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling Boyan Jiang, Xinlin Ren, Mingsong Dou, Xiangyang Xue, Yanwei Fu, Yinda Zhang code -1
On the Versatile Uses of Partial Distance Correlation in Deep Learning Xingjian Zhen, Zihang Meng, Rudrasis Chakraborty, Vikas Singh code -1
Self-Regulated Feature Learning via Teacher-free Feature Distillation Lujun Li code -1
Balancing Between Forgetting and Acquisition in Incremental Subpopulation Learning Mingfu Liang, Jiahuan Zhou, Wei Wei, Ying Wu code -1
Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification Xulin Li, Yan Lu, Bin Liu, Yating Liu, Guojun Yin, Qi Chu, Jinyang Huang, Feng Zhu, Rui Zhao, Nenghai Yu code -1
DAS: Densely-Anchored Sampling for Deep Metric Learning Lizhao Liu, Shangxin Huang, Zhuangwei Zhuang, Ran Yang, Mingkui Tan, Yaowei Wang code -1
Learn from All: Erasing Attention Consistency for Noisy Label Facial Expression Recognition Yuhang Zhang, Chengrui Wang, Xu Ling, Weihong Deng code -1
A Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning Michael Kirchhof, Karsten Roth, Zeynep Akata, Enkelejda Kasneci code -1
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers Jihao Liu, Boxiao Liu, Hang Zhou, Hongsheng Li, Yu Liu code -1
UFO: Unified Feature Optimization Teng Xi, Yifan Sun, Deli Yu, Bi Li, Nan Peng, Gang Zhang, Xinyu Zhang, Zhigang Wang, Jinwen Chen, Jian Wang, Lufei Liu, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang code -1
Sound Localization by Self-supervised Time Delay Estimation Ziyang Chen, David F. Fouhey, Andrew Owens code -1
X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation Yinan He, Gengshi Huang, Siyu Chen, Jianing Teng, Kun Wang, Zhenfei Yin, Lu Sheng, Ziwei Liu, Yu Qiao, Jing Shao code -1
SLIP: Self-supervision Meets Language-Image Pre-training Norman Mu, Alexander Kirillov, David A. Wagner, Saining Xie code -1
Discovering Deformable Keypoint Pyramids Jianing Qian, Anastasios Panagopoulos, Dinesh Jayaraman code -1
Neural Video Compression Using GANs for Detail Synthesis and Propagation Fabian Mentzer, Eirikur Agustsson, Johannes Ballé, David Minnen, Nick Johnston, George Toderici code -1
A Contrastive Objective for Learning Disentangled Representations Jonathan Kahana, Yedid Hoshen code -1
PT4AL: Using Self-supervised Pretext Tasks for Active Learning John Seon Keun Yi, Minseok Seo, Jongchan Park, DongGeol Choi code -1
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer Haokui Zhang, Wenze Hu, Xiaoyu Wang code -1
DualPrompt: Complementary Prompting for Rehearsal-Free Continual Learning Zifeng Wang, Zizhao Zhang, Sayna Ebrahimi, Ruoxi Sun, Han Zhang, ChenYu Lee, Xiaoqi Ren, Guolong Su, Vincent Perot, Jennifer G. Dy, Tomas Pfister code -1
Unifying Visual Contrastive Learning for Object Recognition from a Graph Perspective Shixiang Tang, Feng Zhu, Lei Bai, Rui Zhao, Chenyu Wang, Wanli Ouyang code -1
Decoupled Contrastive Learning ChunHsiao Yeh, ChengYao Hong, YenChi Hsu, TyngLuh Liu, Yubei Chen, Yann LeCun code -1
Joint Learning of Localized Representations from Medical Images and Reports Philip Müller, Georgios Kaissis, Congyu Zou, Daniel Rueckert code -1
The Challenges of Continuous Self-Supervised Learning Senthil Purushwalkam, Pedro Morgado, Abhinav Gupta code -1
Conditional Stroke Recovery for Fine-Grained Sketch-Based Image Retrieval Zhixin Ling, Zhen Xing, Jian Zhou, Xiangdong Zhou code -1
Identifying Hard Noise in Long-Tailed Sample Distribution Xuanyu Yi, Kaihua Tang, XianSheng Hua, JooHwee Lim, Hanwang Zhang code -1
Interpretable Open-Set Domain Adaptation via Angular Margin Separation Xinhao Li, Jingjing Li, Zhekai Du, Lei Zhu, Wen Li code -1
TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation Rui Gong, Martin Danelljan, Dengxin Dai, Danda Pani Paudel, Ajad Chhatkuli, Fisher Yu, Luc Van Gool code -1
Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation Zhengkai Jiang, Yuxi Li, Ceyuan Yang, Peng Gao, Yabiao Wang, Ying Tai, Chengjie Wang code -1
RBC: Rectifying the Biased Context in Continual Semantic Segmentation Hanbin Zhao, Fengyu Yang, Xinghe Fu, Xi Li code -1
Factorizing Knowledge in Neural Networks Xingyi Yang, Jingwen Ye, Xinchao Wang code -1
Contrastive Vicinal Space for Unsupervised Domain Adaptation Jaemin Na, Dongyoon Han, Hyung Jin Chang, Wonjun Hwang code -1
Cross-Modal Knowledge Transfer Without Task-Relevant Source Data Sk Miraj Ahmed, Suhas Lohit, KuanChuan Peng, Michael Jones, Amit K. RoyChowdhury code -1
Online Domain Adaptation for Semantic Segmentation in Ever-Changing Conditions Theodoros Panagiotakopoulos, Pier Luigi Dovesi, Linus HärenstamNielsen, Matteo Poggi code -1
Source-Free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition Yuecong Xu, Jianfei Yang, Haozhi Cao, Keyu Wu, Min Wu, Zhenghua Chen code -1
BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation Sanqing Qu, Guang Chen, Jing Zhang, Zhijun Li, Wei He, Dacheng Tao code -1
Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks Yawen Huang, Feng Zheng, Xu Sun, Yuexiang Li, Ling Shao, Yefeng Zheng code -1
Incomplete Multi-view Domain Adaptation via Channel Enhancement and Knowledge Transfer Haifeng Xia, Pu Wang, Zhengming Ding code -1
DistPro: Searching a Fast Knowledge Distillation Process via Meta Optimization Xueqing Deng, Dawei Sun, Shawn D. Newsam, Peng Wang code -1
ML-BPM: Multi-teacher Learning with Bidirectional Photometric Mixing for Open Compound Domain Adaptation in Semantic Segmentation Fei Pan, Sungsu Hur, Seokju Lee, Junsik Kim, In So Kweon code -1
PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks Nan Ding, Xi Chen, Tomer Levinboim, Soravit Changpinyo, Radu Soricut code -1
Personalized Education: Blind Knowledge Distillation Xiang Deng, Jian Zheng, Zhongfei Zhang code -1
Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo code -1
How Stable Are Transferability Metrics Evaluations? Andrea Agostinelli, Michal Pándy, Jasper R. R. Uijlings, Thomas Mensink, Vittorio Ferrari code -1
Attention Diversification for Domain Generalization Rang Meng, Xianfeng Li, Weijie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Mingli Song, Di Xie, Shiliang Pu code -1
ESS: Learning Event-Based Semantic Segmentation from Still Images Zhaoning Sun, Nico Messikommer, Daniel Gehrig, Davide Scaramuzza code -1
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection Yuetian Weng, Zizheng Pan, Mingfei Han, Xiaojun Chang, Bohan Zhuang code -1
Human Trajectory Prediction via Neural Social Physics Jiangbei Yue, Dinesh Manocha, He Wang code -1
Towards Open Set Video Anomaly Detection Yuansheng Zhu, Wentao Bao, Qi Yu code -1
EclipSE: Efficient Long-Range Video Retrieval Using Sight and Sound YanBo Lin, Jie Lei, Mohit Bansal, Gedas Bertasius code -1
Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing Haoyue Cheng, Zhaoyang Liu, Hang Zhou, Chen Qian, Wayne Wu, Limin Wang code -1
Less Than Few: Self-shot Video Instance Segmentation Pengwan Yang, Yuki M. Asano, Pascal Mettes, Cees G. M. Snoek code -1
Adaptive Face Forgery Detection in Cross Domain Luchuan Song, Zheng Fang, Xiaodan Li, Xiaoyi Dong, Zhenchao Jin, Yuefeng Chen, Siwei Lyu code -1
Real-Time Online Video Detection with Temporal Smoothing Transformers Yue Zhao, Philipp Krähenbühl code -1
TallFormer: Temporal Action Localization with a Long-Memory Transformer Feng Cheng, Gedas Bertasius code -1
Mining Relations Among Cross-Frame Affinities for Video Semantic Segmentation Guolei Sun, Yun Liu, Hao Tang, Ajad Chhatkuli, Le Zhang, Luc Van Gool code -1
TL;DW? Summarizing Instructional Videos with Task Relevance and Cross-Modal Saliency Medhini Narasimhan, Arsha Nagrani, Chen Sun, Michael Rubinstein, Trevor Darrell, Anna Rohrbach, Cordelia Schmid code -1
Rethinking Learning Approaches for Long-Term Action Anticipation Megha Nawhal, Akash Abdu Jyothi, Greg Mori code -1
DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition Yuxuan Liang, Pan Zhou, Roger Zimmermann, Shuicheng Yan code -1
Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation Gensheng Pei, Fumin Shen, Yazhou Yao, GuoSen Xie, Zhenmin Tang, Jinhui Tang code -1
PAC-Net: Highlight Your Video via History Preference Modeling Hang Wang, Penghao Zhou, Chong Zhou, Zhao Zhang, Xing Sun code -1
How Severe Is Benchmark-Sensitivity in Video Self-supervised Learning? Fida Mohammad Thoker, Hazel Doughty, Piyush Bagad, Cees G. M. Snoek code -1
A Sliding Window Scheme for Online Temporal Action Localization Young Hwi Kim, Hyolim Kang, Seon Joo Kim code -1
ERA: Expert Retrieval and Assembly for Early Action Prediction Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu code -1
Dual Perspective Network for Audio-Visual Event Localization Varshanth R. Rao, Md Ibrahim Khalil, Haoda Li, Peng Dai, Juwei Lu code -1
NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition Boyang Xia, Wenhao Wu, Haoran Wang, Rui Su, Dongliang He, Haosen Yang, Xiaoran Fan, Wanli Ouyang code -1
Video Activity Localisation with Uncertainties in Temporal Boundary Jiabo Huang, Hailin Jin, Shaogang Gong, Yang Liu code -1
Temporal Saliency Query Network for Efficient Video Recognition Boyang Xia, Zhihao Wang, Wenhao Wu, Haoran Wang, Jungong Han code -1
Reversed Image Signal Processing and RAW Reconstruction. AIM 2022 Challenge Report Marcos V. Conde, Radu Timofte, Yibin Huang, Jingyang Peng, Chang Chen, Cheng Li, Eduardo PérezPellitero, Fenglong Song, Furui Bai, Shuai Liu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Yu Zhu, Chenghua Li, Yingying Jiang, Yong A, Peisong Wang, Cong Leng, Jian Cheng, Xiaoyu Liu, Zhicun Yin, Zhilu Zhang, Junyi Li, Ming Liu, Wangmeng Zuo, Jun Jiang, Jinha Kim, Yue Zhang, Beiji Zou, Zhikai Zong, Xiaoxiao Liu, Juan MarínVega, Michael Sloth, Peter SchneiderKamp, Richard Röttger, Furkan Kinli, Baris Özcan, Furkan Kiraç, Li Leyi, S. M. Nadim Uddin, Dipon Kumar Ghosh, Yong Ju Jung code -1
AIM 2022 Challenge on Instagram Filter Removal: Methods and Results Furkan Kinli, Sami Mentes, Baris Özcan, Furkan Kiraç, Radu Timofte, Yi Zuo, Zitao Wang, Xiaowen Zhang, Yu Zhu, Chenghua Li, Cong Leng, Jian Cheng, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Tianzhi Ma, Zihan Gao, Wenxin He, WoonHa Yeo, WangTaek Oh, YoungIl Kim, HanCheol Ryu, Gang He, Shaoyi Long, S. M. A. Sharif, Rizwan Ali Naqvi, Sungjun Kim, Guisik Kim, Seohyeon Lee, Sabari Nathan, Priya Kansal code -1
Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report Andrey Ignatov, Radu Timofte, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Ziyao Yi, Yan Xiang, Zibin Liu, Shaoqing Li, Keming Shi, Dehui Kong, Ke Xu, Minsu Kwon, Yaqi Wu, Jiesi Zheng, Zhihao Fan, Xun Wu, Feng Zhang, Albert No, Minhyeok Cho, Zewen Chen, Xiaze Zhang, Ran Li, Juan Wang, Zhiming Wang, Marcos V. Conde, UiJin Choi, Georgy Perevozchikov, Egor I. Ershov, Zheng Hui, Mengchuan Dong, Xin Lou, Wei Zhou, Cong Pang, Haina Qin, Mingxuan Cai code -1
Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report Andrey Ignatov, Grigory Malivenko, Radu Timofte, Lukasz Treszczotko, Xin Chang, Piotr Ksiazek, Michal Lopuszynski, Maciej Pioro, Rafal Rudnicki, Maciej Smyl, Yujie Ma, Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang, XueChao Shi, Difan Xu, Yanan Li, Xiaotao Wang, Lei Lei, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo, Gang Yu, Bin Fu, Jiaqi Li, Yiran Wang, Zihao Huang, Zhiguo Cao, Marcos V. Conde, Denis Sapozhnikov, Byeong Hyun Lee, Dongwon Park, Seongmin Hong, Joonhee Lee, Seunggyu Lee, Se Young Chun code -1
Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 Challenge: Report Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, Jingang Huh, Myeong Kyun Kim, Kihwan Yoon, HyeonCheol Moon, Seungho Lee, Yoonsik Choe, Jinwoo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li, Dan Zhu, Mengdi Sun, Ran Duan, Yan Gao, Lingshun Kong, Long Sun, Xiang Li, Xingdong Zhang, Jiawei Zhang, Yaqi Wu, Jinshan Pan, Gaocheng Yu, Jin Zhang, Feng Zhang, Zhe Ma, Hongbin Wang, Hojin Cho, Steve Kim, Huaen Li, Yanbo Ma, Ziwei Luo, Youwei Li, Lei Yu, Zhihong Wen, Qi Wu, Haoqiang Fan, Shuaicheng Liu, Lize Zhang, Zhikai Zong, Jeremy Kwon, Junxi Zhang, Mengyuan Li, Nianxiang Fu, Guanchen Ding, Han Zhu, Zhenzhong Chen, Gen Li, Yuanfan Zhang, Lei Sun, Dafeng Zhang, Neo Yang, Fitz Liu, Jerry Zhao, Mustafa Ayazoglu, Bahri Batuhan Bilecen, Shota Hirose, Kasidis Arunruangsirilert, Luo Ao, Ho Chun Leung, Andrew Wei, Jie Liu, Qiang Liu, Dahai Yu, Ao Li, Lei Luo, Ce Zhu, Seongmin Hong, Dongwon Park, Joonhee Lee, Byeong Hyun Lee, Seunggyu Lee, Se Young Chun, Ruiyuan He, Xuhao Jiang, Haihang Ruan, Xinjian Zhang, Jing Liu, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He code -1
Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report Andrey Ignatov, Radu Timofte, ChengMing Chiang, HsienKai Kuo, YuSyuan Xu, ManYu Lee, Allen Lu, ChiaMing Cheng, ChihCheng Chen, JiaYing Yong, HongHan Shuai, WenHuang Cheng, Zhuang Jia, Tianyu Xu, Yijian Zhang, Long Bao, Heng Sun, Diankai Zhang, Si Gao, Shaoli Liu, Biao Wu, Xiaofeng Zhang, Chengjian Zheng, Kaidi Lu, Ning Wang, Xiao Sun, Haodong Wu, Xuncheng Liu, Weizhan Zhang, Caixia Yan, Haipeng Du, Qinghua Zheng, Qi Wang, Wangdu Chen, Ran Duan, Mengdi Sun, Dan Zhu, Guannan Chen, Hojin Cho, Steve Kim, Shijie Yue, Chenghua Li, Zhengyang Zhuge, Wei Chen, Wenxu Wang, Yufeng Zhou, Xiaochen Cai, Hengxing Cai, Kele Xu, Li Liu, Zehua Cheng, Wenyi Lian, Wenjing Lian code -1
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 Challenge: Report Andrey Ignatov, Radu Timofte, Jin Zhang, Feng Zhang, Gaocheng Yu, Zhe Ma, Hongbin Wang, Minsu Kwon, Haotian Qian, Wentao Tong, Pan Mu, Ziping Wang, Guangjing Yan, Brian Lee, Lei Fei, Huaijin Chen, Hyebin Cho, Byeongjun Kwon, Munchurl Kim, Mingyang Qian, Huixin Ma, Yanan Li, Xiaotao Wang, Lei Lei code -1
AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results Ren Yang, Radu Timofte, Xin Li, Qi Zhang, Lin Zhang, Fanglong Liu, Dongliang He, Fu Li, He Zheng, Weihang Yuan, Pavel Ostyakov, Dmitry Vyal, Magauiya Zhussip, Xueyi Zou, Youliang Yan, Lei Li, Jingzhu Tang, Ming Chen, Shijie Zhao, Yu Zhu, Xiaoran Qin, Chenghua Li, Cong Leng, Jian Cheng, Claudio Rota, Marco Buzzelli, Simone Bianco, Raimondo Schettini, Dafeng Zhang, Feiyu Huang, Shizhuo Liu, Xiaobing Wang, Zhezhu Jin, Bingchen Li, Xin Li, Mingxi Li, Ding Liu, Wenbin Zou, Peijie Dong, Tian Ye, Yunchen Zhang, Ming Tan, Xin Niu, Mustafa Ayazoglu, Marcos Conde, UiJin Choi, Zhuang Jia, Tianyu Xu, Yijian Zhang, Mao Ye, Dengyan Luo, Xiaofeng Pan, Liuhan Peng code -1
Swin-Unet: Unet-Like Pure Transformer for Medical Image Segmentation Hu Cao, Yueyue Wang, Joy Chen, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, Manning Wang code -1
Self-attention Capsule Network for Tissue Classification in Case of Challenging Medical Image Statistics Assaf Hoogi, Brian Wilcox, Yachee Gupta, Daniel L. Rubin code -1
ReLaX: Retinal Layer Attribution for Guided Explanations of Automated Optical Coherence Tomography Classification Evan Wen, ReBecca Sorenson, Max Ehrlich code -1
Neural Registration and Segmentation of White Matter Tracts in Multi-modal Brain MRI Noa Barzilay, Ilya Nelkenbaum, Eli Konen, Nahum Kiryati, Arnaldo Mayer code -1
Complementary Phase Encoding for Pair-Wise Neural Deblurring of Accelerated Brain MRI Gali Hod, Michael Green, Mark Waserman, Eli Konen, Shai Shrot, Ilya Nelkenbaum, Nahum Kiryati, Arnaldo Mayer code -1
Frequency Dropout: Feature-Level Regularization via Randomized Filtering Mobarakol Islam, Ben Glocker code -1
PVBM: A Python Vasculature Biomarker Toolbox Based on Retinal Blood Vessel Segmentation Jonathan Fhima, Jan Van Eijgen, Ingeborg Stalmans, Yevgeniy Men, Moti Freiman, Joachim A. Behar code -1
Simultaneous Detection and Classification of Partially and Weakly Supervised Cells Alona Golts, Ido Livneh, Yaniv Zohar, Aaron Ciechanover, Michael Elad code -1
Deep-ASPECTS: A Segmentation-Assisted Model for Stroke Severity Measurement Ujjwal Upadhyay, Mukul Ranjan, Satish Golla, Swetha Tanamala, Preetham Sreenivas, Sasank Chilamkurthy, Jeyaraj Pandian, Jason Tarpley code -1
ExSwin-Unet: An Unbalanced Weighted Unet with Shifted Window and External Attentions for Fetal Brain MRI Image Segmentation Yufei Wen, Chongxin Liang, Jingyin Lin, Huisi Wu, Jing Qin code -1
Contour Dice Loss for Structures with Fuzzy and Complex Boundaries in Fetal MRI Bella SpecktorFadida, Bossmat Yehuda, Daphna LinkSourani, Liat BenSira, Dafna BenBashat, Leo Joskowicz code -1
Multi-scale Multi-task Distillation for Incremental 3D Medical Image Segmentation Mu Tian, Qinzhu Yang, Yi Gao code -1
A Data-Efficient Deep Learning Framework for Segmentation and Classification of Histopathology Images Pranav Singh, Jacopo Cirrone code -1
Bounded Future MS-TCN++ for Surgical Gesture Recognition Adam Goldbraikh, Netanell Avisdris, Carla M. Pugh, Shlomi Laufer code -1
Anatomy-Aware Contrastive Representation Learning for Fetal Ultrasound Zeyu Fu, Jianbo Jiao, Robail Yasrab, Lior Drukker, Aris T. Papageorghiou, J. Alison Noble code -1
Joint Calibrationless Reconstruction and Segmentation of Parallel MRI Aniket Pramanik, Mathews Jacob code -1
Patient-Level Microsatellite Stability Assessment from Whole Slide Images by Combining Momentum Contrast Learning and Group Patch Embeddings Daniel Shats, Hadar Hezi, Guy Shani, Yosef E. Maruvka, Moti Freiman code -1
Segmenting Glandular Biopsy Images Using the Separate Merged Objects Algorithm David Sabban, Ilan Shimshoni code -1
qDWI-Morph: Motion-Compensated Quantitative Diffusion-Weighted MRI Analysis for Fetal Lung Maturity Assessment Yael ZaffraniReznikov, Onur Afacan, Sila Kurugol, Simon K. Warfield, Moti Freiman code -1
Estimating Withdrawal Time in Colonoscopies Liran Katzir, Danny Veikherman, Valentin Dashinsky, Roman Goldenberg, Ilan Shimshoni, Nadav Rabani, Regev Cohen, Ori Kelner, Ehud Rivlin, Daniel Freedman code -1
Beyond Local Processing: Adapting CNNs for CT Reconstruction Bassel Hamoud, Yuval Bahat, Tomer Michaeli code -1
CL-GAN: Contrastive Learning-Based Generative Adversarial Network for Modality Transfer with Limited Paired Data Hajar Emami, Ming Dong, Carri GlideHurst code -1
IMPaSh: A Novel Domain-Shift Resistant Representation for Colorectal Cancer Tissue Classification Trinh Thi Le Vuong, Quoc Dang Vu, Mostafa Jahanifar, Simon Graham, Jin Tae Kwak, Nasir M. Rajpoot code -1
Surgical Workflow Recognition: From Analysis of Challenges to Architectural Study Tobias Czempiel, Aidean Sharghi, Magdalini Paschali, Nassir Navab, Omid Mohareri code -1
RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Right Ventricular Function Bálint Magyar, Márton Tokodi, András Soós, Máté Tolvaj, Bálint Károly Lakatos, Alexandra Fábián, Elena Surkova, Béla Merkely, Attila Kovács, András Horváth code -1
Initialization and Alignment for Adversarial Texture Optimization Xiaoming Zhao, Zhizhen Zhao, Alexander G. Schwing code -1
SIGNet: Intrinsic Image Decomposition by a Semantic and Invariant Gradient Driven Network for Indoor Scenes Partha Das, Sezer Karaoglu, Arjan Gijsenij, Theo Gevers code -1
Implicit Map Augmentation for Relocalization Yuxin Hou, Tianwei Shen, TsunYi Yang, Daniel DeTone, Hyo Jin Kim, Chris Sweeney, Richard A. Newcombe code -1
Social Processes: Self-supervised Meta-learning Over Conversational Groups for Forecasting Nonverbal Social Cues Chirag Raman, Hayley Hung, Marco Loog code -1
Photo-Realistic 360$^{\circ }$ Head Avatars in the Wild Stanislaw Szymanowicz, Virginia Estellers, Tadas Baltrusaitis, Matthew Johnson code -1
AvatarGen: A 3D Generative Model for Animatable Human Avatars Jianfeng Zhang, Zihang Jiang, Dingdong Yang, Hongyi Xu, Yichun Shi, Guoxian Song, Zhongcong Xu, Xinchao Wang, Jiashi Feng code -1
INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors Chaojian Li, Bichen Wu, Albert Pumarola, Peizhao Zhang, Yingyan Lin, Peter Vajda code -1
Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation Yangheng Zhao, Jun Wang, Xiaolong Li, Yue Hu, Ce Zhang, Yanfeng Wang, Siheng Chen code -1
Self-supervised 3D Human Pose Estimation in Static Video via Neural Rendering Luca Schmidtke, Benjamin Hou, Athanasios Vlontzos, Bernhard Kainz code -1
Racial Bias in the Beautyverse: Evaluation of Augmented-Reality Beauty Filters Piera Riccio, Nuria Oliver code -1
LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction Xinhan Di, Pengqian Yu code -1
Neural Mesh-Based Graphics Shubhendu Jena, Franck Multon, Adnane Boukhayma code -1
One-Shot Learning for Human Affordance Detection Abel PachecoOrtega, Walterio W. MayolCuevas code -1
Fast Two-View Motion Segmentation Using Christoffel Polynomials Bengisu Özbay, Octavia I. Camps, Mario Sznaier code -1
UCTNet: Uncertainty-Aware Cross-Modal Transformer Network for Indoor RGB-D Semantic Segmentation Xiaowen Ying, Mooi Choo Chuah code -1
Bi-directional Contrastive Learning for Domain Adaptive Semantic Segmentation Geon Lee, Chanho Eom, Wonkyung Lee, Hyekang Park, Bumsub Ham code -1
Learning Regional Purity for Instance Segmentation on 3D Point Clouds Shichao Dong, Guosheng Lin, TzuYi Hung code -1
Cross-Domain Few-Shot Semantic Segmentation Shuo Lei, Xuchao Zhang, Jianfeng He, Fanglan Chen, Bowen Du, ChangTien Lu code -1
Generative Subgraph Contrast for Self-Supervised Graph Representation Learning Yuehui Han, Le Hui, Haobo Jiang, Jianjun Qian, Jin Xie code -1
SdAE: Self-distillated Masked Autoencoder Yabo Chen, Yuchen Liu, Dongsheng Jiang, Xiaopeng Zhang, Wenrui Dai, Hongkai Xiong, Qi Tian code -1
Demystifying Unsupervised Semantic Correspondence Estimation Mehmet Aygün, Oisin Mac Aodha code -1
Open-Set Semi-Supervised Object Detection YenCheng Liu, ChihYao Ma, Xiaoliang Dai, Junjiao Tian, Peter Vajda, Zijian He, Zsolt Kira code -1
Vibration-Based Uncertainty Estimation for Learning from Limited Supervision Hengtong Hu, Lingxi Xie, Xinyue Huo, Richang Hong, Qi Tian code -1
Concurrent Subsidiary Supervision for Unsupervised Source-Free Domain Adaptation Jogendra Nath Kundu, Suvaansh Bhambri, Akshay R. Kulkarni, Hiran Sarkar, Varun Jampani, R. Venkatesh Babu code -1
Weakly Supervised Object Localization Through Inter-class Feature Similarity and Intra-class Appearance Consistency Jun Wei, Sheng Wang, S. Kevin Zhou, Shuguang Cui, Zhen Li code -1
Active Learning Strategies for Weakly-Supervised Object Detection Huy V. Vo, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Jean Ponce code -1
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training Xiaotong Li, Yixiao Ge, Kun Yi, Zixuan Hu, Ying Shan, LingYu Duan code -1
Bootstrapped Masked Autoencoders for Vision BERT Pretraining Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu code -1
Unsupervised Visual Representation Learning by Synchronous Momentum Grouping Bo Pang, Yifan Zhang, Yaoyi Li, Jia Cai, Cewu Lu code -1
Improving Few-Shot Part Segmentation Using Coarse Supervision Oindrila Saha, Zezhou Cheng, Subhransu Maji code -1
What to Hide from Your Students: Attention-Guided Masked Image Modeling Ioannis Kakogeorgiou, Spyros Gidaris, Bill Psomas, Yannis Avrithis, Andrei Bursuc, Konstantinos Karantzalos, Nikos Komodakis code -1
Pointly-Supervised Panoptic Segmentation Junsong Fan, Zhaoxiang Zhang, Tieniu Tan code -1
MVP: Multimodality-Guided Visual Pre-training Longhui Wei, Lingxi Xie, Wengang Zhou, Houqiang Li, Qi Tian code -1
Locally Varying Distance Transform for Unsupervised Visual Anomaly Detection WenYan Lin, Zhonghang Liu, Siying Liu code -1
HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation Lukas Hoyer, Dengxin Dai, Luc Van Gool code -1
SPot-the-Difference Self-supervised Pre-training for Anomaly Detection and Segmentation Yang Zou, Jongheon Jeong, Latha Pemula, Dongqing Zhang, Onkar Dabeer code -1
Dual-Domain Self-supervised Learning and Model Adaption for Deep Compressive Imaging Yuhui Quan, Xinran Qin, Tongyao Pang, Hui Ji code -1
Unsupervised Selective Labeling for More Effective Semi-supervised Learning Xudong Wang, Long Lian, Stella X. Yu code -1
Max Pooling with Vision Transformers Reconciles Class and Shape in Weakly Supervised Semantic Segmentation Simone Rossetti, Damiano Zappia, Marta Sanzari, Marco Schaerf, Fiora Pirri code -1
Dense Siamese Network for Dense Unsupervised Learning Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy code -1
Multi-granularity Distillation Scheme Towards Lightweight Semi-supervised Semantic Segmentation Jie Qin, Jie Wu, Ming Li, Xuefeng Xiao, Min Zheng, Xingang Wang code -1
CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation Feng Wang, Huiyu Wang, Chen Wei, Alan L. Yuille, Wei Shen code -1
Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization Qi Wei, Haoliang Sun, Xiankai Lu, Yilong Yin code -1
RDA: Reciprocal Distribution Alignment for Robust Semi-supervised Learning Yue Duan, Lei Qi, Lei Wang, Luping Zhou, Yinghuan Shi code -1
MemSAC: Memory Augmented Sample Consistency for Large Scale Domain Adaptation Tarun Kalluri, Astuti Sharma, Manmohan Chandraker code -1
United Defocus Blur Detection and Deblurring via Adversarial Promoting Learning Wenda Zhao, Fei Wei, You He, Huchuan Lu code -1
Synergistic Self-supervised and Quantization Learning YunHao Cao, Peiqin Sun, Yechang Huang, Jianxin Wu, Shuchang Zhou code -1
Semi-supervised Vision Transformers Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, YuGang Jiang code -1
Domain Adaptive Video Segmentation via Temporal Pseudo Supervision Yun Xing, Dayan Guan, Jiaxing Huang, Shijian Lu code -1
Diverse Learner: Exploring Diverse Supervision for Semi-supervised Object Detection Linfeng Li, Minyue Jiang, Yue Yu, Wei Zhang, Xiangru Lin, Yingying Li, Xiao Tan, Jingdong Wang, Errui Ding code -1
A Closer Look at Invariances in Self-supervised Pre-training for 3D Vision Lanxiao Li, Michael Heizmann code -1
ConMatch: Semi-supervised Learning with Confidence-Guided Consistency Regularization Jiwon Kim, Youngjo Min, Daehwan Kim, Gyuseong Lee, Junyoung Seo, Kwangrok Ryoo, Seungryong Kim code -1
FedX: Unsupervised Federated Learning with Cross Knowledge Distillation Sungwon Han, Sungwon Park, Fangzhao Wu, Sundong Kim, Chuhan Wu, Xing Xie, Meeyoung Cha code -1
W2N: Switching from Weak Supervision to Noisy Supervision for Object Detection Zitong Huang, Yiping Bao, Bowen Dong, Erjin Zhou, Wangmeng Zuo code -1
Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness Chaoning Zhang, Kang Zhang, Chenshuang Zhang, Axi Niu, Jiu Feng, Chang D. Yoo, In So Kweon code -1
ARAH: Animatable Volume Rendering of Articulated Human SDFs Shaofei Wang, Katja Schwarz, Andreas Geiger, Siyu Tang code -1
ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer Hongkai Chen, Zixin Luo, Lei Zhou, Yurun Tian, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan code -1
NDF: Neural Deformable Fields for Dynamic Human Modelling Ruiqi Zhang, Jie Chen code -1
Neural Density-Distance Fields Itsuki Ueda, Yoshihiro Fukuhara, Hirokatsu Kataoka, Hiroaki Aizawa, Hidehiko Shishido, Itaru Kitahara code -1
NeXT: Towards High Quality Neural Radiance Fields via Multi-skip Transformer Yunxiao Wang, Yanjie Li, Peidong Liu, Tao Dai, ShuTao Xia code -1
Learning Online Multi-sensor Depth Fusion Erik Sandström, Martin R. Oswald, Suryansh Kumar, Silvan Weder, Fisher Yu, Cristian Sminchisescu, Luc Van Gool code -1
BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, Dahua Lin code -1
Decomposing the Tangent of Occluding Boundaries According to Curvatures and Torsions Huizong Yang, Anthony J. Yezzi code -1
NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors Jiepeng Wang, Peng Wang, Xiaoxiao Long, Christian Theobalt, Taku Komura, Lingjie Liu, Wenping Wang code -1
Generalizable Patch-Based Neural Rendering Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia code -1
Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation Ziming Wang, Xiaoliang Huo, Zhenghao Chen, Jing Zhang, Lu Sheng, Dong Xu code -1
Real-Time Neural Character Rendering with Pose-Guided Multiplane Images Hao Ouyang, Bo Zhang, Pan Zhang, Hao Yang, Jiaolong Yang, Dong Chen, Qifeng Chen, Fang Wen code -1
SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views Xiaoxiao Long, Cheng Lin, Peng Wang, Taku Komura, Wenping Wang code -1
Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth Ziyue Feng, Liang Yang, Longlong Jing, Haiyan Wang, Yingli Tian, Bing Li code -1
Depth Field Networks For Generalizable Multi-view Scene Representation Vitor Guizilini, Igor Vasiljevic, Jiading Fang, Rare Ambru, Greg Shakhnarovich, Matthew R. Walter, Adrien Gaidon code -1
Context-Enhanced Stereo Transformer Weiyu Guo, Zhaoshuo Li, Yongkui Yang, Zheng Wang, Russell H. Taylor, Mathias Unberath, Alan L. Yuille, Yingwei Li code -1
PCW-Net: Pyramid Combination and Warping Cost Volume for Stereo Matching Zhelun Shen, Yuchao Dai, Xibin Song, Zhibo Rao, Dingfu Zhou, Liangjun Zhang code -1
Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images Yuan Liu, Yilin Wen, Sida Peng, Cheng Lin, Xiaoxiao Long, Taku Komura, Wenping Wang code -1
Latency-Aware Collaborative Perception Zixing Lei, Shunli Ren, Yue Hu, Wenjun Zhang, Siheng Chen code -1
TensoRF: Tensorial Radiance Fields Anpei Chen, Zexiang Xu, Andreas Geiger, Jingyi Yu, Hao Su code -1
NeFSAC: Neurally Filtered Minimal Samples Luca Cavalli, Marc Pollefeys, Daniel Barath code -1
SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data Eldar Insafutdinov, Dylan Campbell, João F. Henriques, Andrea Vedaldi code -1
HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields Kim JunSeong, Kim YuJi, Moon YeBin, TaeHyun Oh code -1
NeuMan: Neural Human Radiance Field from a Single Video Wei Jiang, Kwang Moo Yi, Golnoosh Samei, Oncel Tuzel, Anurag Ranjan code -1
TAVA: Template-free Animatable Volumetric Actors Ruilong Li, Julian Tanke, Minh Vo, Michael Zollhöfer, Jürgen Gall, Angjoo Kanazawa, Christoph Lassner code -1
EASNet: Searching Elastic and Accurate Network Architecture for Stereo Matching Qiang Wang, Shaohuai Shi, Kaiyong Zhao, Xiaowen Chu code -1
Relative Pose from SIFT Features Daniel Barath, Zuzana Kukelova code -1
Selection and Cross Similarity for Event-Image Deep Stereo Hoonhee Cho, KukJin Yoon code -1
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding Dave Zhenyu Chen, Qirui Wu, Matthias Nießner, Angel X. Chang code -1
CIRCLE: Convolutional Implicit Reconstruction and Completion for Large-Scale Indoor Scene Haoxiang Chen, Jiahui Huang, TaiJiang Mu, ShiMin Hu code -1
ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild Wang Zhao, Shaohui Liu, Hengkai Guo, Wenping Wang, YongJin Liu code -1
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding Yujin Chen, Matthias Nießner, Angela Dai code -1
Few 'Zero Level Set'-Shot Learning of Shape Signed Distance Functions in Feature Space Amine Ouasfi, Adnane Boukhayma code -1
Solution Space Analysis of Essential Matrix Based on Algebraic Error Minimization Gaku Nakano code -1
Approximate Differentiable Rendering with Algebraic Surfaces Leonid Keselman, Martial Hebert code -1
CoVisPose: Co-visibility Pose Transformer for Wide-Baseline Relative Pose Estimation in 360$^\circ $ Indoor Panoramas Will Hutchcroft, Yuguang Li, Ivaylo Boyadzhiev, Zhiqiang Wan, Haiyan Wang, Sing Bing Kang code -1
Affine Correspondences Between Multi-camera Systems for 6DOF Relative Pose Estimation Banglei Guan, Ji Zhao code -1
GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation Keqiang Li, Mingyang Zhao, Huaiyu Wu, DongMing Yan, Zhen Shen, FeiYue Wang, Gang Xiong code -1
IS-MVSNet: Importance Sampling-Based MVSNet Likang Wang, Yue Gong, Xinjun Ma, Qirui Wang, Kaixuan Zhou, Lei Chen code -1
Point Scene Understanding via Disentangled Instance Mesh Reconstruction Jiaxiang Tang, Xiaokang Chen, Jingbo Wang, Gang Zeng code -1
DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras Ruizhi Shao, Zerong Zheng, Hongwen Zhang, Jingxiang Sun, Yebin Liu code -1
Space-Partitioning RANSAC Daniel Barath, Gábor Valasek code -1
Box-Supervised Instance Segmentation with Level Set Evolution Wentong Li, Wenyu Liu, Jianke Zhu, Miaomiao Cui, XianSheng Hua, Lei Zhang code -1
Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding Hao Wen, Yunze Liu, Jingwei Huang, Bo Duan, Li Yi code -1
Adaptive Agent Transformer for Few-Shot Segmentation Yuan Wang, Rui Sun, Zhe Zhang, Tianzhu Zhang code -1
Waymo Open Dataset: Panoramic Video Panoptic Segmentation Jieru Mei, Alex Zihao Zhu, Xinchen Yan, Hang Yan, Siyuan Qiao, LiangChieh Chen, Henrik Kretzschmar code -1
TransFGU: A Top-Down Approach to Fine-Grained Unsupervised Semantic Segmentation Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin code -1
AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-Shot Interactions Yian Wang, Ruihai Wu, Kaichun Mo, Jiaqi Ke, Qingnan Fan, Leonidas J. Guibas, Hao Dong code -1
Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation Sunghwan Hong, Seokju Cho, Jisu Nam, Stephen Lin, Seungryong Kim code -1
Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications Lingzhi Zhang, Shenghao Zhou, Simon Stent, Jianbo Shi code -1
Perceptual Artifacts Localization for Inpainting Lingzhi Zhang, Yuqian Zhou, Connelly Barnes, Sohrab Amirghodsi, Zhe Lin, Eli Shechtman, Jianbo Shi code -1
2D Amodal Instance Segmentation Guided by 3D Shape Prior Zhixuan Li, Weining Ye, Tingting Jiang, Tiejun Huang code -1
Data Efficient 3D Learner via Knowledge Transferred from 2D Model PingChung Yu, Cheng Sun, Min Sun code -1
Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation Tong Wu, Guangyu Gao, Junshi Huang, Xiaolin Wei, Xiaoming Wei, Chi Harold Liu code -1
Dense Gaussian Processes for Few-Shot Segmentation Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, Martin Danelljan code -1
3D Instances as 1D Kernels Yizheng Wu, Min Shi, Shuaiyuan Du, Hao Lu, Zhiguo Cao, Weicai Zhong code -1
TransMatting: Enhancing Transparent Objects Matting with Transformers Huanqia Cai, Fanglei Xue, Lele Xu, Lili Guo code -1
MVSalNet: Multi-view Augmentation for RGB-D Salient Object Detection Jiayuan Zhou, Lijun Wang, Huchuan Lu, Kaining Huang, Xinchu Shi, Bocong Liu code -1
k-means Mask Transformer Qihang Yu, Huiyu Wang, Siyuan Qiao, Maxwell D. Collins, Yukun Zhu, Hartwig Adam, Alan L. Yuille, LiangChieh Chen code -1
SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness Jindong Gu, Hengshuang Zhao, Volker Tresp, Philip H. S. Torr code -1
Adversarial Erasing Framework via Triplet with Gated Pyramid Pooling Layer for Weakly Supervised Semantic Segmentation SungHoon Yoon, Hyeokjun Kweon, Jegyeong Cho, Shinjeong Kim, KukJin Yoon code -1
Continual Semantic Segmentation via Structure Preserving and Projected Feature Alignment Zihan Lin, Zilei Wang, Yixin Zhang code -1
Interclass Prototype Relation for Few-Shot Segmentation Atsuro Okazawa code -1
Slim Scissors: Segmenting Thin Object from Synthetic Background Kunyang Han, Jun Hao Liew, Jiashi Feng, Huawei Tian, Yao Zhao, Yunchao Wei code -1
Abstracting Sketches Through Simple Primitives Stephan Alaniz, Massimiliano Mancini, Anjan Dutta, Diego Marcos, Zeynep Akata code -1
Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation Theodoros Pissas, Claudio S. Ravasio, Lyndon Da Cruz, Christos Bergeles code -1
One-Trimap Video Matting Hongje Seong, Seoung Wug Oh, Brian Price, Euntai Kim, JoonYoung Lee code -1
$\mathrm {D^2ADA}$: Dynamic Density-Aware Active Domain Adaptation for Semantic Segmentation TsungHan Wu, YiSyuan Liou, ShaoJi Yuan, HsinYing Lee, TungI Chen, KuanChih Huang, Winston H. Hsu code -1
Learning Quality-aware Dynamic Memory for Video Object Segmentation Yong Liu, Ran Yu, Fei Yin, Xinyuan Zhao, Wei Zhao, Weihao Xia, Yujiu Yang code -1
Learning Implicit Feature Alignment Function for Semantic Segmentation Hanzhe Hu, Yinbo Chen, Jiarui Xu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang code -1
Quantum Motion Segmentation Federica Arrigoni, Willi Menapace, Marcel Seelbach Benkner, Elisa Ricci, Vladislav Golyanik code -1
Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation Feng Zhu, Zongxin Yang, Xin Yu, Yi Yang, Yunchao Wei code -1
Laplacian Mesh Transformer: Dual Attention and Topology Aware Network for 3D Mesh Classification and Segmentation XiaoJuan Li, Jie Yang, FangLue Zhang code -1
Geodesic-Former: A Geodesic-Guided Few-Shot 3D Point Cloud Instance Segmenter Tuan Ngo, Khoi Nguyen code -1
Union-Set Multi-source Model Adaptation for Semantic Segmentation Zongyao Li, Ren Togo, Takahiro Ogawa, Miki Haseyama code -1
Point MixSwap: Attentional Point Cloud Mixing via Swapping Matched Structural Divisions Ardian Umam, ChengKun Yang, YungYu Chuang, JenHui Chuang, YenYu Lin code -1
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation Ye Yu, Jialing Yuan, Gaurav Mittal, Fuxin Li, Mei Chen code -1
SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection Minhyeok Lee, Chaewon Park, Suhwan Cho, Sangyoun Lee code -1
Global Spectral Filter Memory Network for Video Object Segmentation Yong Liu, Ran Yu, Jiahao Wang, Xinyuan Zhao, Yitong Wang, Yansong Tang, Yujiu Yang code -1
Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, Fahad Shahbaz Khan code -1
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation Haodi He, Yuhui Yuan, Xiangyu Yue, Han Hu code -1
Learning Topological Interactions for Multi-Class Medical Image Segmentation Saumya Gupta, Xiaoling Hu, James Kaan, Michael Jin, Mutshipay Mpoy, Katherine Chung, Gagandeep Singh, Mary M. Saltz, Tahsin M. Kurç, Joel H. Saltz, Apostolos Tassiopoulos, Prateek Prasanna, Chao Chen code -1
Unsupervised Segmentation in Real-World Images via Spelke Object Inference Honglin Chen, Rahul Venkatesh, Yoni Friedman, Jiajun Wu, Joshua B. Tenenbaum, Daniel L. K. Yamins, Daniel M. Bear code -1
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Han Hu, Xiang Bai code -1
Efficient One-Stage Video Object Detection by Exploiting Temporal Consistency Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson code -1
Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation Guodong Ding, Angela Yao code -1
Spotting Temporally Precise, Fine-Grained Events in Video James Hong, Haotian Zhang, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian code -1
Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation Nadine Behrmann, S. Alireza Golestaneh, Zico Kolter, Jürgen Gall, Mehdi Noroozi code -1
Efficient Video Transformers with Spatial-Temporal Token Selection Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, YuGang Jiang code -1
Long Movie Clip Classification with State-Space Video Models Md Mohaiminul Islam, Gedas Bertasius code -1
Prompting Visual-Language Models for Efficient Video Understanding Chen Ju, Tengda Han, Kunhao Zheng, Ya Zhang, Weidi Xie code -1
Asymmetric Relation Consistency Reasoning for Video Relation Grounding Huan Li, Ping Wei, Jiapeng Li, Zeyu Ma, Jiahui Shang, Nanning Zheng code -1
Self-supervised Social Relation Representation for Human Group Detection Jiacheng Li, Ruize Han, Haomin Yan, Zekun Qian, Wei Feng, Song Wang code -1
K-centered Patch Sampling for Efficient Video Recognition Seong Hyeon Park, Jihoon Tack, Byeongho Heo, JungWoo Ha, Jinwoo Shin code -1
A Deep Moving-Camera Background Model Guy Erez, Ron Shapira Weber, Oren Freifeld code -1
GraphVid: It only Takes a Few Nodes to Understand a Video Eitan Kosman, Dotan Di Castro code -1
Delta Distillation for Efficient Video Processing Amirhossein Habibian, Haitam Ben Yahia, Davide Abati, Efstratios Gavves, Fatih Porikli code -1
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning David Junhao Zhang, Kunchang Li, Yali Wang, Yunpeng Chen, Shashwat Chandra, Yu Qiao, Luoqi Liu, Mike Zheng Shou code -1
COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality Honglu Zhou, Asim Kadav, Aviv Shamsian, Shijie Geng, Farley Lai, Long Zhao, Ting Liu, Mubbasir Kapadia, Hans Peter Graf code -1
E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context Zizhang Li, Mengmeng Wang, Huaijin Pi, Kechun Xu, Jianbiao Mei, Yong Liu code -1
TDViT: Temporal Dilated Video Transformer for Dense Video Tasks Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson code -1
Semi-supervised Learning of Optical Flow by Flow Supervisor Woobin Im, Sebin Lee, SungEui Yoon code -1
Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization Nikita Dvornik, Isma Hadji, Hai Pham, Dhaivat Bhatt, Brais Martínez, Afsaneh Fazly, Allan D. Jepson code -1
Deep 360$^\circ $ Optical Flow Estimation Based on Multi-projection Fusion Yiheng Li, Connelly Barnes, Kun Huang, FangLue Zhang code -1
MaCLR: Motion-Aware Contrastive Learning of Representations for Videos Fanyi Xiao, Joseph Tighe, Davide Modolo code -1
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection Kyle Min, Sourya Roy, Subarna Tripathi, Tanaya Guha, Somdeb Majumdar code -1
Frozen CLIP Models are Efficient Video Learners Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li code -1
PIP: Physical Interaction Prediction via Mental Simulation with Span Selection Jiafei Duan, Samson Yu, Soujanya Poria, Bihan Wen, Cheston Tan code -1
Panoramic Vision Transformer for Saliency Detection in 360$^\circ $ Videos Heeseung Yun, Sehun Lee, Gunhee Kim code -1
Bayesian Tracking of Video Graphs Using Joint Kalman Smoothing and Registration Aditi Basu Bal, Ramy Mounir, Sathyanarayanan N. Aakur, Sudeep Sarkar, Anuj Srivastava code -1
Motion Sensitive Contrastive Learning for Self-supervised Video Representation Jingcheng Ni, Nan Zhou, Jie Qin, Qian Wu, Junqi Liu, Boxun Li, Di Huang code -1
Dynamic Temporal Filtering in Video Models Fuchen Long, Zhaofan Qiu, Yingwei Pan, Ting Yao, ChongWah Ngo, Tao Mei code -1
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li code -1
Temporal Lift Pooling for Continuous Sign Language Recognition Lianyu Hu, Liqing Gao, Zekang Liu, Wei Feng code -1
MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes Yang Jiao, Shaoxiang Chen, Zequn Jie, Jingjing Chen, Lin Ma, YuGang Jiang code -1
SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding Mengxue Qu, Yu Wu, Wu Liu, Qiqi Gong, Xiaodan Liang, Olga Russakovsky, Yao Zhao, Yunchao Wei code -1
Cross-Modal Prototype Driven Network for Radiology Report Generation Jun Wang, Abhir Bhalerao, Yulan He code -1
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts Chuan Guo, Xinxin Zuo, Sen Wang, Li Cheng code -1
SeqTR: A Simple Yet Universal Network for Visual Grounding Chaoyang Zhu, Yiyi Zhou, Yunhang Shen, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji code -1
VTC: Improving Video-Text Retrieval with User Comments Laura Hanu, James Thewlis, Yuki M. Asano, Christian Rupprecht code -1
FashionViL: Fashion-Focused Vision-and-Language Representation Learning Xiao Han, Licheng Yu, Xiatian Zhu, Li Zhang, YiZhe Song, Tao Xiang code -1
Weakly Supervised Grounding for VQA in Vision-Language Transformers Aisha Urooj Khan, Hilde Kuehne, Chuang Gan, Niels da Vitoria Lobo, Mubarak Shah code -1
Automatic Dense Annotation of Large-Vocabulary Sign Language Videos Liliane Momeni, Hannah Bull, K. R. Prajwal, Samuel Albanie, Gül Varol, Andrew Zisserman code -1
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval Yuying Ge, Yixiao Ge, Xihui Liu, Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo code -1
GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval Yuxuan Wang, Difei Gao, Licheng Yu, Weixian Lei, Matt Feiszli, Mike Zheng Shou code -1
A Simple and Robust Correlation Filtering Method for Text-Based Person Search Wei Suo, Mengyang Sun, Kai Niu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu code -1
Towards Self-Supervised and Weight-preserving Neural Architecture Search Zhuowei Li, Yibo Gao, Zhenzhou Zha, Zhiqiang Hu, Qing Xia, Shaoting Zhang, Dimitris N. Metaxas code -1
MoQuad: Motion-focused Quadruple Construction for Video Contrastive Learning Yuan Liu, Jiacheng Chen, Hao Wu code -1
On the Effectiveness of ViT Features as Local Semantic Descriptors Shir Amir, Yossi Gandelsman, Shai Bagon, Tali Dekel code -1
Anomaly Detection Requires Better Representations Tal Reiss, Niv Cohen, Eliahu Horwitz, Ron Abutbul, Yedid Hoshen code -1
Leveraging Self-Supervised Training for Unintentional Action Recognition Enea Duka, Anna Kukleva, Bernt Schiele code -1
A Study on Self-Supervised Object Detection Pretraining Trung Dang, Simon Kornblith, Huy Thong Nguyen, Peter Chin, Maryam Khademi code -1
Internet Curiosity: Directed Unsupervised Learning on Uncurated Internet Data Alexander C. Li, Ellis Brown, Alexei A. Efros, Deepak Pathak code -1
Towards Autonomous Grading in the Real World Yakov Miron, Yuval Goldfracht, Dotan Di Castro code -1
Bootstrapping Autonomous Lane Changes with Self-supervised Augmented Runs Xiang Xiang code -1
Artifact-Based Domain Generalization of Skin Lesion Models Alceu Bissoto, Catarina Barata, Eduardo Valle, Sandra Avila code -1
An Evaluation of Self-supervised Pre-training for Skin-Lesion Analysis Levy G. Chaves, Alceu Bissoto, Eduardo Valle, Sandra Avila code -1
Skin_Hair Dataset: Setting the Benchmark for Effective Hair Inpainting Methods for Improving the Image Quality of Dermoscopic Images Joanna JaworekKorjakowska, Anna Wójcicka, Dariusz Kucharski, Andrzej Brodzicki, Connah Kendrick, Bill Cassidy, Moi Hoon Yap code -1
FairDisCo: Fairer AI in Dermatology via Disentanglement Contrastive Learning Siyi Du, Ben Hers, Nourhan Bayasi, Ghassan Hamarneh, Rafeef Garbi code -1
CIRCLe: Color Invariant Representation Learning for Unbiased Classification of Skin Lesions Arezou Pakzad, Kumar Abhishek, Ghassan Hamarneh code -1
Distinctive Image Captioning via CLIP Guided Group Optimization Youyuan Zhang, Jiuniu Wang, Hao Wu, Wenjia Xu code -1
OCR-IDL: OCR Annotations for Industry Document Library Dataset Ali Furkan Biten, Rubèn Tito, Lluís Gómez, Ernest Valveny, Dimosthenis Karatzas code -1
Self-paced Learning to Improve Text Row Detection in Historical Documents with Missing Labels Mihaela Gaman, Lida Ghadamiyan, Radu Tudor Ionescu, Marius Popescu code -1
On Calibration of Scene-Text Recognition Models Ron Slossberg, Oron Anschel, Amir Markovitz, Ron Litman, Aviad Aberdam, Shahar Tsiper, Shai Mazor, Jon Wu, R. Manmatha code -1
End-to-End Document Recognition and Understanding with Dessurt Brian L. Davis, Bryan S. Morse, Brian L. Price, Chris Tensmeyer, Curtis Wigington, Vlad I. Morariu code -1
Task Grouping for Multilingual Text Recognition Jing Huang, Kevin J. Liang, Rama Kovvuri, Tal Hassner code -1
Incorporating Self-attention Mechanism and Multi-task Learning into Scene Text Detection Ning Ding, Liangrui Peng, Changsong Liu, Yuqi Zhang, Ruixue Zhang, Jie Li code -1
Doc2Graph: A Task Agnostic Document Understanding Framework Based on Graph Neural Networks Andrea Gemelli, Sanket Biswas, Enrico Civitelli, Josep Lladós, Simone Marinai code -1
MUST-VQA: MUltilingual Scene-Text VQA Emanuele Vivoli, Ali Furkan Biten, Andrés Mafla, Dimosthenis Karatzas, Lluís Gómez code -1
Out-of-Vocabulary Challenge Report Sergi GarciaBordils, Andrés Mafla, Ali Furkan Biten, Oren Nuriel, Aviad Aberdam, Shai Mazor, Ron Litman, Dimosthenis Karatzas code -1
Towards Structured Noise Models for Unsupervised Denoising Benjamin Salmon, Alexander Krull code -1
Comparison of Semi-supervised Learning Methods for High Content Screening Quality Control Umar Masud, Ethan Cohen, Ihab Bendidi, Guillaume Bollot, Auguste Genovesio code -1
Discriminative Attribution from Paired Images Nils Eckstein, Habib Bukhari, Alexander S. Bates, Gregory S. X. E. Jefferis, Jan Funke code -1
Learning with Minimal Effort: Leveraging in Silico Labeling for Cell and Nucleus Segmentation Thomas Bonte, Maxence Philbert, Emeline Coleno, Edouard Bertrand, Arthur Imbert, Thomas Walter code -1
Towards Better Guided Attention and Human Knowledge Insertion in Deep Convolutional Neural Networks Ankit Gupta, IdaMaria Sintorn code -1
Characterization of AI Model Configurations for Model Reuse Peter Bajcsy, Michael Majurski, Thomas E. Cleveland IV, Manuel J. Carrasco, Walid Keyrouz code -1
Empirical Evaluation of Deep Learning Approaches for Landmark Detection in Fish Bioimages Navdeep Kumar, Claudia Di Biagio, Zachary Dellacqua, Ratish Raman, Arianna Martini, Clara Boglione, Marc Muller, Pierre Geurts, Raphaël Marée code -1
PointFISH: Learning Point Cloud Representations for RNA Localization Patterns Arthur Imbert, Florian Müller, Thomas Walter code -1
N2V2 - Fixing Noise2Void Checkerboard Artifacts with Modified Sampling Strategies and a Tweaked Network Architecture Eva Höck, TimOliver Buchholz, Anselm Brachmann, Florian Jug, Alexander Freytag code -1
Object Detection in Aerial Images with Uncertainty-Aware Graph Network Jongha Kim, Jinheon Baek, Sung Ju Hwang code -1
STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation Zhengkai Jiang, Zhangxuan Gu, Jinlong Peng, Hang Zhou, Liang Liu, Yabiao Wang, Ying Tai, Chengjie Wang, Liqing Zhang code -1
Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks Haodong Duan, Yue Zhao, Kai Chen, Yuanjun Xiong, Dahua Lin code -1
SegTAD: Precise Temporal Action Detection via Semantic Segmentation Chen Zhao, Merey Ramazanova, Mengmeng Xu, Bernard Ghanem code -1
Text-Driven Stylization of Video Objects Sebastian Loeschcke, Serge J. Belongie, Sagie Benaim code -1
MND: A New Dataset and Benchmark of Movie Scenes Classified by Their Narrative Function Chang Liu, Armin Shmilovici, Mark Last code -1
Are All Combinations Equal? Combining Textual and Visual Features with Multiple Space Learning for Text-Based Video Retrieval Damianos Galanopoulos, Vasileios Mezaris code -1
Scene-Adaptive Temporal Stabilisation for Video Colourisation Using Deep Video Priors Marc Górriz Blanch, Noel E. O'Connor, Marta Mrak code -1
Movie Lens: Discovering and Characterizing Editing Patterns in the Analysis of Short Movie Sequences Bartolomeo Vacchetti, Tania Cerquitelli code -1
SKDCGN: Source-free Knowledge Distillation of Counterfactual Generative Networks Using cGANs Sameer Ambekar, Matteo Tafuro, Ankit Ankit, Diego van der Mast, Mark Alence, Christos Athanasiadis code -1
C-3PO: Towards Rotation Equivariant Feature Detection and Description Piyush Bagad, Floor Eijkelboom, Mark Fokkema, Danilo de Goede, Paul Hilders, Miltiadis Kofinas code -1
Towards Flexible Inductive Bias via Progressive Reparameterization Scheduling Yunsung Lee, Gyuseong Lee, Kwangrok Ryoo, Hyojun Go, Jihye Park, Seungryong Kim code -1
Zero-Shot Image Enhancement with Renovated Laplacian Pyramid Shunsuke Takao code -1
Beyond a Video Frame Interpolator: A Space Decoupled Learning Approach to Continuous Image Transition Tao Yang, Peiran Ren, Xuansong Xie, XianSheng Hua, Lei Zhang code -1
Diversified Dynamic Routing for Vision Tasks Botos Csaba, Adel Bibi, Yanwei Li, Philip H. S. Torr, SerNam Lim code -1
MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report Wenxiu Sun, Qingpeng Zhu, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Jun Jiang, Qingyu Yang, Chen Change Loy, Jinwei Gu, Dewang Hou, Kai Zhao, Liying Lu, Yu Li, Huaijia Lin, Ruizheng Wu, Jiangbo Lu, Jiaya Jia, Qiang Liu, Haosong Yue, Danyang Cao, Lehang Yu, Jiaxuan Quan, Jixiang Liang, Yufei Wang, Yuchao Dai, Peng Yang, Hu Yan, Houbiao Liu, Siyuan Su, Xuanhe Li, Rui Ren, Yunlong Liu, Yufan Zhu, Dong Lao, Alex Wong, Katie Chang code -1
MIPI 2022 Challenge on Quad-Bayer Re-mosaic: Dataset and Report Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Zhen Wang, Daoyu Li, Yuzhe Zhang, Lintao Peng, Xuyang Chang, Yinuo Zhang, Yaqi Wu, Xun Wu, Zhihao Fan, Chengjie Xia, Feng Zhang, Haijin Zeng, Kai Feng, Yongqiang Zhao, Hiêp Quang Luong, Jan Aelterman, Anh Minh Truong, Wilfried Philips, Xiaohong Liu, Jun Jia, Hanchi Sun, Guangtao Zhai, Longan Xiao, Qihang Xu, Ting Jiang, Qi Wu, Chengzhi Jiang, Mingyan Han, Xinpeng Li, Wenjie Lin, Youwei Li, Haoqiang Fan, Shuaicheng Liu, Rongyuan Wu, Lingchen Sun, Qiaosi Yi code -1
MIPI 2022 Challenge on RGBW Sensor Re-mosaic: Dataset and Report Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Lingchen Sun, Rongyuan Wu, Qiaosi Yi, Rongjian Xu, Xiaohui Liu, Zhilu Zhang, Xiaohe Wu, Ruohao Wang, Junyi Li, Wangmeng Zuo, Faming Fang code -1
MIPI 2022 Challenge on RGBW Sensor Fusion: Dataset and Report Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Zhen Wang, Daoyu Li, Yuzhe Zhang, Lintao Peng, Xuyang Chang, Yinuo Zhang, Liheng Bian, Bing Li, Jie Huang, Mingde Yao, Ruikang Xu, Feng Zhao, Xiaohui Liu, Rongjian Xu, Zhilu Zhang, Xiaohe Wu, Ruohao Wang, Junyi Li, Wangmeng Zuo, Zhuang Jia, DongJae Lee, Ting Jiang, Qi Wu, Chengzhi Jiang, Mingyan Han, Xinpeng Li, Wenjie Lin, Youwei Li, Haoqiang Fan, Shuaicheng Liu code -1
MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results Ruicheng Feng, Chongyi Li, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Jun Jiang, Qingyu Yang, Chen Change Loy, Jinwei Gu, Yurui Zhu, Xi Wang, Xueyang Fu, Xiaowei Hu, Jinfan Hu, Xina Liu, Xiangyu Chen, Chao Dong, Dafeng Zhang, Feiyu Huang, Shizhuo Liu, Xiaobing Wang, Zhezhu Jin, Xuhao Jiang, Guangqi Shao, Xiaotao Wang, Lei Lei, Zhao Zhang, Suiyi Zhao, Huan Zheng, Yangcheng Gao, Yanyan Wei, Jiahuan Ren, Tao Huang, Zhenxuan Fang, Mengluan Huang, Junwei Xu, Yong Zhang, Yuechi Yang, Qidi Shu, Zhiwen Yang, Shaocong Li, Mingde Yao, Ruikang Xu, Yuanshen Guan, Jie Huang, Zhiwei Xiong, Hangyan Zhu, Ming Liu, Shaohui Liu, Wangmeng Zuo, Zhuang Jia, Binbin Song, Ziqi Song, Guiting Mao, Ben Hou, Zhimou Liu, Yi Ke, Dengpei Ouyang, Dekui Han, Jinghao Zhang, Qi Zhu, Naishan Zheng, Feng Zhao, Wu Jin, Marcos V. Conde, Sabari Nathan, Radu Timofte, Tianyi Xu, Jun Xu, P. S. Hrishikesh, Densen Puthussery, C. V. Jiji, Biao Jiang, Yuhan Ding, WanZhang Li, Xiaoyue Feng, Sijing Chen, Tianheng Zhong, Jiyang Lu, Hongming Chen, Zhentao Fan, Xiang Chen code -1
Continuous Spectral Reconstruction from RGB Images via Implicit Neural Representation Ruikang Xu, Mingde Yao, Chang Chen, Lizhi Wang, Zhiwei Xiong code -1
Event-Based Image Deblurring with Dynamic Motion Awareness Patricia Vitoria, Stamatios Georgoulis, Stepan Tulyakov, Alfredo Bochicchio, Julius Erbach, Yuanyou Li code -1
UDC-UNet: Under-Display Camera Image Restoration via U-shape Dynamic Network Xina Liu, Jinfan Hu, Xiangyu Chen, Chao Dong code -1
Enhanced Coarse-to-Fine Network for Image Restoration from Under-Display Cameras Yurui Zhu, Xi Wang, Xueyang Fu, Xiaowei Hu code -1
Learning to Joint Remosaic and Denoise in Quad Bayer CFA via Universal Multi-scale Channel Attention Network Xun Wu, Zhihao Fan, Jiesi Zheng, Yaqi Wu, Feng Zhang code -1
Learning an Efficient Multimodal Depth Completion Model Dewang Hou, Yuanyuan Du, Kai Zhao, Yang Zhao code -1
Learning Rich Information for Quad Bayer Remosaicing and Denoising Jun Jia, Hanchi Sun, Xiaohong Liu, Longan Xiao, Qihang Xu, Guangtao Zhai code -1
Depth Completion Using Laplacian Pyramid-Based Depth Residuals Haosong Yue, Qiang Liu, Zhong Liu, Jing Zhang, Xingming Wu code -1
PSUMNet: Unified Modality Part Streams Are All You Need for Efficient Pose-Based Action Recognition Neel Trivedi, Ravi Kiran Sarvadevabhatla code -1
YOLO5Face: Why Reinventing a Face Detector Delong Qi, Weijun Tan, Qi Yao, Jingfeng Liu code -1
Counterfactual Fairness for Facial Expression Recognition Jiaee Cheong, Sinan Kalkan, Hatice Gunes code -1
Improved Cross-Dataset Facial Expression Recognition by Handling Data Imbalance and Feature Confusion Manogna Sreenivas, Sawa Takamuku, Soma Biswas, Aditya Chepuri, Balasubramanian Vengatesan, Naotake Natori code -1
Video-Based Gait Analysis for Spinal Deformity Himanshu Kumar Suman, Tanmay Tulsidas Verlekar code -1
TSCom-Net: Coarse-to-Fine 3D Textured Shape Completion Network Ahmet Serdar Karadeniz, Sk Aziz Ali, Anis Kacem, Elona Dupont, Djamila Aouada code -1
Deep Learning-Based Assessment of Facial Periodic Affect in Work-Like Settings Siyang Song, Yiming Luo, Vincenzo Ronca, Gianluca Borghini, Hesam Sagha, Vera Barbara Rick, Alexander Mertens, Hatice Gunes code -1
Supervision by Landmarks: An Enhanced Facial De-occlusion Network for VR-Based Applications Surabhi Gupta, Sai Sagar Jinka, Avinash Sharma, Anoop M. Namboodiri code -1
Consistency-Based Self-supervised Learning for Temporal Anomaly Localization Aniello Panariello, Angelo Porrello, Simone Calderara, Rita Cucchiara code -1
Perspective Reconstruction of Human Faces by Joint Mesh and Landmark Regression Jia Guo, Jinke Yu, Alexandros Lattas, Jiankang Deng code -1
Pixel2ISDF: Implicit Signed Distance Fields Based Human Body Model from Multi-view and Multi-pose Images Jianchuan Chen, Wentao Yi, Tiantian Wang, Xing Li, Liqian Ma, Yangyu Fan, Huchuan Lu code -1
UnconFuse: Avatar Reconstruction from Unconstrained Images Han Huang, Liliang Chen, Xihao Wang code -1
HiFace: Hybrid Task Learning for Face Reconstruction from Single Image Wei Xu, Zhihong Fu, Zhixing Chen, Qili Deng, Mingtao Fu, Xijin Zhang, Yuan Gao, Daniel K. Du, Min Zheng code -1
Multi-view Canonical Pose 3D Human Body Reconstruction Based on Volumetric TSDF Xi Li code -1
End to End Face Reconstruction via Differentiable PnP Yiren Lu, Huawei Wei code -1
One Ontology to Rule Them All: Corner Case Scenarios for Autonomous Driving Daniel Bogdoll, Stefani Guneshka, J. Marius Zöllner code -1
Parametric and Multivariate Uncertainty Calibration for Regression and Object Detection Fabian Küppers, Jonas Schneider, Anselm Haselhoff code -1
Reliable Multimodal Trajectory Prediction via Error Aligned Uncertainty Optimization Neslihan Kose, Ranganath Krishnan, Akash Dhamasia, Omesh Tickoo, Michael Paulitsch code -1
PAI3D: Painting Adaptive Instance-Prior for 3D Object Detection Hao Liu, Zhuoran Xu, Dan Wang, Baofeng Zhang, Guan Wang, Bo Dong, Xin Wen, Xinyu Xu code -1
Validation of Pedestrian Detectors by Classification of Visual Detection Impairing Factors Korbinian Hagn, Oliver Grau code -1
Probing Contextual Diversity for Dense Out-of-Distribution Detection Silvio Galesso, María Alejandra Bravo, Mehdi Naouar, Thomas Brox code -1
Adversarial Vulnerability of Temporal Feature Networks for Object Detection Svetlana Pavlitskaya, Nikolai Polley, Michael Weber, J. Marius Zöllner code -1
Towards Improved Intermediate Layer Variational Inference for Uncertainty Estimation Ahmed Hammam, Frank Bonarens, Seyed Eghbal Ghobadi, Christoph Stiller code -1
Explainable Sparse Attention for Memory-Based Trajectory Predictors Francesco Marchetti, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo code -1
Cycle-Consistent World Models for Domain Independent Latent Imagination Sidney Bender, Tim Joseph, J. Marius Zöllner code -1
Strengthening Skeletal Action Recognizers via Leveraging Temporal Patterns Zhenyue Qin, Pan Ji, Dongwoo Kim, Yang Liu, Saeed Anwar, Tom Gedeon code -1
Which Expert Knows Best? Modulating Soft Learning with Online Batch Confidence for Domain Adaptive Person Re-Identification Andrea Zunino, Christopher Murray, Richard Blythman, Vittorio Murino code -1
Cross-Modality Attention and Multimodal Fusion Transformer for Pedestrian Detection WeiYu Lee, Ljubomir Jovanov, Wilfried Philips code -1
See Finer, See More: Implicit Modality Alignment for Text-Based Person Retrieval Xiujun Shu, Wei Wen, Haoqian Wu, Keyu Chen, Yiran Song, Ruizhi Qiao, Bo Ren, Xiao Wang code -1
Look at Adjacent Frames: Video Anomaly Detection Without Offline Training Yuqi Ouyang, Guodong Shen, Victor Sanchez code -1
SOMPT22: A Surveillance Oriented Multi-pedestrian Tracking Dataset Fatih Emre Simsek, Cevahir Cigla, Koray Kayabol code -1
Detection of Fights in Videos: A Comparison Study of Anomaly Detection and Action Recognition Weijun Tan, Jingfeng Liu code -1
Privacy-Preserving Person Detection Using Low-Resolution Infrared Cameras Thomas Dubail, Fidel Alejandro Guerrero Peña, Heitor Rapela Medeiros, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli code -1
Gait Recognition from Occluded Sequences in Surveillance Sites Dhritimaan Das, Ayush Agarwal, Pratik Chattopadhyay code -1
Visible-Infrared Person Re-Identification Using Privileged Intermediate Information Mahdi Alehdaghi, Arthur Josi, Rafael M. O. Cruz, Eric Granger code -1
Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy Shiyuan Huang, Robinson Piramuthu, ShihFu Chang, Gunnar A. Sigurdsson code -1
ChaLearn LAP Seasons in Drift Challenge: Dataset, Design and Results Anders Skaarup Johansen, Júlio C. S. Jacques Júnior, Kamal Nasrollahi, Sergio Escalera, Thomas B. Moeslund code -1
YORO - Lightweight End to End Visual Grounding ChihHui Ho, Srikar Appalaraju, Bhavan Jasani, R. Manmatha, Nuno Vasconcelos code -1
Localization Uncertainty Estimation for Anchor-Free Object Detection Youngwan Lee, JoongWon Hwang, HyungIl Kim, Kimin Yun, Yongjin Kwon, Yuseok Bae, Sung Ju Hwang code -1
Variational Depth Networks: Uncertainty-Aware Monocular Self-supervised Depth Estimation Georgi Dikov, Joris van Vugt code -1
Unsupervised Joint Image Transfer and Uncertainty Quantification Using Patch Invariant Networks Christoph Angermann, Markus Haltmeier, Ahsan Raza Siyal code -1
Uncertainty Quantification Using Query-Based Object Detectors Meet P. Vadera, Colin Samplawski, Benjamin M. Marlin code -1
CenDerNet: Center and Curvature Representations for Render-and-Compare 6D Pose Estimation Peter De Roovere, Rembert Daems, Jonathan Croenen, Taoufik Bourgana, Joris de Hoog, Francis Wyffels code -1
Trans6D: Transformer-Based 6D Object Pose Estimation and Refinement Zhongqun Zhang, Wei Chen, Linfang Zheng, Ales Leonardis, Hyung Jin Chang code -1
Learning to Estimate Multi-view Pose from Object Silhouettes Yoni Kasten, True Price, David Geraghty, JanMichael Frahm code -1
TransNet: Category-Level Transparent Object Pose Estimation Huijie Zhang, Anthony Opipari, Xiaotong Chen, Jiyue Zhu, Zeren Yu, Odest Chadwicke Jenkins code -1
Fuse and Attend: Generalized Embedding Learning for Art and Sketches Ujjal Kr Dutta code -1
3D Shape Reconstruction from Free-Hand Sketches Jiayun Wang, Jierui Lin, Qian Yu, Runtao Liu, Yubei Chen, Stella X. Yu code -1
Abstract Images Have Different Levels of Retrievability Per Reverse Image Search Engine Shawn M. Jones, Diane Oyen code -1
ECCV 2022 Sign Spotting Challenge: Dataset, Design and Results Manuel VázquezEnríquez, José Luis AlbaCastro, Laura Docío Fernández, Júlio C. S. Jacques Júnior, Sergio Escalera code -1
Hierarchical I3D for Sign Spotting Ryan Wong, Necati Cihan Camgöz, Richard Bowden code -1
Multi-modal Sign Language Spotting by Multi/One-Shot Learning Landong Liu, Wengang Zhou, Weichao Zhao, Hezhen Hu, Houqiang Li code -1
Sign Spotting via Multi-modal Fusion and Testing Time Transferring Hongyu Fu, Chen Liu, Xingqun Qi, Beibei Lin, Lincheng Li, Li Zhang, Xin Yu code -1
Domain-Conditioned Normalization for Test-Time Domain Generalization Yuxuan Jiang, Yanfeng Wang, Ruipeng Zhang, Qinwei Xu, Ya Zhang, Xin Chen, Qi Tian code -1
Unleashing the Potential of Adaptation Models via Go-getting Domain Labels Xin Jin, Tianyu He, Xu Shen, Songhua Wu, Tongliang Liu, Jingwen Ye, Xinchao Wang, Jianqiang Huang, Zhibo Chen, XianSheng Hua code -1
ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization Zdravko Marinov, Alina Roitberg, David Schneider, Rainer Stiefelhagen code -1
Consistency Regularization for Domain Adaptation Kian Boon Koh, Basura Fernando code -1
CAT: Controllable Attribute Translation for Fair Facial Attribute Classification Jiazhi Li, Wael AbdAlmageed code -1
Weakly Supervised Invariant Representation Learning via Disentangling Known and Unknown Nuisance Factors Jiageng Zhu, Hanchen Xie, Wael AbdAlmageed code -1
Learning Visual Explanations for DCNN-Based Image Classifiers Using an Attention Mechanism Ioanna Gkartzonika, Nikolaos Gkalelis, Vasileios Mezaris code -1
Self-supervised Orientation-Guided Deep Network for Segmentation of Carbon Nanotubes in SEM Imagery Nguyen P. Nguyen, Ramakrishna Surya, Matthew R. Maschmann, Prasad Calyam, Kannappan Palaniappan, Filiz Bunyak code -1
The Tenth Visual Object Tracking VOT2022 Challenge Results Matej Kristan, Ales Leonardis, Jirí Matas, Michael Felsberg, Roman P. Pflugfelder, JoniKristian Kämäräinen, Hyung Jin Chang, Martin Danelljan, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Johanna Björklund, Yushan Zhang, Zhongqun Zhang, Song Yan, Wenyan Yang, Dingding Cai, Christoph Mayer, Gustavo Fernández, Kang Ben, Goutam Bhat, Hong Chang, Guangqi Chen, Jiaye Chen, Shengyong Chen, Xilin Chen, Xin Chen, Xiuyi Chen, Yiwei Chen, YuHsi Chen, Zhixing Chen, Yangming Cheng, Angelo Ciaramella, Yutao Cui, Benjamin Dzubur, Mohana Murali Dasari, Qili Deng, Debajyoti Dhar, Shangzhe Di, Emanuel Di Nardo, Daniel K. Du, Matteo Dunnhofer, Heng Fan, ZhenHua Feng, Zhihong Fu, Shang Gao, Rama Krishna Gorthi, Eric Granger, Q. H. Gu, Himanshu Gupta, Jianfeng He, Keji He, Yan Huang, Deepak Jangid, Rongrong Ji, Cheng Jiang, Yingjie Jiang, Felix Järemo Lawin, Ze Kang, Madhu Kiran, Josef Kittler, Simiao Lai, Xiangyuan Lan, Dongwook Lee, Hyunjeong Lee, Seohyung Lee, Hui Li, Ming Li, Wangkai Li, Xi Li, Xianxian Li, Xiao Li, Zhe Li, Liting Lin, Haibin Ling, Bo Liu, Chang Liu, Si Liu, Huchuan Lu, Rafael M. O. Cruz, Bingpeng Ma, Chao Ma, Jie Ma, Yinchao Ma, Niki Martinel, Alireza Memarmoghadam, Christian Micheloni, Payman Moallem, Le Thanh NguyenMeidine, Siyang Pan, ChangBeom Park, Danda Pani Paudel, Matthieu Paul, Houwen Peng, Andreas Robinson, Litu Rout, Shiguang Shan, Kristian Simonato, Tianhui Song, Xiaoning Song, Chao Sun, Jingna Sun, Zhangyong Tang, Radu Timofte, ChiYi Tsai, Luc Van Gool, Om Prakash Verma, Dong Wang, Fei Wang, Liang Wang, Liangliang Wang, Lijun Wang, Limin Wang, Qiang Wang, Gangshan Wu, Jinlin Wu, Xiaojun Wu, Fei Xie, Tianyang Xu, Wei Xu, Yong Xu, Yuanyou Xu, Wanli Xue, Zizheng Xun, Bin Yan, Dawei Yang, Jinyu Yang, Wankou Yang, Xiaoyun Yang, Yi Yang, Yichun Yang, Zongxin Yang, Botao Ye, Fisher Yu, Hongyuan Yu, Jiaqian Yu, Qianjin Yu, Weichen Yu, Kang Ze, Jiang Zhai, Chengwei Zhang, Chunhu Zhang, Kaihua Zhang, Tianzhu Zhang, Wenkang Zhang, Zhibin Zhang, Zhipeng Zhang, Jie Zhao, ShaoChuan Zhao, Feng Zheng, Haixia Zheng, Min Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang code -1
Efficient Visual Tracking via Hierarchical Cross-Attention Transformer Xin Chen, Ben Kang, Dong Wang, Dongdong Li, Huchuan Lu code -1
Learning Dual-Fused Modality-Aware Representations for RGBD Tracking Shang Gao, Jinyu Yang, Zhe Li, Feng Zheng, Ales Leonardis, Jingkuan Song code -1
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman H. Khan, Syed Waqas Zamir, Rao Muhammad Anwer, Fahad Shahbaz Khan code -1
Continual Inference: A Library for Efficient Online Inference with Deep Neural Networks in PyTorch Lukas Hedegaard, Alexandros Iosifidis code -1
Hydra Attention: Efficient Attention with Many Heads Daniel Bolya, ChengYang Fu, Xiaoliang Dai, Peizhao Zhang, Judy Hoffman code -1
BiTAT: Neural Network Binarization with Task-Dependent Aggregated Transformation Geon Park, Jaehong Yoon, Haiyang Zhang, Xing Zhang, Sung Ju Hwang, Yonina C. Eldar code -1
Power Awareness in Low Precision Neural Networks Nurit SpingarnEliezer, Ron Banner, Hilla BenYaacov, Elad Hoffer, Tomer Michaeli code -1
Augmenting Legacy Networks for Flexible Inference Jason Clemons, Iuri Frosio, Maying Shen, Jose M. Alvarez, Stephen W. Keckler code -1
Deep Neural Network Compression for Image Inpainting Soyeong Kim, DoYeon Kim, Jaekyun Moon code -1
QFT: Post-training Quantization via Fast Joint Finetuning of All Degrees of Freedom Alexander Finkelstein, Ella Fuchs, Idan Tal, Mark Grobman, Niv Vosco, Eldad Meller code -1
Searching for N:M Fine-grained Sparsity of Weights and Activations in Neural Networks Ruth AkivaHochman, Shahaf E. Finder, Javier S. Turek, Eran Treister code -1
Image Illumination Enhancement for Construction Worker Pose Estimation in Low-light Conditions Xinyu Chen, Yantao Yu code -1
Towards an Error-free Deep Occupancy Detector for Smart Camera Parking System TungLam Duong, VanDuc Le, TienCuong Bui, HaiThien To code -1
CrackSeg9k: A Collection and Benchmark for Crack Segmentation Datasets and Frameworks Shreyas Kulkarni, Shreyas Singh, Dhananjay Balakrishnan, Siddharth Sharma, Saipraneeth Devunuri, Sai Chowdeswara Rao Korlapati code -1
PriSeg: IFC-Supported Primitive Instance Geometry Segmentation with Unsupervised Clustering Zhiqi Hu, Ioannis K. Brilakis code -1
Depth Contrast: Self-supervised Pretraining on 3DPM Images for Mining Material Classification Prakash Chandra Chhipa, Richa Upadhyay, Rajkumar Saini, Lars Lindqvist, Richard Nordenskjöld, Seiichi Uchida, Marcus Liwicki code -1
Facilitating Construction Scene Understanding Knowledge Sharing and Reuse via Lifelong Site Object Detection Ruoxin Xiong, Yuansheng Zhu, Yanyu Wang, Pengkun Liu, Pingbo Tang code -1
Model-Assisted Labeling via Explainability for Visual Inspection of Civil Infrastructures Klára Janousková, Mattia Rigotti, Ioana Giurgiu, Cristiano Malossi code -1
A Hyperspectral and RGB Dataset for Building Façade Segmentation Nariman Habili, Ernest Kwan, Weihao Li, Christfried Webers, Jeremy Oorloff, Mohammad Ali Armin, Lars Petersson code -1
Improving Object Detection in VHR Aerial Orthomosaics Tanguy Ophoff, Kristof Van Beeck, Toon Goedemé code -1
Active Learning for Imbalanced Civil Infrastructure Data Thomas Frick, Diego Antognini, Mattia Rigotti, Ioana Giurgiu, Benjamin F. Grewe, Cristiano Malossi code -1
UAV-Based Visual Remote Sensing for Automated Building Inspection Kushagra Srivastava, Dhruv Patel, Aditya Kumar Jha, Mohhit Kumar Jha, Jaskirat Singh, Ravi Kiran Sarvadevabhatla, Pradeep Kumar Ramancharla, Harikumar Kandath, K. Madhava Krishna code -1
ConSLAM: Periodically Collected Real-World Construction Dataset for SLAM and Progress Monitoring Maciej Trzeciak, Kacper Pluta, Yasmin Fathy, Lucio Alcalde, Stanley Chee, Antony Bromley, Ioannis K. Brilakis, Pierre Alliez code -1
NeuralSI: Structural Parameter Identification in Nonlinear Dynamical Systems Xuyang Li, Hamed Bolandi, Talal Salem, Nizar Lajnef, Vishnu Naresh Boddeti code -1
A Geometric-Relational Deep Learning Framework for BIM Object Classification Hairong Luo, Ge Gao, Han Huang, Ziyi Ke, Cheng Peng, Ming Gu code -1
Generating Construction Safety Observations via CLIP-Based Image-Language Embedding Wei Lun Tsai, Jacob J. Lin, ShangHsien Hsieh code -1
Harmonization of Diffusion MRI Data Obtained with Multiple Head Coils Using Hybrid CNNs Leon Weninger, Sandro Romanzetti, Julia Ebert, Kathrin Reetz, Dorit Merhof code -1
CCRL: Contrastive Cell Representation Learning Ramin Nakhli, Amirali Darbandsari, Hossein Farahani, Ali Bashashati code -1
Automatic Grading of Cervical Biopsies by Combining Full and Self-supervision Mélanie Lubrano, Tristan Lazard, Guillaume Balezo, Yaëlle BellahsenHarrar, Cécile Badoual, Sylvain Berlemont, Thomas Walter code -1
When CNN Meet with ViT: Towards Semi-supervised Learning for Multi-class Medical Image Semantic Segmentation Ziyang Wang, Tianze Li, JianQing Zheng, Baoru Huang code -1
Using Whole Slide Image Representations from Self-supervised Contrastive Learning for Melanoma Concordance Regression Sean Grullon, Vaughn Spurrier, Jiayi Zhao, Corey Chivers, Yang Jiang, Kiran Motaparthi, Jason B. Lee, Michael J. Bonham, Julianna D. Ianni code -1
Explainable Model for Localization of Spiculation in Lung Nodules Mirtha Lucas, Miguel Lerma, Jacob Furst, Daniela Raicu code -1
Self-supervised Pretraining for 2D Medical Image Segmentation András Kalapos, Bálint GyiresTóth code -1
CMC_v2: Towards More Accurate COVID-19 Detection with Discriminative Video Priors Junlin Hou, Jilan Xu, Nan Zhang, Yi Wang, Yuejie Zhang, Xiaobo Zhang, Rui Feng code -1
COVID Detection and Severity Prediction with 3D-ConvNeXt and Custom Pretrainings Daniel Kienzle, Julian Lorenz, Robin Schön, Katja Ludwig, Rainer Lienhart code -1
Two-Stage COVID19 Classification Using BERT Features Weijun Tan, Qi Yao, Jingfeng Liu code -1
PVT-COV19D: COVID-19 Detection Through Medical Image Classification Based on Pyramid Vision Transformer Lilang Zheng, Jiaxuan Fang, Xiaorun Tang, Hanzhang Li, Jiaxin Fan, Tianyi Wang, Rui Zhou, Zhaoyan Yan code -1
Boosting COVID-19 Severity Detection with Infection-Aware Contrastive Mixup Classification Junlin Hou, Jilan Xu, Nan Zhang, Yuejie Zhang, Xiaobo Zhang, Rui Feng code -1
Variability Matters: Evaluating Inter-Rater Variability in Histopathology for Robust Cell Detection Cholmin Kang, Chunggi Lee, Heon Song, Minuk Ma, Sérgio Pereira code -1
FUSION: Fully Unsupervised Test-Time Stain Adaptation via Fused Normalization Statistics Nilanjan Chattopadhyay, Shiv Gehlot, Nitin Singhal code -1
Relieving Pixel-Wise Labeling Effort for Pathology Image Segmentation with Self-training Romain Mormont, Mehdi Testouri, Raphaël Marée, Pierre Geurts code -1
CNR-IEMN-CD and CNR-IEMN-CSD Approaches for Covid-19 Detection and Covid-19 Severity Detection from 3D CT-scans Fares Bougourzi, Cosimo Distante, Fadi Dornaika, Abdelmalik TalebAhmed code -1
Representation Learning with Information Theory to Detect COVID-19 and Its Severity Abel Díaz Berenguer, Tanmoy Mukherjee, Yifei Da, Matías Nicolás Bossa, Maryna Kvasnytsia, Jef Vandemeulebroucke, Nikos Deligiannis, Hichem Sahli code -1
Spatial-Slice Feature Learning Using Visual Transformer and Essential Slices Selection Module for COVID-19 Detection of CT Scans in the Wild ChihChung Hsu, ChiHan Tsai, GuanLin Chen, SinDi Ma, ShenChieh Tai code -1
Multi-scale Attention-Based Multiple Instance Learning for Classification of Multi-gigapixel Histology Images Made Satria Wibawa, KwokWai Lo, Lawrence Young, Nasir M. Rajpoot code -1
A Deep Wavelet Network for High-Resolution Microscopy Hyperspectral Image Reconstruction Qian Wang, Zhao Chen code -1
Using a 3D ResNet for Detecting the Presence and Severity of COVID-19 from CT Scans Robert Turnbull code -1
AI-MIA: COVID-19 Detection and Severity Analysis Through Medical Imaging Dimitrios Kollias, Anastasios Arsenos, Stefanos D. Kollias code -1
Medical Image Segmentation: A Review of Modern Architectures Natalia Salpea, Paraskevi K. Tzouveli, Dimitrios Kollias code -1
Medical Image Super Resolution by Preserving Interpretable and Disentangled Features Dwarikanath Mahapatra, Behzad Bozorgtabar, Mauricio Reyes code -1
Multi-label Attention Map Assisted Deep Feature Learning for Medical Image Classification Dwarikanath Mahapatra, Mauricio Reyes code -1
Unsupervised Domain Adaptation Using Feature Disentanglement and GCNs for Medical Image Classification Dwarikanath Mahapatra, Steven Korevaar, Behzad Bozorgtabar, Ruwan B. Tennakoon code -1