Learning Mutual Modulation for Self-supervised Cross-Modal Super-Resolution |
Xiaoyu Dong, Naoto Yokoya, Longguang Wang, Tatsumi Uezato |
|
code |
-1 |
Spectrum-Aware and Transferable Architecture Search for Hyperspectral Image Restoration |
Wei He, Quanming Yao, Naoto Yokoya, Tatsumi Uezato, Hongyan Zhang, Liangpei Zhang |
|
code |
-1 |
Neural Color Operators for Sequential Image Retouching |
Yili Wang, Xin Li, Kun Xu, Dongliang He, Qi Zhang, Fu Li, Errui Ding |
|
code |
-1 |
Optimizing Image Compression via Joint Learning with Denoising |
Ka Leong Cheng, Yueqi Xie, Qifeng Chen |
|
code |
-1 |
Restore Globally, Refine Locally: A Mask-Guided Scheme to Accelerate Super-Resolution Networks |
Xiaotao Hu, Jun Xu, Shuhang Gu, MingMing Cheng, Li Liu |
|
code |
-1 |
Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution |
Yushu Wu, Yifan Gong, Pu Zhao, Yanyu Li, Zheng Zhan, Wei Niu, Hao Tang, Minghai Qin, Bin Ren, Yanzhi Wang |
|
code |
-1 |
Modeling Mask Uncertainty in Hyperspectral Image Reconstruction |
Jiamian Wang, Yulun Zhang, Xin Yuan, Ziyi Meng, Zhiqiang Tao |
|
code |
-1 |
Perceiving and Modeling Density for Image Dehazing |
Tian Ye, Yunchen Zhang, Mingchao Jiang, Liang Chen, Yun Liu, Sixiang Chen, Erkang Chen |
|
code |
-1 |
Stripformer: Strip Transformer for Fast Image Deblurring |
FuJen Tsai, YanTsung Peng, YenYu Lin, ChungChi Tsai, ChiaWen Lin |
|
code |
-1 |
Deep Fourier-Based Exposure Correction Network with Spatial-Frequency Interaction |
Jie Huang, Yajing Liu, Feng Zhao, Keyu Yan, Jinghao Zhang, Yukun Huang, Man Zhou, Zhiwei Xiong |
|
code |
-1 |
Frequency and Spatial Dual Guidance for Image Dehazing |
Hu Yu, Naishan Zheng, Man Zhou, Jie Huang, Zeyu Xiao, Feng Zhao |
|
code |
-1 |
Towards Real-World HDRTV Reconstruction: A Data Synthesis-Based Approach |
Zhen Cheng, Tao Wang, Yong Li, Fenglong Song, Chang Chen, Zhiwei Xiong |
|
code |
-1 |
Learning Discriminative Shrinkage Deep Networks for Image Deconvolution |
PinHung Kuo, Jinshan Pan, ShaoYi Chien, MingHsuan Yang |
|
code |
-1 |
KXNet: A Model-Driven Deep Neural Network for Blind Super-Resolution |
Jiahong Fu, Hong Wang, Qi Xie, Qian Zhao, Deyu Meng, Zongben Xu |
|
code |
-1 |
ARM: Any-Time Super-Resolution Method |
Bohong Chen, Mingbao Lin, Kekai Sheng, Mengdan Zhang, Peixian Chen, Ke Li, Liujuan Cao, Rongrong Ji |
|
code |
-1 |
Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines |
Haina Qin, Longfei Han, Juan Wang, Congxuan Zhang, Yanwei Li, Bing Li, Weiming Hu |
|
code |
-1 |
RealFlow: EM-Based Realistic Optical Flow Dataset Generation from Videos |
Yunhui Han, Kunming Luo, Ao Luo, Jiangyu Liu, Haoqiang Fan, Guiming Luo, Shuaicheng Liu |
|
code |
-1 |
Memory-Augmented Model-Driven Network for Pansharpening |
Keyu Yan, Man Zhou, Li Zhang, Chengjun Xie |
|
code |
-1 |
All You Need Is RAW: Defending Against Adversarial Attacks with Camera Image Pipelines |
Yuxuan Zhang, Bo Dong, Felix Heide |
|
code |
-1 |
Ghost-free High Dynamic Range Imaging with Context-Aware Transformer |
Zhen Liu, Yinglong Wang, Bing Zeng, Shuaicheng Liu |
|
code |
-1 |
Style-Guided Shadow Removal |
Jin Wan, Hui Yin, Zhenyao Wu, Xinyi Wu, Yanting Liu, Song Wang |
|
code |
-1 |
D2C-SR: A Divergence to Convergence Approach for Real-World Image Super-Resolution |
Youwei Li, Haibin Huang, Lanpeng Jia, Haoqiang Fan, Shuaicheng Liu |
|
code |
-1 |
GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training |
Jaeseok Byun, Taebaek Hwang, Jianlong Fu, Taesup Moon |
|
code |
-1 |
Efficient Video Deblurring Guided by Motion Magnitude |
Yusheng Wang, Yunfan Lu, Ye Gao, Lin Wang, Zhihang Zhong, Yinqiang Zheng, Atsushi Yamashita |
|
code |
-1 |
Single Frame Atmospheric Turbulence Mitigation: A Benchmark Study and a New Physics-Inspired Transformer Model |
Zhiyuan Mao, Ajay Jaiswal, Zhangyang Wang, Stanley H. Chan |
|
code |
-1 |
Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression |
Ahmet Burakhan Koyuncu, Han Gao, Atanas Boev, Georgii Gaikov, Elena Alshina, Eckehard G. Steinbach |
|
code |
-1 |
Image Super-Resolution with Deep Dictionary |
Shunta Maeda |
|
code |
-1 |
TempFormer: Temporally Consistent Transformer for Video Denoising |
Mingyang Song, Yang Zhang, Tunç Ozan Aydin |
|
code |
-1 |
RAWtoBit: A Fully End-to-end Camera ISP Network |
Wooseok Jeong, SeungWon Jung |
|
code |
-1 |
DRCNet: Dynamic Image Restoration Contrastive Network |
Fei Li, Lingfeng Shen, Yang Mi, Zhenbo Li |
|
code |
-1 |
Zero-Shot Learning for Reflection Removal of Single 360-Degree Image |
ByeongJu Han, JaeYoung Sim |
|
code |
-1 |
Transformer with Implicit Edges for Particle-Based Physics Simulation |
Yidi Shao, Chen Change Loy, Bo Dai |
|
code |
-1 |
Rethinking Video Rain Streak Removal: A New Synthesis Model and a Deraining Network with Video Rain Prior |
Shuai Wang, Lei Zhu, Huazhu Fu, Jing Qin, CarolaBibiane Schönlieb, Wei Feng, Song Wang |
|
code |
-1 |
Super-Resolution by Predicting Offsets: An Ultra-Efficient Super-Resolution Network for Rasterized Images |
Jinjin Gu, Haoming Cai, Chenyu Dong, Ruofan Zhang, Yulun Zhang, Wenming Yang, Chun Yuan |
|
code |
-1 |
Animation from Blur: Multi-modal Blur Decomposition with Motion Guidance |
Zhihang Zhong, Xiao Sun, Zhirong Wu, Yinqiang Zheng, Stephen Lin, Imari Sato |
|
code |
-1 |
AlphaVC: High-Performance and Efficient Learned Video Compression |
Yibo Shi, Yunying Ge, Jing Wang, Jue Mao |
|
code |
-1 |
Content-Oriented Learned Image Compression |
Meng Li, Shangyin Gao, Yihui Feng, Yibo Shi, Jing Wang |
|
code |
-1 |
RRSR: Reciprocal Reference-Based Image Super-Resolution with Progressive Feature Alignment and Selection |
Lin Zhang, Xin Li, Dongliang He, Fu Li, Yili Wang, Zhaoxiang Zhang |
|
code |
-1 |
Contrastive Prototypical Network with Wasserstein Confidence Penalty |
Haoqing Wang, ZhiHong Deng |
|
code |
-1 |
Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Recognition |
Xinyi Zou, Yan Yan, JingHao Xue, Si Chen, Hanzi Wang |
|
code |
-1 |
Self-support Few-Shot Semantic Segmentation |
Qi Fan, Wenjie Pei, YuWing Tai, ChiKeung Tang |
|
code |
-1 |
Few-Shot Object Detection with Model Calibration |
Qi Fan, ChiKeung Tang, YuWing Tai |
|
code |
-1 |
Self-Supervision Can Be a Good Few-Shot Learner |
Yuning Lu, Liangjian Wen, Jianzhuang Liu, Yajing Liu, Xinmei Tian |
|
code |
-1 |
BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers |
Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai |
|
code |
-1 |
Category-Level 6D Object Pose and Size Estimation Using Self-supervised Deep Prior Deformation Networks |
Jiehong Lin, Zewei Wei, Changxing Ding, Kui Jia |
|
code |
-1 |
Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object Detection |
Hongyu Zhou, Zheng Ge, Songtao Liu, Weixin Mao, Zeming Li, Haiyan Yu, Jian Sun |
|
code |
-1 |
Point-to-Box Network for Accurate Object Detection via Single Point Supervision |
Pengfei Chen, Xuehui Yu, Xumeng Han, Najmul Hassan, Kai Wang, Jiachen Li, Jian Zhao, Humphrey Shi, Zhenjun Han, Qixiang Ye |
|
code |
-1 |
Domain Adaptive Hand Keypoint and Pixel Localization in the Wild |
Takehiko Ohkawa, YuJhe Li, Qichen Fu, Ryosuke Furuta, Kris M. Kitani, Yoichi Sato |
|
code |
-1 |
Towards Data-Efficient Detection Transformers |
Wen Wang, Jing Zhang, Yang Cao, Yongliang Shen, Dacheng Tao |
|
code |
-1 |
Open-Vocabulary DETR with Conditional Matching |
Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy |
|
code |
-1 |
Prediction-Guided Distillation for Dense Object Detection |
Chenhongyi Yang, Mateusz Ochal, Amos J. Storkey, Elliot J. Crowley |
|
code |
-1 |
Multimodal Object Detection via Probabilistic Ensembling |
YiTing Chen, Jinghao Shi, Zelin Ye, Christoph Mertz, Deva Ramanan, Shu Kong |
|
code |
-1 |
Exploiting Unlabeled Data with Vision and Language Models for Object Detection |
Shiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao, B. G. Vijay Kumar, Anastasis Stathopoulos, Manmohan Chandraker, Dimitris N. Metaxas |
|
code |
-1 |
CPO: Change Robust Panorama to Point Cloud Localization |
Junho Kim, Hojun Jang, Changwoon Choi, Young Min Kim |
|
code |
-1 |
INT: Towards Infinite-Frames 3D Detection with an Efficient Framework |
Jianyun Xu, Zhenwei Miao, Da Zhang, Hongyu Pan, Kaixuan Liu, Peihan Hao, Jun Zhu, Zhengyang Sun, Hongmin Li, Xin Zhan |
|
code |
-1 |
End-to-End Weakly Supervised Object Detection with Sparse Proposal Evolution |
Mingxiang Liao, Fang Wan, Yuan Yao, Zhenjun Han, Jialing Zou, Yuze Wang, Bailan Feng, Peng Yuan, Qixiang Ye |
|
code |
-1 |
Calibration-Free Multi-view Crowd Counting |
Qi Zhang, Antoni B. Chan |
|
code |
-1 |
Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-training |
Zhenyu Li, Zehui Chen, Ang Li, Liangji Fang, Qinhong Jiang, Xianming Liu, Junjun Jiang |
|
code |
-1 |
SuperLine3D: Self-supervised Line Segmentation and Description for LiDAR Point Cloud |
Xiangrui Zhao, Sheng Yang, Tianxin Huang, Jun Chen, Teng Ma, Mingyang Li, Yong Liu |
|
code |
-1 |
Exploring Plain Vision Transformer Backbones for Object Detection |
Yanghao Li, Hanzi Mao, Ross B. Girshick, Kaiming He |
|
code |
-1 |
Adversarially-Aware Robust Object Detector |
Ziyi Dong, Pengxu Wei, Liang Lin |
|
code |
-1 |
HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors |
Luting Wang, Xiaojie Li, Yue Liao, Zeren Jiang, Jianlong Wu, Fei Wang, Chen Qian, Si Liu |
|
code |
-1 |
You Should Look at All Objects |
Zhenchao Jin, Dongdong Yu, Luchuan Song, Zehuan Yuan, Lequan Yu |
|
code |
-1 |
Detecting Twenty-Thousand Classes Using Image-Level Supervision |
Xingyi Zhou, Rohit Girdhar, Armand Joulin, Philipp Krähenbühl, Ishan Misra |
|
code |
-1 |
DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation |
Hongyang Li, Jiehong Lin, Kui Jia |
|
code |
-1 |
Monocular 3D Object Detection with Depth from Motion |
Tai Wang, Jiangmiao Pang, Dahua Lin |
|
code |
-1 |
DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation |
Yilin Wen, Xiangyu Li, Hao Pan, Lei Yang, Zheng Wang, Taku Komura, Wenping Wang |
|
code |
-1 |
Distilling Object Detectors with Global Knowledge |
Sanli Tang, Zhongyu Zhang, Zhanzhan Cheng, Jing Lu, Yunlu Xu, Yi Niu, Fan He |
|
code |
-1 |
Unifying Visual Perception by Dispersible Points Learning |
Jianming Liang, Guanglu Song, Biao Leng, Yu Liu |
|
code |
-1 |
PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection |
Gang Li, Xiang Li, Yujie Wang, Yichao Wu, Ding Liang, Shanshan Zhang |
|
code |
-1 |
Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection |
Ziteng Cui, Yingying Zhu, Lin Gu, GuoJun Qi, Xiaoxiao Li, Renrui Zhang, Zenghui Zhang, Tatsuya Harada |
|
code |
-1 |
Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features |
Wufei Ma, Angtian Wang, Alan L. Yuille, Adam Kortylewski |
|
code |
-1 |
Translation, Scale and Rotation: Cross-Modal Alignment Meets RGB-Infrared Vehicle Detection |
Maoxun Yuan, Yinyan Wang, Xingxing Wei |
|
code |
-1 |
RFLA: Gaussian Receptive Field Based Label Assignment for Tiny Object Detection |
Chang Xu, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, GuiSong Xia |
|
code |
-1 |
Rethinking IoU-based Optimization for Single-stage 3D Object Detection |
Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Jianqiang Huang, XianSheng Hua, Minjian Zhao, Gim Hee Lee |
|
code |
-1 |
TD-Road: Top-Down Road Network Extraction with Holistic Graph Construction |
Yang He, Ravi Garg, Amber Roy Chowdhury |
|
code |
-1 |
Multi-faceted Distillation of Base-Novel Commonality for Few-Shot Object Detection |
Shuang Wu, Wenjie Pei, Dianwen Mei, Fanglin Chen, Jiandong Tian, Guangming Lu |
|
code |
-1 |
PointCLM: A Contrastive Learning-based Framework for Multi-instance Point Cloud Registration |
Mingzhi Yuan, Zhihao Li, Qiuye Jin, Xinrong Chen, Manning Wang |
|
code |
-1 |
Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration |
Haotian Bai, Ruimao Zhang, Jiong Wang, Xiang Wan |
|
code |
-1 |
MTTrans: Cross-domain Object Detection with Mean Teacher Transformer |
Jinze Yu, Jiaming Liu, Xiaobao Wei, Haoyi Zhou, Yohei Nakata, Denis A. Gudovskiy, Tomoyuki Okuno, Jianxin Li, Kurt Keutzer, Shanghang Zhang |
|
code |
-1 |
Multi-domain Multi-definition Landmark Localization for Small Datasets |
David Ferman, Gaurav Bharaj |
|
code |
-1 |
DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection |
Abhinav Kumar, Garrick Brazil, Enrique Corona, Armin Parchami, Xiaoming Liu |
|
code |
-1 |
Label-Guided Auxiliary Training Improves 3D Object Detector |
Yaomin Huang, Xinmei Liu, Yichen Zhu, Zhiyuan Xu, Chaomin Shen, Zhengping Che, Guixu Zhang, Yaxin Peng, Feifei Feng, Jian Tang |
|
code |
-1 |
PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images |
Chengjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma |
|
code |
-1 |
Densely Constrained Depth Estimator for Monocular 3D Object Detection |
Yingyan Li, Yuntao Chen, Jiawei He, Zhaoxiang Zhang |
|
code |
-1 |
Polarimetric Pose Prediction |
Daoyi Gao, Yitong Li, Patrick Ruhkamp, Iuliia Skobleva, Magdalena Wysocki, HyunJun Jung, Pengyuan Wang, Arturo Guridi, Benjamin Busam |
|
code |
-1 |
TOCH: Spatio-Temporal Object-to-Hand Correspondence for Motion Refinement |
Keyang Zhou, Bharat Lal Bhatnagar, Jan Eric Lenssen, Gerard PonsMoll |
|
code |
-1 |
LaTeRF: Label and Text Driven Object Radiance Fields |
Ashkan Mirzaei, Yash Kant, Jonathan Kelly, Igor Gilitschenski |
|
code |
-1 |
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis |
Yaqian Liang, Shanshan Zhao, Baosheng Yu, Jing Zhang, Fazhi He |
|
code |
-1 |
Unsupervised Deep Multi-shape Matching |
Dongliang Cao, Florian Bernard |
|
code |
-1 |
Texturify: Generating Textures on 3D Shape Surfaces |
Yawar Siddiqui, Justus Thies, Fangchang Ma, Qi Shan, Matthias Nießner, Angela Dai |
|
code |
-1 |
Autoregressive 3D Shape Generation via Canonical Mapping |
AnChieh Cheng, Xueting Li, Sifei Liu, Min Sun, MingHsuan Yang |
|
code |
-1 |
PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees |
JunKun Chen, YuXiong Wang |
|
code |
-1 |
UNIF: United Neural Implicit Functions for Clothed Human Reconstruction and Animation |
Shenhan Qian, Jiale Xu, Ziwei Liu, Liqian Ma, Shenghua Gao |
|
code |
-1 |
PRIF: Primary Ray-Based Implicit Function |
Brandon Yushan Feng, Yinda Zhang, Danhang Tang, Ruofei Du, Amitabh Varshney |
|
code |
-1 |
Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction |
Hanxue Liang, Hehe Fan, Zhiwen Fan, Yi Wang, Tianlong Chen, Yu Cheng, Zhangyang Wang |
|
code |
-1 |
CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes |
Kim Youwang, JiYeon Kim, TaeHyun Oh |
|
code |
-1 |
PlaneFormers: From Sparse View Planes to 3D Reconstruction |
Samir Agarwala, Linyi Jin, Chris Rockwell, David F. Fouhey |
|
code |
-1 |
Learning Implicit Templates for Point-Based Clothed Human Modeling |
Siyou Lin, Hongwen Zhang, Zerong Zheng, Ruizhi Shao, Yebin Liu |
|
code |
-1 |
Exploring the Devil in Graph Spectral Domain for 3D Point Cloud Attacks |
Qianjiang Hu, Daizong Liu, Wei Hu |
|
code |
-1 |
Structure-Aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation |
Jingwang Ling, Zhibo Wang, Ming Lu, Quan Wang, Chen Qian, Feng Xu |
|
code |
-1 |
MoFaNeRF: Morphable Facial Neural Radiance Field |
Yiyu Zhuang, Hao Zhu, Xusen Sun, Xun Cao |
|
code |
-1 |
PointInst3D: Segmenting 3D Instances by Points |
Tong He, Wei Yin, Chunhua Shen, Anton van den Hengel |
|
code |
-1 |
Cross-modal 3D Shape Generation and Manipulation |
Zezhou Cheng, Menglei Chai, Jian Ren, HsinYing Lee, Kyle Olszewski, Zeng Huang, Subhransu Maji, Sergey Tulyakov |
|
code |
-1 |
Latent Partition Implicit with Surface Codes for 3D Representation |
Chao Chen, YuShen Liu, Zhizhong Han |
|
code |
-1 |
Implicit Field Supervision for Robust Non-rigid Shape Matching |
Ramana Sundararaman, Gautam Pai, Maks Ovsjanikov |
|
code |
-1 |
Learning Self-prior for Mesh Denoising Using Dual Graph Convolutional Networks |
Shota Hattori, Tatsuya Yatagawa, Yutaka Ohtake, Hiromasa Suzuki |
|
code |
-1 |
DiffConv: Analyzing Irregular Point Clouds with an Irregular View |
Manxi Lin, Aasa Feragen |
|
code |
-1 |
PD-Flow: A Point Cloud Denoising Framework with Normalizing Flows |
Aihua Mao, Zihui Du, YuHui Wen, Jun Xuan, YongJin Liu |
|
code |
-1 |
SeedFormer: Patch Seeds Based Point Cloud Completion with Upsample Transformer |
Haoran Zhou, Yun Cao, Wenqing Chu, Junwei Zhu, Tong Lu, Ying Tai, Chengjie Wang |
|
code |
-1 |
DeepMend: Learning Occupancy Functions to Represent Shape for Repair |
Nikolas Lamb, Sean Banerjee, Natasha Kholgade Banerjee |
|
code |
-1 |
A Repulsive Force Unit for Garment Collision Handling in Neural Networks |
Qingyang Tan, Yi Zhou, Tuanfeng Y. Wang, Duygu Ceylan, Xin Sun, Dinesh Manocha |
|
code |
-1 |
Shape-Pose Disentanglement Using SE(3)-Equivariant Vector Neurons |
Oren Katzir, Dani Lischinski, Daniel CohenOr |
|
code |
-1 |
3D Equivariant Graph Implicit Functions |
Yunlu Chen, Basura Fernando, Hakan Bilen, Matthias Nießner, Efstratios Gavves |
|
code |
-1 |
PatchRD: Detail-Preserving Shape Completion by Learning Patch Retrieval and Deformation |
Bo Sun, Vladimir G. Kim, Noam Aigerman, Qixing Huang, Siddhartha Chaudhuri |
|
code |
-1 |
3D Shape Sequence of Human Comparison and Classification Using Current and Varifolds |
Emery Pierson, Mohamed Daoudi, Sylvain Arguillère |
|
code |
-1 |
Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty Quantification |
Jianxiong Shen, Antonio Agudo, Francesc MorenoNoguer, Adria Ruiz |
|
code |
-1 |
Unsupervised Pose-aware Part Decomposition for Man-Made Articulated Objects |
Yuki Kawana, Yusuke Mukuta, Tatsuya Harada |
|
code |
-1 |
MeshUDF: Fast and Differentiable Meshing of Unsigned Distance Field Networks |
Benoît Guillard, Federico Stella, Pascal Fua |
|
code |
-1 |
SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement |
Zhaofan Qiu, Yehao Li, Yu Wang, Yingwei Pan, Ting Yao, Tao Mei |
|
code |
-1 |
The Shape Part Slot Machine: Contact-Based Reasoning for Generating 3D Shapes from Parts |
Kai Wang, Paul Guerrero, Vladimir G. Kim, Siddhartha Chaudhuri, Minhyuk Sung, Daniel Ritchie |
|
code |
-1 |
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition |
Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, XianSheng Hua, Lei Zhang |
|
code |
-1 |
Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning |
Sauradip Nag, Xiatian Zhu, YiZhe Song, Tao Xiang |
|
code |
-1 |
Semi-supervised Temporal Action Detection with Proposal-Free Masking |
Sauradip Nag, Xiatian Zhu, YiZhe Song, Tao Xiang |
|
code |
-1 |
Zero-Shot Temporal Action Detection via Vision-Language Prompting |
Sauradip Nag, Xiatian Zhu, YiZhe Song, Tao Xiang |
|
code |
-1 |
CycDA: Unsupervised Cycle Domain Adaptation to Learn from Image to Video |
Wei Lin, Anna Kukleva, Kunyang Sun, Horst Possegger, Hilde Kuehne, Horst Bischof |
|
code |
-1 |
S2N: Suppression-Strengthen Network for Event-Based Recognition Under Variant Illuminations |
Zengyu Wan, Yang Wang, Ganchao Tan, Yang Cao, ZhengJun Zha |
|
code |
-1 |
CMD: Self-supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation |
Yunyao Mao, Wengang Zhou, Zhenbo Lu, Jiajun Deng, Houqiang Li |
|
code |
-1 |
CT2: Colorization Transformer via Color Tokens |
Shuchen Weng, Jimeng Sun, Yu Li, Si Li, Boxin Shi |
|
code |
-1 |
Simple Baselines for Image Restoration |
Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, Jian Sun |
|
code |
-1 |
Spike Transformer: Monocular Depth Estimation for Spiking Camera |
Jiyuan Zhang, Lulu Tang, Zhaofei Yu, Jiwen Lu, TieJun Huang |
|
code |
-1 |
Improving Image Restoration by Revisiting Global Information Aggregation |
Xiaojie Chu, Liangyu Chen, Chengpeng Chen, Xin Lu |
|
code |
-1 |
Data Association Between Event Streams and Intensity Frames Under Diverse Baselines |
Dehao Zhang, Qiankun Ding, Peiqi Duan, Chu Zhou, Boxin Shi |
|
code |
-1 |
D2HNet: Joint Denoising and Deblurring with Hierarchical Network for Robust Night Image Restoration |
Yuzhi Zhao, Yongzhe Xu, Qiong Yan, Dingdong Yang, Xuehui Wang, LaiMan Po |
|
code |
-1 |
Learning Graph Neural Networks for Image Style Transfer |
Yongcheng Jing, Yining Mao, Yiding Yang, Yibing Zhan, Mingli Song, Xinchao Wang, Dacheng Tao |
|
code |
-1 |
DeepPS2: Revisiting Photometric Stereo Using Two Differently Illuminated Images |
Ashish Tiwari, Shanmuganathan Raman |
|
code |
-1 |
Instance Contour Adjustment via Structure-Driven CNN |
Shuchen Weng, Yi Wei, MingChing Chang, Boxin Shi |
|
code |
-1 |
Synthesizing Light Field Video from Monocular Video |
Shrisudhan Govindarajan, Prasan A. Shedligeri, Sarah, Kaushik Mitra |
|
code |
-1 |
Human-Centric Image Cropping with Partition-Aware and Content-Preserving Features |
Bo Zhang, Li Niu, Xing Zhao, Liqing Zhang |
|
code |
-1 |
DeMFI: Deep Joint Deblurring and Multi-frame Interpolation with Flow-Guided Attentive Correlation and Recursive Boosting |
Jihyong Oh, Munchurl Kim |
|
code |
-1 |
Neural Image Representations for Multi-image Fusion and Layer Separation |
Seonghyeon Nam, Marcus A. Brubaker, Michael S. Brown |
|
code |
-1 |
Bringing Rolling Shutter Images Alive with Dual Reversed Distortion |
Zhihang Zhong, Mingdeng Cao, Xiao Sun, Zhirong Wu, Zhongyi Zhou, Yinqiang Zheng, Stephen Lin, Imari Sato |
|
code |
-1 |
FILM: Frame Interpolation for Large Motion |
Fitsum A. Reda, Janne Kontkanen, Eric Tabellion, Deqing Sun, Caroline Pantofaru, Brian Curless |
|
code |
-1 |
Video Interpolation by Event-Driven Anisotropic Adjustment of Optical Flow |
Song Wu, Kaichao You, Weihua He, Chen Yang, Yang Tian, Yaoyuan Wang, Ziyang Zhang, Jianxing Liao |
|
code |
-1 |
EvAC3D: From Event-Based Apparent Contours to 3D Models via Continuous Visual Hulls |
Ziyun Wang, Kenneth Chaney, Kostas Daniilidis |
|
code |
-1 |
DCCF: Deep Comprehensible Color Filter Learning Framework for High-Resolution Image Harmonization |
Ben Xue, Shenghui Ran, Quan Chen, Rongfei Jia, Binqiang Zhao, Xing Tang |
|
code |
-1 |
SelectionConv: Convolutional Neural Networks for Non-rectilinear Image Data |
David Hart, Michael Whitney, Bryan S. Morse |
|
code |
-1 |
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization |
Jingtang Liang, Xiaodong Cun, ChiMan Pun, Jue Wang |
|
code |
-1 |
BigColor: Colorization Using a Generative Color Prior for Natural Images |
Geonung Kim, Kyoungkook Kang, Seongtae Kim, Hwayoon Lee, Sehoon Kim, Jonghyun Kim, SeungHwan Baek, Sunghyun Cho |
|
code |
-1 |
CADyQ: Content-Aware Dynamic Quantization for Image Super-Resolution |
Cheeun Hong, Sungyong Baik, Heewon Kim, Seungjun Nah, Kyoung Mu Lee |
|
code |
-1 |
Deep Semantic Statistics Matching (D2SM) Denoising Network |
Kangfu Mei, Vishal M. Patel, Rui Huang |
|
code |
-1 |
3D Scene Inference from Transient Histograms |
Sacha Jungerman, Atul Ingle, Yin Li, Mohit Gupta |
|
code |
-1 |
Neural Space-Filling Curves |
Hanyu Wang, Kamal Gupta, Larry Davis, Abhinav Shrivastava |
|
code |
-1 |
Exposure-Aware Dynamic Weighted Learning for Single-Shot HDR Imaging |
Vien Gia An, Chul Lee |
|
code |
-1 |
Seeing Through a Black Box: Toward High-Quality Terahertz Imaging via Subspace-and-Attention Guided Restoration |
WenTai Su, YiChun Hung, PoJen Yu, ShangHua Yang, ChiaWen Lin |
|
code |
-1 |
Tomography of Turbulence Strength Based on Scintillation Imaging |
Nir Shaul, Yoav Y. Schechner |
|
code |
-1 |
Realistic Blur Synthesis for Learning Image Deblurring |
Jaesung Rim, Geonung Kim, Jungeon Kim, Junyong Lee, Seungyong Lee, Sunghyun Cho |
|
code |
-1 |
Learning Phase Mask for Privacy-Preserving Passive Depth Estimation |
Zaid Tasneem, Giovanni Milione, YiHsuan Tsai, Xiang Yu, Ashok Veeraraghavan, Manmohan Chandraker, Francesco Pittaluga |
|
code |
-1 |
LWGNet - Learned Wirtinger Gradients for Fourier Ptychographic Phase Retrieval |
Atreyee Saha, Salman Siddique Khan, Sagar Sehrawat, Sanjana S. Prabhu, Shanti Bhattacharya, Kaushik Mitra |
|
code |
-1 |
PANDORA: Polarization-Aided Neural Decomposition of Radiance |
Akshat Dave, Yongyi Zhao, Ashok Veeraraghavan |
|
code |
-1 |
HuMMan: Multi-modal 4D Human Dataset for Versatile Sensing and Modeling |
Zhongang Cai, Daxuan Ren, Ailing Zeng, Zhengyu Lin, Tao Yu, Wenjia Wang, Xiangyu Fan, Yang Gao, Yifan Yu, Liang Pan, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu |
|
code |
-1 |
DVS-Voltmeter: Stochastic Process-Based Event Simulator for Dynamic Vision Sensors |
Songnan Lin, Ye Ma, Zhenhua Guo, Bihan Wen |
|
code |
-1 |
Benchmarking Omni-Vision Representation Through the Lens of Visual Realms |
Yuanhan Zhang, Zhenfei Yin, Jing Shao, Ziwei Liu |
|
code |
-1 |
BEAT: A Large-Scale Semantic and Emotional Multi-modal Dataset for Conversational Gestures Synthesis |
Haiyang Liu, Zihao Zhu, Naoya Iwamoto, Yichen Peng, Zhengqing Li, You Zhou, Elif Bozkurt, Bo Zheng |
|
code |
-1 |
Neuromorphic Data Augmentation for Training Spiking Neural Networks |
Yuhang Li, Youngeun Kim, Hyoungseob Park, Tamar Geller, Priyadarshini Panda |
|
code |
-1 |
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset |
Hao Zhu, Wayne Wu, Wentao Zhu, Liming Jiang, Siwei Tang, Li Zhang, Ziwei Liu, Chen Change Loy |
|
code |
-1 |
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition |
Alejandro Pardo, Fabian Caba Heilbron, Juan León Alcázar, Ali K. Thabet, Bernard Ghanem |
|
code |
-1 |
LaMAR: Benchmarking Localization and Mapping for Augmented Reality |
PaulEdouard Sarlin, Mihai Dusmanu, Johannes L. Schönberger, Pablo Speciale, Lukas Gruber, Viktor Larsson, Ondrej Miksik, Marc Pollefeys |
|
code |
-1 |
Unitail: Detecting, Reading, and Matching in Retail Scene |
Fangyi Chen, Han Zhang, Zaiwang Li, Jiachen Dou, Shentong Mo, Hao Chen, Yongxin Zhang, Uzair Ahmed, Chenchen Zhu, Marios Savvides |
|
code |
-1 |
Not Just Streaks: Towards Ground Truth for Single Image Deraining |
Yunhao Ba, Howard Zhang, Ethan Yang, Akira Suzuki, Arnold Pfahnl, Chethan Chinder Chandrappa, Celso M. de Melo, Suya You, Stefano Soatto, Alex Wong, Achuta Kadambi |
|
code |
-1 |
MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views |
Haitian Zeng, Xin Yu, Jiaxu Miao, Yi Yang |
|
code |
-1 |
Depth Map Decomposition for Monocular Depth Estimation |
Jinyoung Jun, Jaehan Lee, Chul Lee, ChangSu Kim |
|
code |
-1 |
Monitored Distillation for Positive Congruent Depth Completion |
Tian Yu Liu, Parth Agrawal, Allison Chen, ByungWoo Hong, Alex Wong |
|
code |
-1 |
Resolution-Free Point Cloud Sampling Network with Data Distillation |
Tianxin Huang, Jiangning Zhang, Jun Chen, Yuang Liu, Yong Liu |
|
code |
-1 |
Organic Priors in Non-rigid Structure from Motion |
Suryansh Kumar, Luc Van Gool |
|
code |
-1 |
Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation |
Yinlin Hu, Pascal Fua, Mathieu Salzmann |
|
code |
-1 |
DANBO: Disentangled Articulated Neural Body Representations via Graph Neural Networks |
ShihYang Su, Timur M. Bagautdinov, Helge Rhodin |
|
code |
-1 |
CHORE: Contact, Human and Object Reconstruction from a Single RGB Image |
Xianghui Xie, Bharat Lal Bhatnagar, Gerard PonsMoll |
|
code |
-1 |
Learned Vertex Descent: A New Direction for 3D Human Model Fitting |
Enric Corona, Gerard PonsMoll, Guillem Alenyà, Francesc MorenoNoguer |
|
code |
-1 |
Self-calibrating Photometric Stereo by Neural Inverse Rendering |
Junxuan Li, Hongdong Li |
|
code |
-1 |
3D Clothed Human Reconstruction in the Wild |
Gyeongsik Moon, Hyeongjin Nam, Takaaki Shiratori, Kyoung Mu Lee |
|
code |
-1 |
Directed Ray Distance Functions for 3D Scene Reconstruction |
Nilesh Kulkarni, Justin Johnson, David F. Fouhey |
|
code |
-1 |
Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image |
Zhaoxin Fan, Zhenbo Song, Jian Xu, Zhicheng Wang, Kejian Wu, Hongyan Liu, Jun He |
|
code |
-1 |
Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression |
Dongting Hu, Liuhua Peng, Tingjin Chu, Xiaoxing Zhang, Yinian Mao, Howard D. Bondell, Mingming Gong |
|
code |
-1 |
CostDCNet: Cost Volume Based Depth Completion for a Single RGB-D Image |
Jaewon Kam, Jungeon Kim, Soongjin Kim, Jaesik Park, Seungyong Lee |
|
code |
-1 |
ShAPO: Implicit Representations for Multi-object Shape, Appearance, and Pose Optimization |
Muhammad Zubair Irshad, Sergey Zakharov, Rares Ambrus, Thomas Kollar, Zsolt Kira, Adrien Gaidon |
|
code |
-1 |
3D Siamese Transformer Network for Single Object Tracking on Point Clouds |
Le Hui, Lingpeng Wang, Linghua Tang, Kaihao Lan, Jin Xie, Jian Yang |
|
code |
-1 |
Object Wake-Up: 3D Object Rigging from a Single Image |
Ji Yang, Xinxin Zuo, Sen Wang, Zhenbo Yu, Xingyu Li, Bingbing Ni, Minglun Gong, Li Cheng |
|
code |
-1 |
IntegratedPIFu: Integrated Pixel Aligned Implicit Function for Single-View Human Reconstruction |
Kennard Yanting Chan, Guosheng Lin, Haiyu Zhao, Weisi Lin |
|
code |
-1 |
Realistic One-Shot Mesh-Based Head Avatars |
Taras Khakhulin, Vanessa Sklyarova, Victor Lempitsky, Egor Zakharov |
|
code |
-1 |
A Kendall Shape Space Approach to 3D Shape Estimation from 2D Landmarks |
Martha Paskin, Daniel Baum, Mason N. Dean, Christoph von Tycowicz |
|
code |
-1 |
Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion |
Zian Wang, Wenzheng Chen, David Acuna, Jan Kautz, Sanja Fidler |
|
code |
-1 |
Perspective Phase Angle Model for Polarimetric 3D Reconstruction |
Guangcheng Chen, Li He, Yisheng Guan, Hong Zhang |
|
code |
-1 |
DeepShadow: Neural Shape from Shadow |
Asaf Karnieli, Ohad Fried, Yacov HelOr |
|
code |
-1 |
Camera Auto-calibration from the Steiner Conic of the Fundamental Matrix |
Yu Liu, Hui Zhang |
|
code |
-1 |
Super-Resolution 3D Human Shape from a Single Low-Resolution Image |
Marco Pesavento, Marco Volino, Adrian Hilton |
|
code |
-1 |
Minimal Neural Atlas: Parameterizing Complex Surfaces with Minimal Charts and Distortion |
Weng Fei Low, Gim Hee Lee |
|
code |
-1 |
ExtrudeNet: Unsupervised Inverse Sketch-and-Extrude for Shape Parsing |
Daxuan Ren, Jianmin Zheng, Jianfei Cai, Jiatong Li, Junzhe Zhang |
|
code |
-1 |
CATRE: Iterative Point Clouds Alignment for Category-Level Object Pose Refinement |
Xingyu Liu, Gu Wang, Yi Li, Xiangyang Ji |
|
code |
-1 |
Optimization over Disentangled Encoding: Unsupervised Cross-Domain Point Cloud Completion via Occlusion Factor Manipulation |
Jingyu Gong, Fengqi Liu, Jiachen Xu, Min Wang, Xin Tan, Zhizhong Zhang, Ran Yi, Haichuan Song, Yuan Xie, Lizhuang Ma |
|
code |
-1 |
Unsupervised Learning of 3D Semantic Keypoints with Mutual Reconstruction |
Haocheng Yuan, Chen Zhao, Shichao Fan, Jiaxi Jiang, Jiaqi Yang |
|
code |
-1 |
MvDeCor: Multi-view Dense Correspondence Learning for Fine-Grained 3D Segmentation |
Gopal Sharma, Kangxue Yin, Subhransu Maji, Evangelos Kalogerakis, Or Litany, Sanja Fidler |
|
code |
-1 |
SUPR: A Sparse Unified Part-Based Human Representation |
Ahmed A. A. Osman, Timo Bolkart, Dimitrios Tzionas, Michael J. Black |
|
code |
-1 |
Revisiting Point Cloud Simplification: A Learnable Feature Preserving Approach |
Rolandos Alexandros Potamias, Giorgos Bouritsas, Stefanos Zafeiriou |
|
code |
-1 |
Masked Autoencoders for Point Cloud Self-supervised Learning |
Yatian Pang, Wenxiao Wang, Francis E. H. Tay, Wei Liu, Yonghong Tian, Li Yuan |
|
code |
-1 |
Intrinsic Neural Fields: Learning Functions on Manifolds |
Lukas Koestler, Daniel Grittner, Michael Möller, Daniel Cremers, Zorah Lähner |
|
code |
-1 |
Skeleton-Free Pose Transfer for Stylized 3D Characters |
Zhouyingcheng Liao, Jimei Yang, Jun Saito, Gerard PonsMoll, Yang Zhou |
|
code |
-1 |
Masked Discrimination for Self-supervised Learning on Point Clouds |
Haotian Liu, Mu Cai, Yong Jae Lee |
|
code |
-1 |
FBNet: Feedback Network for Point Cloud Completion |
Xuejun Yan, Hongyu Yan, Jingjing Wang, Hang Du, Zhihong Wu, Di Xie, Shiliang Pu, Li Lu |
|
code |
-1 |
Meta-sampler: Almost-Universal yet Task-Oriented Sampling for Point Clouds |
Ta Ying Cheng, Qingyong Hu, Qian Xie, Niki Trigoni, Andrew Markham |
|
code |
-1 |
A Level Set Theory for Neural Implicit Evolution Under Explicit Flows |
Ishit Mehta, Manmohan Chandraker, Ravi Ramamoorthi |
|
code |
-1 |
Efficient Point Cloud Analysis Using Hilbert Curve |
Wanli Chen, Xinge Zhu, Guojin Chen, Bei Yu |
|
code |
-1 |
Expanding Language-Image Pretrained Models for General Video Recognition |
Bolin Ni, Houwen Peng, Minghao Chen, Songyang Zhang, Gaofeng Meng, Jianlong Fu, Shiming Xiang, Haibin Ling |
|
code |
-1 |
Hunting Group Clues with Transformers for Social Group Activity Recognition |
Masato Tamura, Rahul Vishwakarma, Ravigopal Vennelakanti |
|
code |
-1 |
Contrastive Positive Mining for Unsupervised 3D Action Representation Learning |
Haoyuan Zhang, Yonghong Hou, Wenjing Zhang, Wanqing Li |
|
code |
-1 |
Target-Absent Human Attention |
Zhibo Yang, Sounak Mondal, Seoyoung Ahn, Gregory J. Zelinsky, Minh Hoai, Dimitris Samaras |
|
code |
-1 |
Uncertainty-Based Spatial-Temporal Attention for Online Action Detection |
Hongji Guo, Zhou Ren, Yi Wu, Gang Hua, Qiang Ji |
|
code |
-1 |
Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows |
Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen |
|
code |
-1 |
Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions |
Yijun Qian, Lijun Yu, Wenhe Liu, Alexander G. Hauptmann |
|
code |
-1 |
Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection |
Xiaoqian Wu, YongLu Li, Xinpeng Liu, Junyi Zhang, Yuzhe Wu, Cewu Lu |
|
code |
-1 |
Collaborating Domain-Shared and Target-Specific Feature Clustering for Cross-domain 3D Action Recognition |
Qinying Liu, Zilei Wang |
|
code |
-1 |
Is Appearance Free Action Recognition Possible? |
Filip Ilic, Thomas Pock, Richard P. Wildes |
|
code |
-1 |
Learning Spatial-Preserved Skeleton Representations for Few-Shot Action Recognition |
Ning Ma, Hongyi Zhang, Xuhui Li, Sheng Zhou, Zhen Zhang, Jun Wen, Haifeng Li, Jingjun Gu, Jiajun Bu |
|
code |
-1 |
Dual-Evidential Learning for Weakly-supervised Temporal Action Localization |
Mengyuan Chen, Junyu Gao, Shicai Yang, Changsheng Xu |
|
code |
-1 |
Global-Local Motion Transformer for Unsupervised Skeleton-Based Action Learning |
Boeun Kim, Hyung Jin Chang, Jungho Kim, Jin Young Choi |
|
code |
-1 |
AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition |
Yulin Wang, Yang Yue, Xinhong Xu, Ali Hassani, Victor Kulikov, Nikita Orlov, Shiji Song, Humphrey Shi, Gao Huang |
|
code |
-1 |
Panoramic Human Activity Recognition |
Ruize Han, Haomin Yan, Jiacheng Li, Song Wang, Wei Feng |
|
code |
-1 |
Delving into Details: Synopsis-to-Detail Networks for Video Recognition |
Shuxian Liang, Xu Shen, Jianqiang Huang, XianSheng Hua |
|
code |
-1 |
A Generalized and Robust Framework for Timestamp Supervision in Temporal Action Segmentation |
Rahul Rahaman, Dipika Singhania, Alexandre H. Thiery, Angela Yao |
|
code |
-1 |
Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning |
Sipeng Zheng, Shizhe Chen, Qin Jin |
|
code |
-1 |
PrivHAR: Recognizing Human Actions from Privacy-Preserving Lens |
Carlos Hinojosa, Miguel Marquez, Henry Arguello, Ehsan Adeli, Li FeiFei, Juan Carlos Niebles |
|
code |
-1 |
Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection |
Guoqiu Li, Guanxiong Cai, Xingyu Zeng, Rui Zhao |
|
code |
-1 |
Compound Prototype Matching for Few-Shot Action Recognition |
Yifei Huang, Lijin Yang, Yoichi Sato |
|
code |
-1 |
Continual 3D Convolutional Neural Networks for Real-time Processing of Videos |
Lukas Hedegaard, Alexandros Iosifidis |
|
code |
-1 |
Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition |
Tianjiao Li, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Anran Wang, Jinghua Wang, Jun Liu |
|
code |
-1 |
Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection |
Zhiwei Yang, Peng Wu, Jing Liu, Xiaotao Liu |
|
code |
-1 |
Action Quality Assessment with Temporal Parsing Transformer |
Yang Bai, Desen Zhou, Songyang Zhang, Jian Wang, Errui Ding, Yu Guan, Yang Long, Jingdong Wang |
|
code |
-1 |
Entry-Flipped Transformer for Inference and Prediction of Participant Behavior |
Bo Hu, TatJen Cham |
|
code |
-1 |
Pairwise Contrastive Learning Network for Action Quality Assessment |
Mingzhe Li, Hongbo Zhang, Qing Lei, Zongwen Fan, Jinghua Liu, JiXiang Du |
|
code |
-1 |
Geometric Features Informed Multi-person Human-Object Interaction Recognition in Videos |
Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima, Hubert P. H. Shum |
|
code |
-1 |
ActionFormer: Localizing Moments of Actions with Transformers |
ChenLin Zhang, Jianxin Wu, Yin Li |
|
code |
-1 |
SocialVAE: Human Trajectory Prediction Using Timewise Latents |
Pei Xu, JeanBernard Hayet, Ioannis Karamouzas |
|
code |
-1 |
Shape Matters: Deformable Patch Attack |
Zhaoyu Chen, Bo Li, Shuang Wu, Jianghe Xu, Shouhong Ding, Wenqiang Zhang |
|
code |
-1 |
Frequency Domain Model Augmentation for Adversarial Attack |
Yuyang Long, Qilong Zhang, Boheng Zeng, Lianli Gao, Xianglong Liu, Jian Zhang, Jingkuan Song |
|
code |
-1 |
Prior-Guided Adversarial Initialization for Fast Adversarial Training |
Xiaojun Jia, Yong Zhang, Xingxing Wei, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao |
|
code |
-1 |
Enhanced Accuracy and Robustness via Multi-teacher Adversarial Distillation |
Shiji Zhao, Jie Yu, Zhenlong Sun, Bo Zhang, Xingxing Wei |
|
code |
-1 |
LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity |
Martin Gubri, Maxime Cordy, Mike Papadakis, Yves Le Traon, Koushik Sen |
|
code |
-1 |
A Large-Scale Multiple-objective Method for Black-box Attack Against Object Detection |
Siyuan Liang, Longkang Li, Yanbo Fan, Xiaojun Jia, Jingzhi Li, Baoyuan Wu, Xiaochun Cao |
|
code |
-1 |
GradAuto: Energy-Oriented Attack on Dynamic Neural Networks |
Jianhong Pan, Qichen Zheng, Zhipeng Fan, Hossein Rahmani, Qiuhong Ke, Jun Liu |
|
code |
-1 |
A Spectral View of Randomized Smoothing Under Common Corruptions: Benchmarking and Improving Certified Robustness |
Jiachen Sun, Akshay Mehra, Bhavya Kailkhura, PinYu Chen, Dan Hendrycks, Jihun Hamm, Z. Morley Mao |
|
code |
-1 |
Improving Adversarial Robustness of 3D Point Cloud Classification Models |
Guanlin Li, Guowen Xu, Han Qiu, Ruan He, Jiwei Li, Tianwei Zhang |
|
code |
-1 |
Learning Extremely Lightweight and Robust Model with Differentiable Constraints on Sparsity and Condition Number |
Xian Wei, Yangyu Xu, Yanhui Huang, Hairong Lv, Hai Lan, Mingsong Chen, Xuan Tang |
|
code |
-1 |
RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN |
Huy Phan, Cong Shi, Yi Xie, Tianfang Zhang, Zhuohang Li, Tianming Zhao, Jian Liu, Yan Wang, Yingying Chen, Bo Yuan |
|
code |
-1 |
Boosting Transferability of Targeted Adversarial Examples via Hierarchical Generative Networks |
Xiao Yang, Yinpeng Dong, Tianyu Pang, Hang Su, Jun Zhu |
|
code |
-1 |
tSF: Transformer-Based Semantic Filter for Few-Shot Learning |
Jinxiang Lai, Siqian Yang, Wenlong Liu, Yi Zeng, Zhongyi Huang, Wenlong Wu, Jun Liu, BinBin Gao, Chengjie Wang |
|
code |
-1 |
Adversarial Feature Augmentation for Cross-domain Few-Shot Classification |
Yanxu Hu, Andy J. Ma |
|
code |
-1 |
Constructing Balance from Imbalance for Long-Tailed Image Recognition |
Yue Xu, YongLu Li, Jiefeng Li, Cewu Lu |
|
code |
-1 |
On Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization and Beyond |
Yuzhe Yang, Hao Wang, Dina Katabi |
|
code |
-1 |
Few-Shot Video Object Detection |
Qi Fan, ChiKeung Tang, YuWing Tai |
|
code |
-1 |
Worst Case Matters for Few-Shot Recognition |
Minghao Fu, YunHao Cao, Jianxin Wu |
|
code |
-1 |
Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification |
Kai Yi, Xiaoqian Shen, Yunhao Gou, Mohamed Elhoseiny |
|
code |
-1 |
Doubly Deformable Aggregation of Covariance Matrices for Few-Shot Segmentation |
Zhitong Xiong, Haopeng Li, Xiao Xiang Zhu |
|
code |
-1 |
Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation |
Xinyu Shi, Dong Wei, Yu Zhang, Donghuan Lu, Munan Ning, Jiashun Chen, Kai Ma, Yefeng Zheng |
|
code |
-1 |
Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning |
Xingping Dong, Jianbing Shen, Ling Shao |
|
code |
-1 |
CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition |
Shreyank N. Gowda, Laura SevillaLara, Frank Keller, Marcus Rohrbach |
|
code |
-1 |
Few-Shot Class-Incremental Learning for 3D Point Cloud Objects |
Townim F. Chowdhury, Ali Cheraghian, Sameera Ramasinghe, Sahar Ahmadi, Morteza Saberi, Shafin Rahman |
|
code |
-1 |
Meta-Learning with Less Forgetting on Large-Scale Non-Stationary Task Distributions |
Zhenyi Wang, Li Shen, Le Fang, Qiuling Suo, Donglin Zhan, Tiehang Duan, Mingchen Gao |
|
code |
-1 |
DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment |
Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luowei Zhou, Lu Yuan, Ahmed Hassan Awadallah, Zhangyang Wang |
|
code |
-1 |
Learning Instance and Task-Aware Dynamic Kernels for Few-Shot Learning |
Rongkai Ma, Pengfei Fang, Gil Avraham, Yan Zuo, Tianyu Zhu, Tom Drummond, Mehrtash Harandi |
|
code |
-1 |
Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding |
Quande Liu, Youpeng Wen, Jianhua Han, Chunjing Xu, Hang Xu, Xiaodan Liang |
|
code |
-1 |
Few-Shot Classification with Contrastive Learning |
Zhanyuan Yang, Jinghua Wang, Yingying Zhu |
|
code |
-1 |
Time-rEversed DiffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection |
Shan Zhang, Naila Murray, Lei Wang, Piotr Koniusz |
|
code |
-1 |
Self-Promoted Supervision for Few-Shot Transformer |
Bowen Dong, Pan Zhou, Shuicheng Yan, Wangmeng Zuo |
|
code |
-1 |
Few-Shot Object Counting and Detection |
Thanh Nguyen, Chau Pham, Khoi Nguyen, Minh Hoai |
|
code |
-1 |
Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark |
Kibok Lee, Hao Yang, Satyaki Chakraborty, Zhaowei Cai, Gurumurthy Swaminathan, Avinash Ravichandran, Onkar Dabeer |
|
code |
-1 |
Cross-Domain Cross-Set Few-Shot Learning via Learning Compact and Aligned Representations |
Wentao Chen, Zhang Zhang, Wei Wang, Liang Wang, Zilei Wang, Tieniu Tan |
|
code |
-1 |
Mutually Reinforcing Structure with Proposal Contrastive Consistency for Few-Shot Object Detection |
TianXue Ma, Mingwei Bi, Jian Zhang, Wang Yuan, Zhizhong Zhang, Yuan Xie, Shouhong Ding, Lizhuang Ma |
|
code |
-1 |
Dual Contrastive Learning with Anatomical Auxiliary Supervision for Few-Shot Medical Image Segmentation |
Huisi Wu, Fangyan Xiao, Chongxin Liang |
|
code |
-1 |
Improving Few-Shot Learning Through Multi-task Representation Learning Theory |
Quentin Bouniot, Ievgen Redko, Romaric Audigier, Angélique Loesch, Amaury Habrard |
|
code |
-1 |
Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation |
Min Zhang, Siteng Huang, Wenbin Li, Donglin Wang |
|
code |
-1 |
Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments |
Khoi D. Nguyen, QuocHuy Tran, Khoi Nguyen, BinhSon Hua, Rang Nguyen |
|
code |
-1 |
Temporal and Cross-modal Attention for Audio-Visual Zero-Shot Learning |
OtnielBogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata |
|
code |
-1 |
HM: Hybrid Masking for Few-Shot Segmentation |
Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia |
|
code |
-1 |
TransVLAD: Focusing on Locally Aggregated Descriptors for Few-Shot Learning |
Haoquan Li, Laoming Zhang, Daoan Zhang, Lang Fu, Peng Yang, Jianguo Zhang |
|
code |
-1 |
Kernel Relative-prototype Spectral Filtering for Few-Shot Learning |
Tao Zhang, Wu Huang |
|
code |
-1 |
"This Is My Unicorn, Fluffy": Personalizing Frozen Vision-Language Representations |
Niv Cohen, Rinon Gal, Eli A. Meirom, Gal Chechik, Yuval Atzmon |
|
code |
-1 |
CLOSE: Curriculum Learning on the Sharing Extent Towards Better One-Shot NAS |
Zixuan Zhou, Xuefei Ning, Yi Cai, Jiashu Han, Yiping Deng, Yuhan Dong, Huazhong Yang, Yu Wang |
|
code |
-1 |
Streamable Neural Fields |
Junwoo Cho, Seungtae Nam, Daniel Rho, Jong Hwan Ko, Eunbyung Park |
|
code |
-1 |
Gradient-Based Uncertainty for Monocular Depth Estimation |
Julia Hornauer, Vasileios Belagiannis |
|
code |
-1 |
Online Continual Learning with Contrastive Vision Transformer |
Zhen Wang, Liu Liu, Yajing Kong, Jiaxian Guo, Dacheng Tao |
|
code |
-1 |
CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution |
Taeho Kim, Yongin Kwon, Jemin Lee, Taeho Kim, Sangtae Ha |
|
code |
-1 |
EAutoDet: Efficient Architecture Search for Object Detection |
Xiaoxing Wang, Jiale Lin, Juanping Zhao, Xiaokang Yang, Junchi Yan |
|
code |
-1 |
A Max-Flow Based Approach for Neural Architecture Search |
Chao Xue, Xiaoxing Wang, Junchi Yan, ChunGuang Li |
|
code |
-1 |
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses |
Robik Shrestha, Kushal Kafle, Christopher Kanan |
|
code |
-1 |
ERA: Enhanced Rational Activations |
Martin Trimmel, Mihai Zanfir, Richard I. Hartley, Cristian Sminchisescu |
|
code |
-1 |
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger |
Cong Wang, Hongmin Xu, Xiong Zhang, Li Wang, Zhitong Zheng, Haifeng Liu |
|
code |
-1 |
Learning Depth from Focus in the Wild |
Changyeon Won, HaeGon Jeon |
|
code |
-1 |
Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World |
Zheng Dang, Lizhou Wang, Yu Guo, Mathieu Salzmann |
|
code |
-1 |
An End-to-End Transformer Model for Crowd Localization |
Dingkang Liang, Wei Xu, Xiang Bai |
|
code |
-1 |
Few-Shot Single-View 3D Reconstruction with Memory Prior Contrastive Network |
Zhen Xing, Yijiang Chen, Zhixin Ling, Xiangdong Zhou, Yu Xiang |
|
code |
-1 |
DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection |
Liang Peng, Xiaopei Wu, Zheng Yang, Haifeng Liu, Deng Cai |
|
code |
-1 |
Adaptive Co-teaching for Unsupervised Monocular Depth Estimation |
Weisong Ren, Lijun Wang, Yongri Piao, Miao Zhang, Huchuan Lu, Ting Liu |
|
code |
-1 |
Fusing Local Similarities for Retrieval-Based 3D Orientation Estimation of Unseen Objects |
Chen Zhao, Yinlin Hu, Mathieu Salzmann |
|
code |
-1 |
Lidar Point Cloud Guided Monocular 3D Object Detection |
Liang Peng, Fei Liu, Zhengxu Yu, Senbo Yan, Dan Deng, Zheng Yang, Haifeng Liu, Deng Cai |
|
code |
-1 |
Structural Causal 3D Reconstruction |
Weiyang Liu, Zhen Liu, Liam Paull, Adrian Weller, Bernhard Schölkopf |
|
code |
-1 |
3D Human Pose Estimation Using Möbius Graph Convolutional Networks |
Niloofar Azizi, Horst Possegger, Emanuele Rodolà, Horst Bischof |
|
code |
-1 |
Learning to Train a Point Cloud Reconstruction Network Without Matching |
Tianxin Huang, Xuemeng Yang, Jiangning Zhang, Jinhao Cui, Hao Zou, Jun Chen, Xiangrui Zhao, Yong Liu |
|
code |
-1 |
PanoFormer: Panorama Transformer for Indoor 360$^{\circ }$ Depth Estimation |
Zhijie Shen, Chunyu Lin, Kang Liao, Lang Nie, Zishuo Zheng, Yao Zhao |
|
code |
-1 |
Self-supervised Human Mesh Recovery with Cross-Representation Alignment |
Xuan Gong, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, David S. Doermann, Ziyan Wu |
|
code |
-1 |
AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction |
Zerui Chen, Yana Hasson, Cordelia Schmid, Ivan Laptev |
|
code |
-1 |
A Reliable Online Method for Joint Estimation of Focal Length and Camera Rotation |
Yiming Qian, James H. Elder |
|
code |
-1 |
PS-NeRF: Neural Inverse Rendering for Multi-view Photometric Stereo |
Wenqi Yang, Guanying Chen, Chaofeng Chen, Zhenfang Chen, KwanYee K. Wong |
|
code |
-1 |
Share with Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency |
Tom Monnier, Matthew Fisher, Alexei A. Efros, Mathieu Aubry |
|
code |
-1 |
Towards Comprehensive Representation Enhancement in Semantics-Guided Self-supervised Monocular Depth Estimation |
Jingyuan Ma, Xiangyu Lei, Nan Liu, Xian Zhao, Shiliang Pu |
|
code |
-1 |
AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture |
Zhe Li, Zerong Zheng, Hongwen Zhang, Chaonan Ji, Yebin Liu |
|
code |
-1 |
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers |
Junhyeong Cho, Kim Youwang, TaeHyun Oh |
|
code |
-1 |
GeoRefine: Self-supervised Online Depth Refinement for Accurate Dense Mapping |
Pan Ji, Qingan Yan, Yuxin Ma, Yi Xu |
|
code |
-1 |
Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion |
Zhiqiang Yan, Xiang Li, Kun Wang, Zhenyu Zhang, Jun Li, Jian Yang |
|
code |
-1 |
GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation |
Shi Gong, Xiaoqing Ye, Xiao Tan, Jingdong Wang, Errui Ding, Yu Zhou, Xiang Bai |
|
code |
-1 |
Learning Visibility for Robust Dense Human Body Estimation |
ChunHan Yao, Jimei Yang, Duygu Ceylan, Yi Zhou, Yang Zhou, MingHsuan Yang |
|
code |
-1 |
Towards High-Fidelity Single-View Holistic Reconstruction of Indoor Scenes |
Haolin Liu, Yujian Zheng, Guanying Chen, Shuguang Cui, Xiaoguang Han |
|
code |
-1 |
CompNVS: Novel View Synthesis with Scene Completion |
Zuoyue Li, Tianxing Fan, Zhenqiang Li, Zhaopeng Cui, Yoichi Sato, Marc Pollefeys, Martin R. Oswald |
|
code |
-1 |
SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling |
Chenjian Gao, Qian Yu, Lu Sheng, YiZhe Song, Dong Xu |
|
code |
-1 |
LocalBins: Improving Depth Estimation by Learning Local Distributions |
Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka |
|
code |
-1 |
2D GANs Meet Unsupervised Single-View 3D Reconstruction |
Feng Liu, Xiaoming Liu |
|
code |
-1 |
InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images |
Zhengqi Li, Qianqian Wang, Noah Snavely, Angjoo Kanazawa |
|
code |
-1 |
Semi-supervised Single-View 3D Reconstruction via Prototype Shape Priors |
Zhen Xing, Hengduo Li, Zuxuan Wu, YuGang Jiang |
|
code |
-1 |
Bilateral Normal Integration |
Xu Cao, Hiroaki Santo, Boxin Shi, Fumio Okura, Yasuyuki Matsushita |
|
code |
-1 |
S2Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-supervised Learning |
Tze Ho Elden Tse, Zhongqun Zhang, Kwang In Kim, Ales Leonardis, Feng Zheng, Hyung Jin Chang |
|
code |
-1 |
SC-wLS: Towards Interpretable Feed-forward Camera Re-localization |
Xin Wu, Hao Zhao, Shunkai Li, Yingdian Cao, Hongbin Zha |
|
code |
-1 |
FloatingFusion: Depth from ToF and Image-Stabilized Stereo Cameras |
Andreas Meuleman, Hakyeong Kim, James Tompkin, Min H. Kim |
|
code |
-1 |
DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image |
Yijin Li, Xinyang Liu, Wenqi Dong, Han Zhou, Hujun Bao, Guofeng Zhang, Yinda Zhang, Zhaopeng Cui |
|
code |
-1 |
3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform |
Yining Zhao, Chao Wen, Zhou Xue, Yue Gao |
|
code |
-1 |
RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation |
Ruida Zhang, Yan Di, Zhiqiang Lou, Fabian Manhardt, Federico Tombari, Xiangyang Ji |
|
code |
-1 |
Monocular 3D Object Reconstruction with GAN Inversion |
Junzhe Zhang, Daxuan Ren, Zhongang Cai, Chai Kiat Yeo, Bo Dai, Chen Change Loy |
|
code |
-1 |
Map-Free Visual Relocalization: Metric Pose Relative to a Single Image |
Eduardo Arnold, Jamie Wynn, Sara Vicente, Guillermo GarciaHernando, Áron Monszpart, Victor Prisacariu, Daniyar Turmukhambetov, Eric Brachmann |
|
code |
-1 |
Self-distilled Feature Aggregation for Self-supervised Monocular Depth Estimation |
Zhengming Zhou, Qiulei Dong |
|
code |
-1 |
Planes vs. Chairs: Category-Guided 3D Shape Learning Without any 3D Cues |
Zixuan Huang, Stefan Stojanov, Anh Thai, Varun Jampani, James M. Rehg |
|
code |
-1 |
ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO |
Sanghyuk Chun, Wonjae Kim, Song Park, Minsuk Chang, Seong Joon Oh |
|
code |
-1 |
MOTCOM: The Multi-Object Tracking Dataset Complexity Metric |
Malte Pedersen, Joakim Bruslund Haurum, Patrick Dendorfer, Thomas B. Moeslund |
|
code |
-1 |
How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset? |
Yuchi Liu, Zhongdao Wang, Tom Gedeon, Liang Zheng |
|
code |
-1 |
A Real World Dataset for Multi-view 3D Reconstruction |
Rakesh Shrestha, Siqi Hu, Minghao Gou, Ziyuan Liu, Ping Tan |
|
code |
-1 |
REALY: Rethinking the Evaluation of 3D Face Reconstruction |
Zenghao Chai, Haoxian Zhang, Jing Ren, Di Kang, Zhengzhuo Xu, Xuefei Zhe, Chun Yuan, Linchao Bao |
|
code |
-1 |
Capturing, Reconstructing, and Simulating: The UrbanScene3D Dataset |
Liqiang Lin, Yilin Liu, Yue Hu, Xingguang Yan, Ke Xie, Hui Huang |
|
code |
-1 |
3D CoMPaT: Composition of Materials on Parts of 3D Things |
Yuchen Li, Ujjwal Upadhyay, Habib Slim, Ahmed Abdelreheem, Arpit Prajapati, Suhail Pothigara, Peter Wonka, Mohamed Elhoseiny |
|
code |
-1 |
PartImageNet: A Large, High-Quality Dataset of Parts |
Ju He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jieneng Chen, Shuai Liu, Cheng Yang, Qihang Yu, Alan L. Yuille |
|
code |
-1 |
A-OKVQA: A Benchmark for Visual Question Answering Using World Knowledge |
Dustin Schwenk, Apoorv Khandelwal, Christopher Clark, Kenneth Marino, Roozbeh Mottaghi |
|
code |
-1 |
OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images |
Bingchen Zhao, Shaozuo Yu, Wufei Ma, Mingxin Yu, Shenxiao Mei, Angtian Wang, Ju He, Alan L. Yuille, Adam Kortylewski |
|
code |
-1 |
Facial Depth and Normal Estimation Using Single Dual-Pixel Camera |
Minjun Kang, Jaesung Choe, Hyowon Ha, HaeGon Jeon, Sunghoon Im, In So Kweon, KukJin Yoon |
|
code |
-1 |
The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing |
Dawit Mureja Argaw, Fabian Caba Heilbron, JoonYoung Lee, Markus Woodson, In So Kweon |
|
code |
-1 |
StyleBabel: Artistic Style Tagging and Captioning |
Dan Ruta, Andrew Gilbert, Pranav Aggarwal, Naveen Marri, Ajinkya Kale, Jo Briggs, Chris Speed, Hailin Jin, Baldo Faieta, Alex Filipkowski, Zhe Lin, John P. Collomosse |
|
code |
-1 |
PANDORA: A Panoramic Detection Dataset for Object with Orientation |
Hang Xu, Qiang Zhao, Yike Ma, Xiaodong Li, Peng Yuan, Bailan Feng, Chenggang Yan, Feng Dai |
|
code |
-1 |
FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context |
Pinaki Nath Chowdhury, Aneeshan Sain, Ayan Kumar Bhunia, Tao Xiang, Yulia Gryaditskaya, YiZhe Song |
|
code |
-1 |
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset |
Grant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge J. Belongie |
|
code |
-1 |
The Caltech Fish Counting Dataset: A Benchmark for Multiple-Object Tracking and Counting |
Justin Kay, Peter Kulits, Suzanne Stathatos, Siqi Deng, Erik Young, Sara Beery, Grant Van Horn, Pietro Perona |
|
code |
-1 |
A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility |
Andrea Burns, Deniz Arsan, Sanjna Agrawal, Ranjitha Kumar, Kate Saenko, Bryan A. Plummer |
|
code |
-1 |
Dress Code: High-Resolution Multi-category Virtual Try-On |
Davide Morelli, Matteo Fincato, Marcella Cornia, Federico Landi, Fabio Cesari, Rita Cucchiara |
|
code |
-1 |
A Data-Centric Approach for Improving Ambiguous Labels with Combined Semi-supervised Classification and Clustering |
Lars Schmarje, Monty Santarossa, SimonMartin Schröder, Claudius Zelenka, Rainer Kiko, Jenny Stracke, Nina Volkmann, Reinhard Koch |
|
code |
-1 |
ClearPose: Large-scale Transparent Object Dataset and Benchmark |
Xiaotong Chen, Huijie Zhang, Zeren Yu, Anthony Opipari, Odest Chadwicke Jenkins |
|
code |
-1 |
When Deep Classifiers Agree: Analyzing Correlations Between Learning Order and Image Statistics |
Iuliia Pliushch, Martin Mundt, Nicolas Lupp, Visvanathan Ramesh |
|
code |
-1 |
AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment |
Kangyeol Kim, Sunghyun Park, Jaeseong Lee, Sunghyo Chung, Junsoo Lee, Jaegul Choo |
|
code |
-1 |
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration |
Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh |
|
code |
-1 |
A Dense Material Segmentation Dataset for Indoor and Outdoor Scene Parsing |
Paul Upchurch, Ransen Niu |
|
code |
-1 |
MimicME: A Large Scale Diverse 4D Database for Facial Expression Analysis |
Athanasios Papaioannou, Baris Gecer, Shiyang Cheng, Grigorios Chrysos, Jiankang Deng, Eftychia Fotiadou, Christos Kampouris, Dimitrios Kollias, Stylianos Moschoglou, Kritaphat Songsriin, Stylianos Ploumpis, George Trigeorgis, Panagiotis Tzirakis, Evangelos Ververas, Yuxiang Zhou, Allan Ponniah, Anastasios Roussos, Stefanos Zafeiriou |
|
code |
-1 |
Delving into Universal Lesion Segmentation: Method, Dataset, and Benchmark |
Yu Qiu, Jing Xu |
|
code |
-1 |
Large Scale Real-World Multi-person Tracking |
Bing Shuai, Alessandro Bergamo, Uta Büchler, Andrew G. Berneshawi, Alyssa Boden, Joseph Tighe |
|
code |
-1 |
D2-TPred: Discontinuous Dependency for Trajectory Prediction Under Traffic Lights |
Yuzhen Zhang, Wentong Wang, Weizhi Guo, Pei Lv, Mingliang Xu, Wei Chen, Dinesh Manocha |
|
code |
-1 |
The Missing Link: Finding Label Relations Across Datasets |
Jasper R. R. Uijlings, Thomas Mensink, Vittorio Ferrari |
|
code |
-1 |
Learning Omnidirectional Flow in 360$^\circ $ Video via Siamese Representation |
Keshav Bhandari, Bin Duan, Gaowen Liu, Hugo Latapie, Ziliang Zong, Yan Yan |
|
code |
-1 |
VizWiz-FewShot: Locating Objects in Images Taken by People with Visual Impairments |
YuYun Tseng, Alexander Bell, Danna Gurari |
|
code |
-1 |
TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments |
Shubham Dokania, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar |
|
code |
-1 |
Trapped in Texture Bias? A Large Scale Comparison of Deep Instance Segmentation |
Johannes Theodoridis, Jessica Hofmann, Johannes Maucher, Andreas Schilling |
|
code |
-1 |
Deformable Feature Aggregation for Dynamic Multi-modal 3D Object Detection |
Zehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinhong Jiang, Feng Zhao |
|
code |
-1 |
WeLSA: Learning to Predict 6D Pose from Weakly Labeled Data Using Shape Alignment |
Shishir Reddy Vutukur, Ivan Shugurov, Benjamin Busam, Andreas Hutter, Slobodan Ilic |
|
code |
-1 |
Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local Graph |
Honghui Yang, Zili Liu, Xiaopei Wu, Wenxiao Wang, Wei Qian, Xiaofei He, Deng Cai |
|
code |
-1 |
MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection |
Xuesong Chen, Shaoshuai Shi, Benjin Zhu, Ka Chun Cheung, Hang Xu, Hongsheng Li |
|
code |
-1 |
Long-tail Detection with Effective Class-Margins |
Jang Hyun Cho, Philipp Krähenbühl |
|
code |
-1 |
Semi-supervised Monocular 3D Object Detection by Multi-view Consistency |
Qing Lian, Yanbo Xu, Weilong Yao, Yingcong Chen, Tong Zhang |
|
code |
-1 |
PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection |
Han Wang, Jun Tang, Xiaodong Liu, Shanyan Guan, Rong Xie, Li Song |
|
code |
-1 |
AU-Aware 3D Face Reconstruction through Personalized AU-Specific Blendshape Learning |
Chenyi Kuang, Zijun Cui, Jeffrey O. Kephart, Qiang Ji |
|
code |
-1 |
BézierPalm: A Free Lunch for Palmprint Recognition |
Kai Zhao, Lei Shen, Yingyi Zhang, Chuhan Zhou, Tao Wang, Ruixin Zhang, Shouhong Ding, Wei Jia, Wei Shen |
|
code |
-1 |
Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing |
HsinPing Huang, Deqing Sun, Yaojie Liu, WenSheng Chu, Taihong Xiao, Jinwei Yuan, Hartwig Adam, MingHsuan Yang |
|
code |
-1 |
Face2Faceρ: Real-Time High-Resolution One-Shot Face Reenactment |
Kewei Yang, Kang Chen, Daoliang Guo, SongHai Zhang, Yuanchen Guo, Weidong Zhang |
|
code |
-1 |
Towards Racially Unbiased Skin Tone Estimation via Scene Disambiguation |
Haiwen Feng, Timo Bolkart, Joachim Tesch, Michael J. Black, Victoria Fernández Abrevaya |
|
code |
-1 |
BoundaryFace: A Mining Framework with Noise Label Self-correction for Face Recognition |
Shijie Wu, Xun Gong |
|
code |
-1 |
Pre-training Strategies and Datasets for Facial Representation Learning |
Adrian Bulat, Shiyang Cheng, Jing Yang, Andrew Garbett, Enrique SánchezLozano, Georgios Tzimiropoulos |
|
code |
-1 |
Look Both Ways: Self-supervising Driver Gaze Estimation and Road Scene Saliency |
Isaac Kasahara, Simon Stent, Hyun Soo Park |
|
code |
-1 |
MFIM: Megapixel Facial Identity Manipulation |
Sanghyeon Na |
|
code |
-1 |
3D Face Reconstruction with Dense Landmarks |
Erroll Wood, Tadas Baltrusaitis, Charlie Hewitt, Matthew Johnson, Jingjing Shen, Nikola Milosavljevic, Daniel Wilde, Stephan J. Garbin, Toby Sharp, Ivan Stojiljkovic, Tom Cashman, Julien P. C. Valentin |
|
code |
-1 |
Emotion-aware Multi-view Contrastive Learning for Facial Emotion Recognition |
Dae Ha Kim, Byung Cheol Song |
|
code |
-1 |
Order Learning Using Partially Ordered Data via Chainization |
SeonHo Lee, ChangSu Kim |
|
code |
-1 |
Unsupervised High-Fidelity Facial Texture Generation and Reconstruction |
Ron Slossberg, Ibrahim Jubran, Ron Kimmel |
|
code |
-1 |
Multi-domain Learning for Updating Face Anti-spoofing Models |
Xiao Guo, Yaojie Liu, Anil K. Jain, Xiaoming Liu |
|
code |
-1 |
Towards Metrical Reconstruction of Human Faces |
Wojciech Zielonka, Timo Bolkart, Justus Thies |
|
code |
-1 |
Discover and Mitigate Unknown Biases with Debiasing Alternate Networks |
Zhiheng Li, Anthony Hoogs, Chenliang Xu |
|
code |
-1 |
Unsupervised and Semi-supervised Bias Benchmarking in Face Recognition |
Alexandra Chouldechova, Siqi Deng, Yongxin Wang, Wei Xia, Pietro Perona |
|
code |
-1 |
Towards Efficient Adversarial Training on Vision Transformers |
Boxi Wu, Jindong Gu, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu |
|
code |
-1 |
MIME: Minority Inclusion for Majority Group Enhancement of AI Performance |
Pradyumna Chari, Yunhao Ba, Shreeram S. Athreya, Achuta Kadambi |
|
code |
-1 |
Studying Bias in GANs Through the Lens of Race |
Vongani H. Maluleke, Neerja Thakkar, Tim Brooks, Ethan Weber, Trevor Darrell, Alexei A. Efros, Angjoo Kanazawa, Devin Guillory |
|
code |
-1 |
Trust, but Verify: Using Self-supervised Probing to Improve Trustworthiness |
Ailin Deng, Shen Li, Miao Xiong, Zhirui Chen, Bryan Hooi |
|
code |
-1 |
Learning to Censor by Noisy Sampling |
Ayush Chopra, Abhinav Java, Abhishek Singh, Vivek Sharma, Ramesh Raskar |
|
code |
-1 |
An Invisible Black-Box Backdoor Attack Through Frequency Domain |
Tong Wang, Yuan Yao, Feng Xu, Shengwei An, Hanghang Tong, Ting Wang |
|
code |
-1 |
FairGRAPE: Fairness-Aware GRAdient Pruning mEthod for Face Attribute Classification |
Xiaofeng Lin, Seungbae Kim, Jungseock Joo |
|
code |
-1 |
Attaining Class-Level Forgetting in Pretrained Model Using Few Samples |
Pravendra Singh, Pratik Mazumder, Mohammed Asad Karim |
|
code |
-1 |
Anti-Neuron Watermarking: Protecting Personal Data Against Unauthorized Neural Networks |
Zihang Zou, Boqing Gong, Liqiang Wang |
|
code |
-1 |
An Impartial Take to the CNN vs Transformer Robustness Contest |
Francesco Pinto, Philip H. S. Torr, Puneet K. Dokania |
|
code |
-1 |
Recover Fair Deep Classification Models via Altering Pre-trained Structure |
Yanfu Zhang, Shangqian Gao, Heng Huang |
|
code |
-1 |
Decouple-and-Sample: Protecting Sensitive Information in Task Agnostic Data Release |
Abhishek Singh, Ethan Garza, Ayush Chopra, Praneeth Vepakomma, Vivek Sharma, Ramesh Raskar |
|
code |
-1 |
Privacy-Preserving Action Recognition via Motion Difference Quantization |
Sudhakar Kumawat, Hajime Nagahara |
|
code |
-1 |
Latent Space Smoothing for Individually Fair Representations |
Momchil Peychev, Anian Ruoss, Mislav Balunovic, Maximilian Baader, Martin T. Vechev |
|
code |
-1 |
Parameterized Temperature Scaling for Boosting the Expressive Power in Post-Hoc Uncertainty Calibration |
Christian Tomani, Daniel Cremers, Florian Buettner |
|
code |
-1 |
FairStyle: Debiasing StyleGAN2 with Style Channel Manipulations |
Cemre Karakas, Alara Dirik, Eylul Yalcinkaya, Pinar Yanardag |
|
code |
-1 |
Distilling the Undistillable: Learning from a Nasty Teacher |
Surgan Jandial, Yash Khasbage, Arghya Pal, Vineeth N. Balasubramanian, Balaji Krishnamurthy |
|
code |
-1 |
SOS! Self-supervised Learning over Sets of Handled Objects in Egocentric Action Recognition |
Victor Escorcia, Ricardo Guerrero, Xiatian Zhu, Brais Martínez |
|
code |
-1 |
Egocentric Activity Recognition and Localization on a 3D Map |
Miao Liu, Lingni Ma, Kiran Somasundaram, Yin Li, Kristen Grauman, James M. Rehg, Chao Li |
|
code |
-1 |
Generative Adversarial Network for Future Hand Segmentation from Egocentric Video |
Wenqi Jia, Miao Liu, James M. Rehg |
|
code |
-1 |
My View is the Best View: Procedure Learning from Egocentric Videos |
Siddhant Bansal, Chetan Arora, C. V. Jawahar |
|
code |
-1 |
GIMO: Gaze-Informed Human Motion Prediction in Context |
Yang Zheng, Yanchao Yang, Kaichun Mo, Jiaman Li, Tao Yu, Yebin Liu, C. Karen Liu, Leonidas J. Guibas |
|
code |
-1 |
Image-Based CLIP-Guided Essence Transfer |
Hila Chefer, Sagie Benaim, Roni Paiss, Lior Wolf |
|
code |
-1 |
Detecting and Recovering Sequential DeepFake Manipulation |
Rui Shao, Tianxing Wu, Ziwei Liu |
|
code |
-1 |
Self-supervised Sparse Representation for Video Anomaly Detection |
JhihCiang Wu, HeYen Hsieh, DingJie Chen, ChiouShann Fuh, TyngLuh Liu |
|
code |
-1 |
Adaptive Image Transformations for Transfer-Based Adversarial Attack |
Zheng Yuan, Jie Zhang, Shiguang Shan |
|
code |
-1 |
Generative Multiplane Images: Making a 2D GAN 3D-Aware |
Xiaoming Zhao, Fangchang Ma, David Güera, Zhile Ren, Alexander G. Schwing, Alex Colburn |
|
code |
-1 |
AdvDO: Realistic Adversarial Attacks for Trajectory Prediction |
Yulong Cao, Chaowei Xiao, Anima Anandkumar, Danfei Xu, Marco Pavone |
|
code |
-1 |
Adversarial Contrastive Learning via Asymmetric InfoNCE |
Qiying Yu, Jieming Lou, Xianyuan Zhan, Qizhang Li, Wangmeng Zuo, Yang Liu, Jingjing Liu |
|
code |
-1 |
One Size Does NOT Fit All: Data-Adaptive Adversarial Training |
Shuo Yang, Chang Xu |
|
code |
-1 |
UniCR: Universally Approximated Certified Robustness via Randomized Smoothing |
Hanbin Hong, Binghui Wang, Yuan Hong |
|
code |
-1 |
Hardly Perceptible Trojan Attack Against Neural Networks with Bit Flips |
Jiawang Bai, Kuofeng Gao, Dihong Gong, ShuTao Xia, Zhifeng Li, Wei Liu |
|
code |
-1 |
Robust Network Architecture Search via Feature Distortion Restraining |
Yaguan Qian, Shenghui Huang, Bin Wang, Xiang Ling, Xiaohui Guan, Zhaoquan Gu, Shaoning Zeng, Wujie Zhou, Haijiang Wang |
|
code |
-1 |
SecretGen: Privacy Recovery on Pre-trained Models via Distribution Discrimination |
Zhuowen Yuan, Fan Wu, Yunhui Long, Chaowei Xiao, Bo Li |
|
code |
-1 |
Triangle Attack: A Query-Efficient Decision-Based Adversarial Attack |
Xiaosen Wang, Zeliang Zhang, Kangheng Tong, Dihong Gong, Kun He, Zhifeng Li, Wei Liu |
|
code |
-1 |
Data-Free Backdoor Removal Based on Channel Lipschitzness |
Runkai Zheng, Rongjun Tang, Jianze Li, Li Liu |
|
code |
-1 |
Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack |
Yixu Wang, Jie Li, Hong Liu, Yan Wang, Yongjian Wu, Feiyue Huang, Rongrong Ji |
|
code |
-1 |
Learning Energy-Based Models with Adversarial Training |
Xuwang Yin, Shiying Li, Gustavo K. Rohde |
|
code |
-1 |
Adversarial Label Poisoning Attack on Graph Neural Networks via Label Propagation |
Ganlin Liu, Xiaowei Huang, Xinping Yi |
|
code |
-1 |
Revisiting Outer Optimization in Adversarial Training |
Ali Dabouei, Fariborz Taherkhani, Sobhan Soleymani, Nasser M. Nasrabadi |
|
code |
-1 |
Zero-Shot Attribute Attacks on Fine-Grained Recognition Models |
Nasim Shafiee, Ehsan Elhamifar |
|
code |
-1 |
Towards Effective and Robust Neural Trojan Defenses via Input Filtering |
Kien Do, Haripriya Harikumar, Hung Le, Dung Nguyen, Truyen Tran, Santu Rana, Dang Nguyen, Willy Susilo, Svetha Venkatesh |
|
code |
-1 |
Scaling Adversarial Training to Large Perturbation Bounds |
Sravanti Addepalli, Samyak Jain, Gaurang Sriramanan, R. Venkatesh Babu |
|
code |
-1 |
Exploiting the Local Parabolic Landscapes of Adversarial Losses to Accelerate Black-Box Adversarial Attack |
Hoang Tran, Dan Lu, Guannan Zhang |
|
code |
-1 |
Generative Domain Adaptation for Face Anti-Spoofing |
Qianyu Zhou, KeYue Zhang, Taiping Yao, Ran Yi, Kekai Sheng, Shouhong Ding, Lizhuang Ma |
|
code |
-1 |
MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition |
Huanzhang Dou, Pengyi Zhang, Wei Su, Yunlong Yu, Xi Li |
|
code |
-1 |
GaitEdge: Beyond Plain End-to-End Gait Recognition for Better Practicality |
Junhao Liang, Chao Fan, Saihui Hou, Chuanfu Shen, Yongzhen Huang, Shiqi Yu |
|
code |
-1 |
UIA-ViT: Unsupervised Inconsistency-Aware Method Based on Vision Transformer for Face Forgery Detection |
Wanyi Zhuang, Qi Chu, Zhentao Tan, Qiankun Liu, Haojie Yuan, Changtao Miao, Zixiang Luo, Nenghai Yu |
|
code |
-1 |
Effective Presentation Attack Detection Driven by Face Related Task |
Wentian Zhang, Haozhe Liu, Feng Liu, Raghavendra Ramachandra, Christoph Busch |
|
code |
-1 |
PPT: Token-Pruned Pose Transformer for Monocular and Multi-view Human Pose Estimation |
Haoyu Ma, Zhe Wang, Yifei Chen, Deying Kong, Liangjian Chen, Xingwei Liu, Xiangyi Yan, Hao Tang, Xiaohui Xie |
|
code |
-1 |
AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing |
Jiaxi Jiang, Paul Streli, Huajian Qiu, Andreas Fender, Larissa Laich, Patrick Snape, Christian Holz |
|
code |
-1 |
P-STMO: Pre-trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation |
Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao |
|
code |
-1 |
D &D: Learning Human Dynamics from Dynamic Camera |
Jiefeng Li, Siyuan Bian, Chao Xu, Gang Liu, Gang Yu, Cewu Lu |
|
code |
-1 |
Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation |
Qihao Liu, Yi Zhang, Song Bai, Alan L. Yuille |
|
code |
-1 |
COUCH: Towards Controllable Human-Chair Interactions |
Xiaohan Zhang, Bharat Lal Bhatnagar, Sebastian Starke, Vladimir Guzov, Gerard PonsMoll |
|
code |
-1 |
Identity-Aware Hand Mesh Estimation and Personalization from RGB Images |
Deying Kong, Linguang Zhang, Liangjian Chen, Haoyu Ma, Xiangyi Yan, Shanlin Sun, Xingwei Liu, Kun Han, Xiaohui Xie |
|
code |
-1 |
C3P: Cross-Domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation |
Cunlin Wu, Yang Xiao, Boshen Zhang, Mingyang Zhang, Zhiguo Cao, Joey Tianyi Zhou |
|
code |
-1 |
Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields |
Garvita Tiwari, Dimitrije Antic, Jan Eric Lenssen, Nikolaos Sarafianos, Tony Tung, Gerard PonsMoll |
|
code |
-1 |
CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation |
Zhihao Li, Jianzhuang Liu, Zhensong Zhang, Songcen Xu, Youliang Yan |
|
code |
-1 |
DeciWatch: A Simple Baseline for 10˟ Efficient 2D and 3D Pose Estimation |
Ailing Zeng, Xuan Ju, Lei Yang, Ruiyuan Gao, Xizhou Zhu, Bo Dai, Qiang Xu |
|
code |
-1 |
SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos |
Ailing Zeng, Lei Yang, Xuan Ju, Jiefeng Li, Jianyi Wang, Qiang Xu |
|
code |
-1 |
PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation |
Wentao Jiang, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Si Liu |
|
code |
-1 |
Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement |
Junuk Cha, Muhammad Saqlain, GeonU Kim, Mingyu Shin, Seungryul Baek |
|
code |
-1 |
Overlooked Poses Actually Make Sense: Distilling Privileged Knowledge for Human Motion Prediction |
Xiaoning Sun, Qiongjie Cui, Huaijiang Sun, Bin Li, Weiqing Li, Jianfeng Lu |
|
code |
-1 |
Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estimation |
Zhuo Chen, Xu Zhao, Xiaoyue Wan |
|
code |
-1 |
Audio-Driven Stylized Gesture Generation with Flow-Based Model |
Sheng Ye, YuHui Wen, Yanan Sun, Ying He, Ziyang Zhang, Yaoyuan Wang, Weihua He, YongJin Liu |
|
code |
-1 |
Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation |
Zhehan Kan, Shuoshuo Chen, Zeng Li, Zhihai He |
|
code |
-1 |
A Simple Approach and Benchmark for 21, 000-Category Object Detection |
Yutong Lin, Chen Li, Yue Cao, Zheng Zhang, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Han Hu |
|
code |
-1 |
Knowledge Condensation Distillation |
Chenxin Li, Mingbao Lin, Zhiyuan Ding, Nie Lin, Yihong Zhuang, Yue Huang, Xinghao Ding, Liujuan Cao |
|
code |
-1 |
Reducing Information Loss for Spiking Neural Networks |
Yufei Guo, Yuanpei Chen, Liwen Zhang, YingLei Wang, Xiaode Liu, Xinyi Tong, Yuanyuan Ou, Xuhui Huang, Zhe Ma |
|
code |
-1 |
Masked Generative Distillation |
Zhendong Yang, Zhe Li, Mingqi Shao, Dachuan Shi, Zehuan Yuan, Chun Yuan |
|
code |
-1 |
Fine-grained Data Distribution Alignment for Post-Training Quantization |
Yunshan Zhong, Mingbao Lin, Mengzhao Chen, Ke Li, Yunhang Shen, Fei Chao, Yongjian Wu, Rongrong Ji |
|
code |
-1 |
Learning with Recoverable Forgetting |
Jingwen Ye, Yifang Fu, Jie Song, Xingyi Yang, Songhua Liu, Xin Jin, Mingli Song, Xinchao Wang |
|
code |
-1 |
Efficient One Pass Self-distillation with Zipf's Label Smoothing |
Jiajun Liang, Linze Li, Zhaodong Bing, Borui Zhao, Yao Tang, Bo Lin, Haoqiang Fan |
|
code |
-1 |
Prune Your Model Before Distill It |
Jinhyuk Park, Albert No |
|
code |
-1 |
Deep Partial Updating: Towards Communication Efficient Updating for On-Device Inference |
Zhongnan Qu, Cong Liu, Lothar Thiele |
|
code |
-1 |
Patch Similarity Aware Data-Free Quantization for Vision Transformers |
Zhikai Li, Liping Ma, Mengjuan Chen, Junrui Xiao, Qingyi Gu |
|
code |
-1 |
L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training |
Jonghyun Bae, Woohyeon Baek, Tae Jun Ham, Jae W. Lee |
|
code |
-1 |
Streaming Multiscale Deep Equilibrium Models |
Can Ufuk Ertenli, Emre Akbas, Ramazan Gokberk Cinbis |
|
code |
-1 |
Symmetry Regularization and Saturating Nonlinearity for Robust Quantization |
Sein Park, Yeongsang Jang, Eunhyeok Park |
|
code |
-1 |
SP-Net: Slowly Progressing Dynamic Inference Networks |
Huanyu Wang, Wenhu Zhang, Shihao Su, Hui Wang, Zhenwei Miao, Xin Zhan, Xi Li |
|
code |
-1 |
Equivariance and Invariance Inductive Bias for Learning from Insufficient Data |
Tan Wang, Qianru Sun, Sugiri Pranata, Jayashree Karlekar, Hanwang Zhang |
|
code |
-1 |
Mixed-Precision Neural Network Quantization via Learned Layer-Wise Importance |
Chen Tang, Kai Ouyang, Zhi Wang, Yifei Zhu, Wen Ji, Yaowei Wang, Wenwu Zhu |
|
code |
-1 |
Event Neural Networks |
Matthew Dutson, Yin Li, Mohit Gupta |
|
code |
-1 |
EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers |
Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martínez |
|
code |
-1 |
PalQuant: Accelerating High-Precision Networks on Low-Precision Accelerators |
Qinghao Hu, Gang Li, Qiman Wu, Jian Cheng |
|
code |
-1 |
Disentangled Differentiable Network Pruning |
Shangqian Gao, Feihu Huang, Yanfu Zhang, Heng Huang |
|
code |
-1 |
IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors |
Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lü |
|
code |
-1 |
Learning to Weight Samples for Dynamic Early-Exiting Networks |
Yizeng Han, Yifan Pu, Zihang Lai, Chaofei Wang, Shiji Song, Junfen Cao, Wenhui Huang, Chao Deng, Gao Huang |
|
code |
-1 |
AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets |
Zhijun Tu, Xinghao Chen, Pengju Ren, Yunhe Wang |
|
code |
-1 |
Adaptive Token Sampling for Efficient Vision Transformers |
Mohsen Fayyaz, Soroush Abbasi Koohpayegani, Farnoush Rezaei Jafari, Sunando Sengupta, Hamid Reza Vaezi Joze, Eric Sommerlade, Hamed Pirsiavash, Jürgen Gall |
|
code |
-1 |
Weight Fixing Networks |
Christopher SubiaWaud, Srinandan Dasmahapatra |
|
code |
-1 |
Self-slimmed Vision Transformer |
Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu |
|
code |
-1 |
Switchable Online Knowledge Distillation |
Biao Qian, Yang Wang, Hongzhi Yin, Richang Hong, Meng Wang |
|
code |
-1 |
ℓ ∞-Robustness and Beyond: Unleashing Efficient Adversarial Training |
Hadi M. Dolatabadi, Sarah M. Erfani, Christopher Leckie |
|
code |
-1 |
Multi-granularity Pruning for Model Acceleration on Mobile Devices |
Tianli Zhao, Xi Sheryl Zhang, Wentao Zhu, Jiaxing Wang, Sen Yang, Ji Liu, Jian Cheng |
|
code |
-1 |
Deep Ensemble Learning by Diverse Knowledge Distillation for Fine-Grained Object Classification |
Naoki Okamoto, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi |
|
code |
-1 |
Helpful or Harmful: Inter-task Association in Continual Learning |
Hyundong Jin, Eunwoo Kim |
|
code |
-1 |
Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies |
Xingrun Xing, Yangguang Li, Wei Li, Wenrui Ding, Yalong Jiang, Yufeng Wang, Jing Shao, Chunlei Liu, Xianglong Liu |
|
code |
-1 |
SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks |
ChienYu Lin, Anish Prabhu, Thomas Merth, Sachin Mehta, Anurag Ranjan, Maxwell Horton, Mohammad Rastegari |
|
code |
-1 |
Ensemble Knowledge Guided Sub-network Search and Fine-Tuning for Filter Pruning |
Seunghyun Lee, Byung Cheol Song |
|
code |
-1 |
Network Binarization via Contrastive Learning |
Yuzhang Shang, Dan Xu, Ziliang Zong, Liqiang Nie, Yan Yan |
|
code |
-1 |
Lipschitz Continuity Retained Binary Neural Network |
Yuzhang Shang, Dan Xu, Bin Duan, Ziliang Zong, Liqiang Nie, Yan Yan |
|
code |
-1 |
SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning |
Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Wei Niu, Mengshu Sun, Xuan Shen, Geng Yuan, Bin Ren, Hao Tang, Minghai Qin, Yanzhi Wang |
|
code |
-1 |
Soft Masking for Cost-Constrained Channel Pruning |
Ryan Humble, Maying Shen, Jorge Albericio Latorre, Eric Darve, Jose M. Alvarez |
|
code |
-1 |
Non-uniform Step Size Quantization for Accurate Post-training Quantization |
Sangyun Oh, Hyeonuk Sim, Jounghyun Kim, Jongeun Lee |
|
code |
-1 |
SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning |
Haoran You, Baopu Li, Zhanyi Sun, Xu Ouyang, Yingyan Lin |
|
code |
-1 |
Meta-GF: Training Dynamic-Depth Neural Networks Harmoniously |
Yi Sun, Jian Li, Xin Xu |
|
code |
-1 |
Towards Ultra Low Latency Spiking Neural Networks for Vision and Sequential Tasks Using Temporal Pruning |
Sayeed Shafayet Chowdhury, Nitin Rathi, Kaushik Roy |
|
code |
-1 |
Towards Accurate Network Quantization with Equivalent Smooth Regularizer |
Kirill Solodskikh, Vladimir Chikin, Ruslan Aydarkhanov, Dehua Song, Irina Zhelavskaya, Jiansheng Wei |
|
code |
-1 |
DFNet: Enhance Absolute Pose Regression with Direct Feature Matching |
Shuai Chen, Xinghui Li, Zirui Wang, Victor Adrian Prisacariu |
|
code |
-1 |
Cornerformer: Purifying Instances for Corner-Based Detectors |
Haoran Wei, Xin Chen, Lingxi Xie, Qi Tian |
|
code |
-1 |
PillarNet: Real-Time and High-Performance Pillar-Based 3D Object Detection |
Guangsheng Shi, Ruifeng Li, Chao Ma |
|
code |
-1 |
Robust Object Detection with Inaccurate Bounding Boxes |
Chengxin Liu, Kewei Wang, Hao Lu, Zhiguo Cao, Ziming Zhang |
|
code |
-1 |
Efficient Decoder-Free Object Detection with Transformers |
Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen |
|
code |
-1 |
Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection |
Yu Hong, Hang Dai, Yong Ding |
|
code |
-1 |
ReAct: Temporal Action Detection with Relational Queries |
Dingfeng Shi, Yujie Zhong, Qiong Cao, Jing Zhang, Lin Ma, Jia Li, Dacheng Tao |
|
code |
-1 |
Towards Accurate Active Camera Localization |
Qihang Fang, Yingda Yin, Qingnan Fan, Fei Xia, Siyan Dong, Sheng Wang, Jue Wang, Leonidas J. Guibas, Baoquan Chen |
|
code |
-1 |
Camera Pose Auto-encoders for Improving Pose Regression |
Yoli Shavit, Yosi Keller |
|
code |
-1 |
Improving the Intra-class Long-Tail in 3D Detection via Rare Example Mining |
Chiyu Max Jiang, Mahyar Najibi, Charles R. Qi, Yin Zhou, Dragomir Anguelov |
|
code |
-1 |
Bagging Regional Classification Activation Maps for Weakly Supervised Object Localization |
Lei Zhu, Qian Chen, Lujia Jin, Yunfei You, Yanye Lu |
|
code |
-1 |
UC-OWOD: Unknown-Classified Open World Object Detection |
Zhiheng Wu, Yue Lu, Xingyu Chen, Zhengxing Wu, Liwen Kang, Junzhi Yu |
|
code |
-1 |
RayTran: 3D Pose Estimation and Shape Reconstruction of Multiple Objects from Videos with Ray-Traced Transformers |
Michal J. Tyszkiewicz, KevisKokitsi Maninis, Stefan Popov, Vittorio Ferrari |
|
code |
-1 |
GTCaR: Graph Transformer for Camera Re-localization |
Xinyi Li, Haibin Ling |
|
code |
-1 |
3D Object Detection with a Self-supervised Lidar Scene Flow Backbone |
Emeç Erçelik, Ekim Yurtsever, Mingyu Liu, Zhijie Yang, Hanzhen Zhang, Pinar Topçam, Maximilian Listl, Yilmaz Kaan Çayli, Alois C. Knoll |
|
code |
-1 |
Open Vocabulary Object Detection with Pseudo Bounding-Box Labels |
Mingfei Gao, Chen Xing, Juan Carlos Niebles, Junnan Li, Ran Xu, Wenhao Liu, Caiming Xiong |
|
code |
-1 |
Few-Shot Object Detection by Knowledge Distillation Using Bag-of-Visual-Words Representations |
Wenjie Pei, Shuang Wu, Dianwen Mei, Fanglin Chen, Jiandong Tian, Guangming Lu |
|
code |
-1 |
SALISA: Saliency-Based Input Sampling for Efficient Video Object Detection |
Babak Ehteshami Bejnordi, Amirhossein Habibian, Fatih Porikli, Amir Ghodrati |
|
code |
-1 |
ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement |
Dongli Tan, JiangJiang Liu, Xingyu Chen, Chao Chen, Ruixin Zhang, Yunhang Shen, Shouhong Ding, Rongrong Ji |
|
code |
-1 |
Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting |
Yangzheng Wu, Mohsen Zand, Ali Etemad, Michael A. Greenspan |
|
code |
-1 |
Long-Tailed Instance Segmentation Using Gumbel Optimized Loss |
Konstantinos Panagiotis Alexandridis, Jiankang Deng, Anh Nguyen, Shan Luo |
|
code |
-1 |
DetMatch: Two Teachers are Better than One for Joint 2D and 3D Semi-Supervised Object Detection |
Jinhyung Park, Chenfeng Xu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan |
|
code |
-1 |
ObjectBox: From Centers to Boxes for Anchor-Free Object Detection |
Mohsen Zand, Ali Etemad, Michael A. Greenspan |
|
code |
-1 |
Is Geometry Enough for Matching in Visual Localization? |
Qunjie Zhou, Sérgio Agostinho, Aljosa Osep, Laura LealTaixé |
|
code |
-1 |
SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds |
Pei Sun, Mingxing Tan, Weiyue Wang, Chenxi Liu, Fei Xia, Zhaoqi Leng, Dragomir Anguelov |
|
code |
-1 |
PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry |
Yu Zhang, Junle Yu, Xiaolin Huang, Wenhui Zhou, Ji Hou |
|
code |
-1 |
GLAMD: Global and Local Attention Mask Distillation for Object Detectors |
Younho Jang, Wheemyung Shin, Jinbeom Kim, Simon S. Woo, SungHo Bae |
|
code |
-1 |
FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection |
Danila Rukhovich, Anna Vorontsova, Anton Konushin |
|
code |
-1 |
Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles |
Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang |
|
code |
-1 |
Class-Agnostic Object Detection with Multi-modal Transformer |
Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, MingHsuan Yang |
|
code |
-1 |
Enhancing Multi-modal Features Using Local Self-attention for 3D Object Detection |
Hao Li, Zehan Zhang, Xian Zhao, Yulong Wang, Yuxi Shen, Shiliang Pu, Hui Mao |
|
code |
-1 |
Object Detection as Probabilistic Set Prediction |
Georg Hess, Christoffer Petersson, Lennart Svensson |
|
code |
-1 |
Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions |
Zhi Li, Lu He, Huijuan Xu |
|
code |
-1 |
Neural Correspondence Field for Object Pose Estimation |
Lin Huang, Tomas Hodan, Lingni Ma, Linguang Zhang, Luan Tran, Christopher D. Twigg, PoChen Wu, Junsong Yuan, Cem Keskin, Robert Wang |
|
code |
-1 |
On Label Granularity and Object Localization |
Elijah Cole, Kimberly Wilber, Grant Van Horn, Xuan Yang, Marco Fornoni, Pietro Perona, Serge J. Belongie, Andrew G. Howard, Oisin Mac Aodha |
|
code |
-1 |
OIMNet++: Prototypical Normalization and Localization-Aware Learning for Person Search |
Sanghoon Lee, Youngmin Oh, Donghyeon Baek, Junghyup Lee, Bumsub Ham |
|
code |
-1 |
Out-of-Distribution Identification: Let Detector Tell Which I Am Not Sure |
Ruoqi Li, Chongyang Zhang, Hao Zhou, Chao Shi, Yan Luo |
|
code |
-1 |
Learning with Free Object Segments for Long-Tailed Instance Segmentation |
Cheng Zhang, TaiYu Pan, Tianle Chen, Jike Zhong, Wenjin Fu, WeiLun Chao |
|
code |
-1 |
Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction |
Yuxuan Liu, Nikhil Mishra, Maximilian Sieb, Yide Shentu, Pieter Abbeel, Xi Chen |
|
code |
-1 |
3D Random Occlusion and Multi-layer Projection for Deep Multi-camera Pedestrian Localization |
Rui Qiu, Ming Xu, Yuyao Yan, Jeremy S. Smith, Xi Yang |
|
code |
-1 |
A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation |
Wuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, TsungYi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou |
|
code |
-1 |
Simple Open-Vocabulary Object Detection |
Matthias Minderer, Alexey A. Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby |
|
code |
-1 |
UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture |
Hiroyasu Akada, Jian Wang, Soshi Shimada, Masaki Takahashi, Christian Theobalt, Vladislav Golyanik |
|
code |
-1 |
Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction |
Maosen Li, Siheng Chen, Zijing Zhang, Lingxi Xie, Qi Tian, Ya Zhang |
|
code |
-1 |
Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-person Human Pose Estimation |
William J. McNally, Kanav Vats, Alexander Wong, John McPhee |
|
code |
-1 |
VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data |
Jiajun Su, Chunyu Wang, Xiaoxuan Ma, Wenjun Zeng, Yizhou Wang |
|
code |
-1 |
Poseur: Direct Human Pose Regression with Transformers |
Weian Mao, Yongtao Ge, Chunhua Shen, Zhi Tian, Xinlong Wang, Zhibin Wang, Anton van den Hengel |
|
code |
-1 |
SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation |
Yanjie Li, Sen Yang, Peidong Liu, Shoukui Zhang, Yunxiao Wang, Zhicheng Wang, Wankou Yang, ShuTao Xia |
|
code |
-1 |
Regularizing Vector Embedding in Bottom-Up Human Pose Estimation |
Haixin Wang, Lu Zhou, Yingying Chen, Ming Tang, Jinqiao Wang |
|
code |
-1 |
A Visual Navigation Perspective for Category-Level Object Pose Estimation |
Jiaxin Guo, Fangxun Zhong, Rong Xiong, Yunhui Liu, Yue Wang, Yiyi Liao |
|
code |
-1 |
Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection |
Hang Ye, Wentao Zhu, Chunyu Wang, Rujie Wu, Yizhou Wang |
|
code |
-1 |
Learning to Fit Morphable Models |
Vasileios Choutas, Federica Bogo, Jingjing Shen, Julien P. C. Valentin |
|
code |
-1 |
EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices |
Siwei Zhang, Qianli Ma, Yan Zhang, Zhiyin Qian, Taein Kwon, Marc Pollefeys, Federica Bogo, Siyu Tang |
|
code |
-1 |
Grasp'D: Differentiable Contact-Rich Grasp Synthesis for Multi-Fingered Hands |
Dylan Turpin, Liquan Wang, Eric Heiden, YunChun Chen, Miles Macklin, Stavros Tsogkas, Sven J. Dickinson, Animesh Garg |
|
code |
-1 |
AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling |
Ziqian Bai, Timur M. Bagautdinov, Javier Romero, Michael Zollhöfer, Ping Tan, Shunsuke Saito |
|
code |
-1 |
Deep Radial Embedding for Visual Sequence Learning |
Yuecong Min, Peiqi Jiao, Yanan Li, Xiaotao Wang, Lei Lei, Xiujuan Chai, Xilin Chen |
|
code |
-1 |
SAGA: Stochastic Whole-Body Grasping with Contact |
Yan Wu, Jiahao Wang, Yan Zhang, Siwei Zhang, Otmar Hilliges, Fisher Yu, Siyu Tang |
|
code |
-1 |
Neural Capture of Animatable 3D Human from Monocular Video |
Gusi Te, Xiu Li, Xiao Li, Jinglu Wang, Wei Hu, Yan Lu |
|
code |
-1 |
General Object Pose Transformation Network from Unpaired Data |
Yukun Su, Guosheng Lin, Ruizhou Sun, Qingyao Wu |
|
code |
-1 |
Compositional Human-Scene Interaction Synthesis with Semantic Control |
Kaifeng Zhao, Shaofei Wang, Yan Zhang, Thabo Beeler, Siyu Tang |
|
code |
-1 |
PressureVision: Estimating Hand Pressure from a Single RGB Image |
Patrick Grady, Chengcheng Tang, Samarth Brahmbhatt, Christopher D. Twigg, Chengde Wan, James Hays, Charles C. Kemp |
|
code |
-1 |
PoseScript: 3D Human Poses from Natural Language |
Ginger Delmas, Philippe Weinzaepfel, Thomas Lucas, Francesc MorenoNoguer, Grégory Rogez |
|
code |
-1 |
DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation |
Jaewoo Park, Nam Ik Cho |
|
code |
-1 |
3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal |
Hao Meng, Sheng Jin, Wentao Liu, Chen Qian, Mengxiang Lin, Wanli Ouyang, Ping Luo |
|
code |
-1 |
Pose for Everything: Towards Category-Agnostic Pose Estimation |
Lumin Xu, Sheng Jin, Wang Zeng, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang |
|
code |
-1 |
PoseGPT: Quantization-Based 3D Human Motion Generation and Forecasting |
Thomas Lucas, Fabien Baradel, Philippe Weinzaepfel, Grégory Rogez |
|
code |
-1 |
DH-AUG: DH Forward Kinematics Model Driven Augmentation for 3D Human Pose Estimation |
Linzhi Huang, Jiahao Liang, Weihong Deng |
|
code |
-1 |
Estimating Spatially-Varying Lighting in Urban Scenes with Disentangled Representation |
Jiajun Tang, Yongjie Zhu, Haoyu Wang, Jun Hoong Chan, Si Li, Boxin Shi |
|
code |
-1 |
Boosting Event Stream Super-Resolution with a Recurrent Neural Network |
Wenming Weng, Yueyi Zhang, Zhiwei Xiong |
|
code |
-1 |
Projective Parallel Single-Pixel Imaging to Overcome Global Illumination in 3D Structure Light Scanning |
Yuxi Li, Huijie Zhao, Hongzhi Jiang, Xudong Li |
|
code |
-1 |
Semantic-Sparse Colorization Network for Deep Exemplar-Based Colorization |
Yunpeng Bai, Chao Dong, Zenghao Chai, Andong Wang, Zhengzhuo Xu, Chun Yuan |
|
code |
-1 |
Practical and Scalable Desktop-Based High-Quality Facial Capture |
Alexandros Lattas, Yiming Lin, Jayanth Kannan, Ekin Ozturk, Luca Filipi, Giuseppe Claudio Guarnera, Gaurav Chawla, Abhijeet Ghosh |
|
code |
-1 |
FAST-VQA: Efficient End-to-End Video Quality Assessment with Fragment Sampling |
Haoning Wu, Chaofeng Chen, Jingwen Hou, Liang Liao, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin |
|
code |
-1 |
Physically-Based Editing of Indoor Scene Lighting from a Single Image |
Zhengqin Li, Jia Shi, Sai Bi, Rui Zhu, Kalyan Sunkavalli, Milos Hasan, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker |
|
code |
-1 |
LEDNet: Joint Low-Light Enhancement and Deblurring in the Dark |
Shangchen Zhou, Chongyi Li, Chen Change Loy |
|
code |
-1 |
MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects |
Juewen Peng, Jianming Zhang, Xianrui Luo, Hao Lu, Ke Xian, Zhiguo Cao |
|
code |
-1 |
Real-RawVSR: Real-World Raw Video Super-Resolution with a Benchmark Dataset |
Huanjing Yue, Zhiming Zhang, JingYu Yang |
|
code |
-1 |
Transform Your Smartphone into a DSLR Camera: Learning the ISP in the Wild |
Ardhendu Shekhar Tripathi, Martin Danelljan, Samarth Shukla, Radu Timofte, Luc Van Gool |
|
code |
-1 |
Learning Deep Non-blind Image Deconvolution Without Ground Truths |
Yuhui Quan, Zhuojie Chen, Huan Zheng, Hui Ji |
|
code |
-1 |
NEST: Neural Event Stack for Event-Based Image Enhancement |
Minggui Teng, Chu Zhou, Hanyue Lou, Boxin Shi |
|
code |
-1 |
Editable Indoor Lighting Estimation |
Henrique Weber, Mathieu Garon, JeanFrançois Lalonde |
|
code |
-1 |
Fast Two-Step Blind Optical Aberration Correction |
Thomas Eboli, JeanMichel Morel, Gabriele Facciolo |
|
code |
-1 |
Seeing Far in the Dark with Patterned Flash |
Zhanghao Sun, Jian Wang, Yicheng Wu, Shree Nayar |
|
code |
-1 |
PseudoClick: Interactive Image Segmentation with Click Imitation |
Qin Liu, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, Marc Niethammer, Ziyan Wu |
|
code |
-1 |
WaveGAN: Frequency-Aware GAN for High-Fidelity Few-Shot Image Generation |
Mengping Yang, Zhe Wang, Ziqiu Chi, Wenyi Feng |
|
code |
-1 |
End-to-End Visual Editing with a Generatively Pre-trained Artist |
Andrew Brown, ChengYang Fu, Omkar M. Parkhi, Tamara L. Berg, Andrea Vedaldi |
|
code |
-1 |
High-Fidelity GAN Inversion with Padding Space |
Qingyan Bai, Yinghao Xu, Jiapeng Zhu, Weihao Xia, Yujiu Yang, Yujun Shen |
|
code |
-1 |
Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping |
Chao Xu, Jiangning Zhang, Yue Han, Guanzhong Tian, Xianfang Zeng, Ying Tai, Yabiao Wang, Chengjie Wang, Yong Liu |
|
code |
-1 |
Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives |
Wentao Yuan, Qingtian Zhu, Xiangyue Liu, Yikang Ding, Haotian Zhang, Chi Zhang |
|
code |
-1 |
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors |
Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman |
|
code |
-1 |
3D-FM GAN: Towards 3D-Controllable Face Manipulation |
Yuchen Liu, Zhixin Shu, Yijun Li, Zhe Lin, Richard Zhang, SunYuan Kung |
|
code |
-1 |
Multi-Curve Translator for High-Resolution Photorealistic Image Translation |
Yuda Song, Hui Qian, Xin Du |
|
code |
-1 |
Deep Bayesian Video Frame Interpolation |
Zhiyang Yu, Yu Zhang, Xujie Xiang, Dongqing Zou, Xijun Chen, Jimmy S. Ren |
|
code |
-1 |
Cross Attention Based Style Distribution for Controllable Person Image Synthesis |
Xinyue Zhou, Mingyu Yin, Xinyuan Chen, Li Sun, Changxin Gao, Qingli Li |
|
code |
-1 |
KeypointNeRF: Generalizing Image-Based Volumetric Avatars Using Relative Spatial Encoding of Keypoints |
Marko Mihajlovic, Aayush Bansal, Michael Zollhöfer, Siyu Tang, Shunsuke Saito |
|
code |
-1 |
ViewFormer: NeRF-Free Neural Rendering from Few Images Using Transformers |
Jonás Kulhánek, Erik Derner, Torsten Sattler, Robert Babuska |
|
code |
-1 |
L-Tracing: Fast Light Visibility Estimation on Neural Surfaces by Sphere Tracing |
Ziyu Chen, Chenjing Ding, Jianfei Guo, Dongliang Wang, Yikang Li, Xuan Xiao, Wei Wu, Li Song |
|
code |
-1 |
A Perceptual Quality Metric for Video Frame Interpolation |
Qiqi Hou, Abhijay Ghildyal, Feng Liu |
|
code |
-1 |
Adaptive Feature Interpolation for Low-Shot Image Generation |
Mengyu Dai, Haibin Hang, Xiaoyang Guo |
|
code |
-1 |
PalGAN: Image Colorization with Palette Generative Adversarial Networks |
Yi Wang, Menghan Xia, Lu Qi, Jing Shao, Yu Qiao |
|
code |
-1 |
Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis |
Long Zhuo, Guangcong Wang, Shikai Li, Wayne Wu, Ziwei Liu |
|
code |
-1 |
Learning Prior Feature and Attention Enhanced Image Inpainting |
Chenjie Cao, Qiaole Dong, Yanwei Fu |
|
code |
-1 |
Temporal-MPI: Enabling Multi-plane Images for Dynamic Scene Modelling via Temporal Basis Learning |
Wenpeng Xing, Jie Chen |
|
code |
-1 |
3D-Aware Semantic-Guided Generative Model for Human Synthesis |
Jichao Zhang, Enver Sangineto, Hao Tang, Aliaksandr Siarohin, Zhun Zhong, Nicu Sebe, Wei Wang |
|
code |
-1 |
Temporally Consistent Semantic Video Editing |
Yiran Xu, Badour AlBahar, JiaBin Huang |
|
code |
-1 |
Error Compensation Framework for Flow-Guided Video Inpainting |
Jaeyeon Kang, Seoung Wug Oh, Seon Joo Kim |
|
code |
-1 |
Scraping Textures from Natural Images for Synthesis and Editing |
Xueting Li, Xiaolong Wang, MingHsuan Yang, Alexei A. Efros, Sifei Liu |
|
code |
-1 |
Single Stage Virtual Try-On Via Deformable Attention Flows |
Shuai Bai, Huiling Zhou, Zhikang Li, Chang Zhou, Hongxia Yang |
|
code |
-1 |
Improving GANs for Long-Tailed Data Through Group Spectral Regularization |
Harsh Rangwani, Naman Jaswani, Tejan Karmali, Varun Jampani, R. Venkatesh Babu |
|
code |
-1 |
Hierarchical Semantic Regularization of Latent Spaces in StyleGANs |
Tejan Karmali, Rishubh Parihar, Susmit Agrawal, Harsh Rangwani, Varun Jampani, Maneesh Kumar Singh, R. Venkatesh Babu |
|
code |
-1 |
IntereStyle: Encoding an Interest Region for Robust StyleGAN Inversion |
Seung Jun Moon, GyeongMoon Park |
|
code |
-1 |
StyleLight: HDR Panorama Generation for Lighting Estimation and Editing |
Guangcong Wang, Yinuo Yang, Chen Change Loy, Ziwei Liu |
|
code |
-1 |
Contrastive Monotonic Pixel-Level Modulation |
Kun Lu, Rongpeng Li, Honggang Zhang |
|
code |
-1 |
Learning Cross-Video Neural Representations for High-Quality Frame Interpolation |
Wentao Shangguan, Yu Sun, Weijie Gan, Ulugbek S. Kamilov |
|
code |
-1 |
Learning Continuous Implicit Representation for Near-Periodic Patterns |
Bowei Chen, Tiancheng Zhi, Martial Hebert, Srinivasa G. Narasimhan |
|
code |
-1 |
End-to-End Graph-Constrained Vectorized Floorplan Generation with Panoptic Refinement |
Jiachen Liu, Yuan Xue, José Pinto Duarte, Krishnendra Shekhawat, Zihan Zhou, Xiaolei Huang |
|
code |
-1 |
Few-Shot Image Generation with Mixup-Based Distance Learning |
Chaerin Kong, Jeesoo Kim, Donghoon Han, Nojun Kwak |
|
code |
-1 |
A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos |
Xu Yao, Alasdair Newson, Yann Gousseau, Pierre Hellier |
|
code |
-1 |
FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs |
Ziqiang Li, Chaoyue Wang, Heliang Zheng, Jing Zhang, Bin Li |
|
code |
-1 |
BlobGAN: Spatially Disentangled Scene Representations |
Dave Epstein, Taesung Park, Richard Zhang, Eli Shechtman, Alexei A. Efros |
|
code |
-1 |
Unified Implicit Neural Stylization |
Zhiwen Fan, Yifan Jiang, Peihao Wang, Xinyu Gong, Dejia Xu, Zhangyang Wang |
|
code |
-1 |
GAN with Multivariate Disentangling for Controllable Hair Editing |
Xuyang Guo, Meina Kan, Tianle Chen, Shiguang Shan |
|
code |
-1 |
Discovering Transferable Forensic Features for CNN-Generated Images Detection |
Keshigeyan Chandrasegaran, NgocTrung Tran, Alexander Binder, NgaiMan Cheung |
|
code |
-1 |
Harmonizer: Learning to Perform White-Box Image and Video Harmonization |
Zhanghan Ke, Chunyi Sun, Lei Zhu, Ke Xu, Rynson W. H. Lau |
|
code |
-1 |
Text2LIVE: Text-Driven Layered Image and Video Editing |
Omer BarTal, Dolev OfriAmar, Rafail Fridman, Yoni Kasten, Tali Dekel |
|
code |
-1 |
Digging into Radiance Grid for Real-Time View Synthesis with Detail Preservation |
Jian Zhang, Jinchi Huang, Bowen Cai, Huan Fu, Mingming Gong, Chaohui Wang, Jiaming Wang, Hongchen Luo, Rongfei Jia, Binqiang Zhao, Xing Tang |
|
code |
-1 |
StyleGAN-Human: A Data-Centric Odyssey of Human Generation |
Jianglin Fu, Shikai Li, Yuming Jiang, KwanYee Lin, Chen Qian, Chen Change Loy, Wayne Wu, Ziwei Liu |
|
code |
-1 |
ColorFormer: Image Colorization via Color Memory Assisted Hybrid-Attention Transformer |
Xiaozhong Ji, Boyuan Jiang, Donghao Luo, Guangpin Tao, Wenqing Chu, Zhifeng Xie, Chengjie Wang, Ying Tai |
|
code |
-1 |
EAGAN: Efficient Two-Stage Evolutionary Architecture Search for GANs |
Guohao Ying, Xin He, Bin Gao, Bo Han, Xiaowen Chu |
|
code |
-1 |
Weakly-Supervised Stitching Network for Real-World Panoramic Image Generation |
DaeYoung Song, Geonsoo Lee, Heekyung Lee, GiMun Um, Donghyeon Cho |
|
code |
-1 |
DynaST: Dynamic Sparse Transformer for Exemplar-Guided Image Generation |
Songhua Liu, Jingwen Ye, Sucheng Ren, Xinchao Wang |
|
code |
-1 |
Multimodal Conditional Image Synthesis with Product-of-Experts GANs |
Xun Huang, Arun Mallya, TingChun Wang, MingYu Liu |
|
code |
-1 |
Auto-regressive Image Synthesis with Integrated Quantization |
Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Changgong Zhang, Shijian Lu |
|
code |
-1 |
JoJoGAN: One Shot Face Stylization |
Min Jin Chong, David A. Forsyth |
|
code |
-1 |
VecGAN: Image-to-Image Translation with Interpretable Latent Directions |
Yusuf Dalva, Said Fahri Altindis, Aysegul Dundar |
|
code |
-1 |
Any-Resolution Training for High-Resolution Image Synthesis |
Lucy Chai, Michaël Gharbi, Eli Shechtman, Phillip Isola, Richard Zhang |
|
code |
-1 |
CCPL: Contrastive Coherence Preserving Loss for Versatile Style Transfer |
Zijie Wu, Zhen Zhu, Junping Du, Xiang Bai |
|
code |
-1 |
CANF-VC: Conditional Augmented Normalizing Flows for Video Compression |
YungHan Ho, ChihPeng Chang, PengYu Chen, Alessandro Gnutti, WenHsiao Peng |
|
code |
-1 |
Bi-level Feature Alignment for Versatile Image Translation and Manipulation |
Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Aoran Xiao, Shijian Lu, Chunyan Miao |
|
code |
-1 |
High-Fidelity Image Inpainting with GAN Inversion |
Yongsheng Yu, Libo Zhang, Heng Fan, Tiejian Luo |
|
code |
-1 |
DeltaGAN: Towards Diverse Few-Shot Image Generation with Sample-Specific Delta |
Yan Hong, Li Niu, Jianfu Zhang, Liqing Zhang |
|
code |
-1 |
Image Inpainting with Cascaded Modulation GAN and Object-Aware Training |
Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Ning Xu, Sohrab Amirghodsi, Jiebo Luo |
|
code |
-1 |
StyleFace: Towards Identity-Disentangled Face Generation on Megapixels |
Yuchen Luo, Junwei Zhu, Keke He, Wenqing Chu, Ying Tai, Chengjie Wang, Junchi Yan |
|
code |
-1 |
Video Extrapolation in Space and Time |
Yunzhi Zhang, Jiajun Wu |
|
code |
-1 |
Contrastive Learning for Diverse Disentangled Foreground Generation |
Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh |
|
code |
-1 |
BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-Aided Adversarial Learning |
Changgyoon Oh, Wonjune Cho, Yujeong Chae, Daehee Park, Lin Wang, KukJin Yoon |
|
code |
-1 |
Augmentation of rPPG Benchmark Datasets: Learning to Remove and Embed rPPG Signals via Double Cycle Consistent Learning from Unpaired Facial Videos |
ChengJu Hsieh, WeiHao Chung, ChiouTing Hsu |
|
code |
-1 |
Geometry-Aware Single-Image Full-Body Human Relighting |
Chaonan Ji, Tao Yu, Kaiwen Guo, Jingxin Liu, Yebin Liu |
|
code |
-1 |
3D-Aware Indoor Scene Synthesis with Depth Priors |
Zifan Shi, Yujun Shen, Jiapeng Zhu, DitYan Yeung, Qifeng Chen |
|
code |
-1 |
Deep Portrait Delighting |
Joshua Weir, Junhong Zhao, Andrew Chalmers, Taehyun Rhee |
|
code |
-1 |
Vector Quantized Image-to-Image Translation |
YuJie Chen, ShinI Cheng, WeiChen Chiu, HungYu Tseng, HsinYing Lee |
|
code |
-1 |
The Surprisingly Straightforward Scene Text Removal Method with Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis |
Hyeonsu Lee, Chankyu Choi |
|
code |
-1 |
Free-Viewpoint RGB-D Human Performance Capture and Rendering |
Phong NguyenHa, Nikolaos Sarafianos, Christoph Lassner, Janne Heikkilä, Tony Tung |
|
code |
-1 |
Multiview Regenerative Morphing with Dual Flows |
ChihJung Tsai, Cheng Sun, HwannTzong Chen |
|
code |
-1 |
Hallucinating Pose-Compatible Scenes |
Tim Brooks, Alexei A. Efros |
|
code |
-1 |
Motion and Appearance Adaptation for Cross-domain Motion Transfer |
Borun Xu, Biao Wang, Jinhong Deng, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan |
|
code |
-1 |
Layered Controllable Video Generation |
Jiahui Huang, Yuhe Jin, Kwang Moo Yi, Leonid Sigal |
|
code |
-1 |
Custom Structure Preservation in Face Aging |
Guillermo GomezTrenado, Stéphane Lathuilière, Pablo Mesejo, Oscar Cordón |
|
code |
-1 |
Spatio-Temporal Deformable Attention Network for Video Deblurring |
Huicong Zhang, Haozhe Xie, Hongxun Yao |
|
code |
-1 |
NeuMesh: Learning Disentangled Neural Mesh-Based Implicit Field for Geometry and Texture Editing |
Bangbang Yang, Chong Bao, Junyi Zeng, Hujun Bao, Yinda Zhang, Zhaopeng Cui, Guofeng Zhang |
|
code |
-1 |
NeRF for Outdoor Scene Relighting |
Viktor Rudnev, Mohamed Elgharib, William A. P. Smith, Lingjie Liu, Vladislav Golyanik, Christian Theobalt |
|
code |
-1 |
CoGS: Controllable Generation and Search from Sketch and Style |
Cusuh Ham, Gemma Canet Tarres, Tu Bui, James Hays, Zhe Lin, John P. Collomosse |
|
code |
-1 |
HairNet: Hairstyle Transfer with Pose Changes |
Peihao Zhu, Rameen Abdal, John Femiani, Peter Wonka |
|
code |
-1 |
Unbiased Multi-modality Guidance for Image Inpainting |
Yongsheng Yu, Dawei Du, Libo Zhang, Tiejian Luo |
|
code |
-1 |
Intelli-Paint: Towards Developing More Human-Intelligible Painting Agents |
Jaskirat Singh, Cameron Smith, Jose Echevarria, Liang Zheng |
|
code |
-1 |
Motion Transformer for Unsupervised Image Animation |
Jiale Tao, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan |
|
code |
-1 |
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion |
Chenfei Wu, Jian Liang, Lei Ji, Fan Yang, Yuejian Fang, Daxin Jiang, Nan Duan |
|
code |
-1 |
EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer |
Chenyu Yang, Wanrong He, Yingqing Xu, Yang Gao |
|
code |
-1 |
Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks |
Yunshan Zhong, Mingbao Lin, Xunchao Li, Ke Li, Yunhang Shen, Fei Chao, Yongjian Wu, Rongrong Ji |
|
code |
-1 |
OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers |
Jialun Pei, Tianyang Cheng, DengPing Fan, He Tang, Chuanbo Chen, Luc Van Gool |
|
code |
-1 |
Highly Accurate Dichotomous Image Segmentation |
Xuebin Qin, Hang Dai, Xiaobin Hu, DengPing Fan, Ling Shao, Luc Van Gool |
|
code |
-1 |
Boosting Supervised Dehazing Methods via Bi-level Patch Reweighting |
Xingyu Jiang, Hongkun Dou, Chengwei Fu, Bingquan Dai, Tianrun Xu, Yue Deng |
|
code |
-1 |
Flow-Guided Transformer for Video Inpainting |
Kaidong Zhang, Jingjing Fu, Dong Liu |
|
code |
-1 |
Shift-Tolerant Perceptual Similarity Metric |
Abhijay Ghildyal, Feng Liu |
|
code |
-1 |
Perception-Distortion Balanced ADMM Optimization for Single-Image Super-Resolution |
Yuehan Zhang, Bo Ji, Jia Hao, Angela Yao |
|
code |
-1 |
VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder |
Yuchao Gu, Xintao Wang, Liangbin Xie, Chao Dong, Gen Li, Ying Shan, MingMing Cheng |
|
code |
-1 |
Uncertainty Learning in Kernel Estimation for Multi-stage Blind Image Super-Resolution |
Zhenxuan Fang, Weisheng Dong, Xin Li, Jinjian Wu, Leida Li, Guangming Shi |
|
code |
-1 |
Learning Spatio-Temporal Downsampling for Effective Video Upscaling |
Xiaoyu Xiang, Yapeng Tian, Vijay Rengarajan, Lucas D. Young, Bo Zhu, Rakesh Ranjan |
|
code |
-1 |
Learning Local Implicit Fourier Representation for Image Warping |
Jaewon Lee, Kwang Pyo Choi, Kyong Hwan Jin |
|
code |
-1 |
SepLUT: Separable Image-Adaptive Lookup Tables for Real-Time Image Enhancement |
Canqian Yang, Meiguang Jin, Yi Xu, Rui Zhang, Ying Chen, Huaida Liu |
|
code |
-1 |
Blind Image Decomposition |
Junlin Han, Weihao Li, Pengfei Fang, Chunyi Sun, Jie Hong, Mohammad Ali Armin, Lars Petersson, Hongdong Li |
|
code |
-1 |
MuLUT: Cooperating Multiple Look-Up Tables for Efficient Image Super-Resolution |
Jiacheng Li, Chang Chen, Zhen Cheng, Zhiwei Xiong |
|
code |
-1 |
Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution |
Zhongwei Qiu, Huan Yang, Jianlong Fu, Dongmei Fu |
|
code |
-1 |
Spatial-Frequency Domain Information Integration for Pan-Sharpening |
Man Zhou, Jie Huang, Keyu Yan, Hu Yu, Xueyang Fu, Aiping Liu, Xian Wei, Feng Zhao |
|
code |
-1 |
Adaptive Patch Exiting for Scalable Single Image Super-Resolution |
Shizun Wang, Jiaming Liu, Kaixin Chen, Xiaoqi Li, Ming Lu, Yandong Guo |
|
code |
-1 |
Efficient Meta-Tuning for Content-Aware Neural Video Delivery |
Xiaoqi Li, Jiaming Liu, Shizun Wang, Cheng Lyu, Ming Lu, Yurong Chen, Anbang Yao, Yandong Guo, Shanghang Zhang |
|
code |
-1 |
Reference-Based Image Super-Resolution with Deformable Attention Transformer |
Jiezhang Cao, Jingyun Liang, Kai Zhang, Yawei Li, Yulun Zhang, Wenguan Wang, Luc Van Gool |
|
code |
-1 |
Local Color Distributions Prior for Image Enhancement |
Haoyuan Wang, Ke Xu, Rynson W. H. Lau |
|
code |
-1 |
L-CoDer: Language-Based Colorization with Color-Object Decoupling Transformer |
Zheng Chang, Shuchen Weng, Yu Li, Si Li, Boxin Shi |
|
code |
-1 |
From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution |
Xiaoming Li, Chaofeng Chen, Xianhui Lin, Wangmeng Zuo, Lei Zhang |
|
code |
-1 |
Towards Interpretable Video Super-Resolution via Alternating Optimization |
Jiezhang Cao, Jingyun Liang, Kai Zhang, Wenguan Wang, Qin Wang, Yulun Zhang, Hao Tang, Luc Van Gool |
|
code |
-1 |
Event-Based Fusion for Motion Deblurring with Cross-modal Attention |
Lei Sun, Christos Sakaridis, Jingyun Liang, Qi Jiang, Kailun Yang, Peng Sun, Yaozu Ye, Kaiwei Wang, Luc Van Gool |
|
code |
-1 |
Fast and High Quality Image Denoising via Malleable Convolution |
Yifan Jiang, Bartlomiej Wronski, Ben Mildenhall, Jonathan T. Barron, Zhangyang Wang, Tianfan Xue |
|
code |
-1 |
TAPE: Task-Agnostic Prior Embedding for Image Restoration |
Lin Liu, Lingxi Xie, Xiaopeng Zhang, Shanxin Yuan, Xiangyu Chen, Wengang Zhou, Houqiang Li, Qi Tian |
|
code |
-1 |
Uncertainty Inspired Underwater Image Enhancement |
Zhenqi Fu, Wu Wang, Yue Huang, Xinghao Ding, KaiKuang Ma |
|
code |
-1 |
Hourglass Attention Network for Image Inpainting |
Ye Deng, Siqi Hui, Rongye Meng, Sanping Zhou, Jinjun Wang |
|
code |
-1 |
Unfolded Deep Kernel Estimation for Blind Image Super-Resolution |
Hongyi Zheng, Hongwei Yong, Lei Zhang |
|
code |
-1 |
Event-guided Deblurring of Unknown Exposure Time Videos |
Taewoo Kim, Jeongmin Lee, Lin Wang, KukJin Yoon |
|
code |
-1 |
ReCoNet: Recurrent Correction Network for Fast and Efficient Multi-modality Image Fusion |
Zhanbo Huang, Jinyuan Liu, Xin Fan, Risheng Liu, Wei Zhong, Zhongxuan Luo |
|
code |
-1 |
Content Adaptive Latents and Decoder for Neural Image Compression |
Guanbo Pan, Guo Lu, Zhihao Hu, Dong Xu |
|
code |
-1 |
Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution |
Jie Liang, Hui Zeng, Lei Zhang |
|
code |
-1 |
Unidirectional Video Denoising by Mimicking Backward Recurrent Modules with Look-Ahead Forward Ones |
Junyi Li, Xiaohe Wu, Zhenxing Niu, Wangmeng Zuo |
|
code |
-1 |
Self-supervised Learning for Real-World Super-Resolution from Dual Zoomed Observations |
Zhilu Zhang, Ruohao Wang, Hongzhi Zhang, Yunjin Chen, Wangmeng Zuo |
|
code |
-1 |
Secrets of Event-Based Optical Flow |
Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego |
|
code |
-1 |
Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoiréing |
Xin Yu, Peng Dai, Wenbo Li, Lan Ma, Jiajun Shen, Jia Li, Xiaojuan Qi |
|
code |
-1 |
ERDN: Equivalent Receptive Field Deformable Network for Video Deblurring |
Bangrui Jiang, Zhihuai Xie, Zhen Xia, Songnan Li, Shan Liu |
|
code |
-1 |
Rethinking Generic Camera Models for Deep Single Image Camera Calibration to Recover Rotation and Fisheye Distortion |
Nobuhiko Wakai, Satoshi Sato, Yasunori Ishii, Takayoshi Yamashita |
|
code |
-1 |
ART-SS: An Adaptive Rejection Technique for Semi-supervised Restoration for Adverse Weather-Affected Images |
Rajeev Yasarla, Carey E. Priebe, Vishal M. Patel |
|
code |
-1 |
Fusion from Decomposition: A Self-Supervised Decomposition Approach for Image Fusion |
Pengwei Liang, Junjun Jiang, Xianming Liu, Jiayi Ma |
|
code |
-1 |
Learning Degradation Representations for Image Deblurring |
Dasong Li, Yi Zhang, Ka Chun Cheung, Xiaogang Wang, Hongwei Qin, Hongsheng Li |
|
code |
-1 |
Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal |
Xinwei Liu, Jian Liu, Yang Bai, Jindong Gu, Tao Chen, Xiaojun Jia, Xiaochun Cao |
|
code |
-1 |
Explaining Deepfake Detection by Analysing Image Matching |
Shichao Dong, Jin Wang, Jiajun Liang, Haoqiang Fan, Renhe Ji |
|
code |
-1 |
FrequencyLowCut Pooling - Plug and Play Against Catastrophic Overfitting |
Julia Grabinski, Steffen Jung, Janis Keuper, Margret Keuper |
|
code |
-1 |
TAFIM: Targeted Adversarial Attacks Against Facial Image Manipulations |
Shivangi Aneja, Lev Markhasin, Matthias Nießner |
|
code |
-1 |
FingerprintNet: Synthesized Fingerprints for Generated Image Detection |
Yonghyun Jeong, Doyeon Kim, Youngmin Ro, Pyounggeon Kim, Jongwon Choi |
|
code |
-1 |
Detecting Generated Images by Real Images |
Bo Liu, Fan Yang, Xiuli Bi, Bin Xiao, Weisheng Li, Xinbo Gao |
|
code |
-1 |
An Information Theoretic Approach for Attention-Driven Face Forgery Detection |
Ke Sun, Hong Liu, Taiping Yao, Xiaoshuai Sun, Shen Chen, Shouhong Ding, Rongrong Ji |
|
code |
-1 |
Exploring Disentangled Content Information for Face Forgery Detection |
Jiahao Liang, Huafeng Shi, Weihong Deng |
|
code |
-1 |
RepMix: Representation Mixing for Robust Attribution of Synthesized Images |
Tu Bui, Ning Yu, John P. Collomosse |
|
code |
-1 |
Totems: Physical Objects for Verifying Visual Integrity |
Jingwei Ma, Lucy Chai, Minyoung Huh, Tongzhou Wang, SerNam Lim, Phillip Isola, Antonio Torralba |
|
code |
-1 |
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval |
Pandeng Li, Hongtao Xie, Jiannan Ge, Lei Zhang, Shaobo Min, Yongdong Zhang |
|
code |
-1 |
PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification |
Kuan Zhu, Haiyun Guo, Tianyi Yan, Yousong Zhu, Jinqiao Wang, Ming Tang |
|
code |
-1 |
Adaptive Cross-domain Learning for Generalizable Person Re-identification |
Pengyi Zhang, Huanzhang Dou, Yunlong Yu, Xi Li |
|
code |
-1 |
Multi-query Video Retrieval |
Zeyu Wang, Yu Wu, Karthik Narasimhan, Olga Russakovsky |
|
code |
-1 |
Hierarchical Average Precision Training for Pertinent Image Retrieval |
Elias Ramzi, Nicolas Audebert, Nicolas Thome, Clément Rambour, Xavier Bitot |
|
code |
-1 |
Learning Semantic Correspondence with Sparse Annotations |
Shuaiyi Huang, Luyu Yang, Bo He, Songyang Zhang, Xuming He, Abhinav Shrivastava |
|
code |
-1 |
Dynamically Transformed Instance Normalization Network for Generalizable Person Re-Identification |
Bingliang Jiao, Lingqiao Liu, Liying Gao, Guosheng Lin, Lu Yang, Shizhou Zhang, Peng Wang, Yanning Zhang |
|
code |
-1 |
Domain Adaptive Person Search |
Junjie Li, Yichao Yan, Guanshuo Wang, Fufu Yu, Qiong Jia, Shouhong Ding |
|
code |
-1 |
TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval |
Yuqi Liu, Pengfei Xiong, Luhui Xu, Shengming Cao, Qin Jin |
|
code |
-1 |
Unstructured Feature Decoupling for Vehicle Re-identification |
Wen Qian, Hao Luo, Silong Peng, Fan Wang, Chen Chen, Hao Li |
|
code |
-1 |
Deep Hash Distillation for Image Retrieval |
Young Kyun Jang, Geonmo Gu, ByungSoo Ko, Isaac Kang, Nam Ik Cho |
|
code |
-1 |
Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification |
Boqiang Xu, Jian Liang, Lingxiao He, Zhenan Sun |
|
code |
-1 |
Granularity-Aware Adaptation for Image Retrieval Over Multiple Tasks |
Jon Almazán, ByungSoo Ko, Geonmo Gu, Diane Larlus, Yannis Kalantidis |
|
code |
-1 |
Learning Audio-Video Modalities from Image Captions |
Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid |
|
code |
-1 |
RVSL: Robust Vehicle Similarity Learning in Real Hazy Scenes Based on Semi-supervised Learning |
WeiTing Chen, IHsiang Chen, ChihYuan Yeh, HaoHsiang Yang, HuaEn Chang, JianJiun Ding, SyYen Kuo |
|
code |
-1 |
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval |
Fan Hu, Aozhu Chen, Ziyue Wang, Fangming Zhou, Jianfeng Dong, Xirong Li |
|
code |
-1 |
Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification |
Yiyuan Zhang, Sanyuan Zhao, Yuhao Kang, Jianbing Shen |
|
code |
-1 |
Cross-Modality Transformer for Visible-Infrared Person Re-Identification |
Kongzhu Jiang, Tianzhu Zhang, Xiang Liu, Bingqiao Qian, Yongdong Zhang, Feng Wu |
|
code |
-1 |
Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment |
Sangmin Lee, Sungjune Park, Yong Man Ro |
|
code |
-1 |
Connecting Compression Spaces with Transformer for Approximate Nearest Neighbor Search |
Haokui Zhang, Buzhou Tang, Wenze Hu, Xiaoyu Wang |
|
code |
-1 |
SEMICON: A Learning-to-Hash Solution for Large-Scale Fine-Grained Image Retrieval |
Yang Shen, Xuhao Sun, XiuShen Wei, QingYuan Jiang, Jian Yang |
|
code |
-1 |
CAViT: Contextual Alignment Vision Transformer for Video Object Re-identification |
Jinlin Wu, Lingxiao He, Wu Liu, Yang Yang, Zhen Lei, Tao Mei, Stan Z. Li |
|
code |
-1 |
Text-Based Temporal Localization of Novel Events |
Sudipta Paul, Niluthpol Chowdhury Mithun, Amit K. RoyChowdhury |
|
code |
-1 |
Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval |
Zhaopeng Dou, Zhongdao Wang, Weihua Chen, Yali Li, Shengjin Wang |
|
code |
-1 |
Relighting4D: Neural Relightable Human from Videos |
Zhaoxi Chen, Ziwei Liu |
|
code |
-1 |
Real-Time Intermediate Flow Estimation for Video Frame Interpolation |
Zhewei Huang, Tianyuan Zhang, Wen Heng, Boxin Shi, Shuchang Zhou |
|
code |
-1 |
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation |
Jing He, Yiyi Zhou, Qi Zhang, Jun Peng, Yunhang Shen, Xiaoshuai Sun, Chao Chen, Rongrong Ji |
|
code |
-1 |
StyleSwap: Style-Based Generator Empowers Robust Face Swapping |
Zhiliang Xu, Hang Zhou, Zhibin Hong, Ziwei Liu, Jiaming Liu, Zhizhi Guo, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang |
|
code |
-1 |
Paint2Pix: Interactive Painting Based Progressive Image Synthesis and Editing |
Jaskirat Singh, Liang Zheng, Cameron Smith, Jose Echevarria |
|
code |
-1 |
FurryGAN: High Quality Foreground-Aware Image Synthesis |
Jeongmin Bae, Mingi Kwon, Youngjung Uh |
|
code |
-1 |
SCAM! Transferring Humans Between Images with Semantic Cross Attention Modulation |
Nicolas Dufour, David Picard, Vicky Kalogeiton |
|
code |
-1 |
Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields |
Yuedong Chen, Qianyi Wu, Chuanxia Zheng, TatJen Cham, Jianfei Cai |
|
code |
-1 |
Editing Out-of-Domain GAN Inversion via Differential Activations |
Haorui Song, Yong Du, Tianyi Xiang, Junyu Dong, Jing Qin, Shengfeng He |
|
code |
-1 |
On the Robustness of Quality Measures for GANs |
Motasem Alfarra, Juan C. Pérez, Anna Frühstück, Philip H. S. Torr, Peter Wonka, Bernard Ghanem |
|
code |
-1 |
Sound-Guided Semantic Video Generation |
Seung Hyun Lee, Gyeongrok Oh, Wonmin Byeon, Chan Young Kim, Won Jeong Ryoo, Sang Ho Yoon, Hyunjun Cho, Jihyun Bae, Jinkyu Kim, Sangpil Kim |
|
code |
-1 |
Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-curation |
Lingzhi Zhang, Connelly Barnes, Kevin Wampler, Sohrab Amirghodsi, Eli Shechtman, Zhe Lin, Jianbo Shi |
|
code |
-1 |
Controllable Video Generation Through Global and Local Motion Dynamics |
Aram Davtyan, Paolo Favaro |
|
code |
-1 |
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN |
Fei Yin, Yong Zhang, Xiaodong Cun, Mingdeng Cao, Yanbo Fan, Xuan Wang, Qingyan Bai, Baoyuan Wu, Jue Wang, Yujiu Yang |
|
code |
-1 |
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer |
Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, JiaBin Huang, Devi Parikh |
|
code |
-1 |
Combining Internal and External Constraints for Unrolling Shutter in Videos |
Eyal Naor, Itai Antebi, Shai Bagon, Michal Irani |
|
code |
-1 |
WISE: Whitebox Image Stylization by Example-Based Learning |
Winfried Lötzsch, Max Reimann, Martin Büßemeyer, Amir Semmo, Jürgen Döllner, Matthias Trapp |
|
code |
-1 |
Neural Radiance Transfer Fields for Relightable Novel-View Synthesis with Global Illumination |
Linjie Lyu, Ayush Tewari, Thomas Leimkühler, Marc Habermann, Christian Theobalt |
|
code |
-1 |
Transformers as Meta-learners for Implicit Neural Representations |
Yinbo Chen, Xiaolong Wang |
|
code |
-1 |
Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment |
Taewoo Kim, Chaeyeon Chung, Yoonseo Kim, Sunghyun Park, Kangyeol Kim, Jaegul Choo |
|
code |
-1 |
High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions |
Sangyun Lee, Gyojung Gu, Sunghyun Park, Seunghwan Choi, Jaegul Choo |
|
code |
-1 |
A Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution |
Hengsheng Zhang, Xueyi Zou, Jiaming Guo, Youliang Yan, Rong Xie, Li Song |
|
code |
-1 |
Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis |
Jeonggi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim, David K. Han, Hanseok Ko |
|
code |
-1 |
AdaNeRF: Adaptive Sampling for Real-Time Rendering of Neural Radiance Fields |
Andreas Kurz, Thomas Neff, Zhaoyang Lv, Michael Zollhöfer, Markus Steinberger |
|
code |
-1 |
Improving the Perceptual Quality of 2D Animation Interpolation |
Shuhong Chen, Matthias Zwicker |
|
code |
-1 |
Selective TransHDR: Transformer-Based Selective HDR Imaging Using Ghost Region Mask |
Jou Won Song, Ye In Park, Kyeongbo Kong, Jaeho Kwak, SukJu Kang |
|
code |
-1 |
Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution |
Cheng Ma, Jingyi Zhang, Jie Zhou, Jiwen Lu |
|
code |
-1 |
GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints |
Di Chen, Yu Liu, Lianghua Huang, Bin Wang, Pan Pan |
|
code |
-1 |
DoodleFormer: Creative Sketch Drawing with Transformers |
Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen, Michael Felsberg |
|
code |
-1 |
Implicit Neural Representations for Variable Length Human Motion Generation |
Pablo Cervantes, Yusuke Sekikawa, Ikuro Sato, Koichi Shinoda |
|
code |
-1 |
Learning Object Placement via Dual-Path Graph Completion |
Siyuan Zhou, Liu Liu, Li Niu, Liqing Zhang |
|
code |
-1 |
Expanded Adaptive Scaling Normalization for End to End Image Compression |
Chajin Shin, Hyeongmin Lee, Hanbin Son, Sangjin Lee, Dogyoon Lee, Sangyoun Lee |
|
code |
-1 |
Generator Knows What Discriminator Should Learn in Unconditional GANs |
Gayoung Lee, Hyunsu Kim, Junho Kim, Seonghyeon Kim, JungWoo Ha, Yunjey Choi |
|
code |
-1 |
Compositional Visual Generation with Composable Diffusion Models |
Nan Liu, Shuang Li, Yilun Du, Antonio Torralba, Joshua B. Tenenbaum |
|
code |
-1 |
ManiFest: Manifold Deformation for Few-Shot Image Translation |
Fabio Pizzati, JeanFrançois Lalonde, Raoul de Charette |
|
code |
-1 |
Supervised Attribute Information Removal and Reconstruction for Image Manipulation |
Nannan Li, Bryan A. Plummer |
|
code |
-1 |
BLT: Bidirectional Layout Transformer for Controllable Layout Generation |
Xiang Kong, Lu Jiang, Huiwen Chang, Han Zhang, Yuan Hao, Haifeng Gong, Irfan Essa |
|
code |
-1 |
Diverse Generation from a Single Video Made Possible |
Niv Haim, Ben Feinstein, Niv Granot, Assaf Shocher, Shai Bagon, Tali Dekel, Michal Irani |
|
code |
-1 |
Rayleigh EigenDirections (REDs): Nonlinear GAN Latent Space Traversals for Multidimensional Features |
Guha Balakrishnan, Raghudeep Gadde, Aleix Martinez, Pietro Perona |
|
code |
-1 |
Bridging the Domain Gap Towards Generalization in Automatic Colorization |
Hyejin Lee, Daehee Kim, Daeun Lee, Jinkyu Kim, Jaekoo Lee |
|
code |
-1 |
Generating Natural Images with Direct Patch Distributions Matching |
Ariel Elnekave, Yair Weiss |
|
code |
-1 |
Context-Consistent Semantic Image Editing with Style-Preserved Modulation |
Wuyang Luo, Su Yang, Hong Wang, Bo Long, Weishan Zhang |
|
code |
-1 |
Eliminating Gradient Conflict in Reference-based Line-Art Colorization |
Zekun Li, Zhengyang Geng, Zhao Kang, Wenyu Chen, Yibo Yang |
|
code |
-1 |
Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations |
Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada |
|
code |
-1 |
JPEG Artifacts Removal via Contrastive Representation Learning |
Xi Wang, Xueyang Fu, Yurui Zhu, ZhengJun Zha |
|
code |
-1 |
Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning |
Xiang Chen, Zhentao Fan, Pengpeng Li, Longgang Dai, Caihua Kong, Zhuoran Zheng, Yufeng Huang, Yufeng Li |
|
code |
-1 |
Efficient Long-Range Attention Network for Image Super-Resolution |
Xindong Zhang, Hui Zeng, Shi Guo, Lei Zhang |
|
code |
-1 |
FlowFormer: A Transformer Architecture for Optical Flow |
Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li |
|
code |
-1 |
Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction |
Yuanhao Cai, Jing Lin, Xiaowan Hu, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc Van Gool |
|
code |
-1 |
Learning Shadow Correspondence for Video Shadow Detection |
Xinpeng Ding, Jingwen Yang, Xiaowei Hu, Xiaomeng Li |
|
code |
-1 |
Metric Learning Based Interactive Modulation for Real-World Super-Resolution |
Chong Mou, Yanze Wu, Xintao Wang, Chao Dong, Jian Zhang, Ying Shan |
|
code |
-1 |
Explicit Model Size Control and Relaxation via Smooth Regularization for Mixed-Precision Quantization |
Vladimir Chikin, Kirill Solodskikh, Irina Zhelavskaya |
|
code |
-1 |
BASQ: Branch-wise Activation-clipping Search Quantization for Sub-4-bit Neural Networks |
HanByul Kim, Eunhyeok Park, Sungjoo Yoo |
|
code |
-1 |
You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding |
Geng Yuan, SungEn Chang, Qing Jin, Alec Lu, Yanyu Li, Yushu Wu, Zhenglun Kong, Yanyue Xie, Peiyan Dong, Minghai Qin, Xiaolong Ma, Xulong Tang, Zhenman Fang, Yanzhi Wang |
|
code |
-1 |
Real Spike: Learning Real-Valued Spikes for Spiking Neural Networks |
Yufei Guo, Liwen Zhang, Yuanpei Chen, Xinyi Tong, Xiaode Liu, YingLei Wang, Xuhui Huang, Zhe Ma |
|
code |
-1 |
FedLTN: Federated Learning for Sparse and Personalized Lottery Ticket Networks |
Vaikkunth Mugunthan, Eric Lin, Vignesh Gokul, Christian Lau, Lalana Kagal, Steven D. Pieper |
|
code |
-1 |
Theoretical Understanding of the Information Flow on Continual Learning Performance |
Joshua Andle, Salimeh Yasaei Sekeh |
|
code |
-1 |
Exploring Lottery Ticket Hypothesis in Spiking Neural Networks |
Youngeun Kim, Yuhang Li, Hyoungseob Park, Yeshwanth Venkatesha, Ruokai Yin, Priyadarshini Panda |
|
code |
-1 |
On the Angular Update and Hyperparameter Tuning of a Scale-Invariant Network |
Juseung Yun, Janghyeon Lee, Hyounguk Shon, Eojindl Yi, Seung Hwan Kim, Junmo Kim |
|
code |
-1 |
LANA: Latency Aware Network Acceleration |
Pavlo Molchanov, Jimmy Hall, Hongxu Yin, Jan Kautz, Nicolò Fusi, Arash Vahdat |
|
code |
-1 |
RDO-Q: Extremely Fine-Grained Channel-Wise Quantization via Rate-Distortion Optimization |
Zhe Wang, Jie Lin, Xue Geng, Mohamed M. Sabry Aly, Vijay Chandrasekhar |
|
code |
-1 |
U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search |
Ahmet Caner Yüzügüler, Nikolaos Dimitriadis, Pascal Frossard |
|
code |
-1 |
PTQ4ViT: Post-training Quantization for Vision Transformers with Twin Uniform Quantization |
Zhihang Yuan, Chenhao Xue, Yiqi Chen, Qiang Wu, Guangyu Sun |
|
code |
-1 |
Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach |
Jiseok Youn, Jaehun Song, HyungSin Kim, Saewoong Bahk |
|
code |
-1 |
Understanding the Dynamics of DNNs Using Graph Modularity |
Yao Lu, Wen Yang, Yunzhe Zhang, Zuohui Chen, Jinyin Chen, Qi Xuan, Zhen Wang, Xiaoniu Yang |
|
code |
-1 |
Latent Discriminant Deterministic Uncertainty |
Gianni Franchi, Xuanlong Yu, Andrei Bursuc, Emanuel Aldea, Séverine Dubuisson, David Filliat |
|
code |
-1 |
Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals |
Simon Vandenhende, Dhruv Mahajan, Filip Radenovic, Deepti Ghadiyaram |
|
code |
-1 |
HIVE: Evaluating the Human Interpretability of Visual Explanations |
Sunnie S. Y. Kim, Nicole Meister, Vikram V. Ramaswamy, Ruth Fong, Olga Russakovsky |
|
code |
-1 |
BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks |
Uddeshya Upadhyay, Shyamgopal Karthik, Yanbei Chen, Massimiliano Mancini, Zeynep Akata |
|
code |
-1 |
SESS: Saliency Enhancing with Scaling and Sliding |
Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes |
|
code |
-1 |
No Token Left Behind: Explainability-Aided Image Classification and Generation |
Roni Paiss, Hila Chefer, Lior Wolf |
|
code |
-1 |
Interpretable Image Classification with Differentiable Prototypes Assignment |
Dawid Rymarczyk, Lukasz Struski, Michal Górszczak, Koryna Lewandowska, Jacek Tabor, Bartosz Zielinski |
|
code |
-1 |
Contributions of Shape, Texture, and Color in Visual Recognition |
Yunhao Ge, Yao Xiao, Zhi Xu, Xingrui Wang, Laurent Itti |
|
code |
-1 |
STEEX: Steering Counterfactual Explanations with Semantics |
Paul Jacob, Éloi Zablocki, Hédi BenYounes, Mickaël Chen, Patrick Pérez, Matthieu Cord |
|
code |
-1 |
Are Vision Transformers Robust to Patch Perturbations? |
Jindong Gu, Volker Tresp, Yao Qin |
|
code |
-1 |
A Dataset Generation Framework for Evaluating Megapixel Image Classifiers and Their Explanations |
Gautam Machiraju, Sylvia K. Plevritis, Parag Mallick |
|
code |
-1 |
Cartoon Explanations of Image Classifiers |
Stefan Kolek, Duc Anh Nguyen, Ron Levie, Joan Bruna, Gitta Kutyniok |
|
code |
-1 |
Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value |
Quan Zheng, ZiWei Wang, Jie Zhou, Jiwen Lu |
|
code |
-1 |
Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain |
Jiazhen Ji, Huan Wang, Yuge Huang, Jiaxiang Wu, Xingkun Xu, Shouhong Ding, Shengchuan Zhang, Liujuan Cao, Rongrong Ji |
|
code |
-1 |
Contrast-Phys: Unsupervised Video-Based Remote Physiological Measurement via Spatiotemporal Contrast |
Zhaodong Sun, Xiaobai Li |
|
code |
-1 |
Source-Free Domain Adaptation with Contrastive Domain Alignment and Self-supervised Exploration for Face Anti-spoofing |
Yuchen Liu, Yabo Chen, Wenrui Dai, Mengran Gou, ChunTing Huang, Hongkai Xiong |
|
code |
-1 |
On Mitigating Hard Clusters for Face Clustering |
Yingjie Chen, Huasong Zhong, Chong Chen, Chen Shen, Jianqiang Huang, Tao Wang, Yun Liang, Qianru Sun |
|
code |
-1 |
OneFace: One Threshold for All |
Jiaheng Liu, Zhipeng Yu, Haoyu Qin, Yichao Wu, Ding Liang, Gangming Zhao, Ke Xu |
|
code |
-1 |
Label2Label: A Language Modeling Framework for Multi-attribute Learning |
Wanhua Li, Zhexuan Cao, Jianjiang Feng, Jie Zhou, Jiwen Lu |
|
code |
-1 |
AgeTransGAN for Facial Age Transformation with Rectified Performance Metrics |
GeeSern Hsu, RuiCang Xie, ZhiTing Chen, YuHong Lin |
|
code |
-1 |
Hierarchical Contrastive Inconsistency Learning for Deepfake Video Detection |
Zhihao Gu, Taiping Yao, Yang Chen, Shouhong Ding, Lizhuang Ma |
|
code |
-1 |
Rethinking Robust Representation Learning Under Fine-Grained Noisy Faces |
Bingqi Ma, Guanglu Song, Boxiao Liu, Yu Liu |
|
code |
-1 |
Teaching Where to Look: Attention Similarity Knowledge Distillation for Low Resolution Face Recognition |
Sungho Shin, Joosoon Lee, Junseok Lee, Yeonguk Yu, Kyoobin Lee |
|
code |
-1 |
Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions |
Tohar Lukov, Na Zhao, Gim Hee Lee, SerNam Lim |
|
code |
-1 |
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis |
Shuai Shen, Wanhua Li, Zheng Zhu, Yueqi Duan, Jie Zhou, Jiwen Lu |
|
code |
-1 |
CoupleFace: Relation Matters for Face Recognition Distillation |
Jiaheng Liu, Haoyu Qin, Yichao Wu, Jinyang Guo, Ding Liang, Ke Xu |
|
code |
-1 |
Controllable and Guided Face Synthesis for Unconstrained Face Recognition |
Feng Liu, Minchul Kim, Anil K. Jain, Xiaoming Liu |
|
code |
-1 |
Towards Robust Face Recognition with Comprehensive Search |
Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li |
|
code |
-1 |
Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian |
Zhiwen Cao, Dongfang Liu, Qifan Wang, Yingjie Victor Chen |
|
code |
-1 |
ByteTrack: Multi-object Tracking by Associating Every Detection Box |
Yifu Zhang, Peize Sun, Yi Jiang, Dongdong Yu, Fucheng Weng, Zehuan Yuan, Ping Luo, Wenyu Liu, Xinggang Wang |
|
code |
-1 |
Robust Multi-object Tracking by Marginal Inference |
Yifu Zhang, Chunyu Wang, Xinggang Wang, Wenjun Zeng, Wenyu Liu |
|
code |
-1 |
PolarMOT: How Far Can Geometric Relations Take us in 3D Multi-object Tracking? |
Aleksandr Kim, Guillem Brasó, Aljosa Osep, Laura LealTaixé |
|
code |
-1 |
Particle Video Revisited: Tracking Through Occlusions Using Point Trajectories |
Adam W. Harley, Zhaoyuan Fang, Katerina Fragkiadaki |
|
code |
-1 |
Tracking Objects as Pixel-Wise Distributions |
Zelin Zhao, Ze Wu, Yueqing Zhuang, Boxun Li, Jiaya Jia |
|
code |
-1 |
CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds |
Zhiyang Guo, Yunyao Mao, Wengang Zhou, Min Wang, Houqiang Li |
|
code |
-1 |
Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline |
Jinyu Yang, Zhongqun Zhang, Zhe Li, Hyung Jin Chang, Ales Leonardis, Feng Zheng |
|
code |
-1 |
Hierarchical Latent Structure for Multi-modal Vehicle Trajectory Forecasting |
Dooseop Choi, KyoungWook Min |
|
code |
-1 |
AiATrack: Attention in Attention for Transformer Visual Tracking |
Shenyuan Gao, Chunluan Zhou, Chao Ma, Xinggang Wang, Junsong Yuan |
|
code |
-1 |
Disentangling Architecture and Training for Optical Flow |
Deqing Sun, Charles Herrmann, Fitsum A. Reda, Michael Rubinstein, David J. Fleet, William T. Freeman |
|
code |
-1 |
A Perturbation-Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow |
Jenny Schmalfuss, Philipp Scholze, Andrés Bruhn |
|
code |
-1 |
Robust Landmark-Based Stent Tracking in X-ray Fluoroscopy |
Luojie Huang, Yikang Liu, Li Chen, Eric Z. Chen, Xiao Chen, Shanhui Sun |
|
code |
-1 |
Social ODE: Multi-agent Trajectory Forecasting with Neural Ordinary Differential Equations |
Song Wen, Hao Wang, Dimitris N. Metaxas |
|
code |
-1 |
Social-SSL: Self-supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction |
LiWu Tsao, YanKai Wang, HaoSiang Lin, HongHan Shuai, LaiKuan Wong, WenHuang Cheng |
|
code |
-1 |
Diverse Human Motion Prediction Guided by Multi-level Spatial-Temporal Anchors |
Sirui Xu, YuXiong Wang, LiangYan Gui |
|
code |
-1 |
Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction |
Inhwan Bae, JinHwi Park, HaeGon Jeon |
|
code |
-1 |
Sequential Multi-view Fusion Network for Fast LiDAR Point Motion Estimation |
Gang Zhang, Xiaoyan Li, Zhenhua Wang |
|
code |
-1 |
E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs |
Yanyan Li, Federico Tombari |
|
code |
-1 |
Point Cloud Compression with Range Image-Based Entropy Model for Autonomous Driving |
Sukai Wang, Ming Liu |
|
code |
-1 |
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework |
Botao Ye, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen |
|
code |
-1 |
MotionCLIP: Exposing Human Motion Generation to CLIP Space |
Guy Tevet, Brian Gordon, Amir Hertz, Amit H. Bermano, Daniel CohenOr |
|
code |
-1 |
Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking |
Boyu Chen, Peixia Li, Lei Bai, Lei Qiao, Qiuhong Shen, Bo Li, Weihao Gan, Wei Wu, Wanli Ouyang |
|
code |
-1 |
Aware of the History: Trajectory Forecasting with the Local Behavior Data |
Yiqi Zhong, Zhenyang Ni, Siheng Chen, Ulrich Neumann |
|
code |
-1 |
Optical Flow Training Under Limited Label Budget via Active Learning |
Shuai Yuan, Xian Sun, Hannah Halin Kim, Shuzhi Yu, Carlo Tomasi |
|
code |
-1 |
Hierarchical Feature Embedding for Visual Tracking |
Zhixiong Pi, Weitao Wan, Chong Sun, Changxin Gao, Nong Sang, Chen Li |
|
code |
-1 |
Tackling Background Distraction in Video Object Segmentation |
Suhwan Cho, Heansung Lee, Minhyeok Lee, Chaewon Park, Sungjun Jang, Minjung Kim, Sangyoun Lee |
|
code |
-1 |
Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation |
Abduallah A. Mohamed, Deyao Zhu, Warren Vu, Mohamed Elhoseiny, Christian G. Claudel |
|
code |
-1 |
TEMOS: Generating Diverse Human Motions from Textual Descriptions |
Mathis Petrovich, Michael J. Black, Gül Varol |
|
code |
-1 |
Tracking Every Thing in the Wild |
Siyuan Li, Martin Danelljan, Henghui Ding, Thomas E. Huang, Fisher Yu |
|
code |
-1 |
HULC: 3D HUman Motion Capture with Pose Manifold SampLing and Dense Contact Guidance |
Soshi Shimada, Vladislav Golyanik, Zhi Li, Patrick Pérez, Weipeng Xu, Christian Theobalt |
|
code |
-1 |
Towards Sequence-Level Training for Visual Tracking |
Minji Kim, Seungkwan Lee, Jungseul Ok, Bohyung Han, Minsu Cho |
|
code |
-1 |
Learned Monocular Depth Priors in Visual-Inertial Initialization |
Yunwen Zhou, Abhishek Kar, Eric Turner, Adarsh Kowdle, Chao X. Guo, Ryan C. DuToit, Konstantine Tsotsos |
|
code |
-1 |
Robust Visual Tracking by Segmentation |
Matthieu Paul, Martin Danelljan, Christoph Mayer, Luc Van Gool |
|
code |
-1 |
MeshLoc: Mesh-Based Visual Localization |
Vojtech Panek, Zuzana Kukelova, Torsten Sattler |
|
code |
-1 |
S2F2: Single-Stage Flow Forecasting for Future Multiple Trajectories Prediction |
YuWen Chen, HsuanKung Yang, ChuChi Chiu, ChunYi Lee |
|
code |
-1 |
Large-Displacement 3D Object Tracking with Hybrid Non-local Optimization |
Xuhui Tian, Xinran Lin, Fan Zhong, Xueying Qin |
|
code |
-1 |
FEAR: Fast, Efficient, Accurate and Robust Visual Tracker |
Vasyl Borsuk, Roman Vei, Orest Kupyn, Tetiana Martyniuk, Igor Krashenyi, Jiri Matas |
|
code |
-1 |
PREF: Predictability Regularized Neural Motion Fields |
Liangchen Song, Xuan Gong, Benjamin Planche, Meng Zheng, David S. Doermann, Junsong Yuan, Terrence Chen, Ziyan Wu |
|
code |
-1 |
View Vertically: A Hierarchical Network for Trajectory Prediction via Fourier Spectrums |
Conghao Wong, Beihao Xia, Ziming Hong, Qinmu Peng, Wei Yuan, Qiong Cao, Yibo Yang, Xinge You |
|
code |
-1 |
HVC-Net: Unifying Homography, Visibility, and Confidence Learning for Planar Object Tracking |
Haoxian Zhang, Yonggen Ling |
|
code |
-1 |
RamGAN: Region Attentive Morphing GAN for Region-Level Makeup Transfer |
Jianfeng Xiang, Junliang Chen, Wenshuang Liu, Xianxu Hou, Linlin Shen |
|
code |
-1 |
SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image |
Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang |
|
code |
-1 |
Entropy-Driven Sampling and Training Scheme for Conditional Diffusion Generation |
Guangcong Zheng, Shengming Li, Hui Wang, Taiping Yao, Yang Chen, Shouhong Ding, Xi Li |
|
code |
-1 |
Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing |
Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel C. Castro, Anton Schwaighofer, Stephanie L. Hyland, Maria Wetscherek, Tristan Naumann, Aditya V. Nori, Javier AlvarezValle, Hoifung Poon, Ozan Oktay |
|
code |
-1 |
Generative Negative Text Replay for Continual Vision-Language Pretraining |
Shipeng Yan, Lanqing Hong, Hang Xu, Jianhua Han, Tinne Tuytelaars, Zhenguo Li, Xuming He |
|
code |
-1 |
Video Graph Transformer for Video Question Answering |
Junbin Xiao, Pan Zhou, TatSeng Chua, Shuicheng Yan |
|
code |
-1 |
Trace Controlled Text to Image Generation |
Kun Yan, Lei Ji, Chenfei Wu, Jianmin Bao, Ming Zhou, Nan Duan, Shuai Ma |
|
code |
-1 |
Video Question Answering with Iterative Video-Text Co-tokenization |
A. J. Piergiovanni, Kairo Morton, Weicheng Kuo, Michael S. Ryoo, Anelia Angelova |
|
code |
-1 |
Rethinking Data Augmentation for Robust Visual Question Answering |
Long Chen, Yuhang Zheng, Jun Xiao |
|
code |
-1 |
Explicit Image Caption Editing |
Zhen Wang, Long Chen, Wenbo Ma, Guangxing Han, Yulei Niu, Jian Shao, Jun Xiao |
|
code |
-1 |
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding |
Jiachang Hao, Haifeng Sun, Pengfei Ren, Jingyu Wang, Qi Qi, Jianxin Liao |
|
code |
-1 |
Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly |
Spencer Whitehead, Suzanne Petryk, Vedaad Shakib, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach |
|
code |
-1 |
GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features |
VanQuang Nguyen, Masanori Suganuma, Takayuki Okatani |
|
code |
-1 |
Selective Query-Guided Debiasing for Video Corpus Moment Retrieval |
Sunjae Yoon, Ji Woo Hong, Eunseop Yoon, Dahyun Kim, Junyeong Kim, Hee Suk Yoon, Chang D. Yoo |
|
code |
-1 |
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding |
Cheng Shi, Sibei Yang |
|
code |
-1 |
Object-Centric Unsupervised Image Captioning |
Zihang Meng, David Yang, Xuefei Cao, Ashish Shah, SerNam Lim |
|
code |
-1 |
Contrastive Vision-Language Pre-training with Limited Resources |
Quan Cui, Boyan Zhou, Yu Guo, Weidong Yin, Hao Wu, Osamu Yoshie, Yubo Chen |
|
code |
-1 |
Learning Linguistic Association Towards Efficient Text-Video Retrieval |
Sheng Fang, Shuhui Wang, Junbao Zhuo, Xinzhe Han, Qingming Huang |
|
code |
-1 |
ASSISTER: Assistive Navigation via Conditional Instruction Generation |
Zanming Huang, Zhongkai Shangguan, Jimuyang Zhang, Gilad Bar, Matthew Boyd, Eshed OhnBar |
|
code |
-1 |
X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks |
Zhaowei Cai, Gukyeong Kwon, Avinash Ravichandran, Erhan Bas, Zhuowen Tu, Rahul Bhotika, Stefano Soatto |
|
code |
-1 |
Learning Disentanglement with Decoupled Labels for Vision-Language Navigation |
Wenhao Cheng, Xingping Dong, Salman H. Khan, Jianbing Shen |
|
code |
-1 |
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input |
Qingpei Guo, Kaisheng Yao, Wei Chu |
|
code |
-1 |
Word-Level Fine-Grained Story Visualization |
Bowen Li |
|
code |
-1 |
Unifying Event Detection and Captioning as Sequence Generation via Pre-training |
Qi Zhang, Yuqing Song, Qin Jin |
|
code |
-1 |
Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation |
Chuang Lin, Yi Jiang, Jianfei Cai, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan |
|
code |
-1 |
Fine-Grained Visual Entailment |
Christopher Thomas, Yipeng Zhang, ShihFu Chang |
|
code |
-1 |
Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds |
Ayush Jain, Nikolaos Gkanatsios, Ishita Mediratta, Katerina Fragkiadaki |
|
code |
-1 |
New Datasets and Models for Contextual Reasoning in Visual Dialog |
Yifeng Zhang, Ming Jiang, Qi Zhao |
|
code |
-1 |
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection |
Joanna Hong, Minsu Kim, Yong Man Ro |
|
code |
-1 |
Classification-Regression for Chart Comprehension |
Matan Levy, Rami BenAri, Dani Lischinski |
|
code |
-1 |
AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant |
Benita Wong, Joya Chen, You Wu, Stan Weixian Lei, Dongxing Mao, Difei Gao, Mike Zheng Shou |
|
code |
-1 |
FindIt: Generalized Localization with Natural Language Queries |
Weicheng Kuo, Fred Bertsch, Wei Li, A. J. Piergiovanni, Mohammad Saffar, Anelia Angelova |
|
code |
-1 |
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling |
Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Faisal Ahmed, Zicheng Liu, Yumao Lu, Lijuan Wang |
|
code |
-1 |
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels |
Golnaz Ghiasi, Xiuye Gu, Yin Cui, TsungYi Lin |
|
code |
-1 |
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning |
Jack Hessel, Jena D. Hwang, Jae Sung Park, Rowan Zellers, Chandra Bhagavatula, Anna Rohrbach, Kate Saenko, Yejin Choi |
|
code |
-1 |
Speaker-Adaptive Lip Reading with User-Dependent Padding |
Minsu Kim, Hyunjun Kim, Yong Man Ro |
|
code |
-1 |
TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation |
Tan M. Dinh, Rang Nguyen, BinhSon Hua |
|
code |
-1 |
SemAug: Semantically Meaningful Image Augmentations for Object Detection Through Language Grounding |
Morgan Heisler, Amin BanitalebiDehkordi, Yong Zhang |
|
code |
-1 |
Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance |
Myungsub Choi |
|
code |
-1 |
NewsStories: Illustrating Articles with Visual Summaries |
Reuben Tan, Bryan A. Plummer, Kate Saenko, J. P. Lewis, Avneesh Sud, Thomas Leung |
|
code |
-1 |
Webly Supervised Concept Expansion for General Purpose Vision Models |
Amita Kamath, Christopher Clark, Tanmay Gupta, Eric Kolve, Derek Hoiem, Aniruddha Kembhavi |
|
code |
-1 |
FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation |
Kaiwen Zhou, Xin Eric Wang |
|
code |
-1 |
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval |
Haoran Wang, Dongliang He, Wenhao Wu, Boyang Xia, Min Yang, Fu Li, Yunlong Yu, Zhong Ji, Errui Ding, Jingdong Wang |
|
code |
-1 |
Language-Driven Artistic Style Transfer |
TsuJui Fu, Xin Eric Wang, William Yang Wang |
|
code |
-1 |
Single-Stream Multi-level Alignment for Vision-Language Pretraining |
Zaid Khan, B. G. Vijay Kumar, Xiang Yu, Samuel Schulter, Manmohan Chandraker, Yun Fu |
|
code |
-1 |
Most and Least Retrievable Images in Visual-Language Query Systems |
Liuwan Zhu, Rui Ning, Jiang Li, Chunsheng Xin, Hongyi Wu |
|
code |
-1 |
Sports Video Analysis on Large-Scale Data |
Dekun Wu, He Zhao, Xingce Bao, Richard P. Wildes |
|
code |
-1 |
Grounding Visual Representations with Texts for Domain Generalization |
Seonwoo Min, Nokyung Park, Siwon Kim, Seunghyun Park, Jinkyu Kim |
|
code |
-1 |
Bridging the Visual Semantic Gap in VLN via Semantically Richer Instructions |
Joaquín Ossandón, Benjamín Earle, Álvaro Soto |
|
code |
-1 |
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation |
Adyasha Maharana, Darryl Hannan, Mohit Bansal |
|
code |
-1 |
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance |
Katherine Crowson, Stella Biderman, Daniel Kornis, Dashiell Stander, Eric Hallahan, Louis Castricato, Edward Raff |
|
code |
-1 |
Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation |
Xian Liu, Yinghao Xu, Qianyi Wu, Hang Zhou, Wayne Wu, Bolei Zhou |
|
code |
-1 |
End-to-End Active Speaker Detection |
Juan León Alcázar, Moritz Cordes, Chen Zhao, Bernard Ghanem |
|
code |
-1 |
Emotion Recognition for Multiple Context Awareness |
Dingkang Yang, Shuai Huang, Shunli Wang, Yang Liu, Peng Zhai, Liuzhen Su, Mingcheng Li, Lihua Zhang |
|
code |
-1 |
Adaptive Fine-Grained Sketch-Based Image Retrieval |
Ayan Kumar Bhunia, Aneeshan Sain, Parth Hiren Shah, Animesh Gupta, Pinaki Nath Chowdhury, Tao Xiang, YiZhe Song |
|
code |
-1 |
Quantized GAN for Complex Music Generation from Dance Videos |
Ye Zhu, Kyle Olszewski, Yu Wu, Panos Achlioptas, Menglei Chai, Yan Yan, Sergey Tulyakov |
|
code |
-1 |
Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network Prediction |
Hu Wang, Jianpeng Zhang, Yuanhong Chen, Congbo Ma, Jodie Avery, Louise Hull, Gustavo Carneiro |
|
code |
-1 |
Localizing Visual Sounds the Easy Way |
Shentong Mo, Pedro Morgado |
|
code |
-1 |
Learning Visual Styles from Audio-Visual Associations |
Tingle Li, Yichen Liu, Andrew Owens, Hang Zhao |
|
code |
-1 |
Remote Respiration Monitoring of Moving Person Using Radio Signals |
JaeHo Choi, KiBong Kang, KyungTae Kim |
|
code |
-1 |
Camera Pose Estimation and Localization with Active Audio Sensing |
Karren Yang, Michael Firman, Eric Brachmann, Clément Godard |
|
code |
-1 |
PACS: A Dataset for Physical Audiovisual CommonSense Reasoning |
Samuel Yu, Peter Wu, Paul Pu Liang, Ruslan Salakhutdinov, LouisPhilippe Morency |
|
code |
-1 |
VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer |
Juan F. Montesinos, Venkatesh S. Kadandale, Gloria Haro |
|
code |
-1 |
Telepresence Video Quality Assessment |
Zhenqiang Ying, Deepti Ghadiyaram, Alan C. Bovik |
|
code |
-1 |
MultiMAE: Multi-modal Multi-task Masked Autoencoders |
Roman Bachmann, David Mizrahi, Andrei Atanov, Amir Zamir |
|
code |
-1 |
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation |
Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey |
|
code |
-1 |
Audio-Visual Segmentation |
Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Lingpeng Kong, Meng Wang, Yiran Zhong |
|
code |
-1 |
Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression |
Yeying Jin, Wenhan Yang, Robby T. Tan |
|
code |
-1 |
Relationformer: A Unified Framework for Image-to-Graph Generation |
Suprosanna Shit, Rajat Koner, Bastian Wittmann, Johannes C. Paetzold, Ivan Ezhov, Hongwei Li, Jiazhen Pan, Sahand Sharifzadeh, Georgios Kaissis, Volker Tresp, Bjoern H. Menze |
|
code |
-1 |
GAMa: Cross-View Video Geo-Localization |
Shruti Vyas, Chen Chen, Mubarak Shah |
|
code |
-1 |
Revisiting a kNN-Based Image Classification System with High-Capacity Storage |
Kengo Nakata, Youyang Ng, Daisuke Miyashita, Asuka Maki, YuChieh Lin, Jun Deguchi |
|
code |
-1 |
Geometric Representation Learning for Document Image Rectification |
Hao Feng, Wengang Zhou, Jiajun Deng, Yuechen Wang, Houqiang Li |
|
code |
-1 |
S2-VER: Semi-supervised Visual Emotion Recognition |
Guoli Jia, Jufeng Yang |
|
code |
-1 |
Image Coding for Machines with Omnipotent Feature Learning |
Ruoyu Feng, Xin Jin, Zongyu Guo, Runsen Feng, Yixin Gao, Tianyu He, Zhizheng Zhang, Simeng Sun, Zhibo Chen |
|
code |
-1 |
Feature Representation Learning for Unsupervised Cross-Domain Image Retrieval |
Conghui Hu, Gim Hee Lee |
|
code |
-1 |
Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition |
Shilin Xu, Xiangtai Li, Jingbo Wang, Guangliang Cheng, Yunhai Tong, Dacheng Tao |
|
code |
-1 |
Semantic-Guided Multi-mask Image Harmonization |
Xuqian Ren, Yifan Liu |
|
code |
-1 |
Learning an Isometric Surface Parameterization for Texture Unwrapping |
Sagnik Das, Ke Ma, Zhixin Shu, Dimitris Samaras |
|
code |
-1 |
Towards Regression-Free Neural Networks for Diverse Compute Platforms |
Rahul Duggal, Hao Zhou, Shuo Yang, Jun Fang, Yuanjun Xiong, Wei Xia |
|
code |
-1 |
Relationship Spatialization for Depth Estimation |
Xiaoyu Xu, Jiayan Qiu, Xinchao Wang, Zhou Wang |
|
code |
-1 |
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models |
Chenfeng Xu, Shijia Yang, Tomer Galanti, Bichen Wu, Xiangyu Yue, Bohan Zhai, Wei Zhan, Peter Vajda, Kurt Keutzer, Masayoshi Tomizuka |
|
code |
-1 |
FAR: Fourier Aerial Video Recognition |
Divya Kothandaraman, Tianrui Guan, Xijun Wang, Shuowen Hu, Ming C. Lin, Dinesh Manocha |
|
code |
-1 |
Translating a Visual LEGO Manual to a Machine-Executable Plan |
Ruocheng Wang, Yunzhi Zhang, Jiayuan Mao, ChinYi Cheng, Jiajun Wu |
|
code |
-1 |
Fabric Material Recovery from Video Using Multi-scale Geometric Auto-Encoder |
Junbang Liang, Ming C. Lin |
|
code |
-1 |
MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment |
Jie Ren, Wenteng Liang, Ran Yan, Luo Mai, Shiwen Liu, Xiao Liu |
|
code |
-1 |
The One Where They Reconstructed 3D Humans and Environments in TV Shows |
Georgios Pavlakos, Ethan Weber, Matthew Tancik, Angjoo Kanazawa |
|
code |
-1 |
SITTA: Single Image Texture Translation for Data Augmentation |
Boyi Li, Yin Cui, TsungYi Lin, Serge J. Belongie |
|
code |
-1 |
Learning from Noisy Labels with Coarse-to-Fine Sample Credibility Modeling |
Boshen Zhang, Yuxi Li, Yuanpeng Tu, Jinlong Peng, Yabiao Wang, Cunlin Wu, Yang Xiao, Cairong Zhao |
|
code |
-1 |
PLMCL: Partial-Label Momentum Curriculum Learning for Multi-label Image Classification |
Rabab Abdelfattah, Xin Zhang, Zhenyao Wu, Xinyi Wu, Xiaofeng Wang, Song Wang |
|
code |
-1 |
Open-Vocabulary Semantic Segmentation Using Test-Time Distillation |
Nir Zabari, Yedid Hoshen |
|
code |
-1 |
SW-VAE: Weakly Supervised Learn Disentangled Representation via Latent Factor Swapping |
Jiageng Zhu, Hanchen Xie, Wael AbdAlmageed |
|
code |
-1 |
Learning Multiple Probabilistic Degradation Generators for Unsupervised Real World Image Super Resolution |
Sangyun Lee, Sewoong Ahn, Kwangjin Yoon |
|
code |
-1 |
Out-of-Distribution Detection Without Class Labels |
Niv Cohen, Ron Abutbul, Yedid Hoshen |
|
code |
-1 |
Unsupervised Domain Adaptive Object Detection with Class Label Shift Weighted Local Features |
Andong Tan, Niklas Hanselmann, Shuxiao Ding, Federico Tombari, Marius Cordts |
|
code |
-1 |
OpenCoS: Contrastive Semi-supervised Learning for Handling Open-Set Unlabeled Data |
Jongjin Park, Sukmin Yun, Jongheon Jeong, Jinwoo Shin |
|
code |
-1 |
Semi-supervised Domain Adaptation by Similarity Based Pseudo-Label Injection |
Abhay Rawat, Isha Dua, Saurav Gupta, Rahul Tallamraju |
|
code |
-1 |
Evaluating Image Super-Resolution Performance on Mobile Devices: An Online Benchmark |
Xindong Zhang, Hui Zeng, Lei Zhang |
|
code |
-1 |
Style Adaptive Semantic Image Editing with Transformers |
Edward Günther, Rui Gong, Luc Van Gool |
|
code |
-1 |
Third Time's the Charm? Image and Video Editing with StyleGAN3 |
Yuval Alaluf, Or Patashnik, Zongze Wu, Asif Zamir, Eli Shechtman, Dani Lischinski, Daniel CohenOr |
|
code |
-1 |
CNSNet: A Cleanness-Navigated-Shadow Network for Shadow Removal |
Qianhao Yu, Naishan Zheng, Jie Huang, Feng Zhao |
|
code |
-1 |
Unifying Conditional and Unconditional Semantic Image Synthesis with OCO-GAN |
Marlène Careil, Stéphane Lathuilière, Camille Couprie, Jakob Verbeek |
|
code |
-1 |
Efficient Image Super-Resolution Using Vast-Receptive-Field Attention |
Lin Zhou, Haoming Cai, Jinjin Gu, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Yu Qiao, Chao Dong |
|
code |
-1 |
Unsupervised Scene Sketch to Photo Synthesis |
Jiayun Wang, Sangryul Jeon, Stella X. Yu, Xi Zhang, Himanshu Arora, Yu Lou |
|
code |
-1 |
U-shape Transformer for Underwater Image Enhancement |
Lintao Peng, Chunli Zhu, Liheng Bian |
|
code |
-1 |
Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation |
Snehal Singh Tomar, Maitreya Suin, A. N. Rajagopalan |
|
code |
-1 |
Towards Real-World Video Deblurring by Exploring Blur Formation Process |
Mingdeng Cao, Zhihang Zhong, Yanbo Fan, Jiahao Wang, Yong Zhang, Jue Wang, Yujiu Yang, Yinqiang Zheng |
|
code |
-1 |
Unified Transformer Network for Multi-Weather Image Restoration |
Ashutosh Kulkarni, Shruti S. Phutke, Subrahmanyam Murala |
|
code |
-1 |
DSR: Towards Drone Image Super-Resolution |
Xiaoyu Lin, Baran Ozaydin, Vidit Vidit, Majed El Helou, Sabine Süsstrunk |
|
code |
-1 |
CEN-HDR: Computationally Efficient Neural Network for Real-Time High Dynamic Range Imaging |
Steven Tel, Barthélémy Heyrman, Dominique Ginhac |
|
code |
-1 |
Image Super-Resolution with Deep Variational Autoencoders |
Darius Chira, Ilian Haralampiev, Ole Winther, Andrea Dittadi, Valentin Liévin |
|
code |
-1 |
Light Field Angular Super-Resolution via Dense Correspondence Field Reconstruction |
Yu Mo, Yingqian Wang, Longguang Wang, JunGang Yang, Wei An |
|
code |
-1 |
Adaptive Mask-Based Pyramid Network for Realistic Bokeh Rendering |
Konstantinos Georgiadis, Albert SaàGarriga, Mehmet Kerim Yucel, Anastasios Drosou, Bruno Manganelli |
|
code |
-1 |
RISPNet: A Network for Reversed Image Signal Processing |
Xiaoyi Dong, Yu Zhu, Chenghua Li, Peisong Wang, Jian Cheng |
|
code |
-1 |
CIDBNet: A Consecutively-Interactive Dual-Branch Network for JPEG Compressed Image Super-Resolution |
Xiaoran Qin, Yu Zhu, Chenghua Li, Peisong Wang, Jian Cheng |
|
code |
-1 |
XCAT - Lightweight Quantized Single Image Super-Resolution Using Heterogeneous Group Convolutions and Cross Concatenation |
Mustafa Ayazoglu, Bahri Batuhan Bilecen |
|
code |
-1 |
Learned Reverse ISP with Soft Supervision |
Beiji Zou, Yue Zhang |
|
code |
-1 |
LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile Devices |
Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang |
|
code |
-1 |
MSSNet: Multi-Scale-Stage Network for Single Image Deblurring |
Kiyeon Kim, Seungyong Lee, Sunghyun Cho |
|
code |
-1 |
RCBSR: Re-parameterization Convolution Block for Super-Resolution |
Si Gao, Chengjian Zheng, Xiaofeng Zhang, Shaoli Liu, Biao Wu, Kaidi Lu, Diankai Zhang, Ning Wang |
|
code |
-1 |
Multi-patch Learning: Looking More Pixels in the Training Phase |
Lei Li, Jingzhu Tang, Ming Chen, Shijie Zhao, Junlin Li, Li Zhang |
|
code |
-1 |
Fast Nearest Convolution for Real-Time Efficient Image Super-Resolution |
Ziwei Luo, Youwei Li, Lei Yu, Qi Wu, Zhihong Wen, Haoqiang Fan, Shuaicheng Liu |
|
code |
-1 |
Real-Time Channel Mixing Net for Mobile Image Super-Resolution |
Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He |
|
code |
-1 |
Sliding Window Recurrent Network for Efficient Video Super-Resolution |
Wenyi Lian, Wenjing Lian |
|
code |
-1 |
EESRNet: A Network for Energy Efficient Super-Resolution |
Shijie Yue, Chenghua Li, Zhengyang Zhuge, Ruixia Song |
|
code |
-1 |
Bokeh-Loss GAN: Multi-stage Adversarial Training for Realistic Edge-Aware Bokeh |
Brian Lee, Fei Lei, Huaijin G. Chen, Alexis Baudron |
|
code |
-1 |
Residual Feature Distillation Channel Spatial Attention Network for ISP on Smartphone |
Jiesi Zheng, Zhihao Fan, Xun Wu, Yaqi Wu, Feng Zhang |
|
code |
-1 |
HST: Hierarchical Swin Transformer for Compressed Image Super-Resolution |
Bingchen Li, Xin Li, Yiting Lu, Sen Liu, Ruoyu Feng, Zhibo Chen |
|
code |
-1 |
Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration |
Marcos V. Conde, UiJin Choi, Maxime Burchi, Radu Timofte |
|
code |
-1 |
Reversing Image Signal Processors by Reverse Style Transferring |
Furkan Kinli, Baris Özcan, Furkan Kiraç |
|
code |
-1 |
Overexposure Mask Fusion: Generalizable Reverse ISP Multi-step Refinement |
Jinha Kim, Jun Jiang, Jinwei Gu |
|
code |
-1 |
CAIR: Fast and Lightweight Multi-scale Color Attention Network for Instagram Filter Removal |
WoonHa Yeo, WangTaek Oh, KyungSu Kang, YoungIl Kim, HanCheol Ryu |
|
code |
-1 |
MicroISP: Processing 32MP Photos on Mobile Devices with Deep Learning |
Andrey Ignatov, Anastasia Sycheva, Radu Timofte, Yu Tseng, YuSyuan Xu, PoHsiang Yu, ChengMing Chiang, HsienKai Kuo, MinHung Chen, ChiaMing Cheng, Luc Van Gool |
|
code |
-1 |
Real-Time Under-Display Cameras Image Restoration and HDR on Mobile Devices |
Marcos V. Conde, FlorinAlexandru Vasluianu, Sabari Nathan, Radu Timofte |
|
code |
-1 |
Globally Optimal Event-Based Divergence Estimation for Ventral Landing |
Sofia McLeod, Gabriele Meoni, Dario Izzo, Anne Mergy, Daqi Liu, Yasir Latif, Ian D. Reid, TatJun Chin |
|
code |
-1 |
Transfer Learning for On-Orbit Ship Segmentation |
Vincenzo Fanizza, David Rijlaarsdam, Pablo Tomás Toledano González, José Luis EspinosaAranda |
|
code |
-1 |
Spacecraft Pose Estimation Based on Unsupervised Domain Adaptation and on a 3D-Guided Loss Combination |
Juan Ignacio Bravo PérezVillar, Álvaro GarcíaMartín, Jesús Bescós |
|
code |
-1 |
MaRF: Representing Mars as Neural Radiance Fields |
Lorenzo Giusti, Josue Garcia, Steven Cozine, Darrick Suen, Christina Nguyen, Ryan Alimo |
|
code |
-1 |
Asynchronous Kalman Filter for Event-Based Star Tracking |
Yonhon Ng, Yasir Latif, TatJun Chin, Robert E. Mahony |
|
code |
-1 |
Using Moffat Profiles to Register Astronomical Images |
Mason Schuckman, Roy Prouty, David Chapman, Don Engel |
|
code |
-1 |
Mixed-Domain Training Improves Multi-mission Terrain Segmentation |
Grace Vincent, Alice Yepremyan, Jingdao Chen, Edwin Goh |
|
code |
-1 |
CubeSat-CDT: A Cross-Domain Dataset for 6-DoF Trajectory Estimation of a Symmetric Spacecraft |
Mohamed Adel Musallam, Arunkumar Rathinam, Vincent Gaudillière, Miguel Ortiz del Castillo, Djamila Aouada |
|
code |
-1 |
Data Lifecycle Management in Evolving Input Distributions for Learning-based Aerospace Applications |
Somrita Banerjee, Apoorva Sharma, Edward Schmerling, Max Spolaor, Michael Nemerouf, Marco Pavone |
|
code |
-1 |
Strong Gravitational Lensing Parameter Estimation with Vision Transformer |
KuanWei Huang, Geoff ChihFan Chen, PoWen Chang, ShengChieh Lin, ChiaJung Hsu, Vishal Thengane, Joshua YaoYu Lin |
|
code |
-1 |
End-to-end Neural Estimation of Spacecraft Pose with Intermediate Detection of Keypoints |
Antoine Legrand, Renaud Detry, Christophe De Vleeschouwer |
|
code |
-1 |
Improving Contrastive Learning on Visually Homogeneous Mars Rover Images |
Isaac Ronald Ward, Charles Moore, Kai Pak, Jingdao Chen, Edwin Goh |
|
code |
-1 |
Monocular 6-DoF Pose Estimation for Non-cooperative Spacecrafts Using Riemannian Regression Network |
Sunhao Chu, Yuxiao Duan, Klaus Schilling, Shufan Wu |
|
code |
-1 |
HyperNST: Hyper-Networks for Neural Style Transfer |
Dan Ruta, Andrew Gilbert, Saeid Motiian, Baldo Faieta, Zhe Lin, John P. Collomosse |
|
code |
-1 |
DEArt: Dataset of European Art |
Artem Reshetnikov, MariaCristina V. Marinescu, Joaquim Moré López |
|
code |
-1 |
How Well Do Vision Transformers (VTs) Transfer to the Non-natural Image Domain? An Empirical Study Involving Art Classification |
Vincent Tonkes, Matthia Sabatelli |
|
code |
-1 |
On-the-Go Reflectance Transformation Imaging with Ordinary Smartphones |
Mara Pistellato, Filippo Bergamasco |
|
code |
-1 |
Is GPT-3 All You Need for Visual Question Answering in Cultural Heritage? |
Pietro Bongini, Federico Becattini, Alberto Del Bimbo |
|
code |
-1 |
Automatic Analysis of Human Body Representations in Western Art |
Shu Zhao, Alkim Almila Akdag Salah, Albert Ali Salah |
|
code |
-1 |
ArtFacePoints: High-Resolution Facial Landmark Detection in Paintings and Prints |
Aline Sindel, Andreas Maier, Vincent Christlein |
|
code |
-1 |
TransPatch: A Transformer-based Generator for Accelerating Transferable Patch Generation in Adversarial Attacks Against Object Detection Models |
Jinghao Wang, Chenling Cui, Xuejun Wen, Jie Shi |
|
code |
-1 |
Feature-Level Augmentation to Improve Robustness of Deep Neural Networks to Affine Transformations |
Adrian Sandru, MarianaIuliana Georgescu, Radu Tudor Ionescu |
|
code |
-1 |
Benchmarking Robustness Beyond lp Norm Adversaries |
Akshay Agarwal, Nalini K. Ratha, Mayank Vatsa, Richa Singh |
|
code |
-1 |
Masked Faces with Faced Masks |
Jiayi Zhu, Qing Guo, Felix JuefeiXu, Yihao Huang, Yang Liu, Geguang Pu |
|
code |
-1 |
Adversarially Robust Panoptic Segmentation (ARPaS) Benchmark |
Laura Alexandra Daza, Jordi PontTuset, Pablo Arbeláez |
|
code |
-1 |
BadDet: Backdoor Attacks on Object Detection |
ShihHan Chan, Yinpeng Dong, Jun Zhu, Xiaolu Zhang, Jun Zhou |
|
code |
-1 |
Universal, Transferable Adversarial Perturbations for Visual Object Trackers |
Krishna Kanth Nakka, Mathieu Salzmann |
|
code |
-1 |
Why Is the Video Analytics Accuracy Fluctuating, and What Can We Do About It? |
Sibendu Paul, Kunal Rao, Giuseppe Coviello, Murugan Sankaradas, Oliver Po, Y. Charlie Hu, Srimat Chakradhar |
|
code |
-1 |
SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning |
Nilaksh Das, ShengYun Peng, Duen Horng Chau |
|
code |
-1 |
Unrestricted Black-Box Adversarial Attack Using GAN with Limited Queries |
Dongbin Na, Sangwoo Ji, Jong Kim |
|
code |
-1 |
Truth-Table Net: A New Convolutional Architecture Encodable by Design into SAT Formulas |
Adrien Benamira, Thomas Peyrin, Bryan Hooi KuenYew |
|
code |
-1 |
Attribution-Based Confidence Metric for Detection of Adversarial Attacks on Breast Histopathological Images |
Steven Lawrence Fernandes, Senka Krivic, Poonam Sharma, Sumit Kumar Jha |
|
code |
-1 |
Improving Adversarial Robustness by Penalizing Natural Accuracy |
Kshitij Chandna |
|
code |
-1 |
4D-StOP: Panoptic Segmentation of 4D LiDAR Using Spatio-Temporal Object Proposal Generation and Aggregation |
Lars Kreuzberg, Idil Esen Zulfikar, Sabarinath Mahadevan, Francis Engelmann, Bastian Leibe |
|
code |
-1 |
BlindSpotNet: Seeing Where We Cannot See |
Taichi Fukuda, Kotaro Hasegawa, Shinya Ishizaki, Shohei Nobuhara, Ko Nishino |
|
code |
-1 |
Gesture Recognition with Keypoint and Radar Stream Fusion for Automated Vehicles |
Adrian Holzbock, Nicolai Kern, Christian Waldschmidt, Klaus Dietmayer, Vasileios Belagiannis |
|
code |
-1 |
An Improved Lightweight Network Based on YOLOv5s for Object Detection in Autonomous Driving |
Guofa Li, Yingjie Zhang, Delin Ouyang, Xingda Qu |
|
code |
-1 |
Plausibility Verification for 3D Object Detectors Using Energy-Based Optimization |
Abhishek Vivekanandan, Niels Maier, J. Marius Zöllner |
|
code |
-1 |
Lane Change Classification and Prediction with Action Recognition Networks |
Kai Liang, Jun Wang, Abhir Bhalerao |
|
code |
-1 |
Joint Prediction of Amodal and Visible Semantic Segmentation for Automated Driving |
Jasmin Breitenstein, Jonas Löhdefink, Tim Fingscheidt |
|
code |
-1 |
Human-Vehicle Cooperative Visual Perception for Autonomous Driving Under Complex Traffic Environments |
Yiyue Zhao, Cailin Lei, Yu Shen, Yuchuan Du, Qijun Chen |
|
code |
-1 |
MCIP: Multi-Stream Network for Pedestrian Crossing Intention Prediction |
JeSeok Ham, Kangmin Bae, Jinyoung Moon |
|
code |
-1 |
SimpleTrack: Understanding and Rethinking 3D Multi-object Tracking |
Ziqi Pang, Zhichao Li, Naiyan Wang |
|
code |
-1 |
Ego-Motion Compensation of Range-Beam-Doppler Radar Data for Object Detection |
Michael Meyer, Marc Unzueta, Georg Kuschk, Sven Tomforde |
|
code |
-1 |
RPR-Net: A Point Cloud-Based Rotation-Aware Large Scale Place Recognition Network |
Zhaoxin Fan, Zhenbo Song, Wenping Zhang, Hongyan Liu, Jun He, Xiaoyong Du |
|
code |
-1 |
Learning 3D Semantics From Pose-Noisy 2D Images with Hierarchical Full Attention Network |
Yuhang He, Lin Chen, Junkun Xie, Long Chen |
|
code |
-1 |
SIM2E: Benchmarking the Group Equivariant Capability of Correspondence Matching Algorithms |
Shuai Su, Zhongkai Zhao, Yixin Fei, Shuda Li, Qijun Chen, Rui Fan |
|
code |
-1 |
Talisman: Targeted Active Learning for Object Detection with Rare Classes and Slices Using Submodular Mutual Information |
Suraj Kothawade, Saikat Ghosh, Sumit Shekhar, Yu Xiang, Rishabh K. Iyer |
|
code |
-1 |
An Efficient Person Clustering Algorithm for Open Checkout-free Groceries |
Junde Wu, Yu Zhang, Rao Fu, Yuanpei Liu, Jing Gao |
|
code |
-1 |
POP: Mining POtential Performance of New Fashion Products via Webly Cross-modal Query Expansion |
Christian Joppi, Geri Skenderi, Marco Cristani |
|
code |
-1 |
Pose Forecasting in Industrial Human-Robot Collaboration |
Alessio Sampieri, Guido Maria D'Amely di Melendugno, Andrea Avogaro, Federico Cunico, Francesco Setti, Geri Skenderi, Marco Cristani, Fabio Galasso |
|
code |
-1 |
Actor-Centered Representations for Action Localization in Streaming Videos |
Sathyanarayanan N. Aakur, Sudeep Sarkar |
|
code |
-1 |
Bandwidth-Aware Adaptive Codec for DNN Inference Offloading in IoT |
Xiufeng Xie, Ning Zhou, Wentao Zhu, Ji Liu |
|
code |
-1 |
Domain Knowledge-Informed Self-supervised Representations for Workout Form Assessment |
Paritosh Parmar, Amol Gharat, Helge Rhodin |
|
code |
-1 |
Responsive Listening Head Generation: A Benchmark Dataset and Baseline |
Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei |
|
code |
-1 |
Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics |
Sen Zhang, Jing Zhang, Dacheng Tao |
|
code |
-1 |
TIPS: Text-Induced Pose Synthesis |
Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein |
|
code |
-1 |
Addressing Heterogeneity in Federated Learning via Distributional Transformation |
Haolin Yuan, Bo Hui, Yuchen Yang, Philippe Burlina, Neil Zhenqiang Gong, Yinzhi Cao |
|
code |
-1 |
Where in the World Is This Image? Transformer-Based Geo-localization in the Wild |
Shraman Pramanick, Ewa Magdalena Nowara, Joshua Gleason, Carlos Domingo Castillo, Rama Chellappa |
|
code |
-1 |
Colorization for in situ Marine Plankton Images |
Guannan Guo, Qi Lin, Tao Chen, Zhenghui Feng, Zheng Wang, Jianping Li |
|
code |
-1 |
Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection |
Mingyu Yang, Yu Chen, HunSeok Kim |
|
code |
-1 |
A Sketch is Worth a Thousand Words: Image Retrieval with Text and Sketch |
Patsorn Sangkloy, Wittawat Jitkrittum, Diyi Yang, James Hays |
|
code |
-1 |
A Cloud 3D Dataset and Application-Specific Learned Image Compression in Cloud 3D |
Tianyi Liu, Sen He, Vinodh Kumaran Jayakumar, Wei Wang |
|
code |
-1 |
AutoTransition: Learning to Recommend Video Transition Effects |
Yaojie Shen, Libo Zhang, Kai Xu, Xiaojie Jin |
|
code |
-1 |
Online Segmentation of LiDAR Sequences: Dataset and Algorithm |
Romain Loiseau, Mathieu Aubry, Loïc Landrieu |
|
code |
-1 |
Open-world Semantic Segmentation for LIDAR Point Clouds |
Jun Cen, Peng Yun, Shiwei Zhang, Junhao Cai, Di Luan, Mingqian Tang, Ming Liu, Michael Yu Wang |
|
code |
-1 |
KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients |
Niklas Hanselmann, Katrin Renz, Kashyap Chitta, Apratim Bhattacharyya, Andreas Geiger |
|
code |
-1 |
Differentiable Raycasting for Self-Supervised Occupancy Forecasting |
Tarasha Khurana, Peiyun Hu, Achal Dave, Jason Ziglar, David Held, Deva Ramanan |
|
code |
-1 |
InAction: Interpretable Action Decision Making for Autonomous Driving |
Taotao Jing, Haifeng Xia, Renran Tian, Haoran Ding, Xiao Luo, Joshua E. Domeyer, Rini Sherony, Zhengming Ding |
|
code |
-1 |
CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection |
JyhJing Hwang, Henrik Kretzschmar, Joshua Manela, Sean Rafferty, Nicholas ArmstrongCrews, Tiffany Chen, Dragomir Anguelov |
|
code |
-1 |
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving |
Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei Zhang, Chunjing Xu, DitYan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu |
|
code |
-1 |
Motion Inspired Unsupervised Perception and Prediction in Autonomous Driving |
Mahyar Najibi, Jingwei Ji, Yin Zhou, Charles R. Qi, Xinchen Yan, Scott Ettinger, Dragomir Anguelov |
|
code |
-1 |
StretchBEV: Stretching Future Instance Prediction Spatially and Temporally |
Adil Kaan Akan, Fatma Güney |
|
code |
-1 |
RCLane: Relay Chain Prediction for Lane Detection |
Shenghua Xu, Xinyue Cai, Bin Zhao, Li Zhang, Hang Xu, Yanwei Fu, Xiangyang Xue |
|
code |
-1 |
Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-Modal Distillation |
Antonín Vobecký, David Hurych, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivic |
|
code |
-1 |
CenterFormer: Center-Based Transformer for 3D Object Detection |
Zixiang Zhou, Xiangchen Zhao, Yu Wang, Panqu Wang, Hassan Foroosh |
|
code |
-1 |
Physical Attack on Monocular Depth Estimation with Optimal Adversarial Patches |
Zhiyuan Cheng, James Liang, Hongjun Choi, Guanhong Tao, Zhiwen Cao, Dongfang Liu, Xiangyu Zhang |
|
code |
-1 |
ST-P3: End-to-End Vision-Based Autonomous Driving via Spatial-Temporal Feature Learning |
Shengchao Hu, Li Chen, Penghao Wu, Hongyang Li, Junchi Yan, Dacheng Tao |
|
code |
-1 |
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark |
Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan |
|
code |
-1 |
PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation |
Kwonyoung Kim, Jungin Park, Jiyoung Lee, Dongbo Min, Kwanghoon Sohn |
|
code |
-1 |
BRNet: Exploring Comprehensive Features for Monocular Depth Estimation |
Wencheng Han, Junbo Yin, Xiaogang Jin, Xiangdong Dai, Jianbing Shen |
|
code |
-1 |
SiamDoGe: Domain Generalizable Semantic Segmentation Using Siamese Network |
Zhenyao Wu, Xinyi Wu, Xiaoping Zhang, Lili Ju, Song Wang |
|
code |
-1 |
Context-Aware Streaming Perception in Dynamic Environments |
GurEyal Sela, Ionel Gog, Justin Wong, Kumar Krishna Agrawal, Xiangxi Mo, Sukrit Kalra, Peter Schafhalter, Eric Leong, Xin Wang, Bharathan Balaji, Joseph Gonzalez, Ion Stoica |
|
code |
-1 |
SpOT: Spatiotemporal Modeling for 3D Object Tracking |
Colton Stearns, Davis Rempe, Jie Li, Rares Ambrus, Sergey Zakharov, Vitor Guizilini, Yanchao Yang, Leonidas J. Guibas |
|
code |
-1 |
Multimodal Transformer for Automatic 3D Annotation and Object Detection |
Chang Liu, Xiaoyan Qian, Binxiao Huang, Xiaojuan Qi, Edmund Y. Lam, SiewChong Tan, Ngai Wong |
|
code |
-1 |
Dynamic 3D Scene Analysis by Point Cloud Accumulation |
Shengyu Huang, Zan Gojcic, Jiahui Huang, Andreas Wieser, Konrad Schindler |
|
code |
-1 |
Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection |
Xin Li, Botian Shi, Yuenan Hou, Xingjiao Wu, Tianlong Ma, Yikang Li, Liang He |
|
code |
-1 |
JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes |
Haimei Zhao, Jing Zhang, Sen Zhang, Dacheng Tao |
|
code |
-1 |
Semi-supervised 3D Object Detection with Proficient Teachers |
Junbo Yin, Jin Fang, Dingfu Zhou, Liangjun Zhang, ChengZhong Xu, Jianbing Shen, Wenguan Wang |
|
code |
-1 |
Point Cloud Compression with Sibling Context and Surface Priors |
Zhili Chen, Zian Qian, Sukai Wang, Qifeng Chen |
|
code |
-1 |
Lane Detection Transformer Based on Multi-frame Horizontal and Vertical Attention and Visual Transformer Module |
Han Zhang, Yunchao Gu, Xinliang Wang, Junjun Pan, Minghui Wang |
|
code |
-1 |
ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection |
Junbo Yin, Dingfu Zhou, Liangjun Zhang, Jin Fang, ChengZhong Xu, Jianbing Shen, Wenguan Wang |
|
code |
-1 |
PreTraM: Self-supervised Pre-training via Connecting Trajectory and Map |
Chenfeng Xu, Tian Li, Chen Tang, Lingfeng Sun, Kurt Keutzer, Masayoshi Tomizuka, Alireza Fathi, Wei Zhan |
|
code |
-1 |
Master of All: Simultaneous Generalization of Urban-Scene Segmentation to All Adverse Weather Conditions |
Nikhil Reddy, Abhinav Singhal, Abhishek Kumar, Mahsa Baktashmotlagh, Chetan Arora |
|
code |
-1 |
LESS: Label-Efficient Semantic Segmentation for LiDAR Point Clouds |
Minghua Liu, Yin Zhou, Charles R. Qi, Boqing Gong, Hao Su, Dragomir Anguelov |
|
code |
-1 |
Visual Cross-View Metric Localization with Dense Uncertainty Estimates |
Zimin Xia, Olaf Booij, Marco Manfredi, Julian F. P. Kooij |
|
code |
-1 |
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer |
Runsheng Xu, Hao Xiang, Zhengzhong Tu, Xin Xia, MingHsuan Yang, Jiaqi Ma |
|
code |
-1 |
DevNet: Self-supervised Monocular Depth Learning via Density Volume Construction |
Kaichen Zhou, Lanqing Hong, Changhao Chen, Hang Xu, Chaoqiang Ye, Qingyong Hu, Zhenguo Li |
|
code |
-1 |
Action-Based Contrastive Learning for Trajectory Prediction |
Marah Halawa, Olaf Hellwich, Pia Bideau |
|
code |
-1 |
Radatron: Accurate Detection Using Multi-resolution Cascaded MIMO Radar |
Sohrab Madani, Jayden Guan, Waleed Ahmed, Saurabh Gupta, Haitham Hassanieh |
|
code |
-1 |
LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection |
Yi Wei, Zibu Wei, Yongming Rao, Jiaxin Li, Jie Zhou, Jiwen Lu |
|
code |
-1 |
Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks |
Maosheng Ye, Rui Wan, Shuangjie Xu, Tongyi Cao, Qifeng Chen |
|
code |
-1 |
FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds |
Lihe Ding, Shaocong Dong, Tingfa Xu, Xinli Xu, Jie Wang, Jianan Li |
|
code |
-1 |
SpatialDETR: Robust Scalable Transformer-Based 3D Object Detection From Multi-view Camera Images With Global Cross-Sensor Attention |
Simon Doll, Richard Schulz, Lukas Schneider, Viviane Benzin, Markus Enzweiler, Hendrik P. A. Lensch |
|
code |
-1 |
Pixel-Wise Energy-Biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving Scenes |
Yu Tian, Yuyuan Liu, Guansong Pang, Fengbei Liu, Yuanhong Chen, Gustavo Carneiro |
|
code |
-1 |
Rethinking Closed-Loop Training for Autonomous Driving |
Chris Zhang, Runsheng Guo, Wenyuan Zeng, Yuwen Xiong, Binbin Dai, Rui Hu, Mengye Ren, Raquel Urtasun |
|
code |
-1 |
SLiDE: Self-supervised LiDAR De-snowing Through Reconstruction Difficulty |
Gwangtak Bae, Byungjun Kim, Seongyong Ahn, Jihong Min, Inwook Shim |
|
code |
-1 |
Generative Meta-Adversarial Network for Unseen Object Navigation |
Sixian Zhang, Weijie Li, Xinhang Song, Yubing Bai, Shuqiang Jiang |
|
code |
-1 |
Object Manipulation via Visual Target Localization |
Kiana Ehsani, Ali Farhadi, Aniruddha Kembhavi, Roozbeh Mottaghi |
|
code |
-1 |
MoDA: Map Style Transfer for Self-supervised Domain Adaptation of Embodied Agents |
Eun Sun Lee, Junho Kim, SangWon Park, Young Min Kim |
|
code |
-1 |
Housekeep: Tidying Virtual Households Using Commonsense Reasoning |
Yash Kant, Arun Ramachandran, Sriram Yenamandra, Igor Gilitschenski, Dhruv Batra, Andrew Szot, Harsh Agrawal |
|
code |
-1 |
Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects |
Qiyu Dai, Jiyao Zhang, Qiwei Li, Tianhao Wu, Hao Dong, Ziyuan Liu, Ping Tan, He Wang |
|
code |
-1 |
Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction |
ChiaChi Chuang, Donglin Yang, Chuan Wen, Yang Gao |
|
code |
-1 |
OPD: Single-View 3D Openable Part Detection |
Hanxiao Jiang, Yongsen Mao, Manolis Savva, Angel X. Chang |
|
code |
-1 |
AirDet: Few-Shot Detection Without Fine-Tuning for Autonomous Exploration |
Bowen Li, Chen Wang, Pranay Reddy, Seungchan Kim, Sebastian A. Scherer |
|
code |
-1 |
TransGrasp: Grasp Pose Estimation of a Category of Objects by Transferring Grasps from Only One Labeled Instance |
Hongtao Wen, Jianhang Yan, Wanli Peng, Yi Sun |
|
code |
-1 |
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning |
Jinghuan Shang, Kumara Kahatapitiya, Xiang Li, Michael S. Ryoo |
|
code |
-1 |
TIDEE: Tidying Up Novel Rooms Using Visuo-Semantic Commonsense Priors |
Gabriel Sarch, Zhaoyuan Fang, Adam W. Harley, Paul Schydlo, Michael J. Tarr, Saurabh Gupta, Katerina Fragkiadaki |
|
code |
-1 |
Learning Efficient Multi-agent Cooperative Visual Exploration |
Chao Yu, Xinyi Yang, Jiaxuan Gao, Huazhong Yang, Yu Wang, Yi Wu |
|
code |
-1 |
Zero-Shot Category-Level Object Pose Estimation |
Walter Goodwin, Sagar Vaze, Ioannis Havoutis, Ingmar Posner |
|
code |
-1 |
Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking |
Kai Chen, Rui Cao, Stephen James, Yichuan Li, YunHui Liu, Pieter Abbeel, Qi Dou |
|
code |
-1 |
Active Audio-Visual Separation of Dynamic Sound Sources |
Sagnik Majumder, Kristen Grauman |
|
code |
-1 |
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos |
Yuzhe Qin, YuehHua Wu, Shaowei Liu, Hanwen Jiang, Ruihan Yang, Yang Fu, Xiaolong Wang |
|
code |
-1 |
Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments |
Jacob Krantz, Stefan Lee |
|
code |
-1 |
Style-Agnostic Reinforcement Learning |
Juyong Lee, Seokjun Ahn, Jaesik Park |
|
code |
-1 |
Self-supervised Interactive Object Segmentation Through a Singulation-and-Grasping Approach |
Houjian Yu, Changhyun Choi |
|
code |
-1 |
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation |
Shizhe Chen, PierreLouis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev |
|
code |
-1 |
BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking |
Dorian Henning, Tristan Laidlow, Stefan Leutenegger |
|
code |
-1 |
FusionVAE: A Deep Hierarchical Variational Autoencoder for RGB Image Fusion |
Fabian Duffhauss, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann |
|
code |
-1 |
Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning |
Chi Zhang, Sirui Xie, Baoxiong Jia, Ying Nian Wu, SongChun Zhu, Yixin Zhu |
|
code |
-1 |
Video Dialog as Conversation About Objects Living in Space-Time |
HoangAnh Pham, Thao Minh Le, Vuong Le, Tu Minh Phuong, Truyen Tran |
|
code |
-1 |
Improving Vision Transformers by Revisiting High-Frequency Components |
Jiawang Bai, Li Yuan, ShuTao Xia, Shuicheng Yan, Zhifeng Li, Wei Liu |
|
code |
-1 |
Recurrent Bilinear Optimization for Binary Neural Networks |
Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lü, Guodong Guo |
|
code |
-1 |
Neural Architecture Search for Spiking Neural Networks |
Youngeun Kim, Yuhang Li, Hyoungseob Park, Yeshwanth Venkatesha, Priyadarshini Panda |
|
code |
-1 |
Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained Visual Classification |
Yang Liu, Lei Zhou, Pengcheng Zhang, Xiao Bai, Lin Gu, Xiaohan Yu, Jun Zhou, Edwin R. Hancock |
|
code |
-1 |
DaViT: Dual Attention Vision Transformers |
Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan |
|
code |
-1 |
Optimal Transport for Label-Efficient Visible-Infrared Person Re-Identification |
Jiangming Wang, Zhizhong Zhang, Mingang Chen, Yi Zhang, Cong Wang, Bin Sheng, Yanyun Qu, Yuan Xie |
|
code |
-1 |
Locality Guidance for Improving Vision Transformers on Tiny Datasets |
Kehan Li, Runyi Yu, Zhennan Wang, Li Yuan, Guoli Song, Jie Chen |
|
code |
-1 |
Neighborhood Collective Estimation for Noisy Label Identification and Correction |
Jichang Li, Guanbin Li, Feng Liu, Yizhou Yu |
|
code |
-1 |
Few-Shot Class-Incremental Learning via Entropy-Regularized Data-Free Replay |
Huan Liu, Li Gu, Zhixiang Chi, Yang Wang, Yuanhao Yu, Jun Chen, Jin Tang |
|
code |
-1 |
Anti-retroactive Interference for Lifelong Learning |
Runqi Wang, Yuxiang Bao, Baochang Zhang, Jianzhuang Liu, Wentao Zhu, Guodong Guo |
|
code |
-1 |
Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-Tailed Learning |
Hualiang Wang, Siming Fu, Xiaoxuan He, Hangxiang Fang, Zuozhu Liu, Haoji Hu |
|
code |
-1 |
Dynamic Metric Learning with Cross-Level Concept Distillation |
Wenzhao Zheng, Yuan Huang, Borui Zhang, Jie Zhou, Jiwen Lu |
|
code |
-1 |
MENet: A Memory-Based Network with Dual-Branch for Efficient Event Stream Processing |
Linhui Sun, Yifan Zhang, Ke Cheng, Jian Cheng, Hanqing Lu |
|
code |
-1 |
Out-of-distribution Detection with Boundary Aware Learning |
Sen Pei, Xin Zhang, Bin Fan, Gaofeng Meng |
|
code |
-1 |
Learning Hierarchy Aware Features for Reducing Mistake Severity |
Ashima Garg, Depanshu Sani, Saket Anand |
|
code |
-1 |
Learning to Detect Every Thing in an Open World |
Kuniaki Saito, Ping Hu, Trevor Darrell, Kate Saenko |
|
code |
-1 |
KVT: k-NN Attention for Boosting Vision Transformers |
Pichao Wang, Xue Wang, Fan Wang, Ming Lin, Shuning Chang, Hao Li, Rong Jin |
|
code |
-1 |
Registration Based Few-Shot Anomaly Detection |
Chaoqin Huang, Haoyan Guan, Aofan Jiang, Ya Zhang, Michael W. Spratling, YanFeng Wang |
|
code |
-1 |
Improving Robustness by Enhancing Weak Subnets |
Yong Guo, David Stutz, Bernt Schiele |
|
code |
-1 |
Learning Invariant Visual Representations for Compositional Zero-Shot Learning |
Tian Zhang, Kongming Liang, Ruoyi Du, Xian Sun, Zhanyu Ma, Jun Guo |
|
code |
-1 |
Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality |
Yue Song, Nicu Sebe, Wei Wang |
|
code |
-1 |
Out-of-Distribution Detection with Semantic Mismatch Under Masking |
Yijun Yang, Ruiyuan Gao, Qiang Xu |
|
code |
-1 |
Data-Free Neural Architecture Search via Recursive Label Calibration |
Zechun Liu, Zhiqiang Shen, Yun Long, Eric P. Xing, KwangTing Cheng, Chas Leichner |
|
code |
-1 |
Learning from Multiple Annotator Noisy Labels via Sample-Wise Label Fusion |
Zhengqi Gao, FanKeng Sun, Mingran Yang, Sucheng Ren, Zikai Xiong, Marc Engeler, Antonio Burazer, Linda Wildling, Luca Daniel, Duane S. Boning |
|
code |
-1 |
Acknowledging the Unknown for Multi-label Learning with Single Positive Labels |
Donghao Zhou, Pengfei Chen, Qiong Wang, Guangyong Chen, PhengAnn Heng |
|
code |
-1 |
AutoMix: Unveiling the Power of Mixup for Stronger Classifiers |
Zicheng Liu, Siyuan Li, Di Wu, Zihan Liu, Zhiyuan Chen, Lirong Wu, Stan Z. Li |
|
code |
-1 |
MaxViT: Multi-axis Vision Transformer |
Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan C. Bovik, Yinxiao Li |
|
code |
-1 |
ScalableViT: Rethinking the Context-Oriented Generalization of Vision Transformer |
Rui Yang, Hailong Ma, Jie Wu, Yansong Tang, Xuefeng Xiao, Min Zheng, Xiu Li |
|
code |
-1 |
Three Things Everyone Should Know About Vision Transformers |
Hugo Touvron, Matthieu Cord, Alaaeldin ElNouby, Jakob Verbeek, Hervé Jégou |
|
code |
-1 |
DeiT III: Revenge of the ViT |
Hugo Touvron, Matthieu Cord, Hervé Jégou |
|
code |
-1 |
MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition |
Chuanguang Yang, Zhulin An, Helong Zhou, Linhang Cai, Xiang Zhi, Jiwen Wu, Yongjun Xu, Qian Zhang |
|
code |
-1 |
Self-feature Distillation with Uncertainty Modeling for Degraded Image Recognition |
Zhou Yang, Weisheng Dong, Xin Li, Jinjian Wu, Leida Li, Guangming Shi |
|
code |
-1 |
Novel Class Discovery Without Forgetting |
K. J. Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N. Balasubramanian |
|
code |
-1 |
SAFA: Sample-Adaptive Feature Augmentation for Long-Tailed Image Classification |
Yan Hong, Jianfu Zhang, Zhongyi Sun, Ke Yan |
|
code |
-1 |
Negative Samples are at Large: Leveraging Hard-Distance Elastic Loss for Re-identification |
Hyungtae Lee, Sungmin Eum, Heesung Kwon |
|
code |
-1 |
Discrete-Constrained Regression for Local Counting Models |
Haipeng Xiong, Angela Yao |
|
code |
-1 |
Breadcrumbs: Adversarial Class-Balanced Sampling for Long-Tailed Recognition |
Bo Liu, Haoxiang Li, Hao Kang, Gang Hua, Nuno Vasconcelos |
|
code |
-1 |
Chairs Can Be Stood On: Overcoming Object Bias in Human-Object Interaction Detection |
Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan S. Kankanhalli |
|
code |
-1 |
A Fast Knowledge Distillation Framework for Visual Recognition |
Zhiqiang Shen, Eric P. Xing |
|
code |
-1 |
DICE: Leveraging Sparsification for Out-of-Distribution Detection |
Yiyou Sun, Yixuan Li |
|
code |
-1 |
Invariant Feature Learning for Generalized Long-Tailed Classification |
Kaihua Tang, Mingyuan Tao, Jiaxin Qi, Zhenguang Liu, Hanwang Zhang |
|
code |
-1 |
Sliced Recursive Transformer |
Zhiqiang Shen, Zechun Liu, Eric P. Xing |
|
code |
-1 |
Relative Contrastive Loss for Unsupervised Representation Learning |
Shixiang Tang, Feng Zhu, Lei Bai, Rui Zhao, Wanli Ouyang |
|
code |
-1 |
Fine-Grained Fashion Representation Learning by Online Deep Clustering |
Yang Jiao, Ning Xie, Yan Gao, ChienChih Wang, Yi Sun |
|
code |
-1 |
NashAE: Disentangling Representations Through Adversarial Covariance Minimization |
Eric C. Yeats, Frank Liu, David Womble, Hai Helen Li |
|
code |
-1 |
A Gyrovector Space Approach for Symmetric Positive Semi-definite Matrix Learning |
Xuan Son Nguyen |
|
code |
-1 |
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training |
Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, ShihFu Chang, Lu Yuan |
|
code |
-1 |
Contrasting Quadratic Assignments for Set-Based Representation Learning |
Artem Moskalev, Ivan Sosnovik, Volker Fischer, Arnold W. M. Smeulders |
|
code |
-1 |
Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer |
Arjun Ashok, K. J. Joseph, Vineeth N. Balasubramanian |
|
code |
-1 |
Object Discovery and Representation Networks |
Olivier J. Hénaff, Skanda Koppula, Evan Shelhamer, Daniel Zoran, Andrew Jaegle, Andrew Zisserman, João Carreira, Relja Arandjelovic |
|
code |
-1 |
Trading Positional Complexity vs Deepness in Coordinate Networks |
Jianqiao Zheng, Sameera Ramasinghe, Xueqian Li, Simon Lucey |
|
code |
-1 |
MVDG: A Unified Multi-view Framework for Domain Generalization |
Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao |
|
code |
-1 |
Panoptic Scene Graph Generation |
Jingkang Yang, Yi Zhe Ang, Zujin Guo, Kaiyang Zhou, Wayne Zhang, Ziwei Liu |
|
code |
-1 |
Object-Compositional Neural Implicit Surfaces |
Qianyi Wu, Xian Liu, Yuedong Chen, Kejie Li, Chuanxia Zheng, Jianfei Cai, Jianmin Zheng |
|
code |
-1 |
RigNet: Repetitive Image Guided Network for Depth Completion |
Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang |
|
code |
-1 |
FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling |
Hao Lu, Wenze Liu, Hongtao Fu, Zhiguo Cao |
|
code |
-1 |
LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation |
Zeyu Hu, Xuyang Bai, Runze Zhang, Xin Wang, Guangyuan Sun, Hongbo Fu, ChiewLan Tai |
|
code |
-1 |
Hierarchical Memory Learning for Fine-Grained Scene Graph Generation |
Youming Deng, Yansheng Li, Yongjun Zhang, Xiang Xiang, Jian Wang, Jingdong Chen, Jiayi Ma |
|
code |
-1 |
DODA: Data-Oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation |
Runyu Ding, Jihan Yang, Li Jiang, Xiaojuan Qi |
|
code |
-1 |
MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning |
Xiaogang Xu, Hengshuang Zhao, Vibhav Vineet, SerNam Lim, Antonio Torralba |
|
code |
-1 |
MonoPLFlowNet: Permutohedral Lattice FlowNet for Real-Scale 3D Scene Flow Estimation with Monocular Images |
Runfa Li, Truong Nguyen |
|
code |
-1 |
TO-Scene: A Large-Scale Dataset for Understanding 3D Tabletop Scenes |
Mutian Xu, Pei Chen, Haolin Liu, Xiaoguang Han |
|
code |
-1 |
Is It Necessary to Transfer Temporal Knowledge for Domain Adaptive Video Semantic Segmentation? |
Xinyi Wu, Zhenyao Wu, Jin Wan, Lili Ju, Song Wang |
|
code |
-1 |
Meta Spatio-Temporal Debiasing for Video Scene Graph Generation |
Li Xu, Haoxuan Qu, Jason Kuen, Jiuxiang Gu, Jun Liu |
|
code |
-1 |
Improving the Reliability for Confidence Estimation |
Haoxuan Qu, Yanchao Li, Lin Geng Foo, Jason Kuen, Jiuxiang Gu, Jun Liu |
|
code |
-1 |
Fine-Grained Scene Graph Generation with Data Transfer |
Ao Zhang, Yuan Yao, Qianyu Chen, Wei Ji, Zhiyuan Liu, Maosong Sun, TatSeng Chua |
|
code |
-1 |
Pose2Room: Understanding 3D Scenes from Human Activities |
Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner |
|
code |
-1 |
Towards Hard-Positive Query Mining for DETR-Based Human-Object Interaction Detection |
Xubin Zhong, Changxing Ding, Zijian Li, Shaoli Huang |
|
code |
-1 |
Discovering Human-Object Interaction Concepts via Self-Compositional Learning |
Zhi Hou, Baosheng Yu, Dacheng Tao |
|
code |
-1 |
Primitive-Based Shape Abstraction via Nonparametric Bayesian Inference |
Yuwei Wu, Weixiao Liu, Sipu Ruan, Gregory S. Chirikjian |
|
code |
-1 |
Stereo Depth Estimation with Echoes |
Chenghao Zhang, Kun Tian, Bolin Ni, Gaofeng Meng, Bin Fan, Zhaoxiang Zhang, Chunhong Pan |
|
code |
-1 |
Inverted Pyramid Multi-task Transformer for Dense Scene Understanding |
Hanrong Ye, Dan Xu |
|
code |
-1 |
PETR: Position Embedding Transformation for Multi-view 3D Object Detection |
Yingfei Liu, Tiancai Wang, Xiangyu Zhang, Jian Sun |
|
code |
-1 |
S2Net: Stochastic Sequential Pointcloud Forecasting |
Xinshuo Weng, Junyu Nan, KuanHui Lee, Rowan McAllister, Adrien Gaidon, Nicholas Rhinehart, Kris M. Kitani |
|
code |
-1 |
RA-Depth: Resolution Adaptive Self-supervised Monocular Depth Estimation |
Mu He, Le Hui, Yikai Bian, Jian Ren, Jin Xie, Jian Yang |
|
code |
-1 |
PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation |
Haobo Yuan, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao |
|
code |
-1 |
SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds |
Qingyong Hu, Bo Yang, Guangchi Fang, Yulan Guo, Ales Leonardis, Niki Trigoni, Andrew Markham |
|
code |
-1 |
PointMixer: MLP-Mixer for Point Cloud Understanding |
Jaesung Choe, Chunghyun Park, François Rameau, Jaesik Park, In So Kweon |
|
code |
-1 |
Initialization and Alignment for Adversarial Texture Optimization |
Xiaoming Zhao, Zhizhen Zhao, Alexander G. Schwing |
|
code |
-1 |
MOTR: End-to-End Multiple-Object Tracking with Transformer |
Fangao Zeng, Bin Dong, Yuang Zhang, Tiancai Wang, Xiangyu Zhang, Yichen Wei |
|
code |
-1 |
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing |
Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen |
|
code |
-1 |
LaLaLoc++: Global Floor Plan Comprehension for Layout Localisation in Unvisited Environments |
Henry HowardJenkins, Victor Adrian Prisacariu |
|
code |
-1 |
3D-PL: Domain Adaptive Depth Estimation with 3D-Aware Pseudo-Labeling |
YuTing Yen, ChiaNi Lu, WeiChen Chiu, YiHsuan Tsai |
|
code |
-1 |
Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation |
Xiangtai Li, Shilin Xu, Yibo Yang, Guangliang Cheng, Yunhai Tong, Dacheng Tao |
|
code |
-1 |
GOCA: Guided Online Cluster Assignment for Self-supervised Video Representation Learning |
Huseyin Coskun, Alireza Zareian, Joshua L. Moore, Federico Tombari, Chen Wang |
|
code |
-1 |
Constrained Mean Shift Using Distant yet Related Neighbors for Representation Learning |
K. L. Navaneet, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Kossar Pourahmadi, Akshayvarun Subramanya, Hamed Pirsiavash |
|
code |
-1 |
Revisiting the Critical Factors of Augmentation-Invariant Representation Learning |
Junqiang Huang, Xiangwen Kong, Xiangyu Zhang |
|
code |
-1 |
CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation |
Lu Qi, Jason Kuen, Zhe Lin, Jiuxiang Gu, Fengyun Rao, Dian Li, Weidong Guo, Zhen Wen, MingHsuan Yang, Jiaya Jia |
|
code |
-1 |
Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation |
Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai, Chen Qian |
|
code |
-1 |
Semantic-Aware Fine-Grained Correspondence |
Yingdong Hu, Renhao Wang, Kaifeng Zhang, Yang Gao |
|
code |
-1 |
Self-Supervised Classification Network |
Elad Amrani, Leonid Karlinsky, Alexander M. Bronstein |
|
code |
-1 |
Data Invariants to Understand Unsupervised Out-of-Distribution Detection |
Lars Doorenbos, Raphael Sznitman, Pablo MárquezNeila |
|
code |
-1 |
Domain Invariant Masked Autoencoders for Self-supervised Learning from Multi-domains |
Haiyang Yang, Shixiang Tang, Meilin Chen, Yizhou Wang, Feng Zhu, Lei Bai, Rui Zhao, Wanli Ouyang |
|
code |
-1 |
Semi-supervised Object Detection via VC Learning |
Changrui Chen, Kurt Debattista, Jungong Han |
|
code |
-1 |
Completely Self-supervised Crowd Counting via Distribution Matching |
Deepak Babu Sam, Abhinav Agarwalla, Jimmy Joseph, Vishwanath A. Sindagi, R. Venkatesh Babu, Vishal M. Patel |
|
code |
-1 |
Coarse-To-Fine Incremental Few-Shot Learning |
Xiang Xiang, Yuwen Tan, Qian Wan, Jing Ma, Alan L. Yuille, Gregory D. Hager |
|
code |
-1 |
Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling |
Jian Hu, Haowen Zhong, Fei Yang, Shaogang Gong, Guile Wu, Junchi Yan |
|
code |
-1 |
Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition |
Shreyank N. Gowda, Marcus Rohrbach, Frank Keller, Laura SevillaLara |
|
code |
-1 |
CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation |
Renhao Wang, Hang Zhao, Yang Gao |
|
code |
-1 |
PSS: Progressive Sample Selection for Open-World Visual Representation Learning |
Tianyue Cao, Yongxin Wang, Yifan Xing, Tianjun Xiao, Tong He, Zheng Zhang, Hao Zhou, Joseph Tighe |
|
code |
-1 |
Improving Self-supervised Lightweight Model Learning via Hard-Aware Metric Distillation |
Hao Liu, Mang Ye |
|
code |
-1 |
Object Discovery via Contrastive Learning for Weakly Supervised Object Detection |
Jinhwan Seo, Wonho Bae, Danica J. Sutherland, Junhyug Noh, Daijin Kim |
|
code |
-1 |
Stochastic Consensus: Enhancing Semi-Supervised Learning with Consistency of Stochastic Classifiers |
Hui Tang, Lin Sun, Kui Jia |
|
code |
-1 |
DiffuseMorph: Unsupervised Deformable Image Registration Using Diffusion Model |
Boah Kim, Inhwa Han, Jong Chul Ye |
|
code |
-1 |
Semi-Leak: Membership Inference Attacks Against Semi-supervised Learning |
Xinlei He, Hongbin Liu, Neil Zhenqiang Gong, Yang Zhang |
|
code |
-1 |
OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning |
Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah |
|
code |
-1 |
Embedding Contrastive Unsupervised Features to Cluster In- And Out-of-Distribution Noise in Corrupted Image Datasets |
Paul Albert, Eric Arazo, Noel E. O'Connor, Kevin McGuinness |
|
code |
-1 |
Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space |
Shuo Li, Fang Liu, Zehua Hao, Kaibo Zhao, Licheng Jiao |
|
code |
-1 |
Towards Realistic Semi-supervised Learning |
Mamshad Nayeem Rizve, Navid Kardan, Mubarak Shah |
|
code |
-1 |
Masked Siamese Networks for Label-Efficient Learning |
Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Florian Bordes, Pascal Vincent, Armand Joulin, Mike Rabbat, Nicolas Ballas |
|
code |
-1 |
Natural Synthetic Anomalies for Self-supervised Anomaly Detection and Localization |
Hannah M. Schlüter, Jeremy Tan, Benjamin Hou, Bernhard Kainz |
|
code |
-1 |
Understanding Collapse in Non-contrastive Siamese Representation Learning |
Alexander C. Li, Alexei A. Efros, Deepak Pathak |
|
code |
-1 |
Federated Self-supervised Learning for Video Understanding |
Yasar Abbas Ur Rehman, Yan Gao, Jiajun Shen, Pedro Porto Buarque de Gusmão, Nicholas D. Lane |
|
code |
-1 |
Towards Efficient and Effective Self-supervised Learning of Visual Representations |
Sravanti Addepalli, Kaushal Bhogale, Priyam Dey, R. Venkatesh Babu |
|
code |
-1 |
DSR - A Dual Subspace Re-Projection Network for Surface Anomaly Detection |
Vitjan Zavrtanik, Matej Kristan, Danijel Skocaj |
|
code |
-1 |
PseudoAugment: Learning to Use Unlabeled Data for Data Augmentation in Point Clouds |
Zhaoqi Leng, Shuyang Cheng, Benjamin Caine, Weiyue Wang, Xiao Zhang, Mingxing Tan, Dragomir Anguelov |
|
code |
-1 |
MVSTER: Epipolar Transformer for Efficient Multi-view Stereo |
Xiaofeng Wang, Zheng Zhu, Guan Huang, Fangbo Qin, Yun Ye, Yijia He, Xu Chi, Xingang Wang |
|
code |
-1 |
RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild |
Jason Y. Zhang, Deva Ramanan, Shubham Tulsiani |
|
code |
-1 |
R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis |
Huan Wang, Jian Ren, Zeng Huang, Kyle Olszewski, Menglei Chai, Yun Fu, Sergey Tulyakov |
|
code |
-1 |
KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo |
Yikang Ding, Qingtian Zhu, Xiangyue Liu, Wentao Yuan, Haotian Zhang, Chi Zhang |
|
code |
-1 |
SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas |
John Lambert, Yuguang Li, Ivaylo Boyadzhiev, Lambert Wixson, Manjunath Narayana, Will Hutchcroft, James Hays, Frank Dellaert, Sing Bing Kang |
|
code |
-1 |
RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering |
Di Chang, Aljaz Bozic, Tong Zhang, Qingsong Yan, Yingcong Chen, Sabine Süsstrunk, Matthias Nießner |
|
code |
-1 |
Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation using Bounding Boxes |
Julian Chibane, Francis Engelmann, Tuan Anh Tran, Gerard PonsMoll |
|
code |
-1 |
NeILF: Neural Incident Light Field for Physically-based Material Estimation |
Yao Yao, Jingyang Zhang, Jingbo Liu, Yihang Qu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan |
|
code |
-1 |
ARF: Artistic Radiance Fields |
Kai Zhang, Nicholas I. Kolkin, Sai Bi, Fujun Luan, Zexiang Xu, Eli Shechtman, Noah Snavely |
|
code |
-1 |
Multiview Stereo with Cascaded Epipolar RAFT |
Zeyu Ma, Zachary Teed, Jia Deng |
|
code |
-1 |
Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling |
Hengyuan Ma, Li Zhang, Xiatian Zhu, Jianfeng Feng |
|
code |
-1 |
Learning to Generate Realistic LiDAR Point Clouds |
Vlas Zyrianov, Xiyue Zhu, Shenlong Wang |
|
code |
-1 |
RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds |
TuanAnh Vu, Duc Thanh Nguyen, BinhSon Hua, QuangHieu Pham, SaiKit Yeung |
|
code |
-1 |
Diverse Image Inpainting with Normalizing Flow |
Cairong Wang, Yiming Zhu, Chun Yuan |
|
code |
-1 |
Improved Masked Image Generation with Token-Critic |
José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa |
|
code |
-1 |
TREND: Truncated Generalized Normal Density Estimation of Inception Embeddings for GAN Evaluation |
Junghyuk Lee, JongSeok Lee |
|
code |
-1 |
Exploring Gradient-Based Multi-directional Controls in GANs |
Zikun Chen, Ruowei Jiang, Brendan Duke, Han Zhao, Parham Aarabi |
|
code |
-1 |
Spatially Invariant Unsupervised 3D Object-Centric Learning and Scene Decomposition |
Tianyu Wang, Miaomiao Liu, Kee Siong Ng |
|
code |
-1 |
Neural Scene Decoration from a Single Photograph |
HongWing Pang, Yingshu Chen, PhuocHieu Le, BinhSon Hua, Duc Thanh Nguyen, SaiKit Yeung |
|
code |
-1 |
Outpainting by Queries |
Kai Yao, Penglei Gao, Xi Yang, Jie Sun, Rui Zhang, Kaizhu Huang |
|
code |
-1 |
Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes |
Sam BondTaylor, Peter Hessey, Hiroshi Sasaki, Toby P. Breckon, Chris G. Willcocks |
|
code |
-1 |
ChunkyGAN: Real Image Inversion via Segments |
Adéla Subrtová, David Futschik, Jan Cech, Michal Lukác, Eli Shechtman, Daniel Sýkora |
|
code |
-1 |
GAN Cocktail: Mixing GANs Without Dataset Access |
Omri Avrahami, Dani Lischinski, Ohad Fried |
|
code |
-1 |
Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering |
Mingfei Chen, Jianfeng Zhang, Xiangyu Xu, Lijuan Liu, Yujun Cai, Jiashi Feng, Shuicheng Yan |
|
code |
-1 |
Controllable Shadow Generation Using Pixel Height Maps |
Yichen Sheng, Yifan Liu, Jianming Zhang, Wei Yin, A. Cengiz Öztireli, He Zhang, Zhe Lin, Eli Shechtman, Bedrich Benes |
|
code |
-1 |
Learning Where to Look - Generative NAS is Surprisingly Efficient |
Jovita Lukasik, Steffen Jung, Margret Keuper |
|
code |
-1 |
Subspace Diffusion Generative Models |
Bowen Jing, Gabriele Corso, Renato Berlinghieri, Tommi S. Jaakkola |
|
code |
-1 |
DuelGAN: A Duel Between Two Discriminators Stabilizes the GAN Training |
Jiaheng Wei, Minghao Liu, Jiahao Luo, Andrew Zhu, James Davis, Yang Liu |
|
code |
-1 |
MINER: Multiscale Implicit Neural Representation |
Vishwanath Saragadam, Jasper Tan, Guha Balakrishnan, Richard G. Baraniuk, Ashok Veeraraghavan |
|
code |
-1 |
An Embedded Feature Whitening Approach to Deep Neural Network Optimization |
Hongwei Yong, Lei Zhang |
|
code |
-1 |
Q-FW: A Hybrid Classical-Quantum Frank-Wolfe for Quadratic Binary Optimization |
Alp Yurtsever, Tolga Birdal, Vladislav Golyanik |
|
code |
-1 |
Self-supervised Learning of Visual Graph Matching |
Chang Liu, Shaofeng Zhang, Xiaokang Yang, Junchi Yan |
|
code |
-1 |
Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models |
Xuxi Chen, Tianlong Chen, Yu Cheng, Weizhu Chen, Ahmed Hassan Awadallah, Zhangyang Wang |
|
code |
-1 |
QISTA-ImageNet: A Deep Compressive Image Sensing Framework Solving ℓ q-Norm Optimization Problem |
GangXuan Lin, ShihWei Hu, ChunShien Lu |
|
code |
-1 |
R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning |
Qiankun Gao, Chen Zhao, Bernard Ghanem, Jian Zhang |
|
code |
-1 |
Domain Generalization by Mutual-Information Regularization with Pre-trained Models |
Junbum Cha, Kyungjae Lee, Sungrae Park, Sanghyuk Chun |
|
code |
-1 |
Predicting Is Not Understanding: Recognizing and Addressing Underspecification in Machine Learning |
Damien Teney, Maxime Peyrard, Ehsan Abbasnejad |
|
code |
-1 |
Neural-Sim: Learning to Generate Training Data with NeRF |
Yunhao Ge, Harkirat S. Behl, Jiashu Xu, Suriya Gunasekar, Neel Joshi, Yale Song, Xin Wang, Laurent Itti, Vibhav Vineet |
|
code |
-1 |
Bayesian Optimization with Clustering and Rollback for CNN Auto Pruning |
Hanwei Fan, Jiandong Mu, Wei Zhang |
|
code |
-1 |
Learned Variational Video Color Propagation |
Markus Hofinger, Erich Kobler, Alexander Effland, Thomas Pock |
|
code |
-1 |
Continual Variational Autoencoder Learning via Online Cooperative Memorization |
Fei Ye, Adrian G. Bors |
|
code |
-1 |
Learning to Learn with Smooth Regularization |
Yuanhao Xiong, ChoJui Hsieh |
|
code |
-1 |
Incremental Task Learning with Incremental Rank Updates |
Rakib Hyder, Ken Shao, Boyu Hou, Panos P. Markopoulos, Ashley PraterBennette, M. Salman Asif |
|
code |
-1 |
Batch-Efficient EigenDecomposition for Small and Medium Matrices |
Yue Song, Nicu Sebe, Wei Wang |
|
code |
-1 |
Ensemble Learning Priors Driven Deep Unfolding for Scalable Video Snapshot Compressive Imaging |
Chengshuai Yang, Shiyu Zhang, Xin Yuan |
|
code |
-1 |
Approximate Discrete Optimal Transport Plan with Auxiliary Measure Method |
Dongsheng An, Na Lei, Xianfeng Gu |
|
code |
-1 |
Improving Generalization in Federated Learning by Seeking Flat Minima |
Debora Caldarola, Barbara Caputo, Marco Ciccone |
|
code |
-1 |
Semidefinite Relaxations of Truncated Least-Squares in Robust Rotation Search: Tight or Not |
Liangzu Peng, Mahyar Fazlyab, René Vidal |
|
code |
-1 |
Transfer Without Forgetting |
Matteo Boschini, Lorenzo Bonicelli, Angelo Porrello, Giovanni Bellitto, Matteo Pennisi, Simone Palazzo, Concetto Spampinato, Simone Calderara |
|
code |
-1 |
AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation |
Farshid Varno, Marzie Saghayi, Laya Rafiee Sevyeri, Sharut Gupta, Stan Matwin, Mohammad Havaei |
|
code |
-1 |
Tackling Long-Tailed Category Distribution Under Domain Shifts |
Xiao Gu, Yao Guo, Zeju Li, Jianing Qiu, Qi Dou, Yuxuan Liu, Benny Lo, GuangZhong Yang |
|
code |
-1 |
Doubly-Fused ViT: Fuse Information from Vision Transformer Doubly with Local Representation |
Li Gao, Dong Nie, Bo Li, Xiaofeng Ren |
|
code |
-1 |
Salient Object Detection for Point Clouds |
Songlin Fan, Wei Gao, Ge Li |
|
code |
-1 |
Learning Semantic Segmentation from Multiple Datasets with Label Shifts |
Dongwan Kim, YiHsuan Tsai, Yumin Suh, Masoud Faraki, Sparsh Garg, Manmohan Chandraker, Bohyung Han |
|
code |
-1 |
Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination |
Kangcheng Liu, Yuzhi Zhao, Qiang Nie, Zhi Gao, Ben M. Chen |
|
code |
-1 |
Towards Open-Vocabulary Scene Graph Generation with Prompt-Based Finetuning |
Tao He, Lianli Gao, Jingkuan Song, YuanFang Li |
|
code |
-1 |
Variance-Aware Weight Initialization for Point Convolutional Neural Networks |
Pedro Hermosilla, Michael Schelling, Tobias Ritschel, Timo Ropinski |
|
code |
-1 |
Break and Make: Interactive Structural Understanding Using LEGO Bricks |
Aaron Walsman, Muru Zhang, Klemen Kotar, Karthik Desingh, Ali Farhadi, Dieter Fox |
|
code |
-1 |
Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation |
Wencan Cheng, Jong Hwan Ko |
|
code |
-1 |
3DG-STFM: 3D Geometric Guided Student-Teacher Feature Matching |
Runyu Mao, Chen Bai, Yatong An, Fengqing Zhu, Cheng Lu |
|
code |
-1 |
Video Restoration Framework and Its Meta-adaptations to Data-Poor Conditions |
Prashant W. Patil, Sunil Gupta, Santu Rana, Svetha Venkatesh |
|
code |
-1 |
MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud |
Michaël Ramamonjisoa, Sinisa Stekovic, Vincent Lepetit |
|
code |
-1 |
Scene Text Recognition with Permuted Autoregressive Sequence Models |
Darwin Bautista, Rowel Atienza |
|
code |
-1 |
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition |
Bohan Li, Ye Yuan, Dingkang Liang, Xiao Liu, Zhilong Ji, Jinfeng Bai, Wenyu Liu, Xiang Bai |
|
code |
-1 |
Detecting Tampered Scene Text in the Wild |
Yuxin Wang, Hongtao Xie, Mengting Xing, Jing Wang, Shenggao Zhu, Yongdong Zhang |
|
code |
-1 |
Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning |
Jingqun Tang, Wenming Qian, Luchuan Song, Xiena Dong, Lan Li, Xiang Bai |
|
code |
-1 |
GLASS: Global to Local Attention for Scene-Text Spotting |
Roi Ronen, Shahar Tsiper, Oron Anschel, Inbal Lavi, Amir Markovitz, R. Manmatha |
|
code |
-1 |
COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts |
Jeonghun Baek, Yusuke Matsui, Kiyoharu Aizawa |
|
code |
-1 |
Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting |
Chuhui Xue, Wenqing Zhang, Yu Hao, Shijian Lu, Philip H. S. Torr, Song Bai |
|
code |
-1 |
Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition |
Xudong Xie, Ling Fu, Zhifei Zhang, Zhaowen Wang, Xiang Bai |
|
code |
-1 |
Levenshtein OCR |
Cheng Da, Peng Wang, Cong Yao |
|
code |
-1 |
Multi-granularity Prediction for Scene Text Recognition |
Peng Wang, Cheng Da, Cong Yao |
|
code |
-1 |
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting |
Ying Chen, Liang Qiao, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Xi Li |
|
code |
-1 |
Contextual Text Block Detection Towards Scene Text Understanding |
Chuhui Xue, Jiaxing Huang, Wenqing Zhang, Shijian Lu, Changhu Wang, Song Bai |
|
code |
-1 |
CoMER: Modeling Coverage for Transformer-Based Handwritten Mathematical Expression Recognition |
Wenqi Zhao, Liangcai Gao |
|
code |
-1 |
Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context |
Chongyu Liu, Lianwen Jin, Yuliang Liu, Canjie Luo, Bangdong Chen, Fengjun Guo, Kai Ding |
|
code |
-1 |
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers |
Oren Nuriel, Sharon Fogel, Ron Litman |
|
code |
-1 |
Multi-modal Text Recognition Networks: Interactive Enhancements Between Visual and Semantic Features |
Byeonghu Na, Yoonsik Kim, Sungrae Park |
|
code |
-1 |
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition |
Dajian Zhong, Shujing Lyu, Palaiahnakote Shivakumara, Bing Yin, Jiajia Wu, Umapada Pal, Yue Lu |
|
code |
-1 |
Pure Transformer with Integrated Experts for Scene Text Recognition |
Yew Lee Tan, Adams WaiKin Kong, JungJae Kim |
|
code |
-1 |
OCR-Free Document Understanding Transformer |
Geewook Kim, Teakgyu Hong, Moonbin Yim, JeongYeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park |
|
code |
-1 |
CAR: Class-Aware Regularizations for Semantic Segmentation |
Ye Huang, Di Kang, Liang Chen, Xuefei Zhe, Wenjing Jia, Linchao Bao, Xiangjian He |
|
code |
-1 |
Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation |
Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee |
|
code |
-1 |
SeqFormer: Sequential Transformer for Video Instance Segmentation |
Junfeng Wu, Yi Jiang, Song Bai, Wenqing Zhang, Xiang Bai |
|
code |
-1 |
Saliency Hierarchy Modeling via Generative Kernels for Salient Object Detection |
Wenhu Zhang, Liangli Zheng, Huanyu Wang, Xintian Wu, Xi Li |
|
code |
-1 |
In Defense of Online Models for Video Instance Segmentation |
Junfeng Wu, Qihao Liu, Yi Jiang, Song Bai, Alan L. Yuille, Xiang Bai |
|
code |
-1 |
Active Pointly-Supervised Instance Segmentation |
Chufeng Tang, Lingxi Xie, Gang Zhang, Xiaopeng Zhang, Qi Tian, Xiaolin Hu |
|
code |
-1 |
A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining |
Bowen Shi, Dongsheng Jiang, Xiaopeng Zhang, Han Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian |
|
code |
-1 |
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model |
Ho Kei Cheng, Alexander G. Schwing |
|
code |
-1 |
Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving |
Jiale Li, Hang Dai, Yong Ding |
|
code |
-1 |
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds |
Yan Xu, Jiantao Gao, Chaoda Zheng, Chao Zheng, Ruimao Zhang, Shuguang Cui, Zhen Li |
|
code |
-1 |
Extract Free Dense Labels from CLIP |
Chong Zhou, Chen Change Loy, Bo Dai |
|
code |
-1 |
3D Compositional Zero-Shot Learning with DeCompositional Consensus |
Muhammad Ferjad Naeem, Evin Pinar Örnek, Yongqin Xian, Luc Van Gool, Federico Tombari |
|
code |
-1 |
Video Mask Transfiner for High-Quality Video Instance Segmentation |
Lei Ke, Henghui Ding, Martin Danelljan, YuWing Tai, ChiKeung Tang, Fisher Yu |
|
code |
-1 |
SimpleRecon: 3D Reconstruction Without 3D Convolutions |
Mohamed Sayed, John Gibson, Jamie Watson, Victor Prisacariu, Michael Firman, Clément Godard |
|
code |
-1 |
Structure and Motion from Casual Videos |
Zhoutong Zhang, Forrester Cole, Zhengqi Li, Michael Rubinstein, Noah Snavely, William T. Freeman |
|
code |
-1 |
What Matters for 3D Scene Flow Network |
Guangming Wang, Yunzhe Hu, Zhe Liu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang |
|
code |
-1 |
Correspondence Reweighted Translation Averaging |
Lalit Manam, Venu Madhav Govindu |
|
code |
-1 |
Neural Strands: Learning Hair Geometry and Appearance from Multi-view Images |
Radu Alexandru Rosu, Shunsuke Saito, Ziyan Wang, Chenglei Wu, Sven Behnke, Giljoo Nam |
|
code |
-1 |
GraphCSPN: Geometry-Aware Depth Completion via Dynamic GCNs |
Xin Liu, Xiaofei Shao, Bo Wang, Yali Li, Shengjin Wang |
|
code |
-1 |
Objects Can Move: 3D Change Detection by Geometric Transformation Consistency |
Aikaterini Adam, Torsten Sattler, Konstantinos Karantzalos, Tomás Pajdla |
|
code |
-1 |
Language-Grounded Indoor 3D Semantic Segmentation in the Wild |
Dávid Rozenberszki, Or Litany, Angela Dai |
|
code |
-1 |
Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs |
Sameera Ramasinghe, Simon Lucey |
|
code |
-1 |
Deforming Radiance Fields with Cages |
Tianhan Xu, Tatsuya Harada |
|
code |
-1 |
FLEX: Extrinsic Parameters-free Multi-view 3D Human Motion Reconstruction |
Brian Gordon, Sigal Raab, Guy Azov, Raja Giryes, Daniel CohenOr |
|
code |
-1 |
MODE: Multi-view Omnidirectional Depth Estimation with 360$^\circ $ Cameras |
Ming Li, Xueqian Jin, Xuejiao Hu, Jingzhao Dai, Sidan Du, Yang Li |
|
code |
-1 |
GigaDepth: Learning Depth from Structured Light with Branching Neural Networks |
Simon Schreiberhuber, JeanBaptiste Weibel, Timothy Patten, Markus Vincze |
|
code |
-1 |
ActiveNeRF: Learning Where to See with Uncertainty Estimation |
Xuran Pan, Zihang Lai, Shiji Song, Gao Huang |
|
code |
-1 |
PoserNet: Refining Relative Camera Poses Exploiting Object Detections |
Matteo Taiana, Matteo Toso, Stuart James, Alessio Del Bue |
|
code |
-1 |
Gaussian Activated Neural Radiance Fields for High Fidelity Reconstruction and Pose Estimation |
ShinFang Chng, Sameera Ramasinghe, Jamie Sherrah, Simon Lucey |
|
code |
-1 |
Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling |
Jan U. Müller, Michael Weinmann, Reinhard Klein |
|
code |
-1 |
Towards Learning Neural Representations from Shadows |
Kushagra Tiwary, Tzofi Klinghoffer, Ramesh Raskar |
|
code |
-1 |
Class-Incremental Novel Class Discovery |
Subhankar Roy, Mingxuan Liu, Zhun Zhong, Nicu Sebe, Elisa Ricci |
|
code |
-1 |
Unknown-Oriented Learning for Open Set Domain Adaptation |
Jie Liu, Xiaoqing Guo, Yixuan Yuan |
|
code |
-1 |
Prototype-Guided Continual Adaptation for Class-Incremental Unsupervised Domain Adaptation |
Hongbin Lin, Yifan Zhang, Zhen Qiu, Shuaicheng Niu, Chuang Gan, Yanxia Liu, Mingkui Tan |
|
code |
-1 |
DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation |
Xin Lai, Zhuotao Tian, Xiaogang Xu, YingCong Chen, Shu Liu, Hengshuang Zhao, Liwei Wang, Jiaya Jia |
|
code |
-1 |
Class-Agnostic Object Counting Robust to Intraclass Diversity |
Shenjian Gong, Shanshan Zhang, Jian Yang, Dengxin Dai, Bernt Schiele |
|
code |
-1 |
Burn After Reading: Online Adaptation for Cross-domain Streaming Data |
Luyu Yang, Mingfei Gao, Zeyuan Chen, Ran Xu, Abhinav Shrivastava, Chetan Ramaiah |
|
code |
-1 |
Mind the Gap in Distilling StyleGANs |
Guodong Xu, Yuenan Hou, Ziwei Liu, Chen Change Loy |
|
code |
-1 |
Improving Test-Time Adaptation Via Shift-Agnostic Weight Regularization and Nearest Source Prototypes |
Sungha Choi, Seunghan Yang, Seokeon Choi, Sungrack Yun |
|
code |
-1 |
Learning Instance-Specific Adaptation for Cross-Domain Segmentation |
Yuliang Zou, Zizhao Zhang, ChunLiang Li, Han Zhang, Tomas Pfister, JiaBin Huang |
|
code |
-1 |
RegionCL: Exploring Contrastive Region Pairs for Self-supervised Representation Learning |
Yufei Xu, Qiming Zhang, Jing Zhang, Dacheng Tao |
|
code |
-1 |
Long-Tailed Class Incremental Learning |
Xialei Liu, YuSong Hu, XuSheng Cao, Andrew D. Bagdanov, Ke Li, MingMing Cheng |
|
code |
-1 |
DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning |
Hyounguk Shon, Janghyeon Lee, Seung Hwan Kim, Junmo Kim |
|
code |
-1 |
Adversarial Partial Domain Adaptation by Cycle Inconsistency |
KunYu Lin, Jiaming Zhou, Yukun Qiu, WeiShi Zheng |
|
code |
-1 |
Combating Label Distribution Shift for Active Domain Adaptation |
Sehyun Hwang, Sohyun Lee, Sungyeon Kim, Jungseul Ok, Suha Kwak |
|
code |
-1 |
GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation |
Cristiano Saltori, Evgeny Krivosheev, Stéphane Lathuilière, Nicu Sebe, Fabio Galasso, Giuseppe Fiameni, Elisa Ricci, Fabio Poiesi |
|
code |
-1 |
CoSMix: Compositional Semantic Mix for Domain Adaptation in 3D LiDAR Segmentation |
Cristiano Saltori, Fabio Galasso, Giuseppe Fiameni, Nicu Sebe, Elisa Ricci, Fabio Poiesi |
|
code |
-1 |
A Unified Framework for Domain Adaptive Pose Estimation |
Donghyun Kim, Kaihong Wang, Kate Saenko, Margrit Betke, Stan Sclaroff |
|
code |
-1 |
A Broad Study of Pre-training for Domain Generalization and Adaptation |
Donghyun Kim, Kaihong Wang, Stan Sclaroff, Kate Saenko |
|
code |
-1 |
Prior Knowledge Guided Unsupervised Domain Adaptation |
Tao Sun, Cheng Lu, Haibin Ling |
|
code |
-1 |
GCISG: Guided Causal Invariant Learning for Improved Syn-to-Real Generalization |
Gilhyun Nam, Gyeongjae Choi, Kyungmin Lee |
|
code |
-1 |
AcroFOD: An Adaptive Method for Cross-Domain Few-Shot Object Detection |
Yipeng Gao, Lingxiao Yang, Yunmu Huang, Song Xie, Shiyong Li, WeiShi Zheng |
|
code |
-1 |
Unsupervised Domain Adaptation for One-Stage Object Detector Using Offsets to Bounding Box |
Jayeon Yoo, Inseop Chung, Nojun Kwak |
|
code |
-1 |
Visual Prompt Tuning |
Menglin Jia, Luming Tang, BorChun Chen, Claire Cardie, Serge J. Belongie, Bharath Hariharan, SerNam Lim |
|
code |
-1 |
Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap |
Yongwei Chen, Zihao Wang, Longkun Zou, Ke Chen, Kui Jia |
|
code |
-1 |
Cross-domain Ensemble Distillation for Domain Generalization |
Kyungmoon Lee, Sungyeon Kim, Suha Kwak |
|
code |
-1 |
Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels |
Ganlong Zhao, Guanbin Li, Yipeng Qin, Feng Liu, Yizhou Yu |
|
code |
-1 |
Hyperspherical Learning in Multi-Label Classification |
Bo Ke, Yunquan Zhu, Mengtian Li, Xiujun Shu, Ruizhi Qiao, Bo Ren |
|
code |
-1 |
When Active Learning Meets Implicit Semantic Data Augmentation |
Zhuangzhuang Chen, Jin Zhang, Pan Wang, Jie Chen, Jianqiang Li |
|
code |
-1 |
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition |
Changyao Tian, Wenhai Wang, Xizhou Zhu, Jifeng Dai, Yu Qiao |
|
code |
-1 |
Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization |
Jiaxin Qi, Kaihua Tang, Qianru Sun, XianSheng Hua, Hanwang Zhang |
|
code |
-1 |
Hierarchical Semi-supervised Contrastive Learning for Contamination-Resistant Anomaly Detection |
Gaoang Wang, Yibing Zhan, Xinchao Wang, Mingli Song, Klara Nahrstedt |
|
code |
-1 |
Tracking by Associating Clips |
Sanghyun Woo, Kwanyong Park, Seoung Wug Oh, In So Kweon, JoonYoung Lee |
|
code |
-1 |
RealPatch: A Statistical Matching Framework for Model Patching with Real Samples |
Sara Romiti, Christopher Inskip, Viktoriia Sharmanska, Novi Quadrianto |
|
code |
-1 |
Background-Insensitive Scene Text Recognition with Text Semantic Segmentation |
Liang Zhao, Zhenyao Wu, Xinyi Wu, Greg Wilsbacher, Song Wang |
|
code |
-1 |
Semantic Novelty Detection via Relational Reasoning |
Francesco Cappio Borlino, Silvia Bucci, Tatiana Tommasi |
|
code |
-1 |
Improving Closed and Open-Vocabulary Attribute Prediction Using Transformers |
Khoi Pham, Kushal Kafle, Zhe Lin, Zhihong Ding, Scott Cohen, Quan Tran, Abhinav Shrivastava |
|
code |
-1 |
Training Vision Transformers with only 2040 Images |
YunHao Cao, Hao Yu, Jianxin Wu |
|
code |
-1 |
Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection |
Sanghyun Woo, Kwanyong Park, Seoung Wug Oh, In So Kweon, JoonYoung Lee |
|
code |
-1 |
TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNs |
Shantanu Jaiswal, Basura Fernando, Cheston Tan |
|
code |
-1 |
Automatic Check-Out via Prototype-Based Classifier Learning from Single-Product Exemplars |
Hao Chen, XiuShen Wei, Faen Zhang, Yang Shen, Hui Xu, Liang Xiao |
|
code |
-1 |
Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain |
Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Claudia Blaiotta, Mauricio Munoz, Volker Fischer |
|
code |
-1 |
Photo-realistic Neural Domain Randomization |
Sergey Zakharov, Rares Ambrus, Vitor Guizilini, Wadim Kehl, Adrien Gaidon |
|
code |
-1 |
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning |
Ting Yao, Yingwei Pan, Yehao Li, ChongWah Ngo, Tao Mei |
|
code |
-1 |
Tailoring Self-Supervision for Supervised Learning |
WonJun Moon, JiHwan Kim, JaePil Heo |
|
code |
-1 |
Difficulty-Aware Simulator for Open Set Recognition |
WonJun Moon, Jun Ho Park, Hyun Seok Seong, CheolHo Cho, JaePil Heo |
|
code |
-1 |
Few-Shot Class-Incremental Learning from an Open-Set Perspective |
Can Peng, Kun Zhao, Tianren Wang, Meng Li, Brian C. Lovell |
|
code |
-1 |
FOSTER: Feature Boosting and Compression for Class-Incremental Learning |
FuYun Wang, DaWei Zhou, HanJia Ye, DeChuan Zhan |
|
code |
-1 |
Visual Knowledge Tracing |
Neehar Kondapaneni, Pietro Perona, Oisin Mac Aodha |
|
code |
-1 |
S3C: Self-Supervised Stochastic Classifiers for Few-Shot Class-Incremental Learning |
Jayateja Kalla, Soma Biswas |
|
code |
-1 |
Improving Fine-Grained Visual Recognition in Low Data Regimes via Self-boosting Attention Mechanism |
Yangyang Shu, Baosheng Yu, Haiming Xu, Lingqiao Liu |
|
code |
-1 |
VSA: Learning Varied-Size Window Attention in Vision Transformers |
Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao |
|
code |
-1 |
Unbiased Manifold Augmentation for Coarse Class Subdivision |
Baoming Yan, Ke Gao, Bo Gao, Lin Wang, Jiang Yang, Xiaobo Li |
|
code |
-1 |
DenseHybrid: Hybrid Anomaly Detection for Dense Open-Set Recognition |
Matej Grcic, Petra Bevandic, Sinisa Segvic |
|
code |
-1 |
Rethinking Confidence Calibration for Failure Prediction |
Fei Zhu, Zhen Cheng, XuYao Zhang, ChengLin Liu |
|
code |
-1 |
Uncertainty-Guided Source-Free Domain Adaptation |
Subhankar Roy, Martin Trapp, Andrea Pilzer, Juho Kannala, Nicu Sebe, Elisa Ricci, Arno Solin |
|
code |
-1 |
Should All Proposals Be Treated Equally in Object Detection? |
Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Ying Jin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos |
|
code |
-1 |
ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers |
Junbo Li, Huan Zhang, Cihang Xie |
|
code |
-1 |
incDFM: Incremental Deep Feature Modeling for Continual Novelty Detection |
Amanda Rios, Nilesh A. Ahuja, Ibrahima J. Ndiour, Ergin Utku Genc, Laurent Itti, Omesh Tickoo |
|
code |
-1 |
IGFormer: Interaction Graph Transformer for Skeleton-Based Human Interaction Recognition |
Yunsheng Pang, Qiuhong Ke, Hossein Rahmani, James Bailey, Jun Liu |
|
code |
-1 |
PRIME: A Few Primitives Can Boost Robustness to Common Corruptions |
Apostolos Modas, Rahul Rade, Guillermo OrtizJiménez, SeyedMohsen MoosaviDezfooli, Pascal Frossard |
|
code |
-1 |
Rotation Regularization Without Rotation |
Takumi Kobayashi |
|
code |
-1 |
Towards Accurate Open-Set Recognition via Background-Class Regularization |
Wonwoo Cho, Jaegul Choo |
|
code |
-1 |
In Defense of Image Pre-Training for Spatiotemporal Recognition |
Xianhang Li, Huiyu Wang, Chen Wei, Jieru Mei, Alan L. Yuille, Yuyin Zhou, Cihang Xie |
|
code |
-1 |
Augmenting Deep Classifiers with Polynomial Neural Networks |
Grigorios G. Chrysos, Markos Georgopoulos, Jiankang Deng, Jean Kossaifi, Yannis Panagakis, Anima Anandkumar |
|
code |
-1 |
Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label Miscorrection |
Seong Min Kye, Kwanghee Choi, Joonyoung Yi, Buru Chang |
|
code |
-1 |
Online Task-free Continual Learning with Dynamic Sparse Distributed Memory |
Julien Pourcel, NgocSon Vu, Robert M. French |
|
code |
-1 |
Contrastive Deep Supervision |
Linfeng Zhang, Xin Chen, Junbo Zhang, Runpei Dong, Kaisheng Ma |
|
code |
-1 |
Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective |
Quan Cui, Bingchen Zhao, ZhaoMin Chen, Borui Zhao, Renjie Song, Boyan Zhou, Jiajun Liang, Osamu Yoshie |
|
code |
-1 |
LocVTP: Video-Text Pre-training for Temporal Localization |
Meng Cao, Tianyu Yang, Junwu Weng, Can Zhang, Jue Wang, Yuexian Zou |
|
code |
-1 |
Few-Shot End-to-End Object Detection via Constantly Concentrated Encoding Across Heads |
Jiawei Ma, Guangxing Han, Shiyuan Huang, Yuncong Yang, ShihFu Chang |
|
code |
-1 |
Implicit Neural Representations for Image Compression |
Yannick Strümpler, Janis Postels, Ren Yang, Luc Van Gool, Federico Tombari |
|
code |
-1 |
LiP-Flow: Learning Inference-Time Priors for Codec Avatars via Normalizing Flows in Latent Space |
Emre Aksan, Shugao Ma, Akin Caliskan, Stanislav Pidhorskyi, Alexander Richard, ShihEn Wei, Jason M. Saragih, Otmar Hilliges |
|
code |
-1 |
Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining |
Qihang Zhang, Zhenghao Peng, Bolei Zhou |
|
code |
-1 |
Learning Ego 3D Representation as Ray Tracing |
Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang |
|
code |
-1 |
Static and Dynamic Concepts for Self-supervised Video Representation Learning |
Rui Qian, Shuangrui Ding, Xian Liu, Dahua Lin |
|
code |
-1 |
SphereFed: Hyperspherical Federated Learning |
Xin Dong, Sai Qian Zhang, Ang Li, H. T. Kung |
|
code |
-1 |
Hierarchically Self-supervised Transformer for Human Skeleton Representation Learning |
Yuxiao Chen, Long Zhao, Jianbo Yuan, Yu Tian, Zhaoyang Xia, Shijie Geng, Ligong Han, Dimitris N. Metaxas |
|
code |
-1 |
Posterior Refinement on Metric Matrix Improves Generalization Bound in Metric Learning |
Mingda Wang, Canqian Yang, Yi Xu |
|
code |
-1 |
Balancing Stability and Plasticity Through Advanced Null Space in Continual Learning |
Yajing Kong, Liu Liu, Zhen Wang, Dacheng Tao |
|
code |
-1 |
DisCo: Remedying Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning |
Yuting Gao, JiaXin Zhuang, Shaohui Lin, Hao Cheng, Xing Sun, Ke Li, Chunhua Shen |
|
code |
-1 |
CoSCL: Cooperation of Small Continual Learners is Stronger Than a Big One |
Liyuan Wang, Xingxing Zhang, Qian Li, Jun Zhu, Yi Zhong |
|
code |
-1 |
Manifold Adversarial Learning for Cross-domain 3D Shape Representation |
Hao Huang, Cheng Chen, Yi Fang |
|
code |
-1 |
Fast-MoCo: Boost Momentum-Based Contrastive Learning with Combinatorial Patches |
Yuanzheng Ci, Chen Lin, Lei Bai, Wanli Ouyang |
|
code |
-1 |
LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling |
Boyan Jiang, Xinlin Ren, Mingsong Dou, Xiangyang Xue, Yanwei Fu, Yinda Zhang |
|
code |
-1 |
On the Versatile Uses of Partial Distance Correlation in Deep Learning |
Xingjian Zhen, Zihang Meng, Rudrasis Chakraborty, Vikas Singh |
|
code |
-1 |
Self-Regulated Feature Learning via Teacher-free Feature Distillation |
Lujun Li |
|
code |
-1 |
Balancing Between Forgetting and Acquisition in Incremental Subpopulation Learning |
Mingfu Liang, Jiahuan Zhou, Wei Wei, Ying Wu |
|
code |
-1 |
Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification |
Xulin Li, Yan Lu, Bin Liu, Yating Liu, Guojun Yin, Qi Chu, Jinyang Huang, Feng Zhu, Rui Zhao, Nenghai Yu |
|
code |
-1 |
DAS: Densely-Anchored Sampling for Deep Metric Learning |
Lizhao Liu, Shangxin Huang, Zhuangwei Zhuang, Ran Yang, Mingkui Tan, Yaowei Wang |
|
code |
-1 |
Learn from All: Erasing Attention Consistency for Noisy Label Facial Expression Recognition |
Yuhang Zhang, Chengrui Wang, Xu Ling, Weihong Deng |
|
code |
-1 |
A Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning |
Michael Kirchhof, Karsten Roth, Zeynep Akata, Enkelejda Kasneci |
|
code |
-1 |
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers |
Jihao Liu, Boxiao Liu, Hang Zhou, Hongsheng Li, Yu Liu |
|
code |
-1 |
UFO: Unified Feature Optimization |
Teng Xi, Yifan Sun, Deli Yu, Bi Li, Nan Peng, Gang Zhang, Xinyu Zhang, Zhigang Wang, Jinwen Chen, Jian Wang, Lufei Liu, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang |
|
code |
-1 |
Sound Localization by Self-supervised Time Delay Estimation |
Ziyang Chen, David F. Fouhey, Andrew Owens |
|
code |
-1 |
X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation |
Yinan He, Gengshi Huang, Siyu Chen, Jianing Teng, Kun Wang, Zhenfei Yin, Lu Sheng, Ziwei Liu, Yu Qiao, Jing Shao |
|
code |
-1 |
SLIP: Self-supervision Meets Language-Image Pre-training |
Norman Mu, Alexander Kirillov, David A. Wagner, Saining Xie |
|
code |
-1 |
Discovering Deformable Keypoint Pyramids |
Jianing Qian, Anastasios Panagopoulos, Dinesh Jayaraman |
|
code |
-1 |
Neural Video Compression Using GANs for Detail Synthesis and Propagation |
Fabian Mentzer, Eirikur Agustsson, Johannes Ballé, David Minnen, Nick Johnston, George Toderici |
|
code |
-1 |
A Contrastive Objective for Learning Disentangled Representations |
Jonathan Kahana, Yedid Hoshen |
|
code |
-1 |
PT4AL: Using Self-supervised Pretext Tasks for Active Learning |
John Seon Keun Yi, Minseok Seo, Jongchan Park, DongGeol Choi |
|
code |
-1 |
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer |
Haokui Zhang, Wenze Hu, Xiaoyu Wang |
|
code |
-1 |
DualPrompt: Complementary Prompting for Rehearsal-Free Continual Learning |
Zifeng Wang, Zizhao Zhang, Sayna Ebrahimi, Ruoxi Sun, Han Zhang, ChenYu Lee, Xiaoqi Ren, Guolong Su, Vincent Perot, Jennifer G. Dy, Tomas Pfister |
|
code |
-1 |
Unifying Visual Contrastive Learning for Object Recognition from a Graph Perspective |
Shixiang Tang, Feng Zhu, Lei Bai, Rui Zhao, Chenyu Wang, Wanli Ouyang |
|
code |
-1 |
Decoupled Contrastive Learning |
ChunHsiao Yeh, ChengYao Hong, YenChi Hsu, TyngLuh Liu, Yubei Chen, Yann LeCun |
|
code |
-1 |
Joint Learning of Localized Representations from Medical Images and Reports |
Philip Müller, Georgios Kaissis, Congyu Zou, Daniel Rueckert |
|
code |
-1 |
The Challenges of Continuous Self-Supervised Learning |
Senthil Purushwalkam, Pedro Morgado, Abhinav Gupta |
|
code |
-1 |
Conditional Stroke Recovery for Fine-Grained Sketch-Based Image Retrieval |
Zhixin Ling, Zhen Xing, Jian Zhou, Xiangdong Zhou |
|
code |
-1 |
Identifying Hard Noise in Long-Tailed Sample Distribution |
Xuanyu Yi, Kaihua Tang, XianSheng Hua, JooHwee Lim, Hanwang Zhang |
|
code |
-1 |
Interpretable Open-Set Domain Adaptation via Angular Margin Separation |
Xinhao Li, Jingjing Li, Zhekai Du, Lei Zhu, Wen Li |
|
code |
-1 |
TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation |
Rui Gong, Martin Danelljan, Dengxin Dai, Danda Pani Paudel, Ajad Chhatkuli, Fisher Yu, Luc Van Gool |
|
code |
-1 |
Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation |
Zhengkai Jiang, Yuxi Li, Ceyuan Yang, Peng Gao, Yabiao Wang, Ying Tai, Chengjie Wang |
|
code |
-1 |
RBC: Rectifying the Biased Context in Continual Semantic Segmentation |
Hanbin Zhao, Fengyu Yang, Xinghe Fu, Xi Li |
|
code |
-1 |
Factorizing Knowledge in Neural Networks |
Xingyi Yang, Jingwen Ye, Xinchao Wang |
|
code |
-1 |
Contrastive Vicinal Space for Unsupervised Domain Adaptation |
Jaemin Na, Dongyoon Han, Hyung Jin Chang, Wonjun Hwang |
|
code |
-1 |
Cross-Modal Knowledge Transfer Without Task-Relevant Source Data |
Sk Miraj Ahmed, Suhas Lohit, KuanChuan Peng, Michael Jones, Amit K. RoyChowdhury |
|
code |
-1 |
Online Domain Adaptation for Semantic Segmentation in Ever-Changing Conditions |
Theodoros Panagiotakopoulos, Pier Luigi Dovesi, Linus HärenstamNielsen, Matteo Poggi |
|
code |
-1 |
Source-Free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition |
Yuecong Xu, Jianfei Yang, Haozhi Cao, Keyu Wu, Min Wu, Zhenghua Chen |
|
code |
-1 |
BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation |
Sanqing Qu, Guang Chen, Jing Zhang, Zhijun Li, Wei He, Dacheng Tao |
|
code |
-1 |
Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks |
Yawen Huang, Feng Zheng, Xu Sun, Yuexiang Li, Ling Shao, Yefeng Zheng |
|
code |
-1 |
Incomplete Multi-view Domain Adaptation via Channel Enhancement and Knowledge Transfer |
Haifeng Xia, Pu Wang, Zhengming Ding |
|
code |
-1 |
DistPro: Searching a Fast Knowledge Distillation Process via Meta Optimization |
Xueqing Deng, Dawei Sun, Shawn D. Newsam, Peng Wang |
|
code |
-1 |
ML-BPM: Multi-teacher Learning with Bidirectional Photometric Mixing for Open Compound Domain Adaptation in Semantic Segmentation |
Fei Pan, Sungsu Hur, Seokju Lee, Junsik Kim, In So Kweon |
|
code |
-1 |
PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks |
Nan Ding, Xi Chen, Tomer Levinboim, Soravit Changpinyo, Radu Soricut |
|
code |
-1 |
Personalized Education: Blind Knowledge Distillation |
Xiang Deng, Jian Zheng, Zhongfei Zhang |
|
code |
-1 |
Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space |
Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo |
|
code |
-1 |
How Stable Are Transferability Metrics Evaluations? |
Andrea Agostinelli, Michal Pándy, Jasper R. R. Uijlings, Thomas Mensink, Vittorio Ferrari |
|
code |
-1 |
Attention Diversification for Domain Generalization |
Rang Meng, Xianfeng Li, Weijie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Mingli Song, Di Xie, Shiliang Pu |
|
code |
-1 |
ESS: Learning Event-Based Semantic Segmentation from Still Images |
Zhaoning Sun, Nico Messikommer, Daniel Gehrig, Davide Scaramuzza |
|
code |
-1 |
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection |
Yuetian Weng, Zizheng Pan, Mingfei Han, Xiaojun Chang, Bohan Zhuang |
|
code |
-1 |
Human Trajectory Prediction via Neural Social Physics |
Jiangbei Yue, Dinesh Manocha, He Wang |
|
code |
-1 |
Towards Open Set Video Anomaly Detection |
Yuansheng Zhu, Wentao Bao, Qi Yu |
|
code |
-1 |
EclipSE: Efficient Long-Range Video Retrieval Using Sight and Sound |
YanBo Lin, Jie Lei, Mohit Bansal, Gedas Bertasius |
|
code |
-1 |
Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing |
Haoyue Cheng, Zhaoyang Liu, Hang Zhou, Chen Qian, Wayne Wu, Limin Wang |
|
code |
-1 |
Less Than Few: Self-shot Video Instance Segmentation |
Pengwan Yang, Yuki M. Asano, Pascal Mettes, Cees G. M. Snoek |
|
code |
-1 |
Adaptive Face Forgery Detection in Cross Domain |
Luchuan Song, Zheng Fang, Xiaodan Li, Xiaoyi Dong, Zhenchao Jin, Yuefeng Chen, Siwei Lyu |
|
code |
-1 |
Real-Time Online Video Detection with Temporal Smoothing Transformers |
Yue Zhao, Philipp Krähenbühl |
|
code |
-1 |
TallFormer: Temporal Action Localization with a Long-Memory Transformer |
Feng Cheng, Gedas Bertasius |
|
code |
-1 |
Mining Relations Among Cross-Frame Affinities for Video Semantic Segmentation |
Guolei Sun, Yun Liu, Hao Tang, Ajad Chhatkuli, Le Zhang, Luc Van Gool |
|
code |
-1 |
TL;DW? Summarizing Instructional Videos with Task Relevance and Cross-Modal Saliency |
Medhini Narasimhan, Arsha Nagrani, Chen Sun, Michael Rubinstein, Trevor Darrell, Anna Rohrbach, Cordelia Schmid |
|
code |
-1 |
Rethinking Learning Approaches for Long-Term Action Anticipation |
Megha Nawhal, Akash Abdu Jyothi, Greg Mori |
|
code |
-1 |
DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition |
Yuxuan Liang, Pan Zhou, Roger Zimmermann, Shuicheng Yan |
|
code |
-1 |
Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation |
Gensheng Pei, Fumin Shen, Yazhou Yao, GuoSen Xie, Zhenmin Tang, Jinhui Tang |
|
code |
-1 |
PAC-Net: Highlight Your Video via History Preference Modeling |
Hang Wang, Penghao Zhou, Chong Zhou, Zhao Zhang, Xing Sun |
|
code |
-1 |
How Severe Is Benchmark-Sensitivity in Video Self-supervised Learning? |
Fida Mohammad Thoker, Hazel Doughty, Piyush Bagad, Cees G. M. Snoek |
|
code |
-1 |
A Sliding Window Scheme for Online Temporal Action Localization |
Young Hwi Kim, Hyolim Kang, Seon Joo Kim |
|
code |
-1 |
ERA: Expert Retrieval and Assembly for Early Action Prediction |
Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu |
|
code |
-1 |
Dual Perspective Network for Audio-Visual Event Localization |
Varshanth R. Rao, Md Ibrahim Khalil, Haoda Li, Peng Dai, Juwei Lu |
|
code |
-1 |
NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition |
Boyang Xia, Wenhao Wu, Haoran Wang, Rui Su, Dongliang He, Haosen Yang, Xiaoran Fan, Wanli Ouyang |
|
code |
-1 |
Video Activity Localisation with Uncertainties in Temporal Boundary |
Jiabo Huang, Hailin Jin, Shaogang Gong, Yang Liu |
|
code |
-1 |
Temporal Saliency Query Network for Efficient Video Recognition |
Boyang Xia, Zhihao Wang, Wenhao Wu, Haoran Wang, Jungong Han |
|
code |
-1 |
Reversed Image Signal Processing and RAW Reconstruction. AIM 2022 Challenge Report |
Marcos V. Conde, Radu Timofte, Yibin Huang, Jingyang Peng, Chang Chen, Cheng Li, Eduardo PérezPellitero, Fenglong Song, Furui Bai, Shuai Liu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Yu Zhu, Chenghua Li, Yingying Jiang, Yong A, Peisong Wang, Cong Leng, Jian Cheng, Xiaoyu Liu, Zhicun Yin, Zhilu Zhang, Junyi Li, Ming Liu, Wangmeng Zuo, Jun Jiang, Jinha Kim, Yue Zhang, Beiji Zou, Zhikai Zong, Xiaoxiao Liu, Juan MarínVega, Michael Sloth, Peter SchneiderKamp, Richard Röttger, Furkan Kinli, Baris Özcan, Furkan Kiraç, Li Leyi, S. M. Nadim Uddin, Dipon Kumar Ghosh, Yong Ju Jung |
|
code |
-1 |
AIM 2022 Challenge on Instagram Filter Removal: Methods and Results |
Furkan Kinli, Sami Mentes, Baris Özcan, Furkan Kiraç, Radu Timofte, Yi Zuo, Zitao Wang, Xiaowen Zhang, Yu Zhu, Chenghua Li, Cong Leng, Jian Cheng, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Tianzhi Ma, Zihan Gao, Wenxin He, WoonHa Yeo, WangTaek Oh, YoungIl Kim, HanCheol Ryu, Gang He, Shaoyi Long, S. M. A. Sharif, Rizwan Ali Naqvi, Sungjun Kim, Guisik Kim, Seohyeon Lee, Sabari Nathan, Priya Kansal |
|
code |
-1 |
Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report |
Andrey Ignatov, Radu Timofte, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Ziyao Yi, Yan Xiang, Zibin Liu, Shaoqing Li, Keming Shi, Dehui Kong, Ke Xu, Minsu Kwon, Yaqi Wu, Jiesi Zheng, Zhihao Fan, Xun Wu, Feng Zhang, Albert No, Minhyeok Cho, Zewen Chen, Xiaze Zhang, Ran Li, Juan Wang, Zhiming Wang, Marcos V. Conde, UiJin Choi, Georgy Perevozchikov, Egor I. Ershov, Zheng Hui, Mengchuan Dong, Xin Lou, Wei Zhou, Cong Pang, Haina Qin, Mingxuan Cai |
|
code |
-1 |
Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report |
Andrey Ignatov, Grigory Malivenko, Radu Timofte, Lukasz Treszczotko, Xin Chang, Piotr Ksiazek, Michal Lopuszynski, Maciej Pioro, Rafal Rudnicki, Maciej Smyl, Yujie Ma, Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang, XueChao Shi, Difan Xu, Yanan Li, Xiaotao Wang, Lei Lei, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo, Gang Yu, Bin Fu, Jiaqi Li, Yiran Wang, Zihao Huang, Zhiguo Cao, Marcos V. Conde, Denis Sapozhnikov, Byeong Hyun Lee, Dongwon Park, Seongmin Hong, Joonhee Lee, Seunggyu Lee, Se Young Chun |
|
code |
-1 |
Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 Challenge: Report |
Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, Jingang Huh, Myeong Kyun Kim, Kihwan Yoon, HyeonCheol Moon, Seungho Lee, Yoonsik Choe, Jinwoo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li, Dan Zhu, Mengdi Sun, Ran Duan, Yan Gao, Lingshun Kong, Long Sun, Xiang Li, Xingdong Zhang, Jiawei Zhang, Yaqi Wu, Jinshan Pan, Gaocheng Yu, Jin Zhang, Feng Zhang, Zhe Ma, Hongbin Wang, Hojin Cho, Steve Kim, Huaen Li, Yanbo Ma, Ziwei Luo, Youwei Li, Lei Yu, Zhihong Wen, Qi Wu, Haoqiang Fan, Shuaicheng Liu, Lize Zhang, Zhikai Zong, Jeremy Kwon, Junxi Zhang, Mengyuan Li, Nianxiang Fu, Guanchen Ding, Han Zhu, Zhenzhong Chen, Gen Li, Yuanfan Zhang, Lei Sun, Dafeng Zhang, Neo Yang, Fitz Liu, Jerry Zhao, Mustafa Ayazoglu, Bahri Batuhan Bilecen, Shota Hirose, Kasidis Arunruangsirilert, Luo Ao, Ho Chun Leung, Andrew Wei, Jie Liu, Qiang Liu, Dahai Yu, Ao Li, Lei Luo, Ce Zhu, Seongmin Hong, Dongwon Park, Joonhee Lee, Byeong Hyun Lee, Seunggyu Lee, Se Young Chun, Ruiyuan He, Xuhao Jiang, Haihang Ruan, Xinjian Zhang, Jing Liu, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He |
|
code |
-1 |
Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report |
Andrey Ignatov, Radu Timofte, ChengMing Chiang, HsienKai Kuo, YuSyuan Xu, ManYu Lee, Allen Lu, ChiaMing Cheng, ChihCheng Chen, JiaYing Yong, HongHan Shuai, WenHuang Cheng, Zhuang Jia, Tianyu Xu, Yijian Zhang, Long Bao, Heng Sun, Diankai Zhang, Si Gao, Shaoli Liu, Biao Wu, Xiaofeng Zhang, Chengjian Zheng, Kaidi Lu, Ning Wang, Xiao Sun, Haodong Wu, Xuncheng Liu, Weizhan Zhang, Caixia Yan, Haipeng Du, Qinghua Zheng, Qi Wang, Wangdu Chen, Ran Duan, Mengdi Sun, Dan Zhu, Guannan Chen, Hojin Cho, Steve Kim, Shijie Yue, Chenghua Li, Zhengyang Zhuge, Wei Chen, Wenxu Wang, Yufeng Zhou, Xiaochen Cai, Hengxing Cai, Kele Xu, Li Liu, Zehua Cheng, Wenyi Lian, Wenjing Lian |
|
code |
-1 |
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 Challenge: Report |
Andrey Ignatov, Radu Timofte, Jin Zhang, Feng Zhang, Gaocheng Yu, Zhe Ma, Hongbin Wang, Minsu Kwon, Haotian Qian, Wentao Tong, Pan Mu, Ziping Wang, Guangjing Yan, Brian Lee, Lei Fei, Huaijin Chen, Hyebin Cho, Byeongjun Kwon, Munchurl Kim, Mingyang Qian, Huixin Ma, Yanan Li, Xiaotao Wang, Lei Lei |
|
code |
-1 |
AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results |
Ren Yang, Radu Timofte, Xin Li, Qi Zhang, Lin Zhang, Fanglong Liu, Dongliang He, Fu Li, He Zheng, Weihang Yuan, Pavel Ostyakov, Dmitry Vyal, Magauiya Zhussip, Xueyi Zou, Youliang Yan, Lei Li, Jingzhu Tang, Ming Chen, Shijie Zhao, Yu Zhu, Xiaoran Qin, Chenghua Li, Cong Leng, Jian Cheng, Claudio Rota, Marco Buzzelli, Simone Bianco, Raimondo Schettini, Dafeng Zhang, Feiyu Huang, Shizhuo Liu, Xiaobing Wang, Zhezhu Jin, Bingchen Li, Xin Li, Mingxi Li, Ding Liu, Wenbin Zou, Peijie Dong, Tian Ye, Yunchen Zhang, Ming Tan, Xin Niu, Mustafa Ayazoglu, Marcos Conde, UiJin Choi, Zhuang Jia, Tianyu Xu, Yijian Zhang, Mao Ye, Dengyan Luo, Xiaofeng Pan, Liuhan Peng |
|
code |
-1 |
Swin-Unet: Unet-Like Pure Transformer for Medical Image Segmentation |
Hu Cao, Yueyue Wang, Joy Chen, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, Manning Wang |
|
code |
-1 |
Self-attention Capsule Network for Tissue Classification in Case of Challenging Medical Image Statistics |
Assaf Hoogi, Brian Wilcox, Yachee Gupta, Daniel L. Rubin |
|
code |
-1 |
ReLaX: Retinal Layer Attribution for Guided Explanations of Automated Optical Coherence Tomography Classification |
Evan Wen, ReBecca Sorenson, Max Ehrlich |
|
code |
-1 |
Neural Registration and Segmentation of White Matter Tracts in Multi-modal Brain MRI |
Noa Barzilay, Ilya Nelkenbaum, Eli Konen, Nahum Kiryati, Arnaldo Mayer |
|
code |
-1 |
Complementary Phase Encoding for Pair-Wise Neural Deblurring of Accelerated Brain MRI |
Gali Hod, Michael Green, Mark Waserman, Eli Konen, Shai Shrot, Ilya Nelkenbaum, Nahum Kiryati, Arnaldo Mayer |
|
code |
-1 |
Frequency Dropout: Feature-Level Regularization via Randomized Filtering |
Mobarakol Islam, Ben Glocker |
|
code |
-1 |
PVBM: A Python Vasculature Biomarker Toolbox Based on Retinal Blood Vessel Segmentation |
Jonathan Fhima, Jan Van Eijgen, Ingeborg Stalmans, Yevgeniy Men, Moti Freiman, Joachim A. Behar |
|
code |
-1 |
Simultaneous Detection and Classification of Partially and Weakly Supervised Cells |
Alona Golts, Ido Livneh, Yaniv Zohar, Aaron Ciechanover, Michael Elad |
|
code |
-1 |
Deep-ASPECTS: A Segmentation-Assisted Model for Stroke Severity Measurement |
Ujjwal Upadhyay, Mukul Ranjan, Satish Golla, Swetha Tanamala, Preetham Sreenivas, Sasank Chilamkurthy, Jeyaraj Pandian, Jason Tarpley |
|
code |
-1 |
ExSwin-Unet: An Unbalanced Weighted Unet with Shifted Window and External Attentions for Fetal Brain MRI Image Segmentation |
Yufei Wen, Chongxin Liang, Jingyin Lin, Huisi Wu, Jing Qin |
|
code |
-1 |
Contour Dice Loss for Structures with Fuzzy and Complex Boundaries in Fetal MRI |
Bella SpecktorFadida, Bossmat Yehuda, Daphna LinkSourani, Liat BenSira, Dafna BenBashat, Leo Joskowicz |
|
code |
-1 |
Multi-scale Multi-task Distillation for Incremental 3D Medical Image Segmentation |
Mu Tian, Qinzhu Yang, Yi Gao |
|
code |
-1 |
A Data-Efficient Deep Learning Framework for Segmentation and Classification of Histopathology Images |
Pranav Singh, Jacopo Cirrone |
|
code |
-1 |
Bounded Future MS-TCN++ for Surgical Gesture Recognition |
Adam Goldbraikh, Netanell Avisdris, Carla M. Pugh, Shlomi Laufer |
|
code |
-1 |
Anatomy-Aware Contrastive Representation Learning for Fetal Ultrasound |
Zeyu Fu, Jianbo Jiao, Robail Yasrab, Lior Drukker, Aris T. Papageorghiou, J. Alison Noble |
|
code |
-1 |
Joint Calibrationless Reconstruction and Segmentation of Parallel MRI |
Aniket Pramanik, Mathews Jacob |
|
code |
-1 |
Patient-Level Microsatellite Stability Assessment from Whole Slide Images by Combining Momentum Contrast Learning and Group Patch Embeddings |
Daniel Shats, Hadar Hezi, Guy Shani, Yosef E. Maruvka, Moti Freiman |
|
code |
-1 |
Segmenting Glandular Biopsy Images Using the Separate Merged Objects Algorithm |
David Sabban, Ilan Shimshoni |
|
code |
-1 |
qDWI-Morph: Motion-Compensated Quantitative Diffusion-Weighted MRI Analysis for Fetal Lung Maturity Assessment |
Yael ZaffraniReznikov, Onur Afacan, Sila Kurugol, Simon K. Warfield, Moti Freiman |
|
code |
-1 |
Estimating Withdrawal Time in Colonoscopies |
Liran Katzir, Danny Veikherman, Valentin Dashinsky, Roman Goldenberg, Ilan Shimshoni, Nadav Rabani, Regev Cohen, Ori Kelner, Ehud Rivlin, Daniel Freedman |
|
code |
-1 |
Beyond Local Processing: Adapting CNNs for CT Reconstruction |
Bassel Hamoud, Yuval Bahat, Tomer Michaeli |
|
code |
-1 |
CL-GAN: Contrastive Learning-Based Generative Adversarial Network for Modality Transfer with Limited Paired Data |
Hajar Emami, Ming Dong, Carri GlideHurst |
|
code |
-1 |
IMPaSh: A Novel Domain-Shift Resistant Representation for Colorectal Cancer Tissue Classification |
Trinh Thi Le Vuong, Quoc Dang Vu, Mostafa Jahanifar, Simon Graham, Jin Tae Kwak, Nasir M. Rajpoot |
|
code |
-1 |
Surgical Workflow Recognition: From Analysis of Challenges to Architectural Study |
Tobias Czempiel, Aidean Sharghi, Magdalini Paschali, Nassir Navab, Omid Mohareri |
|
code |
-1 |
RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Right Ventricular Function |
Bálint Magyar, Márton Tokodi, András Soós, Máté Tolvaj, Bálint Károly Lakatos, Alexandra Fábián, Elena Surkova, Béla Merkely, Attila Kovács, András Horváth |
|
code |
-1 |
Initialization and Alignment for Adversarial Texture Optimization |
Xiaoming Zhao, Zhizhen Zhao, Alexander G. Schwing |
|
code |
-1 |
SIGNet: Intrinsic Image Decomposition by a Semantic and Invariant Gradient Driven Network for Indoor Scenes |
Partha Das, Sezer Karaoglu, Arjan Gijsenij, Theo Gevers |
|
code |
-1 |
Implicit Map Augmentation for Relocalization |
Yuxin Hou, Tianwei Shen, TsunYi Yang, Daniel DeTone, Hyo Jin Kim, Chris Sweeney, Richard A. Newcombe |
|
code |
-1 |
Social Processes: Self-supervised Meta-learning Over Conversational Groups for Forecasting Nonverbal Social Cues |
Chirag Raman, Hayley Hung, Marco Loog |
|
code |
-1 |
Photo-Realistic 360$^{\circ }$ Head Avatars in the Wild |
Stanislaw Szymanowicz, Virginia Estellers, Tadas Baltrusaitis, Matthew Johnson |
|
code |
-1 |
AvatarGen: A 3D Generative Model for Animatable Human Avatars |
Jianfeng Zhang, Zihang Jiang, Dingdong Yang, Hongyi Xu, Yichun Shi, Guoxian Song, Zhongcong Xu, Xinchao Wang, Jiashi Feng |
|
code |
-1 |
INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors |
Chaojian Li, Bichen Wu, Albert Pumarola, Peizhao Zhang, Yingyan Lin, Peter Vajda |
|
code |
-1 |
Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation |
Yangheng Zhao, Jun Wang, Xiaolong Li, Yue Hu, Ce Zhang, Yanfeng Wang, Siheng Chen |
|
code |
-1 |
Self-supervised 3D Human Pose Estimation in Static Video via Neural Rendering |
Luca Schmidtke, Benjamin Hou, Athanasios Vlontzos, Bernhard Kainz |
|
code |
-1 |
Racial Bias in the Beautyverse: Evaluation of Augmented-Reality Beauty Filters |
Piera Riccio, Nuria Oliver |
|
code |
-1 |
LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction |
Xinhan Di, Pengqian Yu |
|
code |
-1 |
Neural Mesh-Based Graphics |
Shubhendu Jena, Franck Multon, Adnane Boukhayma |
|
code |
-1 |
One-Shot Learning for Human Affordance Detection |
Abel PachecoOrtega, Walterio W. MayolCuevas |
|
code |
-1 |
Fast Two-View Motion Segmentation Using Christoffel Polynomials |
Bengisu Özbay, Octavia I. Camps, Mario Sznaier |
|
code |
-1 |
UCTNet: Uncertainty-Aware Cross-Modal Transformer Network for Indoor RGB-D Semantic Segmentation |
Xiaowen Ying, Mooi Choo Chuah |
|
code |
-1 |
Bi-directional Contrastive Learning for Domain Adaptive Semantic Segmentation |
Geon Lee, Chanho Eom, Wonkyung Lee, Hyekang Park, Bumsub Ham |
|
code |
-1 |
Learning Regional Purity for Instance Segmentation on 3D Point Clouds |
Shichao Dong, Guosheng Lin, TzuYi Hung |
|
code |
-1 |
Cross-Domain Few-Shot Semantic Segmentation |
Shuo Lei, Xuchao Zhang, Jianfeng He, Fanglan Chen, Bowen Du, ChangTien Lu |
|
code |
-1 |
Generative Subgraph Contrast for Self-Supervised Graph Representation Learning |
Yuehui Han, Le Hui, Haobo Jiang, Jianjun Qian, Jin Xie |
|
code |
-1 |
SdAE: Self-distillated Masked Autoencoder |
Yabo Chen, Yuchen Liu, Dongsheng Jiang, Xiaopeng Zhang, Wenrui Dai, Hongkai Xiong, Qi Tian |
|
code |
-1 |
Demystifying Unsupervised Semantic Correspondence Estimation |
Mehmet Aygün, Oisin Mac Aodha |
|
code |
-1 |
Open-Set Semi-Supervised Object Detection |
YenCheng Liu, ChihYao Ma, Xiaoliang Dai, Junjiao Tian, Peter Vajda, Zijian He, Zsolt Kira |
|
code |
-1 |
Vibration-Based Uncertainty Estimation for Learning from Limited Supervision |
Hengtong Hu, Lingxi Xie, Xinyue Huo, Richang Hong, Qi Tian |
|
code |
-1 |
Concurrent Subsidiary Supervision for Unsupervised Source-Free Domain Adaptation |
Jogendra Nath Kundu, Suvaansh Bhambri, Akshay R. Kulkarni, Hiran Sarkar, Varun Jampani, R. Venkatesh Babu |
|
code |
-1 |
Weakly Supervised Object Localization Through Inter-class Feature Similarity and Intra-class Appearance Consistency |
Jun Wei, Sheng Wang, S. Kevin Zhou, Shuguang Cui, Zhen Li |
|
code |
-1 |
Active Learning Strategies for Weakly-Supervised Object Detection |
Huy V. Vo, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Jean Ponce |
|
code |
-1 |
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training |
Xiaotong Li, Yixiao Ge, Kun Yi, Zixuan Hu, Ying Shan, LingYu Duan |
|
code |
-1 |
Bootstrapped Masked Autoencoders for Vision BERT Pretraining |
Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu |
|
code |
-1 |
Unsupervised Visual Representation Learning by Synchronous Momentum Grouping |
Bo Pang, Yifan Zhang, Yaoyi Li, Jia Cai, Cewu Lu |
|
code |
-1 |
Improving Few-Shot Part Segmentation Using Coarse Supervision |
Oindrila Saha, Zezhou Cheng, Subhransu Maji |
|
code |
-1 |
What to Hide from Your Students: Attention-Guided Masked Image Modeling |
Ioannis Kakogeorgiou, Spyros Gidaris, Bill Psomas, Yannis Avrithis, Andrei Bursuc, Konstantinos Karantzalos, Nikos Komodakis |
|
code |
-1 |
Pointly-Supervised Panoptic Segmentation |
Junsong Fan, Zhaoxiang Zhang, Tieniu Tan |
|
code |
-1 |
MVP: Multimodality-Guided Visual Pre-training |
Longhui Wei, Lingxi Xie, Wengang Zhou, Houqiang Li, Qi Tian |
|
code |
-1 |
Locally Varying Distance Transform for Unsupervised Visual Anomaly Detection |
WenYan Lin, Zhonghang Liu, Siying Liu |
|
code |
-1 |
HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation |
Lukas Hoyer, Dengxin Dai, Luc Van Gool |
|
code |
-1 |
SPot-the-Difference Self-supervised Pre-training for Anomaly Detection and Segmentation |
Yang Zou, Jongheon Jeong, Latha Pemula, Dongqing Zhang, Onkar Dabeer |
|
code |
-1 |
Dual-Domain Self-supervised Learning and Model Adaption for Deep Compressive Imaging |
Yuhui Quan, Xinran Qin, Tongyao Pang, Hui Ji |
|
code |
-1 |
Unsupervised Selective Labeling for More Effective Semi-supervised Learning |
Xudong Wang, Long Lian, Stella X. Yu |
|
code |
-1 |
Max Pooling with Vision Transformers Reconciles Class and Shape in Weakly Supervised Semantic Segmentation |
Simone Rossetti, Damiano Zappia, Marta Sanzari, Marco Schaerf, Fiora Pirri |
|
code |
-1 |
Dense Siamese Network for Dense Unsupervised Learning |
Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy |
|
code |
-1 |
Multi-granularity Distillation Scheme Towards Lightweight Semi-supervised Semantic Segmentation |
Jie Qin, Jie Wu, Ming Li, Xuefeng Xiao, Min Zheng, Xingang Wang |
|
code |
-1 |
CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation |
Feng Wang, Huiyu Wang, Chen Wei, Alan L. Yuille, Wei Shen |
|
code |
-1 |
Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization |
Qi Wei, Haoliang Sun, Xiankai Lu, Yilong Yin |
|
code |
-1 |
RDA: Reciprocal Distribution Alignment for Robust Semi-supervised Learning |
Yue Duan, Lei Qi, Lei Wang, Luping Zhou, Yinghuan Shi |
|
code |
-1 |
MemSAC: Memory Augmented Sample Consistency for Large Scale Domain Adaptation |
Tarun Kalluri, Astuti Sharma, Manmohan Chandraker |
|
code |
-1 |
United Defocus Blur Detection and Deblurring via Adversarial Promoting Learning |
Wenda Zhao, Fei Wei, You He, Huchuan Lu |
|
code |
-1 |
Synergistic Self-supervised and Quantization Learning |
YunHao Cao, Peiqin Sun, Yechang Huang, Jianxin Wu, Shuchang Zhou |
|
code |
-1 |
Semi-supervised Vision Transformers |
Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, YuGang Jiang |
|
code |
-1 |
Domain Adaptive Video Segmentation via Temporal Pseudo Supervision |
Yun Xing, Dayan Guan, Jiaxing Huang, Shijian Lu |
|
code |
-1 |
Diverse Learner: Exploring Diverse Supervision for Semi-supervised Object Detection |
Linfeng Li, Minyue Jiang, Yue Yu, Wei Zhang, Xiangru Lin, Yingying Li, Xiao Tan, Jingdong Wang, Errui Ding |
|
code |
-1 |
A Closer Look at Invariances in Self-supervised Pre-training for 3D Vision |
Lanxiao Li, Michael Heizmann |
|
code |
-1 |
ConMatch: Semi-supervised Learning with Confidence-Guided Consistency Regularization |
Jiwon Kim, Youngjo Min, Daehwan Kim, Gyuseong Lee, Junyoung Seo, Kwangrok Ryoo, Seungryong Kim |
|
code |
-1 |
FedX: Unsupervised Federated Learning with Cross Knowledge Distillation |
Sungwon Han, Sungwon Park, Fangzhao Wu, Sundong Kim, Chuhan Wu, Xing Xie, Meeyoung Cha |
|
code |
-1 |
W2N: Switching from Weak Supervision to Noisy Supervision for Object Detection |
Zitong Huang, Yiping Bao, Bowen Dong, Erjin Zhou, Wangmeng Zuo |
|
code |
-1 |
Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness |
Chaoning Zhang, Kang Zhang, Chenshuang Zhang, Axi Niu, Jiu Feng, Chang D. Yoo, In So Kweon |
|
code |
-1 |
ARAH: Animatable Volume Rendering of Articulated Human SDFs |
Shaofei Wang, Katja Schwarz, Andreas Geiger, Siyu Tang |
|
code |
-1 |
ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer |
Hongkai Chen, Zixin Luo, Lei Zhou, Yurun Tian, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan |
|
code |
-1 |
NDF: Neural Deformable Fields for Dynamic Human Modelling |
Ruiqi Zhang, Jie Chen |
|
code |
-1 |
Neural Density-Distance Fields |
Itsuki Ueda, Yoshihiro Fukuhara, Hirokatsu Kataoka, Hiroaki Aizawa, Hidehiko Shishido, Itaru Kitahara |
|
code |
-1 |
NeXT: Towards High Quality Neural Radiance Fields via Multi-skip Transformer |
Yunxiao Wang, Yanjie Li, Peidong Liu, Tao Dai, ShuTao Xia |
|
code |
-1 |
Learning Online Multi-sensor Depth Fusion |
Erik Sandström, Martin R. Oswald, Suryansh Kumar, Silvan Weder, Fisher Yu, Cristian Sminchisescu, Luc Van Gool |
|
code |
-1 |
BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering |
Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, Dahua Lin |
|
code |
-1 |
Decomposing the Tangent of Occluding Boundaries According to Curvatures and Torsions |
Huizong Yang, Anthony J. Yezzi |
|
code |
-1 |
NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors |
Jiepeng Wang, Peng Wang, Xiaoxiao Long, Christian Theobalt, Taku Komura, Lingjie Liu, Wenping Wang |
|
code |
-1 |
Generalizable Patch-Based Neural Rendering |
Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia |
|
code |
-1 |
Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation |
Ziming Wang, Xiaoliang Huo, Zhenghao Chen, Jing Zhang, Lu Sheng, Dong Xu |
|
code |
-1 |
Real-Time Neural Character Rendering with Pose-Guided Multiplane Images |
Hao Ouyang, Bo Zhang, Pan Zhang, Hao Yang, Jiaolong Yang, Dong Chen, Qifeng Chen, Fang Wen |
|
code |
-1 |
SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views |
Xiaoxiao Long, Cheng Lin, Peng Wang, Taku Komura, Wenping Wang |
|
code |
-1 |
Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth |
Ziyue Feng, Liang Yang, Longlong Jing, Haiyan Wang, Yingli Tian, Bing Li |
|
code |
-1 |
Depth Field Networks For Generalizable Multi-view Scene Representation |
Vitor Guizilini, Igor Vasiljevic, Jiading Fang, Rare Ambru, Greg Shakhnarovich, Matthew R. Walter, Adrien Gaidon |
|
code |
-1 |
Context-Enhanced Stereo Transformer |
Weiyu Guo, Zhaoshuo Li, Yongkui Yang, Zheng Wang, Russell H. Taylor, Mathias Unberath, Alan L. Yuille, Yingwei Li |
|
code |
-1 |
PCW-Net: Pyramid Combination and Warping Cost Volume for Stereo Matching |
Zhelun Shen, Yuchao Dai, Xibin Song, Zhibo Rao, Dingfu Zhou, Liangjun Zhang |
|
code |
-1 |
Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images |
Yuan Liu, Yilin Wen, Sida Peng, Cheng Lin, Xiaoxiao Long, Taku Komura, Wenping Wang |
|
code |
-1 |
Latency-Aware Collaborative Perception |
Zixing Lei, Shunli Ren, Yue Hu, Wenjun Zhang, Siheng Chen |
|
code |
-1 |
TensoRF: Tensorial Radiance Fields |
Anpei Chen, Zexiang Xu, Andreas Geiger, Jingyi Yu, Hao Su |
|
code |
-1 |
NeFSAC: Neurally Filtered Minimal Samples |
Luca Cavalli, Marc Pollefeys, Daniel Barath |
|
code |
-1 |
SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data |
Eldar Insafutdinov, Dylan Campbell, João F. Henriques, Andrea Vedaldi |
|
code |
-1 |
HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields |
Kim JunSeong, Kim YuJi, Moon YeBin, TaeHyun Oh |
|
code |
-1 |
NeuMan: Neural Human Radiance Field from a Single Video |
Wei Jiang, Kwang Moo Yi, Golnoosh Samei, Oncel Tuzel, Anurag Ranjan |
|
code |
-1 |
TAVA: Template-free Animatable Volumetric Actors |
Ruilong Li, Julian Tanke, Minh Vo, Michael Zollhöfer, Jürgen Gall, Angjoo Kanazawa, Christoph Lassner |
|
code |
-1 |
EASNet: Searching Elastic and Accurate Network Architecture for Stereo Matching |
Qiang Wang, Shaohuai Shi, Kaiyong Zhao, Xiaowen Chu |
|
code |
-1 |
Relative Pose from SIFT Features |
Daniel Barath, Zuzana Kukelova |
|
code |
-1 |
Selection and Cross Similarity for Event-Image Deep Stereo |
Hoonhee Cho, KukJin Yoon |
|
code |
-1 |
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding |
Dave Zhenyu Chen, Qirui Wu, Matthias Nießner, Angel X. Chang |
|
code |
-1 |
CIRCLE: Convolutional Implicit Reconstruction and Completion for Large-Scale Indoor Scene |
Haoxiang Chen, Jiahui Huang, TaiJiang Mu, ShiMin Hu |
|
code |
-1 |
ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild |
Wang Zhao, Shaohui Liu, Hengkai Guo, Wenping Wang, YongJin Liu |
|
code |
-1 |
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding |
Yujin Chen, Matthias Nießner, Angela Dai |
|
code |
-1 |
Few 'Zero Level Set'-Shot Learning of Shape Signed Distance Functions in Feature Space |
Amine Ouasfi, Adnane Boukhayma |
|
code |
-1 |
Solution Space Analysis of Essential Matrix Based on Algebraic Error Minimization |
Gaku Nakano |
|
code |
-1 |
Approximate Differentiable Rendering with Algebraic Surfaces |
Leonid Keselman, Martial Hebert |
|
code |
-1 |
CoVisPose: Co-visibility Pose Transformer for Wide-Baseline Relative Pose Estimation in 360$^\circ $ Indoor Panoramas |
Will Hutchcroft, Yuguang Li, Ivaylo Boyadzhiev, Zhiqiang Wan, Haiyan Wang, Sing Bing Kang |
|
code |
-1 |
Affine Correspondences Between Multi-camera Systems for 6DOF Relative Pose Estimation |
Banglei Guan, Ji Zhao |
|
code |
-1 |
GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation |
Keqiang Li, Mingyang Zhao, Huaiyu Wu, DongMing Yan, Zhen Shen, FeiYue Wang, Gang Xiong |
|
code |
-1 |
IS-MVSNet: Importance Sampling-Based MVSNet |
Likang Wang, Yue Gong, Xinjun Ma, Qirui Wang, Kaixuan Zhou, Lei Chen |
|
code |
-1 |
Point Scene Understanding via Disentangled Instance Mesh Reconstruction |
Jiaxiang Tang, Xiaokang Chen, Jingbo Wang, Gang Zeng |
|
code |
-1 |
DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras |
Ruizhi Shao, Zerong Zheng, Hongwen Zhang, Jingxiang Sun, Yebin Liu |
|
code |
-1 |
Space-Partitioning RANSAC |
Daniel Barath, Gábor Valasek |
|
code |
-1 |
Box-Supervised Instance Segmentation with Level Set Evolution |
Wentong Li, Wenyu Liu, Jianke Zhu, Miaomiao Cui, XianSheng Hua, Lei Zhang |
|
code |
-1 |
Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding |
Hao Wen, Yunze Liu, Jingwei Huang, Bo Duan, Li Yi |
|
code |
-1 |
Adaptive Agent Transformer for Few-Shot Segmentation |
Yuan Wang, Rui Sun, Zhe Zhang, Tianzhu Zhang |
|
code |
-1 |
Waymo Open Dataset: Panoramic Video Panoptic Segmentation |
Jieru Mei, Alex Zihao Zhu, Xinchen Yan, Hang Yan, Siyuan Qiao, LiangChieh Chen, Henrik Kretzschmar |
|
code |
-1 |
TransFGU: A Top-Down Approach to Fine-Grained Unsupervised Semantic Segmentation |
Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin |
|
code |
-1 |
AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-Shot Interactions |
Yian Wang, Ruihai Wu, Kaichun Mo, Jiaqi Ke, Qingnan Fan, Leonidas J. Guibas, Hao Dong |
|
code |
-1 |
Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation |
Sunghwan Hong, Seokju Cho, Jisu Nam, Stephen Lin, Seungryong Kim |
|
code |
-1 |
Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications |
Lingzhi Zhang, Shenghao Zhou, Simon Stent, Jianbo Shi |
|
code |
-1 |
Perceptual Artifacts Localization for Inpainting |
Lingzhi Zhang, Yuqian Zhou, Connelly Barnes, Sohrab Amirghodsi, Zhe Lin, Eli Shechtman, Jianbo Shi |
|
code |
-1 |
2D Amodal Instance Segmentation Guided by 3D Shape Prior |
Zhixuan Li, Weining Ye, Tingting Jiang, Tiejun Huang |
|
code |
-1 |
Data Efficient 3D Learner via Knowledge Transferred from 2D Model |
PingChung Yu, Cheng Sun, Min Sun |
|
code |
-1 |
Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation |
Tong Wu, Guangyu Gao, Junshi Huang, Xiaolin Wei, Xiaoming Wei, Chi Harold Liu |
|
code |
-1 |
Dense Gaussian Processes for Few-Shot Segmentation |
Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, Martin Danelljan |
|
code |
-1 |
3D Instances as 1D Kernels |
Yizheng Wu, Min Shi, Shuaiyuan Du, Hao Lu, Zhiguo Cao, Weicai Zhong |
|
code |
-1 |
TransMatting: Enhancing Transparent Objects Matting with Transformers |
Huanqia Cai, Fanglei Xue, Lele Xu, Lili Guo |
|
code |
-1 |
MVSalNet: Multi-view Augmentation for RGB-D Salient Object Detection |
Jiayuan Zhou, Lijun Wang, Huchuan Lu, Kaining Huang, Xinchu Shi, Bocong Liu |
|
code |
-1 |
k-means Mask Transformer |
Qihang Yu, Huiyu Wang, Siyuan Qiao, Maxwell D. Collins, Yukun Zhu, Hartwig Adam, Alan L. Yuille, LiangChieh Chen |
|
code |
-1 |
SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness |
Jindong Gu, Hengshuang Zhao, Volker Tresp, Philip H. S. Torr |
|
code |
-1 |
Adversarial Erasing Framework via Triplet with Gated Pyramid Pooling Layer for Weakly Supervised Semantic Segmentation |
SungHoon Yoon, Hyeokjun Kweon, Jegyeong Cho, Shinjeong Kim, KukJin Yoon |
|
code |
-1 |
Continual Semantic Segmentation via Structure Preserving and Projected Feature Alignment |
Zihan Lin, Zilei Wang, Yixin Zhang |
|
code |
-1 |
Interclass Prototype Relation for Few-Shot Segmentation |
Atsuro Okazawa |
|
code |
-1 |
Slim Scissors: Segmenting Thin Object from Synthetic Background |
Kunyang Han, Jun Hao Liew, Jiashi Feng, Huawei Tian, Yao Zhao, Yunchao Wei |
|
code |
-1 |
Abstracting Sketches Through Simple Primitives |
Stephan Alaniz, Massimiliano Mancini, Anjan Dutta, Diego Marcos, Zeynep Akata |
|
code |
-1 |
Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation |
Theodoros Pissas, Claudio S. Ravasio, Lyndon Da Cruz, Christos Bergeles |
|
code |
-1 |
One-Trimap Video Matting |
Hongje Seong, Seoung Wug Oh, Brian Price, Euntai Kim, JoonYoung Lee |
|
code |
-1 |
$\mathrm {D^2ADA}$: Dynamic Density-Aware Active Domain Adaptation for Semantic Segmentation |
TsungHan Wu, YiSyuan Liou, ShaoJi Yuan, HsinYing Lee, TungI Chen, KuanChih Huang, Winston H. Hsu |
|
code |
-1 |
Learning Quality-aware Dynamic Memory for Video Object Segmentation |
Yong Liu, Ran Yu, Fei Yin, Xinyuan Zhao, Wei Zhao, Weihao Xia, Yujiu Yang |
|
code |
-1 |
Learning Implicit Feature Alignment Function for Semantic Segmentation |
Hanzhe Hu, Yinbo Chen, Jiarui Xu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang |
|
code |
-1 |
Quantum Motion Segmentation |
Federica Arrigoni, Willi Menapace, Marcel Seelbach Benkner, Elisa Ricci, Vladislav Golyanik |
|
code |
-1 |
Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation |
Feng Zhu, Zongxin Yang, Xin Yu, Yi Yang, Yunchao Wei |
|
code |
-1 |
Laplacian Mesh Transformer: Dual Attention and Topology Aware Network for 3D Mesh Classification and Segmentation |
XiaoJuan Li, Jie Yang, FangLue Zhang |
|
code |
-1 |
Geodesic-Former: A Geodesic-Guided Few-Shot 3D Point Cloud Instance Segmenter |
Tuan Ngo, Khoi Nguyen |
|
code |
-1 |
Union-Set Multi-source Model Adaptation for Semantic Segmentation |
Zongyao Li, Ren Togo, Takahiro Ogawa, Miki Haseyama |
|
code |
-1 |
Point MixSwap: Attentional Point Cloud Mixing via Swapping Matched Structural Divisions |
Ardian Umam, ChengKun Yang, YungYu Chuang, JenHui Chuang, YenYu Lin |
|
code |
-1 |
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation |
Ye Yu, Jialing Yuan, Gaurav Mittal, Fuxin Li, Mei Chen |
|
code |
-1 |
SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection |
Minhyeok Lee, Chaewon Park, Suhwan Cho, Sangyoun Lee |
|
code |
-1 |
Global Spectral Filter Memory Network for Video Object Segmentation |
Yong Liu, Ran Yu, Jiahao Wang, Xinyuan Zhao, Yitong Wang, Yansong Tang, Yujiu Yang |
|
code |
-1 |
Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer |
Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, Fahad Shahbaz Khan |
|
code |
-1 |
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation |
Haodi He, Yuhui Yuan, Xiangyu Yue, Han Hu |
|
code |
-1 |
Learning Topological Interactions for Multi-Class Medical Image Segmentation |
Saumya Gupta, Xiaoling Hu, James Kaan, Michael Jin, Mutshipay Mpoy, Katherine Chung, Gagandeep Singh, Mary M. Saltz, Tahsin M. Kurç, Joel H. Saltz, Apostolos Tassiopoulos, Prateek Prasanna, Chao Chen |
|
code |
-1 |
Unsupervised Segmentation in Real-World Images via Spelke Object Inference |
Honglin Chen, Rahul Venkatesh, Yoni Friedman, Jiajun Wu, Joshua B. Tenenbaum, Daniel L. K. Yamins, Daniel M. Bear |
|
code |
-1 |
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model |
Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Han Hu, Xiang Bai |
|
code |
-1 |
Efficient One-Stage Video Object Detection by Exploiting Temporal Consistency |
Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson |
|
code |
-1 |
Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation |
Guodong Ding, Angela Yao |
|
code |
-1 |
Spotting Temporally Precise, Fine-Grained Events in Video |
James Hong, Haotian Zhang, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian |
|
code |
-1 |
Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation |
Nadine Behrmann, S. Alireza Golestaneh, Zico Kolter, Jürgen Gall, Mehdi Noroozi |
|
code |
-1 |
Efficient Video Transformers with Spatial-Temporal Token Selection |
Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, YuGang Jiang |
|
code |
-1 |
Long Movie Clip Classification with State-Space Video Models |
Md Mohaiminul Islam, Gedas Bertasius |
|
code |
-1 |
Prompting Visual-Language Models for Efficient Video Understanding |
Chen Ju, Tengda Han, Kunhao Zheng, Ya Zhang, Weidi Xie |
|
code |
-1 |
Asymmetric Relation Consistency Reasoning for Video Relation Grounding |
Huan Li, Ping Wei, Jiapeng Li, Zeyu Ma, Jiahui Shang, Nanning Zheng |
|
code |
-1 |
Self-supervised Social Relation Representation for Human Group Detection |
Jiacheng Li, Ruize Han, Haomin Yan, Zekun Qian, Wei Feng, Song Wang |
|
code |
-1 |
K-centered Patch Sampling for Efficient Video Recognition |
Seong Hyeon Park, Jihoon Tack, Byeongho Heo, JungWoo Ha, Jinwoo Shin |
|
code |
-1 |
A Deep Moving-Camera Background Model |
Guy Erez, Ron Shapira Weber, Oren Freifeld |
|
code |
-1 |
GraphVid: It only Takes a Few Nodes to Understand a Video |
Eitan Kosman, Dotan Di Castro |
|
code |
-1 |
Delta Distillation for Efficient Video Processing |
Amirhossein Habibian, Haitam Ben Yahia, Davide Abati, Efstratios Gavves, Fatih Porikli |
|
code |
-1 |
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning |
David Junhao Zhang, Kunchang Li, Yali Wang, Yunpeng Chen, Shashwat Chandra, Yu Qiao, Luoqi Liu, Mike Zheng Shou |
|
code |
-1 |
COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality |
Honglu Zhou, Asim Kadav, Aviv Shamsian, Shijie Geng, Farley Lai, Long Zhao, Ting Liu, Mubbasir Kapadia, Hans Peter Graf |
|
code |
-1 |
E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context |
Zizhang Li, Mengmeng Wang, Huaijin Pi, Kechun Xu, Jianbiao Mei, Yong Liu |
|
code |
-1 |
TDViT: Temporal Dilated Video Transformer for Dense Video Tasks |
Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson |
|
code |
-1 |
Semi-supervised Learning of Optical Flow by Flow Supervisor |
Woobin Im, Sebin Lee, SungEui Yoon |
|
code |
-1 |
Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization |
Nikita Dvornik, Isma Hadji, Hai Pham, Dhaivat Bhatt, Brais Martínez, Afsaneh Fazly, Allan D. Jepson |
|
code |
-1 |
Deep 360$^\circ $ Optical Flow Estimation Based on Multi-projection Fusion |
Yiheng Li, Connelly Barnes, Kun Huang, FangLue Zhang |
|
code |
-1 |
MaCLR: Motion-Aware Contrastive Learning of Representations for Videos |
Fanyi Xiao, Joseph Tighe, Davide Modolo |
|
code |
-1 |
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection |
Kyle Min, Sourya Roy, Subarna Tripathi, Tanaya Guha, Somdeb Majumdar |
|
code |
-1 |
Frozen CLIP Models are Efficient Video Learners |
Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li |
|
code |
-1 |
PIP: Physical Interaction Prediction via Mental Simulation with Span Selection |
Jiafei Duan, Samson Yu, Soujanya Poria, Bihan Wen, Cheston Tan |
|
code |
-1 |
Panoramic Vision Transformer for Saliency Detection in 360$^\circ $ Videos |
Heeseung Yun, Sehun Lee, Gunhee Kim |
|
code |
-1 |
Bayesian Tracking of Video Graphs Using Joint Kalman Smoothing and Registration |
Aditi Basu Bal, Ramy Mounir, Sathyanarayanan N. Aakur, Sudeep Sarkar, Anuj Srivastava |
|
code |
-1 |
Motion Sensitive Contrastive Learning for Self-supervised Video Representation |
Jingcheng Ni, Nan Zhou, Jie Qin, Qian Wu, Junqi Liu, Boxun Li, Di Huang |
|
code |
-1 |
Dynamic Temporal Filtering in Video Models |
Fuchen Long, Zhaofan Qiu, Yingwei Pan, Ting Yao, ChongWah Ngo, Tao Mei |
|
code |
-1 |
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification |
Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li |
|
code |
-1 |
Temporal Lift Pooling for Continuous Sign Language Recognition |
Lianyu Hu, Liqing Gao, Zekang Liu, Wei Feng |
|
code |
-1 |
MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes |
Yang Jiao, Shaoxiang Chen, Zequn Jie, Jingjing Chen, Lin Ma, YuGang Jiang |
|
code |
-1 |
SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding |
Mengxue Qu, Yu Wu, Wu Liu, Qiqi Gong, Xiaodan Liang, Olga Russakovsky, Yao Zhao, Yunchao Wei |
|
code |
-1 |
Cross-Modal Prototype Driven Network for Radiology Report Generation |
Jun Wang, Abhir Bhalerao, Yulan He |
|
code |
-1 |
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts |
Chuan Guo, Xinxin Zuo, Sen Wang, Li Cheng |
|
code |
-1 |
SeqTR: A Simple Yet Universal Network for Visual Grounding |
Chaoyang Zhu, Yiyi Zhou, Yunhang Shen, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji |
|
code |
-1 |
VTC: Improving Video-Text Retrieval with User Comments |
Laura Hanu, James Thewlis, Yuki M. Asano, Christian Rupprecht |
|
code |
-1 |
FashionViL: Fashion-Focused Vision-and-Language Representation Learning |
Xiao Han, Licheng Yu, Xiatian Zhu, Li Zhang, YiZhe Song, Tao Xiang |
|
code |
-1 |
Weakly Supervised Grounding for VQA in Vision-Language Transformers |
Aisha Urooj Khan, Hilde Kuehne, Chuang Gan, Niels da Vitoria Lobo, Mubarak Shah |
|
code |
-1 |
Automatic Dense Annotation of Large-Vocabulary Sign Language Videos |
Liliane Momeni, Hannah Bull, K. R. Prajwal, Samuel Albanie, Gül Varol, Andrew Zisserman |
|
code |
-1 |
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval |
Yuying Ge, Yixiao Ge, Xihui Liu, Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo |
|
code |
-1 |
GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval |
Yuxuan Wang, Difei Gao, Licheng Yu, Weixian Lei, Matt Feiszli, Mike Zheng Shou |
|
code |
-1 |
A Simple and Robust Correlation Filtering Method for Text-Based Person Search |
Wei Suo, Mengyang Sun, Kai Niu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu |
|
code |
-1 |
Towards Self-Supervised and Weight-preserving Neural Architecture Search |
Zhuowei Li, Yibo Gao, Zhenzhou Zha, Zhiqiang Hu, Qing Xia, Shaoting Zhang, Dimitris N. Metaxas |
|
code |
-1 |
MoQuad: Motion-focused Quadruple Construction for Video Contrastive Learning |
Yuan Liu, Jiacheng Chen, Hao Wu |
|
code |
-1 |
On the Effectiveness of ViT Features as Local Semantic Descriptors |
Shir Amir, Yossi Gandelsman, Shai Bagon, Tali Dekel |
|
code |
-1 |
Anomaly Detection Requires Better Representations |
Tal Reiss, Niv Cohen, Eliahu Horwitz, Ron Abutbul, Yedid Hoshen |
|
code |
-1 |
Leveraging Self-Supervised Training for Unintentional Action Recognition |
Enea Duka, Anna Kukleva, Bernt Schiele |
|
code |
-1 |
A Study on Self-Supervised Object Detection Pretraining |
Trung Dang, Simon Kornblith, Huy Thong Nguyen, Peter Chin, Maryam Khademi |
|
code |
-1 |
Internet Curiosity: Directed Unsupervised Learning on Uncurated Internet Data |
Alexander C. Li, Ellis Brown, Alexei A. Efros, Deepak Pathak |
|
code |
-1 |
Towards Autonomous Grading in the Real World |
Yakov Miron, Yuval Goldfracht, Dotan Di Castro |
|
code |
-1 |
Bootstrapping Autonomous Lane Changes with Self-supervised Augmented Runs |
Xiang Xiang |
|
code |
-1 |
Artifact-Based Domain Generalization of Skin Lesion Models |
Alceu Bissoto, Catarina Barata, Eduardo Valle, Sandra Avila |
|
code |
-1 |
An Evaluation of Self-supervised Pre-training for Skin-Lesion Analysis |
Levy G. Chaves, Alceu Bissoto, Eduardo Valle, Sandra Avila |
|
code |
-1 |
Skin_Hair Dataset: Setting the Benchmark for Effective Hair Inpainting Methods for Improving the Image Quality of Dermoscopic Images |
Joanna JaworekKorjakowska, Anna Wójcicka, Dariusz Kucharski, Andrzej Brodzicki, Connah Kendrick, Bill Cassidy, Moi Hoon Yap |
|
code |
-1 |
FairDisCo: Fairer AI in Dermatology via Disentanglement Contrastive Learning |
Siyi Du, Ben Hers, Nourhan Bayasi, Ghassan Hamarneh, Rafeef Garbi |
|
code |
-1 |
CIRCLe: Color Invariant Representation Learning for Unbiased Classification of Skin Lesions |
Arezou Pakzad, Kumar Abhishek, Ghassan Hamarneh |
|
code |
-1 |
Distinctive Image Captioning via CLIP Guided Group Optimization |
Youyuan Zhang, Jiuniu Wang, Hao Wu, Wenjia Xu |
|
code |
-1 |
OCR-IDL: OCR Annotations for Industry Document Library Dataset |
Ali Furkan Biten, Rubèn Tito, Lluís Gómez, Ernest Valveny, Dimosthenis Karatzas |
|
code |
-1 |
Self-paced Learning to Improve Text Row Detection in Historical Documents with Missing Labels |
Mihaela Gaman, Lida Ghadamiyan, Radu Tudor Ionescu, Marius Popescu |
|
code |
-1 |
On Calibration of Scene-Text Recognition Models |
Ron Slossberg, Oron Anschel, Amir Markovitz, Ron Litman, Aviad Aberdam, Shahar Tsiper, Shai Mazor, Jon Wu, R. Manmatha |
|
code |
-1 |
End-to-End Document Recognition and Understanding with Dessurt |
Brian L. Davis, Bryan S. Morse, Brian L. Price, Chris Tensmeyer, Curtis Wigington, Vlad I. Morariu |
|
code |
-1 |
Task Grouping for Multilingual Text Recognition |
Jing Huang, Kevin J. Liang, Rama Kovvuri, Tal Hassner |
|
code |
-1 |
Incorporating Self-attention Mechanism and Multi-task Learning into Scene Text Detection |
Ning Ding, Liangrui Peng, Changsong Liu, Yuqi Zhang, Ruixue Zhang, Jie Li |
|
code |
-1 |
Doc2Graph: A Task Agnostic Document Understanding Framework Based on Graph Neural Networks |
Andrea Gemelli, Sanket Biswas, Enrico Civitelli, Josep Lladós, Simone Marinai |
|
code |
-1 |
MUST-VQA: MUltilingual Scene-Text VQA |
Emanuele Vivoli, Ali Furkan Biten, Andrés Mafla, Dimosthenis Karatzas, Lluís Gómez |
|
code |
-1 |
Out-of-Vocabulary Challenge Report |
Sergi GarciaBordils, Andrés Mafla, Ali Furkan Biten, Oren Nuriel, Aviad Aberdam, Shai Mazor, Ron Litman, Dimosthenis Karatzas |
|
code |
-1 |
Towards Structured Noise Models for Unsupervised Denoising |
Benjamin Salmon, Alexander Krull |
|
code |
-1 |
Comparison of Semi-supervised Learning Methods for High Content Screening Quality Control |
Umar Masud, Ethan Cohen, Ihab Bendidi, Guillaume Bollot, Auguste Genovesio |
|
code |
-1 |
Discriminative Attribution from Paired Images |
Nils Eckstein, Habib Bukhari, Alexander S. Bates, Gregory S. X. E. Jefferis, Jan Funke |
|
code |
-1 |
Learning with Minimal Effort: Leveraging in Silico Labeling for Cell and Nucleus Segmentation |
Thomas Bonte, Maxence Philbert, Emeline Coleno, Edouard Bertrand, Arthur Imbert, Thomas Walter |
|
code |
-1 |
Towards Better Guided Attention and Human Knowledge Insertion in Deep Convolutional Neural Networks |
Ankit Gupta, IdaMaria Sintorn |
|
code |
-1 |
Characterization of AI Model Configurations for Model Reuse |
Peter Bajcsy, Michael Majurski, Thomas E. Cleveland IV, Manuel J. Carrasco, Walid Keyrouz |
|
code |
-1 |
Empirical Evaluation of Deep Learning Approaches for Landmark Detection in Fish Bioimages |
Navdeep Kumar, Claudia Di Biagio, Zachary Dellacqua, Ratish Raman, Arianna Martini, Clara Boglione, Marc Muller, Pierre Geurts, Raphaël Marée |
|
code |
-1 |
PointFISH: Learning Point Cloud Representations for RNA Localization Patterns |
Arthur Imbert, Florian Müller, Thomas Walter |
|
code |
-1 |
N2V2 - Fixing Noise2Void Checkerboard Artifacts with Modified Sampling Strategies and a Tweaked Network Architecture |
Eva Höck, TimOliver Buchholz, Anselm Brachmann, Florian Jug, Alexander Freytag |
|
code |
-1 |
Object Detection in Aerial Images with Uncertainty-Aware Graph Network |
Jongha Kim, Jinheon Baek, Sung Ju Hwang |
|
code |
-1 |
STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation |
Zhengkai Jiang, Zhangxuan Gu, Jinlong Peng, Hang Zhou, Liang Liu, Yabiao Wang, Ying Tai, Chengjie Wang, Liqing Zhang |
|
code |
-1 |
Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks |
Haodong Duan, Yue Zhao, Kai Chen, Yuanjun Xiong, Dahua Lin |
|
code |
-1 |
SegTAD: Precise Temporal Action Detection via Semantic Segmentation |
Chen Zhao, Merey Ramazanova, Mengmeng Xu, Bernard Ghanem |
|
code |
-1 |
Text-Driven Stylization of Video Objects |
Sebastian Loeschcke, Serge J. Belongie, Sagie Benaim |
|
code |
-1 |
MND: A New Dataset and Benchmark of Movie Scenes Classified by Their Narrative Function |
Chang Liu, Armin Shmilovici, Mark Last |
|
code |
-1 |
Are All Combinations Equal? Combining Textual and Visual Features with Multiple Space Learning for Text-Based Video Retrieval |
Damianos Galanopoulos, Vasileios Mezaris |
|
code |
-1 |
Scene-Adaptive Temporal Stabilisation for Video Colourisation Using Deep Video Priors |
Marc Górriz Blanch, Noel E. O'Connor, Marta Mrak |
|
code |
-1 |
Movie Lens: Discovering and Characterizing Editing Patterns in the Analysis of Short Movie Sequences |
Bartolomeo Vacchetti, Tania Cerquitelli |
|
code |
-1 |
SKDCGN: Source-free Knowledge Distillation of Counterfactual Generative Networks Using cGANs |
Sameer Ambekar, Matteo Tafuro, Ankit Ankit, Diego van der Mast, Mark Alence, Christos Athanasiadis |
|
code |
-1 |
C-3PO: Towards Rotation Equivariant Feature Detection and Description |
Piyush Bagad, Floor Eijkelboom, Mark Fokkema, Danilo de Goede, Paul Hilders, Miltiadis Kofinas |
|
code |
-1 |
Towards Flexible Inductive Bias via Progressive Reparameterization Scheduling |
Yunsung Lee, Gyuseong Lee, Kwangrok Ryoo, Hyojun Go, Jihye Park, Seungryong Kim |
|
code |
-1 |
Zero-Shot Image Enhancement with Renovated Laplacian Pyramid |
Shunsuke Takao |
|
code |
-1 |
Beyond a Video Frame Interpolator: A Space Decoupled Learning Approach to Continuous Image Transition |
Tao Yang, Peiran Ren, Xuansong Xie, XianSheng Hua, Lei Zhang |
|
code |
-1 |
Diversified Dynamic Routing for Vision Tasks |
Botos Csaba, Adel Bibi, Yanwei Li, Philip H. S. Torr, SerNam Lim |
|
code |
-1 |
MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report |
Wenxiu Sun, Qingpeng Zhu, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Jun Jiang, Qingyu Yang, Chen Change Loy, Jinwei Gu, Dewang Hou, Kai Zhao, Liying Lu, Yu Li, Huaijia Lin, Ruizheng Wu, Jiangbo Lu, Jiaya Jia, Qiang Liu, Haosong Yue, Danyang Cao, Lehang Yu, Jiaxuan Quan, Jixiang Liang, Yufei Wang, Yuchao Dai, Peng Yang, Hu Yan, Houbiao Liu, Siyuan Su, Xuanhe Li, Rui Ren, Yunlong Liu, Yufan Zhu, Dong Lao, Alex Wong, Katie Chang |
|
code |
-1 |
MIPI 2022 Challenge on Quad-Bayer Re-mosaic: Dataset and Report |
Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Zhen Wang, Daoyu Li, Yuzhe Zhang, Lintao Peng, Xuyang Chang, Yinuo Zhang, Yaqi Wu, Xun Wu, Zhihao Fan, Chengjie Xia, Feng Zhang, Haijin Zeng, Kai Feng, Yongqiang Zhao, Hiêp Quang Luong, Jan Aelterman, Anh Minh Truong, Wilfried Philips, Xiaohong Liu, Jun Jia, Hanchi Sun, Guangtao Zhai, Longan Xiao, Qihang Xu, Ting Jiang, Qi Wu, Chengzhi Jiang, Mingyan Han, Xinpeng Li, Wenjie Lin, Youwei Li, Haoqiang Fan, Shuaicheng Liu, Rongyuan Wu, Lingchen Sun, Qiaosi Yi |
|
code |
-1 |
MIPI 2022 Challenge on RGBW Sensor Re-mosaic: Dataset and Report |
Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Lingchen Sun, Rongyuan Wu, Qiaosi Yi, Rongjian Xu, Xiaohui Liu, Zhilu Zhang, Xiaohe Wu, Ruohao Wang, Junyi Li, Wangmeng Zuo, Faming Fang |
|
code |
-1 |
MIPI 2022 Challenge on RGBW Sensor Fusion: Dataset and Report |
Qingyu Yang, Guang Yang, Jun Jiang, Chongyi Li, Ruicheng Feng, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Zhen Wang, Daoyu Li, Yuzhe Zhang, Lintao Peng, Xuyang Chang, Yinuo Zhang, Liheng Bian, Bing Li, Jie Huang, Mingde Yao, Ruikang Xu, Feng Zhao, Xiaohui Liu, Rongjian Xu, Zhilu Zhang, Xiaohe Wu, Ruohao Wang, Junyi Li, Wangmeng Zuo, Zhuang Jia, DongJae Lee, Ting Jiang, Qi Wu, Chengzhi Jiang, Mingyan Han, Xinpeng Li, Wenjie Lin, Youwei Li, Haoqiang Fan, Shuaicheng Liu |
|
code |
-1 |
MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results |
Ruicheng Feng, Chongyi Li, Shangchen Zhou, Wenxiu Sun, Qingpeng Zhu, Jun Jiang, Qingyu Yang, Chen Change Loy, Jinwei Gu, Yurui Zhu, Xi Wang, Xueyang Fu, Xiaowei Hu, Jinfan Hu, Xina Liu, Xiangyu Chen, Chao Dong, Dafeng Zhang, Feiyu Huang, Shizhuo Liu, Xiaobing Wang, Zhezhu Jin, Xuhao Jiang, Guangqi Shao, Xiaotao Wang, Lei Lei, Zhao Zhang, Suiyi Zhao, Huan Zheng, Yangcheng Gao, Yanyan Wei, Jiahuan Ren, Tao Huang, Zhenxuan Fang, Mengluan Huang, Junwei Xu, Yong Zhang, Yuechi Yang, Qidi Shu, Zhiwen Yang, Shaocong Li, Mingde Yao, Ruikang Xu, Yuanshen Guan, Jie Huang, Zhiwei Xiong, Hangyan Zhu, Ming Liu, Shaohui Liu, Wangmeng Zuo, Zhuang Jia, Binbin Song, Ziqi Song, Guiting Mao, Ben Hou, Zhimou Liu, Yi Ke, Dengpei Ouyang, Dekui Han, Jinghao Zhang, Qi Zhu, Naishan Zheng, Feng Zhao, Wu Jin, Marcos V. Conde, Sabari Nathan, Radu Timofte, Tianyi Xu, Jun Xu, P. S. Hrishikesh, Densen Puthussery, C. V. Jiji, Biao Jiang, Yuhan Ding, WanZhang Li, Xiaoyue Feng, Sijing Chen, Tianheng Zhong, Jiyang Lu, Hongming Chen, Zhentao Fan, Xiang Chen |
|
code |
-1 |
Continuous Spectral Reconstruction from RGB Images via Implicit Neural Representation |
Ruikang Xu, Mingde Yao, Chang Chen, Lizhi Wang, Zhiwei Xiong |
|
code |
-1 |
Event-Based Image Deblurring with Dynamic Motion Awareness |
Patricia Vitoria, Stamatios Georgoulis, Stepan Tulyakov, Alfredo Bochicchio, Julius Erbach, Yuanyou Li |
|
code |
-1 |
UDC-UNet: Under-Display Camera Image Restoration via U-shape Dynamic Network |
Xina Liu, Jinfan Hu, Xiangyu Chen, Chao Dong |
|
code |
-1 |
Enhanced Coarse-to-Fine Network for Image Restoration from Under-Display Cameras |
Yurui Zhu, Xi Wang, Xueyang Fu, Xiaowei Hu |
|
code |
-1 |
Learning to Joint Remosaic and Denoise in Quad Bayer CFA via Universal Multi-scale Channel Attention Network |
Xun Wu, Zhihao Fan, Jiesi Zheng, Yaqi Wu, Feng Zhang |
|
code |
-1 |
Learning an Efficient Multimodal Depth Completion Model |
Dewang Hou, Yuanyuan Du, Kai Zhao, Yang Zhao |
|
code |
-1 |
Learning Rich Information for Quad Bayer Remosaicing and Denoising |
Jun Jia, Hanchi Sun, Xiaohong Liu, Longan Xiao, Qihang Xu, Guangtao Zhai |
|
code |
-1 |
Depth Completion Using Laplacian Pyramid-Based Depth Residuals |
Haosong Yue, Qiang Liu, Zhong Liu, Jing Zhang, Xingming Wu |
|
code |
-1 |
PSUMNet: Unified Modality Part Streams Are All You Need for Efficient Pose-Based Action Recognition |
Neel Trivedi, Ravi Kiran Sarvadevabhatla |
|
code |
-1 |
YOLO5Face: Why Reinventing a Face Detector |
Delong Qi, Weijun Tan, Qi Yao, Jingfeng Liu |
|
code |
-1 |
Counterfactual Fairness for Facial Expression Recognition |
Jiaee Cheong, Sinan Kalkan, Hatice Gunes |
|
code |
-1 |
Improved Cross-Dataset Facial Expression Recognition by Handling Data Imbalance and Feature Confusion |
Manogna Sreenivas, Sawa Takamuku, Soma Biswas, Aditya Chepuri, Balasubramanian Vengatesan, Naotake Natori |
|
code |
-1 |
Video-Based Gait Analysis for Spinal Deformity |
Himanshu Kumar Suman, Tanmay Tulsidas Verlekar |
|
code |
-1 |
TSCom-Net: Coarse-to-Fine 3D Textured Shape Completion Network |
Ahmet Serdar Karadeniz, Sk Aziz Ali, Anis Kacem, Elona Dupont, Djamila Aouada |
|
code |
-1 |
Deep Learning-Based Assessment of Facial Periodic Affect in Work-Like Settings |
Siyang Song, Yiming Luo, Vincenzo Ronca, Gianluca Borghini, Hesam Sagha, Vera Barbara Rick, Alexander Mertens, Hatice Gunes |
|
code |
-1 |
Supervision by Landmarks: An Enhanced Facial De-occlusion Network for VR-Based Applications |
Surabhi Gupta, Sai Sagar Jinka, Avinash Sharma, Anoop M. Namboodiri |
|
code |
-1 |
Consistency-Based Self-supervised Learning for Temporal Anomaly Localization |
Aniello Panariello, Angelo Porrello, Simone Calderara, Rita Cucchiara |
|
code |
-1 |
Perspective Reconstruction of Human Faces by Joint Mesh and Landmark Regression |
Jia Guo, Jinke Yu, Alexandros Lattas, Jiankang Deng |
|
code |
-1 |
Pixel2ISDF: Implicit Signed Distance Fields Based Human Body Model from Multi-view and Multi-pose Images |
Jianchuan Chen, Wentao Yi, Tiantian Wang, Xing Li, Liqian Ma, Yangyu Fan, Huchuan Lu |
|
code |
-1 |
UnconFuse: Avatar Reconstruction from Unconstrained Images |
Han Huang, Liliang Chen, Xihao Wang |
|
code |
-1 |
HiFace: Hybrid Task Learning for Face Reconstruction from Single Image |
Wei Xu, Zhihong Fu, Zhixing Chen, Qili Deng, Mingtao Fu, Xijin Zhang, Yuan Gao, Daniel K. Du, Min Zheng |
|
code |
-1 |
Multi-view Canonical Pose 3D Human Body Reconstruction Based on Volumetric TSDF |
Xi Li |
|
code |
-1 |
End to End Face Reconstruction via Differentiable PnP |
Yiren Lu, Huawei Wei |
|
code |
-1 |
One Ontology to Rule Them All: Corner Case Scenarios for Autonomous Driving |
Daniel Bogdoll, Stefani Guneshka, J. Marius Zöllner |
|
code |
-1 |
Parametric and Multivariate Uncertainty Calibration for Regression and Object Detection |
Fabian Küppers, Jonas Schneider, Anselm Haselhoff |
|
code |
-1 |
Reliable Multimodal Trajectory Prediction via Error Aligned Uncertainty Optimization |
Neslihan Kose, Ranganath Krishnan, Akash Dhamasia, Omesh Tickoo, Michael Paulitsch |
|
code |
-1 |
PAI3D: Painting Adaptive Instance-Prior for 3D Object Detection |
Hao Liu, Zhuoran Xu, Dan Wang, Baofeng Zhang, Guan Wang, Bo Dong, Xin Wen, Xinyu Xu |
|
code |
-1 |
Validation of Pedestrian Detectors by Classification of Visual Detection Impairing Factors |
Korbinian Hagn, Oliver Grau |
|
code |
-1 |
Probing Contextual Diversity for Dense Out-of-Distribution Detection |
Silvio Galesso, María Alejandra Bravo, Mehdi Naouar, Thomas Brox |
|
code |
-1 |
Adversarial Vulnerability of Temporal Feature Networks for Object Detection |
Svetlana Pavlitskaya, Nikolai Polley, Michael Weber, J. Marius Zöllner |
|
code |
-1 |
Towards Improved Intermediate Layer Variational Inference for Uncertainty Estimation |
Ahmed Hammam, Frank Bonarens, Seyed Eghbal Ghobadi, Christoph Stiller |
|
code |
-1 |
Explainable Sparse Attention for Memory-Based Trajectory Predictors |
Francesco Marchetti, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo |
|
code |
-1 |
Cycle-Consistent World Models for Domain Independent Latent Imagination |
Sidney Bender, Tim Joseph, J. Marius Zöllner |
|
code |
-1 |
Strengthening Skeletal Action Recognizers via Leveraging Temporal Patterns |
Zhenyue Qin, Pan Ji, Dongwoo Kim, Yang Liu, Saeed Anwar, Tom Gedeon |
|
code |
-1 |
Which Expert Knows Best? Modulating Soft Learning with Online Batch Confidence for Domain Adaptive Person Re-Identification |
Andrea Zunino, Christopher Murray, Richard Blythman, Vittorio Murino |
|
code |
-1 |
Cross-Modality Attention and Multimodal Fusion Transformer for Pedestrian Detection |
WeiYu Lee, Ljubomir Jovanov, Wilfried Philips |
|
code |
-1 |
See Finer, See More: Implicit Modality Alignment for Text-Based Person Retrieval |
Xiujun Shu, Wei Wen, Haoqian Wu, Keyu Chen, Yiran Song, Ruizhi Qiao, Bo Ren, Xiao Wang |
|
code |
-1 |
Look at Adjacent Frames: Video Anomaly Detection Without Offline Training |
Yuqi Ouyang, Guodong Shen, Victor Sanchez |
|
code |
-1 |
SOMPT22: A Surveillance Oriented Multi-pedestrian Tracking Dataset |
Fatih Emre Simsek, Cevahir Cigla, Koray Kayabol |
|
code |
-1 |
Detection of Fights in Videos: A Comparison Study of Anomaly Detection and Action Recognition |
Weijun Tan, Jingfeng Liu |
|
code |
-1 |
Privacy-Preserving Person Detection Using Low-Resolution Infrared Cameras |
Thomas Dubail, Fidel Alejandro Guerrero Peña, Heitor Rapela Medeiros, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli |
|
code |
-1 |
Gait Recognition from Occluded Sequences in Surveillance Sites |
Dhritimaan Das, Ayush Agarwal, Pratik Chattopadhyay |
|
code |
-1 |
Visible-Infrared Person Re-Identification Using Privileged Intermediate Information |
Mahdi Alehdaghi, Arthur Josi, Rafael M. O. Cruz, Eric Granger |
|
code |
-1 |
Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy |
Shiyuan Huang, Robinson Piramuthu, ShihFu Chang, Gunnar A. Sigurdsson |
|
code |
-1 |
ChaLearn LAP Seasons in Drift Challenge: Dataset, Design and Results |
Anders Skaarup Johansen, Júlio C. S. Jacques Júnior, Kamal Nasrollahi, Sergio Escalera, Thomas B. Moeslund |
|
code |
-1 |
YORO - Lightweight End to End Visual Grounding |
ChihHui Ho, Srikar Appalaraju, Bhavan Jasani, R. Manmatha, Nuno Vasconcelos |
|
code |
-1 |
Localization Uncertainty Estimation for Anchor-Free Object Detection |
Youngwan Lee, JoongWon Hwang, HyungIl Kim, Kimin Yun, Yongjin Kwon, Yuseok Bae, Sung Ju Hwang |
|
code |
-1 |
Variational Depth Networks: Uncertainty-Aware Monocular Self-supervised Depth Estimation |
Georgi Dikov, Joris van Vugt |
|
code |
-1 |
Unsupervised Joint Image Transfer and Uncertainty Quantification Using Patch Invariant Networks |
Christoph Angermann, Markus Haltmeier, Ahsan Raza Siyal |
|
code |
-1 |
Uncertainty Quantification Using Query-Based Object Detectors |
Meet P. Vadera, Colin Samplawski, Benjamin M. Marlin |
|
code |
-1 |
CenDerNet: Center and Curvature Representations for Render-and-Compare 6D Pose Estimation |
Peter De Roovere, Rembert Daems, Jonathan Croenen, Taoufik Bourgana, Joris de Hoog, Francis Wyffels |
|
code |
-1 |
Trans6D: Transformer-Based 6D Object Pose Estimation and Refinement |
Zhongqun Zhang, Wei Chen, Linfang Zheng, Ales Leonardis, Hyung Jin Chang |
|
code |
-1 |
Learning to Estimate Multi-view Pose from Object Silhouettes |
Yoni Kasten, True Price, David Geraghty, JanMichael Frahm |
|
code |
-1 |
TransNet: Category-Level Transparent Object Pose Estimation |
Huijie Zhang, Anthony Opipari, Xiaotong Chen, Jiyue Zhu, Zeren Yu, Odest Chadwicke Jenkins |
|
code |
-1 |
Fuse and Attend: Generalized Embedding Learning for Art and Sketches |
Ujjal Kr Dutta |
|
code |
-1 |
3D Shape Reconstruction from Free-Hand Sketches |
Jiayun Wang, Jierui Lin, Qian Yu, Runtao Liu, Yubei Chen, Stella X. Yu |
|
code |
-1 |
Abstract Images Have Different Levels of Retrievability Per Reverse Image Search Engine |
Shawn M. Jones, Diane Oyen |
|
code |
-1 |
ECCV 2022 Sign Spotting Challenge: Dataset, Design and Results |
Manuel VázquezEnríquez, José Luis AlbaCastro, Laura Docío Fernández, Júlio C. S. Jacques Júnior, Sergio Escalera |
|
code |
-1 |
Hierarchical I3D for Sign Spotting |
Ryan Wong, Necati Cihan Camgöz, Richard Bowden |
|
code |
-1 |
Multi-modal Sign Language Spotting by Multi/One-Shot Learning |
Landong Liu, Wengang Zhou, Weichao Zhao, Hezhen Hu, Houqiang Li |
|
code |
-1 |
Sign Spotting via Multi-modal Fusion and Testing Time Transferring |
Hongyu Fu, Chen Liu, Xingqun Qi, Beibei Lin, Lincheng Li, Li Zhang, Xin Yu |
|
code |
-1 |
Domain-Conditioned Normalization for Test-Time Domain Generalization |
Yuxuan Jiang, Yanfeng Wang, Ruipeng Zhang, Qinwei Xu, Ya Zhang, Xin Chen, Qi Tian |
|
code |
-1 |
Unleashing the Potential of Adaptation Models via Go-getting Domain Labels |
Xin Jin, Tianyu He, Xu Shen, Songhua Wu, Tongliang Liu, Jingwen Ye, Xinchao Wang, Jianqiang Huang, Zhibo Chen, XianSheng Hua |
|
code |
-1 |
ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization |
Zdravko Marinov, Alina Roitberg, David Schneider, Rainer Stiefelhagen |
|
code |
-1 |
Consistency Regularization for Domain Adaptation |
Kian Boon Koh, Basura Fernando |
|
code |
-1 |
CAT: Controllable Attribute Translation for Fair Facial Attribute Classification |
Jiazhi Li, Wael AbdAlmageed |
|
code |
-1 |
Weakly Supervised Invariant Representation Learning via Disentangling Known and Unknown Nuisance Factors |
Jiageng Zhu, Hanchen Xie, Wael AbdAlmageed |
|
code |
-1 |
Learning Visual Explanations for DCNN-Based Image Classifiers Using an Attention Mechanism |
Ioanna Gkartzonika, Nikolaos Gkalelis, Vasileios Mezaris |
|
code |
-1 |
Self-supervised Orientation-Guided Deep Network for Segmentation of Carbon Nanotubes in SEM Imagery |
Nguyen P. Nguyen, Ramakrishna Surya, Matthew R. Maschmann, Prasad Calyam, Kannappan Palaniappan, Filiz Bunyak |
|
code |
-1 |
The Tenth Visual Object Tracking VOT2022 Challenge Results |
Matej Kristan, Ales Leonardis, Jirí Matas, Michael Felsberg, Roman P. Pflugfelder, JoniKristian Kämäräinen, Hyung Jin Chang, Martin Danelljan, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Johanna Björklund, Yushan Zhang, Zhongqun Zhang, Song Yan, Wenyan Yang, Dingding Cai, Christoph Mayer, Gustavo Fernández, Kang Ben, Goutam Bhat, Hong Chang, Guangqi Chen, Jiaye Chen, Shengyong Chen, Xilin Chen, Xin Chen, Xiuyi Chen, Yiwei Chen, YuHsi Chen, Zhixing Chen, Yangming Cheng, Angelo Ciaramella, Yutao Cui, Benjamin Dzubur, Mohana Murali Dasari, Qili Deng, Debajyoti Dhar, Shangzhe Di, Emanuel Di Nardo, Daniel K. Du, Matteo Dunnhofer, Heng Fan, ZhenHua Feng, Zhihong Fu, Shang Gao, Rama Krishna Gorthi, Eric Granger, Q. H. Gu, Himanshu Gupta, Jianfeng He, Keji He, Yan Huang, Deepak Jangid, Rongrong Ji, Cheng Jiang, Yingjie Jiang, Felix Järemo Lawin, Ze Kang, Madhu Kiran, Josef Kittler, Simiao Lai, Xiangyuan Lan, Dongwook Lee, Hyunjeong Lee, Seohyung Lee, Hui Li, Ming Li, Wangkai Li, Xi Li, Xianxian Li, Xiao Li, Zhe Li, Liting Lin, Haibin Ling, Bo Liu, Chang Liu, Si Liu, Huchuan Lu, Rafael M. O. Cruz, Bingpeng Ma, Chao Ma, Jie Ma, Yinchao Ma, Niki Martinel, Alireza Memarmoghadam, Christian Micheloni, Payman Moallem, Le Thanh NguyenMeidine, Siyang Pan, ChangBeom Park, Danda Pani Paudel, Matthieu Paul, Houwen Peng, Andreas Robinson, Litu Rout, Shiguang Shan, Kristian Simonato, Tianhui Song, Xiaoning Song, Chao Sun, Jingna Sun, Zhangyong Tang, Radu Timofte, ChiYi Tsai, Luc Van Gool, Om Prakash Verma, Dong Wang, Fei Wang, Liang Wang, Liangliang Wang, Lijun Wang, Limin Wang, Qiang Wang, Gangshan Wu, Jinlin Wu, Xiaojun Wu, Fei Xie, Tianyang Xu, Wei Xu, Yong Xu, Yuanyou Xu, Wanli Xue, Zizheng Xun, Bin Yan, Dawei Yang, Jinyu Yang, Wankou Yang, Xiaoyun Yang, Yi Yang, Yichun Yang, Zongxin Yang, Botao Ye, Fisher Yu, Hongyuan Yu, Jiaqian Yu, Qianjin Yu, Weichen Yu, Kang Ze, Jiang Zhai, Chengwei Zhang, Chunhu Zhang, Kaihua Zhang, Tianzhu Zhang, Wenkang Zhang, Zhibin Zhang, Zhipeng Zhang, Jie Zhao, ShaoChuan Zhao, Feng Zheng, Haixia Zheng, Min Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang |
|
code |
-1 |
Efficient Visual Tracking via Hierarchical Cross-Attention Transformer |
Xin Chen, Ben Kang, Dong Wang, Dongdong Li, Huchuan Lu |
|
code |
-1 |
Learning Dual-Fused Modality-Aware Representations for RGBD Tracking |
Shang Gao, Jinyu Yang, Zhe Li, Feng Zheng, Ales Leonardis, Jingkuan Song |
|
code |
-1 |
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications |
Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman H. Khan, Syed Waqas Zamir, Rao Muhammad Anwer, Fahad Shahbaz Khan |
|
code |
-1 |
Continual Inference: A Library for Efficient Online Inference with Deep Neural Networks in PyTorch |
Lukas Hedegaard, Alexandros Iosifidis |
|
code |
-1 |
Hydra Attention: Efficient Attention with Many Heads |
Daniel Bolya, ChengYang Fu, Xiaoliang Dai, Peizhao Zhang, Judy Hoffman |
|
code |
-1 |
BiTAT: Neural Network Binarization with Task-Dependent Aggregated Transformation |
Geon Park, Jaehong Yoon, Haiyang Zhang, Xing Zhang, Sung Ju Hwang, Yonina C. Eldar |
|
code |
-1 |
Power Awareness in Low Precision Neural Networks |
Nurit SpingarnEliezer, Ron Banner, Hilla BenYaacov, Elad Hoffer, Tomer Michaeli |
|
code |
-1 |
Augmenting Legacy Networks for Flexible Inference |
Jason Clemons, Iuri Frosio, Maying Shen, Jose M. Alvarez, Stephen W. Keckler |
|
code |
-1 |
Deep Neural Network Compression for Image Inpainting |
Soyeong Kim, DoYeon Kim, Jaekyun Moon |
|
code |
-1 |
QFT: Post-training Quantization via Fast Joint Finetuning of All Degrees of Freedom |
Alexander Finkelstein, Ella Fuchs, Idan Tal, Mark Grobman, Niv Vosco, Eldad Meller |
|
code |
-1 |
Searching for N:M Fine-grained Sparsity of Weights and Activations in Neural Networks |
Ruth AkivaHochman, Shahaf E. Finder, Javier S. Turek, Eran Treister |
|
code |
-1 |
Image Illumination Enhancement for Construction Worker Pose Estimation in Low-light Conditions |
Xinyu Chen, Yantao Yu |
|
code |
-1 |
Towards an Error-free Deep Occupancy Detector for Smart Camera Parking System |
TungLam Duong, VanDuc Le, TienCuong Bui, HaiThien To |
|
code |
-1 |
CrackSeg9k: A Collection and Benchmark for Crack Segmentation Datasets and Frameworks |
Shreyas Kulkarni, Shreyas Singh, Dhananjay Balakrishnan, Siddharth Sharma, Saipraneeth Devunuri, Sai Chowdeswara Rao Korlapati |
|
code |
-1 |
PriSeg: IFC-Supported Primitive Instance Geometry Segmentation with Unsupervised Clustering |
Zhiqi Hu, Ioannis K. Brilakis |
|
code |
-1 |
Depth Contrast: Self-supervised Pretraining on 3DPM Images for Mining Material Classification |
Prakash Chandra Chhipa, Richa Upadhyay, Rajkumar Saini, Lars Lindqvist, Richard Nordenskjöld, Seiichi Uchida, Marcus Liwicki |
|
code |
-1 |
Facilitating Construction Scene Understanding Knowledge Sharing and Reuse via Lifelong Site Object Detection |
Ruoxin Xiong, Yuansheng Zhu, Yanyu Wang, Pengkun Liu, Pingbo Tang |
|
code |
-1 |
Model-Assisted Labeling via Explainability for Visual Inspection of Civil Infrastructures |
Klára Janousková, Mattia Rigotti, Ioana Giurgiu, Cristiano Malossi |
|
code |
-1 |
A Hyperspectral and RGB Dataset for Building Façade Segmentation |
Nariman Habili, Ernest Kwan, Weihao Li, Christfried Webers, Jeremy Oorloff, Mohammad Ali Armin, Lars Petersson |
|
code |
-1 |
Improving Object Detection in VHR Aerial Orthomosaics |
Tanguy Ophoff, Kristof Van Beeck, Toon Goedemé |
|
code |
-1 |
Active Learning for Imbalanced Civil Infrastructure Data |
Thomas Frick, Diego Antognini, Mattia Rigotti, Ioana Giurgiu, Benjamin F. Grewe, Cristiano Malossi |
|
code |
-1 |
UAV-Based Visual Remote Sensing for Automated Building Inspection |
Kushagra Srivastava, Dhruv Patel, Aditya Kumar Jha, Mohhit Kumar Jha, Jaskirat Singh, Ravi Kiran Sarvadevabhatla, Pradeep Kumar Ramancharla, Harikumar Kandath, K. Madhava Krishna |
|
code |
-1 |
ConSLAM: Periodically Collected Real-World Construction Dataset for SLAM and Progress Monitoring |
Maciej Trzeciak, Kacper Pluta, Yasmin Fathy, Lucio Alcalde, Stanley Chee, Antony Bromley, Ioannis K. Brilakis, Pierre Alliez |
|
code |
-1 |
NeuralSI: Structural Parameter Identification in Nonlinear Dynamical Systems |
Xuyang Li, Hamed Bolandi, Talal Salem, Nizar Lajnef, Vishnu Naresh Boddeti |
|
code |
-1 |
A Geometric-Relational Deep Learning Framework for BIM Object Classification |
Hairong Luo, Ge Gao, Han Huang, Ziyi Ke, Cheng Peng, Ming Gu |
|
code |
-1 |
Generating Construction Safety Observations via CLIP-Based Image-Language Embedding |
Wei Lun Tsai, Jacob J. Lin, ShangHsien Hsieh |
|
code |
-1 |
Harmonization of Diffusion MRI Data Obtained with Multiple Head Coils Using Hybrid CNNs |
Leon Weninger, Sandro Romanzetti, Julia Ebert, Kathrin Reetz, Dorit Merhof |
|
code |
-1 |
CCRL: Contrastive Cell Representation Learning |
Ramin Nakhli, Amirali Darbandsari, Hossein Farahani, Ali Bashashati |
|
code |
-1 |
Automatic Grading of Cervical Biopsies by Combining Full and Self-supervision |
Mélanie Lubrano, Tristan Lazard, Guillaume Balezo, Yaëlle BellahsenHarrar, Cécile Badoual, Sylvain Berlemont, Thomas Walter |
|
code |
-1 |
When CNN Meet with ViT: Towards Semi-supervised Learning for Multi-class Medical Image Semantic Segmentation |
Ziyang Wang, Tianze Li, JianQing Zheng, Baoru Huang |
|
code |
-1 |
Using Whole Slide Image Representations from Self-supervised Contrastive Learning for Melanoma Concordance Regression |
Sean Grullon, Vaughn Spurrier, Jiayi Zhao, Corey Chivers, Yang Jiang, Kiran Motaparthi, Jason B. Lee, Michael J. Bonham, Julianna D. Ianni |
|
code |
-1 |
Explainable Model for Localization of Spiculation in Lung Nodules |
Mirtha Lucas, Miguel Lerma, Jacob Furst, Daniela Raicu |
|
code |
-1 |
Self-supervised Pretraining for 2D Medical Image Segmentation |
András Kalapos, Bálint GyiresTóth |
|
code |
-1 |
CMC_v2: Towards More Accurate COVID-19 Detection with Discriminative Video Priors |
Junlin Hou, Jilan Xu, Nan Zhang, Yi Wang, Yuejie Zhang, Xiaobo Zhang, Rui Feng |
|
code |
-1 |
COVID Detection and Severity Prediction with 3D-ConvNeXt and Custom Pretrainings |
Daniel Kienzle, Julian Lorenz, Robin Schön, Katja Ludwig, Rainer Lienhart |
|
code |
-1 |
Two-Stage COVID19 Classification Using BERT Features |
Weijun Tan, Qi Yao, Jingfeng Liu |
|
code |
-1 |
PVT-COV19D: COVID-19 Detection Through Medical Image Classification Based on Pyramid Vision Transformer |
Lilang Zheng, Jiaxuan Fang, Xiaorun Tang, Hanzhang Li, Jiaxin Fan, Tianyi Wang, Rui Zhou, Zhaoyan Yan |
|
code |
-1 |
Boosting COVID-19 Severity Detection with Infection-Aware Contrastive Mixup Classification |
Junlin Hou, Jilan Xu, Nan Zhang, Yuejie Zhang, Xiaobo Zhang, Rui Feng |
|
code |
-1 |
Variability Matters: Evaluating Inter-Rater Variability in Histopathology for Robust Cell Detection |
Cholmin Kang, Chunggi Lee, Heon Song, Minuk Ma, Sérgio Pereira |
|
code |
-1 |
FUSION: Fully Unsupervised Test-Time Stain Adaptation via Fused Normalization Statistics |
Nilanjan Chattopadhyay, Shiv Gehlot, Nitin Singhal |
|
code |
-1 |
Relieving Pixel-Wise Labeling Effort for Pathology Image Segmentation with Self-training |
Romain Mormont, Mehdi Testouri, Raphaël Marée, Pierre Geurts |
|
code |
-1 |
CNR-IEMN-CD and CNR-IEMN-CSD Approaches for Covid-19 Detection and Covid-19 Severity Detection from 3D CT-scans |
Fares Bougourzi, Cosimo Distante, Fadi Dornaika, Abdelmalik TalebAhmed |
|
code |
-1 |
Representation Learning with Information Theory to Detect COVID-19 and Its Severity |
Abel Díaz Berenguer, Tanmoy Mukherjee, Yifei Da, Matías Nicolás Bossa, Maryna Kvasnytsia, Jef Vandemeulebroucke, Nikos Deligiannis, Hichem Sahli |
|
code |
-1 |
Spatial-Slice Feature Learning Using Visual Transformer and Essential Slices Selection Module for COVID-19 Detection of CT Scans in the Wild |
ChihChung Hsu, ChiHan Tsai, GuanLin Chen, SinDi Ma, ShenChieh Tai |
|
code |
-1 |
Multi-scale Attention-Based Multiple Instance Learning for Classification of Multi-gigapixel Histology Images |
Made Satria Wibawa, KwokWai Lo, Lawrence Young, Nasir M. Rajpoot |
|
code |
-1 |
A Deep Wavelet Network for High-Resolution Microscopy Hyperspectral Image Reconstruction |
Qian Wang, Zhao Chen |
|
code |
-1 |
Using a 3D ResNet for Detecting the Presence and Severity of COVID-19 from CT Scans |
Robert Turnbull |
|
code |
-1 |
AI-MIA: COVID-19 Detection and Severity Analysis Through Medical Imaging |
Dimitrios Kollias, Anastasios Arsenos, Stefanos D. Kollias |
|
code |
-1 |
Medical Image Segmentation: A Review of Modern Architectures |
Natalia Salpea, Paraskevi K. Tzouveli, Dimitrios Kollias |
|
code |
-1 |
Medical Image Super Resolution by Preserving Interpretable and Disentangled Features |
Dwarikanath Mahapatra, Behzad Bozorgtabar, Mauricio Reyes |
|
code |
-1 |
Multi-label Attention Map Assisted Deep Feature Learning for Medical Image Classification |
Dwarikanath Mahapatra, Mauricio Reyes |
|
code |
-1 |
Unsupervised Domain Adaptation Using Feature Disentanglement and GCNs for Medical Image Classification |
Dwarikanath Mahapatra, Steven Korevaar, Behzad Bozorgtabar, Ruwan B. Tennakoon |
|
code |
-1 |