Skip to content

Latest commit

 

History

History
690 lines (689 loc) · 207 KB

iclr2020.md

File metadata and controls

690 lines (689 loc) · 207 KB

ICLR2020 Paper List

论文 作者 摘要 代码 引用数
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes Yang You, Jing Li, Sashank J. Reddi, Jonathan Hseu, Sanjiv Kumar, Srinadh Bhojanapalli, Xiaodan Song, James Demmel, Kurt Keutzer, ChoJui Hsieh code -1
SELF: Learning to Filter Noisy Labels with Self-Ensembling Duc Tam Nguyen, Chaithanya Kumar Mummadi, ThiPhuongNhung Ngo, Thi Hoai Phuong Nguyen, Laura Beggel, Thomas Brox code -1
Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation Yu Chen, Lingfei Wu, Mohammed J. Zaki code -1
Sharing Knowledge in Multi-Task Deep Reinforcement Learning Carlo D'Eramo, Davide Tateo, Andrea Bonarini, Marcello Restelli, Jan Peters code -1
On the Weaknesses of Reinforcement Learning for Neural Machine Translation Leshem Choshen, Lior Fox, Zohar Aizenbud, Omri Abend code -1
StructPool: Structured Graph Pooling via Conditional Random Fields Hao Yuan, Shuiwang Ji code -1
Learning deep graph matching with channel-independent embedding and Hungarian attention Tianshu Yu, Runzhong Wang, Junchi Yan, Baoxin Li code -1
Graph inference learning for semi-supervised classification Chunyan Xu, Zhen Cui, Xiaobin Hong, Tong Zhang, Jian Yang, Wei Liu code -1
SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards Siddharth Reddy, Anca D. Dragan, Sergey Levine code -1
Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data Sergei Popov, Stanislav Morozov, Artem Babenko code -1
Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification Yixiao Ge, Dapeng Chen, Hongsheng Li code -1
Automatically Discovering and Learning New Visual Categories with Ranking Statistics Kai Han, SylvestreAlvise Rebuffi, Sébastien Ehrhardt, Andrea Vedaldi, Andrew Zisserman code -1
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White code -1
Federated Adversarial Domain Adaptation Xingchao Peng, Zijun Huang, Yizhe Zhu, Kate Saenko code -1
Depth-Adaptive Transformer Maha Elbayad, Jiatao Gu, Edouard Grave, Michael Auli code -1
DeepHoyer: Learning Sparser Neural Network with Differentiable Scale-Invariant Sparsity Measures Huanrui Yang, Wei Wen, Hai Li code -1
Evaluating The Search Phase of Neural Architecture Search Kaicheng Yu, Christian Sciuto, Martin Jaggi, Claudiu Musat, Mathieu Salzmann code -1
Diverse Trajectory Forecasting with Determinantal Point Processes Ye Yuan, Kris M. Kitani code -1
ProxSGD: Training Structured Neural Networks under Regularization and Constraints Yang Yang, Yaxiong Yuan, Avraam Chatzimichailidis, Ruud J. G. van Sloun, Lei Lei, Symeon Chatzinotas code -1
LAMOL: LAnguage MOdeling for Lifelong Language Learning FanKeng Sun, ChengHao Ho, HungYi Lee code -1
Learning Expensive Coordination: An Event-Based Deep RL Approach Zhenyu Shi, Runsheng Yu, Xinrun Wang, Rundong Wang, Youzhi Zhang, Hanjiang Lai, Bo An code -1
Curvature Graph Network Ze Ye, Kin Sum Liu, Tengfei Ma, Jie Gao, Chao Chen code -1
Distance-Based Learning from Errors for Confidence Calibration Chen Xing, Sercan Ömer Arik, Zizhao Zhang, Tomas Pfister code -1
Deep Learning of Determinantal Point Processes via Proper Spectral Sub-gradient Tianshu Yu, Yikang Li, Baoxin Li code -1
N-BEATS: Neural basis expansion analysis for interpretable time series forecasting Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio code -1
Automated Relational Meta-learning Huaxiu Yao, Xian Wu, Zhiqiang Tao, Yaliang Li, Bolin Ding, Ruirui Li, Zhenhui Li code -1
To Relieve Your Headache of Training an MRF, Take AdVIL Chongxuan Li, Chao Du, Kun Xu, Max Welling, Jun Zhu, Bo Zhang code -1
Linear Symmetric Quantization of Neural Networks for Low-precision Integer Hardware Xiandong Zhao, Ying Wang, Xuyi Cai, Cheng Liu, Lei Zhang code -1
Weakly Supervised Clustering by Exploiting Unique Class Count Mustafa Umit Oner, Hwee Kuan Lee, WingKin Sung code -1
Scalable and Order-robust Continual Learning with Additive Parameter Decomposition Jaehong Yoon, Saehoon Kim, Eunho Yang, Sung Ju Hwang code -1
Continual Learning with Adaptive Weights (CLAW) Tameem Adel, Han Zhao, Richard E. Turner code -1
Transferable Perturbations of Deep Feature Distributions Nathan Inkawhich, Kevin J. Liang, Lawrence Carin, Yiran Chen code -1
A Learning-based Iterative Method for Solving Vehicle Routing Problems Hao Lu, Xingwen Zhang, Shuang Yang code -1
Poly-encoders: Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring Samuel Humeau, Kurt Shuster, MarieAnne Lachaux, Jason Weston code -1
AutoQ: Automated Kernel-Wise Neural Network Quantization Qian Lou, Feng Guo, Minje Kim, Lantao Liu, Lei Jiang code -1
Understanding Architectures Learnt by Cell-based Neural Architecture Search Yao Shu, Wei Wang, Shaofeng Cai code -1
SVQN: Sequential Variational Soft Q-Learning Networks Shiyu Huang, Hang Su, Jun Zhu, Ting Chen code -1
Ranking Policy Gradient Kaixiang Lin, Jiayu Zhou code -1
On Mutual Information Maximization for Representation Learning Michael Tschannen, Josip Djolonga, Paul K. Rubenstein, Sylvain Gelly, Mario Lucic code -1
Observational Overfitting in Reinforcement Learning Xingyou Song, Yiding Jiang, Stephen Tu, Yilun Du, Behnam Neyshabur code -1
Enhancing Transformation-Based Defenses Against Adversarial Attacks with a Distribution Classifier Connie Kou, Hwee Kuan Lee, EeChien Chang, Teck Khim Ng code -1
Additive Powers-of-Two Quantization: An Efficient Non-uniform Discretization for Neural Networks Yuhang Li, Xin Dong, Wei Wang code -1
Lazy-CFR: fast and near-optimal regret minimization for extensive games with imperfect information Yichi Zhou, Tongzheng Ren, Jialian Li, Dong Yan, Jun Zhu code -1
Knowledge Consistency between Neural Networks and Beyond Ruofan Liang, Tianlin Li, Longfei Li, Jing Wang, Quanshi Zhang code -1
Image-guided Neural Object Rendering Justus Thies, Michael Zollhöfer, Christian Theobalt, Marc Stamminger, Matthias Nießner code -1
Implicit Bias of Gradient Descent based Adversarial Training on Separable Data Yan Li, Ethan X. Fang, Huan Xu, Tuo Zhao code -1
TabFact: A Large-scale Dataset for Table-based Fact Verification Wenhu Chen, Hongmin Wang, Jianshu Chen, Yunkai Zhang, Hong Wang, Shiyang Li, Xiyou Zhou, William Yang Wang code -1
ES-MAML: Simple Hessian-Free Meta Learning Xingyou Song, Wenbo Gao, Yuxiang Yang, Krzysztof Choromanski, Aldo Pacchiano, Yunhao Tang code -1
Neural Stored-program Memory Hung Le, Truyen Tran, Svetha Venkatesh code -1
Hierarchical Foresight: Self-Supervised Learning of Long-Horizon Tasks via Visual Subgoal Generation Suraj Nair, Chelsea Finn code -1
Multi-agent Reinforcement Learning for Networked System Control Tianshu Chu, Sandeep Chinchali, Sachin Katti code -1
FSPool: Learning Set Representations with Featurewise Sort Pooling Yan Zhang, Jonathon S. Hare, Adam PrügelBennett code -1
Are Pre-trained Language Models Aware of Phrases? Simple but Strong Baselines for Grammar Induction Taeuk Kim, Jihun Choi, Daniel Edmiston, Sanggoo Lee code -1
Dynamically Pruned Message Passing Networks for Large-scale Knowledge Graph Reasoning Xiaoran Xu, Wei Feng, Yunsheng Jiang, Xiaohui Xie, Zhiqing Sun, ZhiHong Deng code -1
Mixup Inference: Better Exploiting Mixup to Defend Adversarial Attacks Tianyu Pang, Kun Xu, Jun Zhu code -1
Theory and Evaluation Metrics for Learning Disentangled Representations Kien Do, Truyen Tran code -1
Measuring Compositional Generalization: A Comprehensive Method on Realistic Data Daniel Keysers, Nathanael Schärli, Nathan Scales, Hylke Buisman, Daniel Furrer, Sergii Kashubin, Nikola Momchev, Danila Sinopalnikov, Lukasz Stafiniak, Tibor Tihon, Dmitry Tsarkov, Xiao Wang, Marc van Zee, Olivier Bousquet code -1
Rethinking Softmax Cross-Entropy Loss for Adversarial Robustness Tianyu Pang, Kun Xu, Yinpeng Dong, Chao Du, Ning Chen, Jun Zhu code -1
The Implicit Bias of Depth: How Incremental Learning Drives Generalization Daniel Gissin, Shai ShalevShwartz, Amit Daniely code -1
The Variational Bandwidth Bottleneck: Stochastic Evaluation on an Information Budget Anirudh Goyal, Yoshua Bengio, Matthew M. Botvinick, Sergey Levine code -1
Learning the Arrow of Time for Problems in Reinforcement Learning Nasim Rahaman, Steffen Wolf, Anirudh Goyal, Roman Remme, Yoshua Bengio code -1
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives Anirudh Goyal, Shagun Sodhani, Jonathan Binas, Xue Bin Peng, Sergey Levine, Yoshua Bengio code -1
Robust Local Features for Improving the Generalization of Adversarial Training Chuanbiao Song, Kun He, Jiadong Lin, Liwei Wang, John E. Hopcroft code -1
Analysis of Video Feature Learning in Two-Stream CNNs on the Example of Zebrafish Swim Bout Classification Bennet Breier, Arno Onken code -1
Learning Disentangled Representations for CounterFactual Regression Negar Hassanpour, Russell Greiner code -1
Exploration in Reinforcement Learning with Deep Covering Options Yuu Jinnai, Jee Won Park, Marlos C. Machado, George Dimitri Konidaris code -1
Ae-OT: a New Generative Model based on Extended Semi-discrete Optimal transport Dongsheng An, Yang Guo, Na Lei, Zhongxuan Luo, ShingTung Yau, Xianfeng Gu code -1
Logic and the 2-Simplicial Transformer James Clift, Dmitry Doryn, Daniel Murfet, James Wallbridge code -1
Watch, Try, Learn: Meta-Learning from Demonstrations and Rewards Allan Zhou, Eric Jang, Daniel Kappler, Alexander Herzog, Mohi Khansari, Paul Wohlhart, Yunfei Bai, Mrinal Kalakrishnan, Sergey Levine, Chelsea Finn code -1
Fooling Detection Alone is Not Enough: Adversarial Attack against Multiple Object Tracking Yunhan Jia, Yantao Lu, Junjie Shen, Qi Alfred Chen, Hao Chan, Zhenyu Zhong, Tao Wei code -1
DivideMix: Learning with Noisy Labels as Semi-supervised Learning Junnan Li, Richard Socher, Steven C. H. Hoi code -1
Improving Adversarial Robustness Requires Revisiting Misclassified Examples Yisen Wang, Difan Zou, Jinfeng Yi, James Bailey, Xingjun Ma, Quanquan Gu code -1
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin A. Riedmiller, Matthew M. Botvinick code -1
Interpretable Complex-Valued Neural Networks for Privacy Protection Liyao Xiang, Hao Zhang, Haotian Ma, Yifan Zhang, Jie Ren, Quanshi Zhang code -1
Accelerating SGD with momentum for over-parameterized learning Chaoyue Liu, Mikhail Belkin code -1
A critical analysis of self-supervision, or what we can learn from a single image Yuki Markus Asano, Christian Rupprecht, Andrea Vedaldi code -1
Disentangling Factors of Variations Using Few Labels Francesco Locatello, Michael Tschannen, Stefan Bauer, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem code -1
Functional vs. parametric equivalence of ReLU networks Mary Phuong, Christoph H. Lampert code -1
Input Complexity and Out-of-distribution Detection with Likelihood-based Generative Models Joan Serrà, David Álvarez, Vicenç Gómez, Olga Slizovskaia, José F. Núñez, Jordi Luque code -1
RTFM: Generalising to New Environment Dynamics via Reading Victor Zhong, Tim Rocktäschel, Edward Grefenstette code -1
What graph neural networks cannot learn: depth vs width Andreas Loukas code -1
Progressive Memory Banks for Incremental Domain Adaptation Nabiha Asghar, Lili Mou, Kira A. Selby, Kevin D. Pantasdo, Pascal Poupart, Xin Jiang code -1
Automated curriculum generation through setter-solver interactions Sébastien Racanière, Andrew K. Lampinen, Adam Santoro, David P. Reichert, Vlad Firoiu, Timothy P. Lillicrap code -1
On Identifiability in Transformers Gino Brunner, Yang Liu, Damian Pascual, Oliver Richter, Massimiliano Ciaramita, Roger Wattenhofer code -1
Exploring Model-based Planning with Policy Networks Tingwu Wang, Jimmy Ba code -1
Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling Yuping Luo, Huazhe Xu, Tengyu Ma code -1
Geometric Insights into the Convergence of Nonlinear TD Learning David Brandfonbrener, Joan Bruna code -1
Few-shot Text Classification with Distributional Signatures Yujia Bao, Menghua Wu, Shiyu Chang, Regina Barzilay code -1
Escaping Saddle Points Faster with Stochastic Momentum JunKun Wang, ChiHeng Lin, Jacob D. Abernethy code -1
Adversarial Policies: Attacking Deep Reinforcement Learning Adam Gleave, Michael Dennis, Cody Wild, Neel Kant, Sergey Levine, Stuart Russell code -1
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation Manoj Kumar, Mohammad Babaeizadeh, Dumitru Erhan, Chelsea Finn, Sergey Levine, Laurent Dinh, Durk Kingma code -1
GLAD: Learning Sparse Graph Recovery Harsh Shrivastava, Xinshi Chen, Binghong Chen, Guanghui Lan, Srinivas Aluru, Han Liu, Le Song code -1
Pruned Graph Scattering Transforms Vassilis N. Ioannidis, Siheng Chen, Georgios B. Giannakis code -1
Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model Wenhan Xiong, Jingfei Du, William Yang Wang, Veselin Stoyanov code -1
Can gradient clipping mitigate label noise? Aditya Krishna Menon, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar code -1
Editable Neural Networks Anton Sinitsin, Vsevolod Plokhotnyuk, Dmitry V. Pyrkin, Sergei Popov, Artem Babenko code -1
Learning Execution through Neural Code fusion Zhan Shi, Kevin Swersky, Daniel Tarlow, Parthasarathy Ranganathan, Milad Hashemi code -1
FasterSeg: Searching for Faster Real-time Semantic Segmentation Wuyang Chen, Xinyu Gong, Xianming Liu, Qian Zhang, Yuan Li, Zhangyang Wang code -1
Difference-Seeking Generative Adversarial Network-Unseen Sample Generation Yi Lin Sung, SungHsien Hsieh, SooChang Pei, ChunShien Lu code -1
Stochastic AUC Maximization with Deep Neural Networks Mingrui Liu, Zhuoning Yuan, Yiming Ying, Tianbao Yang code -1
Semantically-Guided Representation Learning for Self-Supervised Monocular Depth Vitor Guizilini, Rui Hou, Jie Li, Rares Ambrus, Adrien Gaidon code -1
MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius Runtian Zhai, Chen Dan, Di He, Huan Zhang, Boqing Gong, Pradeep Ravikumar, ChoJui Hsieh, Liwei Wang code -1
Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions Yao Qin, Nicholas Frosst, Sara Sabour, Colin Raffel, Garrison W. Cottrell, Geoffrey E. Hinton code -1
GAT: Generative Adversarial Training for Adversarial Example Detection and Robust Classification Xuwang Yin, Soheil Kolouri, Gustavo K. Rohde code -1
Variational Recurrent Models for Solving Partially Observable Control Tasks Dongqi Han, Kenji Doya, Jun Tani code -1
Population-Guided Parallel Policy Search for Reinforcement Learning Whiyoung Jung, Giseung Park, Youngchul Sung code -1
Compositional languages emerge in a neural iterated learning model Yi Ren, Shangmin Guo, Matthieu Labeau, Shay B. Cohen, Simon Kirby code -1
Black-Box Adversarial Attack with Transferable Model-based Embedding Zhichao Huang, Tong Zhang code -1
Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models Cheolhyoung Lee, Kyunghyun Cho, Wanmo Kang code -1
Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP Yuanhao Wang, Kefan Dong, Xiaoyu Chen, Liwei Wang code -1
Deep Network Classification by Scattering and Homotopy Dictionary Learning John Zarka, Louis Thiry, Tomás Angles, Stéphane Mallat code -1
Data-Independent Neural Pruning via Coresets Ben Mussay, Margarita Osadchy, Vladimir Braverman, Samson Zhou, Dan Feldman code -1
Bounds on Over-Parameterization for Guaranteed Existence of Descent Paths in Shallow ReLU Networks Arsalan SharifNassab, Saber Salehkaleybar, S. Jamaloddin Golestani code -1
Novelty Detection Via Blurring SungIk Choi, SaeYoung Chung code -1
Piecewise linear activations substantially shape the loss surfaces of neural networks Fengxiang He, Bohan Wang, Dacheng Tao code -1
Relational State-Space Model for Stochastic Multi-Object Systems Fan Yang, Ling Chen, Fan Zhou, Yusong Gao, Wei Cao code -1
Learning Efficient Parameter Server Synchronization Policies for Distributed SGD Rong Zhu, Sheng Yang, Andreas Pfadler, Zhengping Qian, Jingren Zhou code -1
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao code -1
Vid2Game: Controllable Characters Extracted from Real-World Videos Oran Gafni, Lior Wolf, Yaniv Taigman code -1
Self-Adversarial Learning with Comparative Discrimination for Text Generation Wangchunshu Zhou, Tao Ge, Ke Xu, Furu Wei, Ming Zhou code -1
Robust training with ensemble consensus Jisoo Lee, SaeYoung Chung code -1
Identifying through Flows for Recovering Latent Representations Shen Li, Bryan Hooi, Gim Hee Lee code -1
Certified Robustness for Top-k Predictions against Adversarial Perturbations via Randomized Smoothing Jinyuan Jia, Xiaoyu Cao, Binghui Wang, Neil Zhenqiang Gong code -1
Optimistic Exploration even with a Pessimistic Initialisation Tabish Rashid, Bei Peng, Wendelin Boehmer, Shimon Whiteson code -1
VL-BERT: Pre-training of Generic Visual-Linguistic Representations Weijie Su, Xizhou Zhu, Yue Cao, Bin Li, Lewei Lu, Furu Wei, Jifeng Dai code -1
Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation Hang Gao, Xizhou Zhu, Stephen Lin, Jifeng Dai code -1
Ensemble Distribution Distillation Andrey Malinin, Bruno Mlodozeniec, Mark J. F. Gales code -1
Gap-Aware Mitigation of Gradient Staleness Saar Barkai, Ido Hakimi, Assaf Schuster code -1
Counterfactuals uncover the modular structure of deep generative models Michel Besserve, Arash Mehrjou, Rémy Sun, Bernhard Schölkopf code -1
Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video Miguel Jaques, Michael Burke, Timothy M. Hospedales code -1
An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality Silviu Pitis, Harris Chan, Kiarash Jamali, Jimmy Ba code -1
A Constructive Prediction of the Generalization Error Across Scales Jonathan S. Rosenfeld, Amir Rosenfeld, Yonatan Belinkov, Nir Shavit code -1
Scalable Neural Methods for Reasoning With a Symbolic Knowledge Base William W. Cohen, Haitian Sun, R. Alex Hofer, Matthew Siegler code -1
CLN2INV: Learning Loop Invariants with Continuous Logic Networks Gabriel Ryan, Justin Wong, Jianan Yao, Ronghui Gu, Suman Jana code -1
NAS evaluation is frustratingly hard Antoine Yang, Pedro M. Esperança, Fabio Maria Carlucci code -1
Efficient and Information-Preserving Future Frame Prediction and Beyond Wei Yu, Yichao Lu, Steve Easterbrook, Sanja Fidler code -1
Order Learning and Its Application to Age Estimation Kyungsun Lim, NyeongHo Shin, YoungYoon Lee, ChangSu Kim code -1
ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning Weihao Yu, Zihang Jiang, Yanfei Dong, Jiashi Feng code -1
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures Michael S. Ryoo, A. J. Piergiovanni, Mingxing Tan, Anelia Angelova code -1
Adversarially Robust Representations with Smooth Encoders A. Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli code -1
From Variational to Deterministic Autoencoders Partha Ghosh, Mehdi S. M. Sajjadi, Antonio Vergari, Michael J. Black, Bernhard Schölkopf code -1
Computation Reallocation for Object Detection Feng Liang, Chen Lin, Ronghao Guo, Ming Sun, Wei Wu, Junjie Yan, Wanli Ouyang code -1
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents Christian Rupprecht, Cyril Ibrahim, Christopher J. Pal code -1
A Fair Comparison of Graph Neural Networks for Graph Classification Federico Errica, Marco Podda, Davide Bacciu, Alessio Micheli code -1
Generalization bounds for deep convolutional neural networks Philip M. Long, Hanie Sedghi code -1
SAdam: A Variant of Adam for Strongly Convex Functions Guanghui Wang, Shiyin Lu, Quan Cheng, Weiwei Tu, Lijun Zhang code -1
Continual Learning with Bayesian Neural Networks for Non-Stationary Data Richard Kurle, Botond Cseke, Alexej Klushyn, Patrick van der Smagt, Stephan Günnemann code -1
Multiplicative Interactions and Where to Find Them Siddhant M. Jayakumar, Wojciech M. Czarnecki, Jacob Menick, Jonathan Schwarz, Jack W. Rae, Simon Osindero, Yee Whye Teh, Tim Harley, Razvan Pascanu code -1
Few-Shot Learning on graphs via super-Classes based on Graph spectral Measures Jatin Chauhan, Deepak Nathani, Manohar Kaul code -1
On Computation and Generalization of Generative Adversarial Imitation Learning Minshuo Chen, Yizhou Wang, Tianyi Liu, Zhuoran Yang, Xingguo Li, Zhaoran Wang, Tuo Zhao code -1
A Target-Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning Shahbaz Rezaei, Xin Liu code -1
Low-Resource Knowledge-Grounded Dialogue Generation Xueliang Zhao, Wei Wu, Chongyang Tao, Can Xu, Dongyan Zhao, Rui Yan code -1
Deep 3D Pan via local adaptive "t-shaped" convolutions with global and local adaptive dilations Juan Luis Gonzalez Bello, Munchurl Kim code -1
Tree-Structured Attention with Hierarchical Accumulation XuanPhi Nguyen, Shafiq R. Joty, Steven C. H. Hoi, Richard Socher code -1
The asymptotic spectrum of the Hessian of DNN throughout training Arthur Jacot, Franck Gabriel, Clément Hongler code -1
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games Zuyue Fu, Zhuoran Yang, Yongxin Chen, Zhaoran Wang code -1
In Search for a SAT-friendly Binarized Neural Network Architecture Nina Narodytska, Hongce Zhang, Aarti Gupta, Toby Walsh code -1
Generative Ratio Matching Networks Akash Srivastava, Kai Xu, Michael U. Gutmann, Charles Sutton code -1
Learning to Represent Programs with Property Signatures Augustus Odena, Charles Sutton code -1
V4D: 4D Convolutional Neural Networks for Video-level Representation Learning Shiwen Zhang, Sheng Guo, Weilin Huang, Matthew R. Scott, Limin Wang code -1
Option Discovery using Deep Skill Chaining Akhil Bagaria, George Konidaris code -1
Quantifying the Cost of Reliable Photo Authentication via High-Performance Learned Lossy Representations Pawel Korus, Nasir D. Memon code -1
On the Variance of the Adaptive Learning Rate and Beyond Liyuan Liu, Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao, Jiawei Han code -1
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery Kristian Hartikainen, Xinyang Geng, Tuomas Haarnoja, Sergey Levine code -1
A Theoretical Analysis of the Number of Shots in Few-Shot Learning Tianshi Cao, Marc T. Law, Sanja Fidler code -1
Unsupervised Model Selection for Variational Disentangled Representation Learning Sunny Duan, Loic Matthey, Andre Saraiva, Nick Watters, Chris Burgess, Alexander Lerchner, Irina Higgins code -1
Feature Interaction Interpretability: A Case for Explaining Ad-Recommendation Systems via Neural Interaction Detection Michael Tsang, Dehua Cheng, Hanpeng Liu, Xue Feng, Eric Zhou, Yan Liu code -1
Understanding the Limitations of Variational Mutual Information Estimators Jiaming Song, Stefano Ermon code -1
GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations Martin Engelcke, Adam R. Kosiorek, Oiwi Parker Jones, Ingmar Posner code -1
Language GANs Falling Short Massimo Caccia, Lucas Caccia, William Fedus, Hugo Larochelle, Joelle Pineau, Laurent Charlin code -1
Stochastic Conditional Generative Networks with Basis Decomposition Ze Wang, Xiuyuan Cheng, Guillermo Sapiro, Qiang Qiu code -1
Learned Step Size quantization Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha code -1
On the "steerability" of generative adversarial networks Ali Jahanian, Lucy Chai, Phillip Isola code -1
Reinforced active learning for image segmentation Arantxa Casanova, Pedro O. Pinheiro, Negar Rostamzadeh, Christopher J. Pal code -1
Sign Bits Are All You Need for Black-Box Attacks Abdullah AlDujaili, UnaMay O'Reilly code -1
Deep Semi-Supervised Anomaly Detection Lukas Ruff, Robert A. Vandermeulen, Nico Görnitz, Alexander Binder, Emmanuel Müller, KlausRobert Müller, Marius Kloft code -1
Budgeted Training: Rethinking Deep Neural Network Training Under Resource Constraints Mengtian Li, Ersin Yumer, Deva Ramanan code -1
Minimizing FLOPs to Learn Efficient Sparse Representations Biswajit Paria, ChihKuan Yeh, Ian EnHsu Yen, Ning Xu, Pradeep Ravikumar, Barnabás Póczos code -1
Reanalysis of Variance Reduced Temporal Difference Learning Tengyu Xu, Zhe Wang, Yi Zhou, Yingbin Liang code -1
Imitation Learning via Off-Policy Distribution Matching Ilya Kostrikov, Ofir Nachum, Jonathan Tompson code -1
Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML Aniruddh Raghu, Maithra Raghu, Samy Bengio, Oriol Vinyals code -1
Augmenting Genetic Algorithms with Deep Neural Networks for Exploring the Chemical Space AkshatKumar Nigam, Pascal Friederich, Mario Krenn, Alán AspuruGuzik code -1
Improved Sample Complexities for Deep Neural Networks and Robust Classification via an All-Layer Margin Colin Wei, Tengyu Ma code -1
Identity Crisis: Memorization and Generalization Under Extreme Overparameterization Chiyuan Zhang, Samy Bengio, Moritz Hardt, Michael C. Mozer, Yoram Singer code -1
ReMixMatch: Semi-Supervised Learning with Distribution Matching and Augmentation Anchoring David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk Sohn, Han Zhang, Colin Raffel code -1
Adaptive Structural Fingerprints for Graph Attention Networks Kai Zhang, Yaokang Zhu, Jun Wang, Jie Zhang code -1
CAQL: Continuous Action Q-Learning Moonkyung Ryu, Yinlam Chow, Ross Anderson, Christian Tjandraatmadja, Craig Boutilier code -1
Learning Heuristics for Quantified Boolean Formulas through Reinforcement Learning Gil Lederman, Markus N. Rabe, Sanjit Seshia, Edward A. Lee code -1
Pure and Spurious Critical Points: a Geometric Study of Linear Networks Matthew Trager, Kathlén Kohn, Joan Bruna code -1
Neural Text Generation With Unlikelihood Training Sean Welleck, Ilia Kulikov, Stephen Roller, Emily Dinan, Kyunghyun Cho, Jason Weston code -1
Semi-Supervised Generative Modeling for Controllable Speech Synthesis Raza Habib, Soroosh Mariooryad, Matt Shannon, Eric Battenberg, R. J. SkerryRyan, Daisy Stanton, David Kao, Tom Bagby code -1
Dynamic Time Lag Regression: Predicting What & When Mandar Chandorkar, Cyril Furtlehner, Bala Poduval, Enrico Camporeale, Michèle Sebag code -1
Scalable Model Compression by Entropy Penalized Reparameterization Deniz Oktay, Johannes Ballé, Saurabh Singh, Abhinav Shrivastava code -1
AMRL: Aggregated Memory For Reinforcement Learning Jacob Beck, Kamil Ciosek, Sam Devlin, Sebastian Tschiatschek, Cheng Zhang, Katja Hofmann code -1
Efficient Riemannian Optimization on the Stiefel Manifold via the Cayley Transform Jun Li, Fuxin Li, Sinisa Todorovic code -1
Unpaired Point Cloud Completion on Real Scans using Adversarial Training Xuelin Chen, Baoquan Chen, Niloy J. Mitra code -1
Adjustable Real-time Style Transfer Mohammad Babaeizadeh, Golnaz Ghiasi code -1
Stochastic Weight Averaging in Parallel: Large-Batch Training That Generalizes Well Vipul Gupta, Santiago Akle Serrano, Dennis DeCoste code -1
Short and Sparse Deconvolution - A Geometric Approach Yenson Lau, Qing Qu, HanWen Kuo, Pengcheng Zhou, Yuqian Zhang, John Wright code -1
Selection via Proxy: Efficient Data Selection for Deep Learning Cody Coleman, Christopher Yeh, Stephen Mussmann, Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia code -1
Global Relational Models of Source Code Vincent J. Hellendoorn, Charles Sutton, Rishabh Singh, Petros Maniatis, David Bieber code -1
Detecting Extrapolation with Local Ensembles David Madras, James Atwood, Alexander D'Amour code -1
Learning to Link MariaFlorina Balcan, Travis Dick, Manuel Lang code -1
Adversarially robust transfer learning Ali Shafahi, Parsa Saadatpanah, Chen Zhu, Amin Ghiasi, Christoph Studer, David W. Jacobs, Tom Goldstein code -1
Overlearning Reveals Sensitive Attributes Congzheng Song, Vitaly Shmatikov code -1
Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness Pu Zhao, PinYu Chen, Payel Das, Karthikeyan Natesan Ramamurthy, Xue Lin code -1
Differentially Private Meta-Learning Jeffrey Li, Mikhail Khodak, Sebastian Caldas, Ameet Talwalkar code -1
One-Shot Pruning of Recurrent Neural Networks by Jacobian Spectrum Evaluation Matthew Shunshi Zhang, Bradly C. Stadie code -1
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples Eleni Triantafillou, Tyler Zhu, Vincent Dumoulin, Pascal Lamblin, Utku Evci, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, PierreAntoine Manzagol, Hugo Larochelle code -1
Are Transformers universal approximators of sequence-to-sequence functions? Chulhee Yun, Srinadh Bhojanapalli, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar code -1
Pre-training Tasks for Embedding-based Large-scale Retrieval WeiCheng Chang, Felix X. Yu, YinWen Chang, Yiming Yang, Sanjiv Kumar code -1
Deep Imitative Models for Flexible Inference, Planning, and Control Nicholas Rhinehart, Rowan McAllister, Sergey Levine code -1
CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning Jiachen Yang, Alireza Nakhaei, David Isele, Kikuo Fujimura, Hongyuan Zha code -1
Robust And Interpretable Blind Image Denoising Via Bias-Free Convolutional Neural Networks Sreyas Mohan, Zahra Kadkhodaie, Eero P. Simoncelli, Carlos FernandezGranda code -1
Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets Mingrui Liu, Youssef Mroueh, Jerret Ross, Wei Zhang, Xiaodong Cui, Payel Das, Tianbao Yang code -1
DeepV2D: Video to Depth with Differentiable Structure from Motion Zachary Teed, Jia Deng code -1
Learning Space Partitions for Nearest Neighbor Search Yihe Dong, Piotr Indyk, Ilya P. Razenshteyn, Tal Wagner code -1
Playing the lottery with rewards and multiple languages: lottery tickets in RL and NLP Haonan Yu, Sergey Edunov, Yuandong Tian, Ari S. Morcos code -1
Sign-OPT: A Query-Efficient Hard-label Adversarial Attack Minhao Cheng, Simranjit Singh, Patrick H. Chen, PinYu Chen, Sijia Liu, ChoJui Hsieh code -1
RaCT: Toward Amortized Ranking-Critical Training For Collaborative Filtering Sam Lobel, Chunyuan Li, Jianfeng Gao, Lawrence Carin code -1
Intrinsic Motivation for Encouraging Synergistic Behavior Rohan Chitnis, Shubham Tulsiani, Saurabh Gupta, Abhinav Gupta code -1
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation Byung Hoon Ahn, Prannoy Pilligundla, Amir Yazdanbakhsh, Hadi Esmaeilzadeh code -1
Recurrent neural circuits for contour detection Drew Linsley, Junkyung Kim, Alekh Ashok, Thomas Serre code -1
Locality and Compositionality in Zero-Shot Learning Tristan Sylvain, Linda Petrini, R. Devon Hjelm code -1
Understanding Knowledge Distillation in Non-autoregressive Machine Translation Chunting Zhou, Jiatao Gu, Graham Neubig code -1
Thieves on Sesame Street! Model Extraction of BERT-based APIs Kalpesh Krishna, Gaurav Singh Tomar, Ankur P. Parikh, Nicolas Papernot, Mohit Iyyer code -1
Fast is better than free: Revisiting adversarial training Eric Wong, Leslie Rice, J. Zico Kolter code -1
DBA: Distributed Backdoor Attacks against Federated Learning Chulin Xie, Keli Huang, PinYu Chen, Bo Li code -1
DeFINE: Deep Factorized Input Token Embeddings for Neural Sequence Modeling Sachin Mehta, Rik KoncelKedziorski, Mohammad Rastegari, Hannaneh Hajishirzi code -1
Sampling-Free Learning of Bayesian Quantized Neural Networks Jiahao Su, Milan Cvitkovic, Furong Huang code -1
Learning to solve the credit assignment problem Benjamin James Lansdell, Prashanth Ravi Prakash, Konrad Paul Körding code -1
Four Things Everyone Should Know to Improve Batch Normalization Cecilia Summers, Michael J. Dinneen code -1
Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving Yurong You, Yan Wang, WeiLun Chao, Divyansh Garg, Geoff Pleiss, Bharath Hariharan, Mark E. Campbell, Kilian Q. Weinberger code -1
SlowMo: Improving Communication-Efficient Distributed SGD with Slow Momentum Jianyu Wang, Vinayak Tantia, Nicolas Ballas, Michael G. Rabbat code -1
MetaPix: Few-Shot Video Retargeting Jessica Lee, Deva Ramanan, Rohit Girdhar code -1
Learning to Learn by Zeroth-Order Oracle Yangjun Ruan, Yuanhao Xiong, Sashank J. Reddi, Sanjiv Kumar, ChoJui Hsieh code -1
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra code -1
PAC Confidence Sets for Deep Neural Networks via Calibrated Prediction Sangdon Park, Osbert Bastani, Nikolai Matni, Insup Lee code -1
Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision Activations Yichi Zhang, Ritchie Zhao, Weizhe Hua, Nayun Xu, G. Edward Suh, Zhiru Zhang code -1
Oblique Decision Trees from Derivatives of ReLU Networks GuangHe Lee, Tommi S. Jaakkola code -1
Span Recovery for Deep Neural Networks with Applications to Input Obfuscation Rajesh Jayaram, David P. Woodruff, Qiuyi Zhang code -1
Improving Neural Language Generation with Spectrum Control Lingxiao Wang, Jing Huang, Kevin Huang, Ziniu Hu, Guangtao Wang, Quanquan Gu code -1
Learn to Explain Efficiently via Neural Logic Inductive Learning Yuan Yang, Le Song code -1
Improved memory in recurrent neural networks with sequential non-normal dynamics A. Emin Orhan, Xaq Pitkow code -1
Neural Module Networks for Reasoning over Text Nitish Gupta, Kevin Lin, Dan Roth, Sameer Singh, Matt Gardner code -1
Higher-Order Function Networks for Learning Composable 3D Object Representations Eric Mitchell, Selim Engin, Volkan Isler, Daniel D. Lee code -1
Variational Hetero-Encoder Randomized GANs for Joint Image-Text Modeling Hao Zhang, Bo Chen, Long Tian, Zhengjue Wang, Mingyuan Zhou code -1
Towards Fast Adaptation of Neural Architectures with Meta Learning Dongze Lian, Yin Zheng, Yintao Xu, Yanxiong Lu, Leyu Lin, Peilin Zhao, Junzhou Huang, Shenghua Gao code -1
Graph Constrained Reinforcement Learning for Natural Language Action Spaces Prithviraj Ammanabrolu, Matthew J. Hausknecht code -1
Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control Nir Levine, Yinlam Chow, Rui Shu, Ang Li, Mohammad Ghavamzadeh, Hung Bui code -1
Augmenting Non-Collaborative Dialog Systems with Explicit Semantic and Strategic Dialog History Yiheng Zhou, Yulia Tsvetkov, Alan W. Black, Zhou Yu code -1
BERTScore: Evaluating Text Generation with BERT Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q. Weinberger, Yoav Artzi code -1
Neural Execution of Graph Algorithms Petar Velickovic, Rex Ying, Matilde Padovano, Raia Hadsell, Charles Blundell code -1
On the Need for Topology-Aware Generative Models for Manifold-Based Defenses Uyeong Jang, Susmit Jha, Somesh Jha code -1
FSNet: Compression of Deep Convolutional Neural Networks by Filter Summary Yingzhen Yang, Jiahui Yu, Nebojsa Jojic, Jun Huan, Thomas S. Huang code -1
Capsules with Inverted Dot-Product Attention Routing YaoHung Hubert Tsai, Nitish Srivastava, Hanlin Goh, Ruslan Salakhutdinov code -1
Composition-based Multi-Relational Graph Convolutional Networks Shikhar Vashishth, Soumya Sanyal, Vikram Nitin, Partha P. Talukdar code -1
Gradient-Based Neural DAG Learning Sébastien Lachapelle, Philippe Brouillard, Tristan Deleu, Simon LacosteJulien code -1
The Local Elasticity of Neural Networks Hangfeng He, Weijie J. Su code -1
Composing Task-Agnostic Policies with Deep Reinforcement Learning Ahmed Hussain Qureshi, Jacob J. Johnson, Yuzhe Qin, Taylor Henderson, Byron Boots, Michael C. Yip code -1
Convergence of Gradient Methods on Bilinear Zero-Sum Games Guojun Zhang, Yaoliang Yu code -1
Discovering Motor Programs by Recomposing Demonstrations Tanmay Shankar, Shubham Tulsiani, Lerrel Pinto, Abhinav Gupta code -1
Learning from Explanations with Neural Execution Tree Ziqi Wang, Yujia Qin, Wenxuan Zhou, Jun Yan, Qinyuan Ye, Leonardo Neves, Zhiyuan Liu, Xiang Ren code -1
Jelly Bean World: A Testbed for Never-Ending Learning Emmanouil Antonios Platanios, Abulhair Saparov, Tom M. Mitchell code -1
Coherent Gradients: An Approach to Understanding Generalization in Gradient Descent-based Optimization Satrajit Chatterjee code -1
Probabilistic Connection Importance Inference and Lossless Compression of Deep Neural Networks Xin Xing, Long Sha, Pengyu Hong, Zuofeng Shang, Jun S. Liu code -1
MEMO: A Deep Network for Flexible Combination of Episodic Memories Andrea Banino, Adrià Puigdomènech Badia, Raphael Köster, Martin J. Chadwick, Vinícius Flores Zambaldi, Demis Hassabis, Caswell Barry, Matthew M. Botvinick, Dharshan Kumaran, Charles Blundell code -1
Economy Statistical Recurrent Units For Inferring Nonlinear Granger Causality Saurabh Khanna, Vincent Y. F. Tan code -1
Bayesian Meta Sampling for Fast Uncertainty Adaptation Zhenyi Wang, Yang Zhao, Ping Yu, Ruiyi Zhang, Changyou Chen code -1
Non-Autoregressive Dialog State Tracking Hung Le, Richard Socher, Steven C. H. Hoi code -1
Extreme Tensoring for Low-Memory Preconditioning Xinyi Chen, Naman Agarwal, Elad Hazan, Cyril Zhang, Yi Zhang code -1
RNNs Incrementally Evolving on an Equilibrium Manifold: A Panacea for Vanishing and Exploding Gradients? Anil Kag, Ziming Zhang, Venkatesh Saligrama code -1
The Early Phase of Neural Network Training Jonathan Frankle, David J. Schwab, Ari S. Morcos code -1
NeurQuRI: Neural Question Requirement Inspector for Answerability Prediction in Machine Reading Comprehension Seohyun Back, Sai Chetan Chinthakindi, Akhil Kedia, Haejun Lee, Jaegul Choo code -1
Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization Junjie Yan, Ruosi Wan, Xiangyu Zhang, Wei Zhang, Yichen Wei, Jian Sun code -1
Single Episode Policy Transfer in Reinforcement Learning Jiachen Yang, Brenden K. Petersen, Hongyuan Zha, Daniel M. Faissol code -1
Generalization through Memorization: Nearest Neighbor Language Models Urvashi Khandelwal, Omer Levy, Dan Jurafsky, Luke Zettlemoyer, Mike Lewis code -1
Transformer-XH: Multi-Evidence Reasoning with eXtra Hop Attention Chen Zhao, Chenyan Xiong, Corby Rosset, Xia Song, Paul N. Bennett, Saurabh Tiwary code -1
Synthesizing Programmatic Policies that Inductively Generalize Jeevana Priya Inala, Osbert Bastani, Zenna Tavares, Armando SolarLezama code -1
Decoding As Dynamic Programming For Recurrent Autoregressive Models Najam Zaidi, Trevor Cohn, Gholamreza Haffari code -1
Deep Double Descent: Where Bigger Models and More Data Hurt Preetum Nakkiran, Gal Kaplun, Yamini Bansal, Tristan Yang, Boaz Barak, Ilya Sutskever code -1
Intriguing Properties of Adversarial Training at Scale Cihang Xie, Alan L. Yuille code -1
Shifted and Squeezed 8-bit Floating Point format for Low-Precision Training of Deep Neural Networks Léopold Cambier, Anahita Bhiwandiwalla, Ting Gong, Oguz H. Elibol, Mehran Nekuii, Hanlin Tang code -1
Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication Yuanhao Wang, Jiachen Hu, Xiaoyu Chen, Liwei Wang code -1
Biologically inspired sleep algorithm for increased generalization and adversarial robustness in deep neural networks Timothy Tadros, Giri P. Krishnan, Ramyaa Ramyaa, Maxim Bazhenov code -1
A Closer Look at the Optimization Landscapes of Generative Adversarial Networks Hugo Berard, Gauthier Gidel, Amjad Almahairi, Pascal Vincent, Simon LacosteJulien code -1
On the Global Convergence of Training Deep Linear ResNets Difan Zou, Philip M. Long, Quanquan Gu code -1
Towards a Deep Network Architecture for Structured Smoothness Haroun Habeeb, Oluwasanmi Koyejo code -1
Revisiting Self-Training for Neural Sequence Generation Junxian He, Jiatao Gu, Jiajun Shen, Marc'Aurelio Ranzato code -1
Denoising and Regularization via Exploiting the Structural Bias of Convolutional Generators Reinhard Heckel, Mahdi Soltanolkotabi code -1
Variational Autoencoders for Highly Multivariate Spatial Point Processes Intensities Baichuan Yuan, Xiaowei Wang, Jianxin Ma, Chang Zhou, Andrea L. Bertozzi, Hongxia Yang code -1
Model-Augmented Actor-Critic: Backpropagating through Paths Ignasi Clavera, Yao Fu, Pieter Abbeel code -1
LambdaNet: Probabilistic Type Inference using Graph Neural Networks Jiayi Wei, Maruth Goyal, Greg Durrett, Isil Dillig code -1
From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech HyeongSeok Choi, Changdae Park, Kyogu Lee code -1
Learning from Unlabelled Videos Using Contrastive Predictive Neural 3D Mapping Adam W. Harley, Shrinidhi Kowshika Lakshmikanth, Fangyu Li, Xian Zhou, HsiaoYu Fish Tung, Katerina Fragkiadaki code -1
Decoupling Representation and Classifier for Long-Tailed Recognition Bingyi Kang, Saining Xie, Marcus Rohrbach, Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis code -1
Robust Reinforcement Learning for Continuous Control with Model Misspecification Daniel J. Mankowitz, Nir Levine, Rae Jeong, Abbas Abdolmaleki, Jost Tobias Springenberg, Yuanyuan Shi, Jackie Kay, Todd Hester, Timothy A. Mann, Martin A. Riedmiller code -1
Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework Zirui Wang, Jiateng Xie, Ruochen Xu, Yiming Yang, Graham Neubig, Jaime G. Carbonell code -1
Training Recurrent Neural Networks Online by Learning Explicit State Variables Somjit Nath, Vincent Liu, Alan Chan, Xin Li, Adam White, Martha White code -1
Uncertainty-guided Continual Learning with Bayesian Neural Networks Sayna Ebrahimi, Mohamed Elhoseiny, Trevor Darrell, Marcus Rohrbach code -1
Curriculum Loss: Robust Learning and Generalization against Label Corruption Yueming Lyu, Ivor W. Tsang code -1
Picking Winning Tickets Before Training by Preserving Gradient Flow Chaoqi Wang, Guodong Zhang, Roger B. Grosse code -1
Generative Models for Effective ML on Private, Decentralized Datasets Sean Augenstein, H. Brendan McMahan, Daniel Ramage, Swaroop Ramaswamy, Peter Kairouz, Mingqing Chen, Rajiv Mathews, Blaise Agüera y Arcas code -1
Inductive representation learning on temporal graphs Da Xu, Chuanwei Ruan, Evren Körpeoglu, Sushant Kumar, Kannan Achan code -1
BatchEnsemble: an Alternative Approach to Efficient Ensemble and Lifelong Learning Yeming Wen, Dustin Tran, Jimmy Ba code -1
Towards neural networks that provably know when they don't know Alexander Meinke, Matthias Hein code -1
Iterative energy-based projection on a normal data manifold for anomaly localization David Dehaene, Oriel Frigo, Sébastien Combrexelle, Pierre Eline code -1
Towards Stable and Efficient Training of Verifiably Robust Neural Networks Huan Zhang, Hongge Chen, Chaowei Xiao, Sven Gowal, Robert Stanforth, Bo Li, Duane S. Boning, ChoJui Hsieh code -1
Frequency-based Search-control in Dyna Yangchen Pan, Jincheng Mei, Amirmassoud Farahmand code -1
Learning representations for binary-classification without backpropagation Mathias Lechner code -1
Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks Ziwei Ji, Matus Telgarsky code -1
Physics-aware Difference Graph Networks for Sparsely-Observed Dynamics Sungyong Seo, Chuizheng Meng, Yan Liu code -1
HiLLoC: lossless image compression with hierarchical latent variable models James Townsend, Thomas Bird, Julius Kunze, David Barber code -1
IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks Michael Luo, Jiahao Yao, Richard Liaw, Eric Liang, Ion Stoica code -1
On Bonus Based Exploration Methods In The Arcade Learning Environment Adrien Ali Taïga, William Fedus, Marlos C. Machado, Aaron C. Courville, Marc G. Bellemare code -1
Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation Xinjie Fan, Yizhe Zhang, Zhendong Wang, Mingyuan Zhou code -1
Smoothness and Stability in GANs Casey Chu, Kentaro Minami, Kenji Fukumizu code -1
SNOW: Subscribing to Knowledge via Channel Pooling for Transfer & Lifelong Learning of Convolutional Neural Networks Chungkuk Yoo, Bumsoo Kang, Minsik Cho code -1
Empirical Studies on the Properties of Linear Regions in Deep Neural Networks Xiao Zhang, Dongrui Wu code -1
Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning Ali Mousavi, Lihong Li, Qiang Liu, Denny Zhou code -1
PairNorm: Tackling Oversmoothing in GNNs Lingxiao Zhao, Leman Akoglu code -1
Unsupervised Clustering using Pseudo-semi-supervised Learning Divam Gupta, Ramachandran Ramjee, Nipun Kwatra, Muthian Sivathanu code -1
Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee Wei Hu, Zhiyuan Li, Dingli Yu code -1
Controlling generative models with continuous factors of variations Antoine Plumerault, Hervé Le Borgne, Céline Hudelot code -1
Symplectic ODE-Net: Learning Hamiltonian Dynamics with Control Yaofeng Desmond Zhong, Biswadip Dey, Amit Chakraborty code -1
Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness Yuexiang Zhai, Hermish Mehta, Zhengyuan Zhou, Yi Ma code -1
Quantum Algorithms for Deep Convolutional Neural Networks Iordanis Kerenidis, Jonas Landman, Anupam Prakash code -1
Self-Supervised Learning of Appliance Usage ChenYu Hsu, Abbas Zeitoun, GuangHe Lee, Dina Katabi, Tommi S. Jaakkola code -1
Deep Graph Matching Consensus Matthias Fey, Jan Eric Lenssen, Christopher Morris, Jonathan Masci, Nils M. Kriege code -1
Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks Yu Bai, Jason D. Lee code -1
Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers Junjie Liu, Zhe Xu, Runbin Shi, Ray C. C. Cheung, Hayden KwokHay So code -1
Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference TingKuei Hu, Tianlong Chen, Haotao Wang, Zhangyang Wang code -1
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence Lingxiao Wang, Qi Cai, Zhuoran Yang, Zhaoran Wang code -1
Double Neural Counterfactual Regret Minimization Hui Li, Kailiang Hu, Shaohua Zhang, Yuan Qi, Le Song code -1
GraphAF: a Flow-based Autoregressive Model for Molecular Graph Generation Chence Shi, Minkai Xu, Zhaocheng Zhu, Weinan Zhang, Ming Zhang, Jian Tang code -1
The Gambler's Problem and Beyond Baoxiang Wang, Shuai Li, Jiajin Li, Siu On Chan code -1
Multilingual Alignment of Contextual Word Representations Steven Cao, Nikita Kitaev, Dan Klein code -1
The Curious Case of Neural Text Degeneration Ari Holtzman, Jan Buys, Li Du, Maxwell Forbes, Yejin Choi code -1
Graph Convolutional Reinforcement Learning Jiechuan Jiang, Chen Dun, Tiejun Huang, Zongqing Lu code -1
Meta-Learning Deep Energy-Based Memory Models Sergey Bartunov, Jack W. Rae, Simon Osindero, Timothy P. Lillicrap code -1
Exploratory Not Explanatory: Counterfactual Analysis of Saliency Maps for Deep Reinforcement Learning Akanksha Atrey, Kaleigh Clary, David D. Jensen code -1
Fast Neural Network Adaptation via Parameter Remapping and Architecture Search Jiemin Fang, Yuzhu Sun, Kangjian Peng, Qian Zhang, Yuan Li, Wenyu Liu, Xinggang Wang code -1
Guiding Program Synthesis by Learning to Generate Examples Larissa Laich, Pavol Bielik, Martin T. Vechev code -1
SNODE: Spectral Discretization of Neural ODEs for System Identification Alessio Quaglino, Marco Gallieri, Jonathan Masci, Jan Koutník code -1
Generalized Convolutional Forest Networks for Domain Generalization and Visual Recognition Jongbin Ryu, Gitaek Kwon, MingHsuan Yang, Jongwoo Lim code -1
Once-for-All: Train One Network and Specialize it for Efficient Deployment Han Cai, Chuang Gan, Tianzhe Wang, Zhekai Zhang, Song Han code -1
Multi-Agent Interactions Modeling with Correlated Policies Minghuan Liu, Ming Zhou, Weinan Zhang, Yuzheng Zhuang, Jun Wang, Wulong Liu, Yong Yu code -1
PCMC-Net: Feature-based Pairwise Choice Markov Chains Alix Lhéritier code -1
Implementing Inductive bias for different navigation tasks through diverse RNN attrractors Tie Xu, Omri Barak code -1
Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings Hongyu Ren, Weihua Hu, Jure Leskovec code -1
Rethinking the Hyperparameters for Fine-tuning Hao Li, Pratik Chaudhari, Hao Yang, Michael Lam, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto code -1
Plug and Play Language Models: A Simple Approach to Controlled Text Generation Sumanth Dathathri, Andrea Madotto, Janice Lan, Jane Hung, Eric Frank, Piero Molino, Jason Yosinski, Rosanne Liu code -1
Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks Wei Hu, Lechao Xiao, Jeffrey Pennington code -1
RGBD-GAN: Unsupervised 3D Representation Learning From Natural Image Datasets via RGBD Image Synthesis Atsuhiro Noguchi, Tatsuya Harada code -1
Towards Verified Robustness under Text Deletion Interventions Johannes Welbl, PoSen Huang, Robert Stanforth, Sven Gowal, Krishnamurthy (Dj) Dvijotham, Martin Szummer, Pushmeet Kohli code -1
Jacobian Adversarially Regularized Networks for Robustness Alvin Chan, Yi Tay, YewSoon Ong, Jie Fu code -1
Thinking While Moving: Deep Reinforcement Learning with Concurrent Control Ted Xiao, Eric Jang, Dmitry Kalashnikov, Sergey Levine, Julian Ibarz, Karol Hausman, Alexander Herzog code -1
Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning Qian Long, Zihan Zhou, Abhinav Gupta, Fei Fang, Yi Wu, Xiaolong Wang code -1
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators Kevin Clark, MinhThang Luong, Quoc V. Le, Christopher D. Manning code -1
Environmental drivers of systematicity and generalization in a situated agent Felix Hill, Andrew K. Lampinen, Rosalia Schneider, Stephen Clark, Matthew M. Botvinick, James L. McClelland, Adam Santoro code -1
Abstract Diagrammatic Reasoning with Multiplex Graph Networks Duo Wang, Mateja Jamnik, Pietro Liò code -1
A Baseline for Few-Shot Image Classification Guneet Singh Dhillon, Pratik Chaudhari, Avinash Ravichandran, Stefano Soatto code -1
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering Akari Asai, Kazuma Hashimoto, Hannaneh Hajishirzi, Richard Socher, Caiming Xiong code -1
Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks Alejandro Molina, Patrick Schramowski, Kristian Kersting code -1
A Framework for robustness Certification of Smoothed Classifiers using F-Divergences Krishnamurthy (Dj) Dvijotham, Jamie Hayes, Borja Balle, J. Zico Kolter, Chongli Qin, András György, Kai Xiao, Sven Gowal, Pushmeet Kohli code -1
Contrastive Representation Distillation Yonglong Tian, Dilip Krishnan, Phillip Isola code -1
Certified Defenses for Adversarial Patches Pingyeh Chiang, Renkun Ni, Ahmed Abdelkader, Chen Zhu, Christoph Studer, Tom Goldstein code -1
Sample Efficient Policy Gradient Methods with Recursive Variance Reduction Pan Xu, Felicia Gao, Quanquan Gu code -1
Deep Symbolic Superoptimization Without Human Knowledge Hui Shi, Yang Zhang, Xinyun Chen, Yuandong Tian, Jishen Zhao code -1
Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution Nikaash Puri, Sukriti Verma, Piyush Gupta, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh code -1
Universal Approximation with Certified Networks Maximilian Baader, Matthew Mirman, Martin T. Vechev code -1
Measuring and Improving the Use of Graph Information in Graph Neural Networks Yifan Hou, Jie Zhang, James Cheng, Kaili Ma, Richard T. B. Ma, Hongzhi Chen, MingChang Yang code -1
State-only Imitation with Transition Dynamics Mismatch Tanmay Gangwani, Jian Peng code -1
Adversarial AutoAugment Xinyu Zhang, Qiang Wang, Jian Zhang, Zhao Zhong code -1
Meta Dropout: Learning to Perturb Latent Features for Generalization Haebeom Lee, Taewook Nam, Eunho Yang, Sung Ju Hwang code -1
Rényi Fair Inference Sina Baharlouei, Maher Nouiehed, Ahmad Beirami, Meisam Razaviyayn code -1
Learning transport cost from subset correspondence Ruishan Liu, Akshay Balsubramani, James Zou code -1
BlockSwap: Fisher-guided Block Substitution for Network Compression on a Budget Jack Turner, Elliot J. Crowley, Michael F. P. O'Boyle, Amos J. Storkey, Gavin Gray code -1
Variance Reduction With Sparse Gradients Melih Elibol, Lihua Lei, Michael I. Jordan code -1
Abductive Commonsense Reasoning Chandra Bhagavatula, Ronan Le Bras, Chaitanya Malaviya, Keisuke Sakaguchi, Ari Holtzman, Hannah Rashkin, Doug Downey, Wentau Yih, Yejin Choi code -1
Discrepancy Ratio: Evaluating Model Performance When Even Experts Disagree on the Truth Igor Lovchinsky, Alon Daks, Israel Malkin, Pouya Samangouei, Ardavan Saeedi, Yang Liu, Swami Sankaranarayanan, Tomer Gafner, Ben Sternlieb, Patrick Maher, Nathan Silberman code -1
Weakly Supervised Disentanglement with Guarantees Rui Shu, Yining Chen, Abhishek Kumar, Stefano Ermon, Ben Poole code -1
Nesterov Accelerated Gradient and Scale Invariance for Adversarial Attacks Jiadong Lin, Chuanbiao Song, Kun He, Liwei Wang, John E. Hopcroft code -1
Fantastic Generalization Measures and Where to Find Them Yiding Jiang, Behnam Neyshabur, Hossein Mobahi, Dilip Krishnan, Samy Bengio code -1
Robustness Verification for Transformers Zhouxing Shi, Huan Zhang, KaiWei Chang, Minlie Huang, ChoJui Hsieh code -1
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning Kimin Lee, Kibok Lee, Jinwoo Shin, Honglak Lee code -1
Tensor Decompositions for Temporal Knowledge Base Completion Timothée Lacroix, Guillaume Obozinski, Nicolas Usunier code -1
On Universal Equivariant Set Networks Nimrod Segol, Yaron Lipman code -1
Provable robustness against all adversarial $l_p$-perturbations for $p\geq 1$ Francesco Croce, Matthias Hein code -1
Don't Use Large Mini-batches, Use Local SGD Tao Lin, Sebastian U. Stich, Kumar Kshitij Patel, Martin Jaggi code -1
Kernel of CycleGAN as a principal homogeneous space Nikita Moriakov, Jonas Adler, Jonas Teuwen code -1
Distributionally Robust Neural Networks Shiori Sagawa, Pang Wei Koh, Tatsunori B. Hashimoto, Percy Liang code -1
On Solving Minimax Optimization Locally: A Follow-the-Ridge Approach Yuanhao Wang, Guodong Zhang, Jimmy Ba code -1
A Neural Dirichlet Process Mixture Model for Task-Free Continual Learning Soochan Lee, Junsoo Ha, Dongsu Zhang, Gunhee Kim code -1
Hyper-SAGNN: a self-attention based graph neural network for hypergraphs Ruochi Zhang, Yuesong Zou, Jian Ma code -1
Neural Epitome Search for Architecture-Agnostic Network Compression Daquan Zhou, Xiaojie Jin, Qibin Hou, Kaixin Wang, Jianchao Yang, Jiashi Feng code -1
On the Equivalence between Positional Node Embeddings and Structural Graph Representations Balasubramaniam Srinivasan, Bruno Ribeiro code -1
Probability Calibration for Knowledge Graph Embedding Models Pedro Tabacof, Luca Costabello code -1
Why Not to Use Zero Imputation? Correcting Sparsity Bias in Training Neural Networks Joonyoung Yi, Juhyuk Lee, Kwang Joon Kim, Sung Ju Hwang, Eunho Yang code -1
DropEdge: Towards Deep Graph Convolutional Networks on Node Classification Yu Rong, Wenbing Huang, Tingyang Xu, Junzhou Huang code -1
Masked Based Unsupervised Content Transfer Ron Mokady, Sagie Benaim, Lior Wolf, Amit Bermano code -1
U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation Junho Kim, Minjae Kim, Hyeonwoo Kang, Kwanghee Lee code -1
Inductive and Unsupervised Representation Learning on Graph Structured Objects Lichen Wang, Bo Zong, Qianqian Ma, Wei Cheng, Jingchao Ni, Wenchao Yu, Yanchi Liu, Dongjin Song, Haifeng Chen, Yun Fu code -1
Batch-shaping for learning conditional channel gated networks Babak Ehteshami Bejnordi, Tijmen Blankevoort, Max Welling code -1
Learning Robust Representations via Multi-View Information Bottleneck Marco Federici, Anjan Dutta, Patrick Forré, Nate Kushman, Zeynep Akata code -1
Deep probabilistic subsampling for task-adaptive compressed sensing Iris A. M. Huijben, Bastiaan S. Veeling, Ruud J. G. van Sloun code -1
Robust anomaly detection and backdoor attack detection via differential privacy Min Du, Ruoxi Jia, Dawn Song code -1
Learning to Guide Random Search Ozan Sener, Vladlen Koltun code -1
Lagrangian Fluid Simulation with Continuous Convolutions Benjamin Ummenhofer, Lukas Prantl, Nils Thuerey, Vladlen Koltun code -1
Reinforced Genetic Algorithm Learning for Optimizing Computation Graphs Aditya Paliwal, Felix Gimeno, Vinod Nair, Yujia Li, Miles Lubin, Pushmeet Kohli, Oriol Vinyals code -1
Compressive Transformers for Long-Range Sequence Modelling Jack W. Rae, Anna Potapenko, Siddhant M. Jayakumar, Chloe Hillier, Timothy P. Lillicrap code -1
A Stochastic Derivative Free Optimization Method with Momentum Eduard Gorbunov, Adel Bibi, Ozan Sener, El Houcine Bergou, Peter Richtárik code -1
Understanding and Improving Information Transfer in Multi-Task Learning Sen Wu, Hongyang R. Zhang, Christopher Ré code -1
Learning To Explore Using Active Neural SLAM Devendra Singh Chaplot, Dhiraj Gandhi, Saurabh Gupta, Abhinav Gupta, Ruslan Salakhutdinov code -1
EMPIR: Ensembles of Mixed Precision Deep Networks for Increased Robustness Against Adversarial Attacks Sanchari Sen, Balaraman Ravindran, Anand Raghunathan code -1
Quantifying Point-Prediction Uncertainty in Neural Networks via Residual Estimation with an I/O Kernel Xin Qiu, Elliot Meyerson, Risto Miikkulainen code -1
B-Spline CNNs on Lie groups Erik J. Bekkers code -1
Neural Outlier Rejection for Self-Supervised Keypoint Learning Jiexiong Tang, Hanme Kim, Vitor Guizilini, Sudeep Pillai, Rares Ambrus code -1
Reducing Transformer Depth on Demand with Structured Dropout Angela Fan, Edouard Grave, Armand Joulin code -1
Cross-Lingual Ability of Multilingual BERT: An Empirical Study Karthikeyan K, Zihan Wang, Stephen Mayhew, Dan Roth code -1
SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition Zhixuan Lin, YiFu Wu, Skand Vishwanath Peri, Weihao Sun, Gautam Singh, Fei Deng, Jindong Jiang, Sungjin Ahn code -1
RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments Roberta Raileanu, Tim Rocktäschel code -1
Low-dimensional statistical manifold embedding of directed graphs Thorben Funke, Tian Guo, Alen Lancic, Nino AntulovFantulin code -1
Efficient Probabilistic Logic Reasoning with Graph Neural Networks Yuyu Zhang, Xinshi Chen, Yuan Yang, Arun Ramamurthy, Bo Li, Yuan Qi, Le Song code -1
GraphSAINT: Graph Sampling Based Inductive Learning Method Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, Viktor K. Prasanna code -1
You Only Train Once: Loss-Conditional Training of Deep Networks Alexey Dosovitskiy, Josip Djolonga code -1
Projection-Based Constrained Policy Optimization TsungYen Yang, Justinian Rosca, Karthik Narasimhan, Peter J. Ramadge code -1
Infinite-Horizon Differentiable Model Predictive Control Sebastian East, Marco Gallieri, Jonathan Masci, Jan Koutník, Mark Cannon code -1
Combining Q-Learning and Search with Amortized Value Estimates Jessica B. Hamrick, Victor Bapst, Alvaro SanchezGonzalez, Tobias Pfaff, Theophane Weber, Lars Buesing, Peter W. Battaglia code -1
Training Generative Adversarial Networks from Incomplete Observations using Factorised Discriminators Daniel Stoller, Sebastian Ewert, Simon Dixon code -1
Decentralized Deep Learning with Arbitrary Communication Compression Anastasia Koloskova, Tao Lin, Sebastian U. Stich, Martin Jaggi code -1
Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control TsuiWei Weng, Krishnamurthy (Dj) Dvijotham, Jonathan Uesato, Kai Xiao, Sven Gowal, Robert Stanforth, Pushmeet Kohli code -1
Gradient $\ell_1$ Regularization for Quantization Robustness Milad Alizadeh, Arash Behboodi, Mart van Baalen, Christos Louizos, Tijmen Blankevoort, Max Welling code -1
SpikeGrad: An ANN-equivalent Computation Model for Implementing Backpropagation with Spikes Johannes C. Thiele, Olivier Bichler, Antoine Dupret code -1
On the Relationship between Self-Attention and Convolutional Layers JeanBaptiste Cordonnier, Andreas Loukas, Martin Jaggi code -1
Learning-Augmented Data Stream Algorithms Tanqiu Jiang, Yi Li, Honghao Lin, Yisong Ruan, David P. Woodruff code -1
Structured Object-Aware Physics Prediction for Video Modeling and Planning Jannik Kossen, Karl Stelzner, Marcel Hussing, Claas Voelcker, Kristian Kersting code -1
Incorporating BERT into Neural Machine Translation Jinhua Zhu, Yingce Xia, Lijun Wu, Di He, Tao Qin, Wengang Zhou, Houqiang Li, TieYan Liu code -1
MMA Training: Direct Input Space Margin Maximization through Adversarial Training Gavin Weiguang Ding, Yash Sharma, Kry Yik Chau Lui, Ruitong Huang code -1
Infinite-horizon Off-Policy Policy Evaluation with Multiple Behavior Policies Xinyun Chen, Lu Wang, Yizhe Hang, Heng Ge, Hongyuan Zha code -1
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations Alexei Baevski, Steffen Schneider, Michael Auli code -1
Meta-learning curiosity algorithms Ferran Alet, Martin F. Schneider, Tomás LozanoPérez, Leslie Pack Kaelbling code -1
Making Efficient Use of Demonstrations to Solve Hard Exploration Problems Çaglar Gülçehre, Tom Le Paine, Bobak Shahriari, Misha Denil, Matt Hoffman, Hubert Soyer, Richard Tanburn, Steven Kapturowski, Neil C. Rabinowitz, Duncan Williams, Gabriel BarthMaron, Ziyu Wang, Nando de Freitas, Worlds Team code -1
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning Luisa M. Zintgraf, Kyriacos Shiarlis, Maximilian Igl, Sebastian Schulze, Yarin Gal, Katja Hofmann, Shimon Whiteson code -1
Lookahead: A Far-sighted Alternative of Magnitude-based Pruning Sejun Park, Jaeho Lee, Sangwoo Mo, Jinwoo Shin code -1
Spike-based causal inference for weight alignment Jordan Guerguiev, Konrad P. Körding, Blake A. Richards code -1
Empirical Bayes Transductive Meta-Learning with Synthetic Gradients Shell Xu Hu, Pablo Garcia Moreno, Yang Xiao, Xi Shen, Guillaume Obozinski, Neil D. Lawrence, Andreas C. Damianou code -1
Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning Noah Y. Siegel, Jost Tobias Springenberg, Felix Berkenkamp, Abbas Abdolmaleki, Michael Neunert, Thomas Lampe, Roland Hafner, Nicolas Heess, Martin A. Riedmiller code -1
Understanding the Limitations of Conditional Generative Models Ethan Fetaya, JörnHenrik Jacobsen, Will Grathwohl, Richard S. Zemel code -1
Demystifying Inter-Class Disentanglement Aviv Gabbay, Yedid Hoshen code -1
Mixed-curvature Variational Autoencoders Ondrej Skopek, OctavianEugen Ganea, Gary Bécigneul code -1
BinaryDuo: Reducing Gradient Mismatch in Binary Activation Network by Coupling Binary Activations Hyungjun Kim, Kyungsu Kim, Jinseok Kim, JaeJoon Kim code -1
Model-based reinforcement learning for biological sequence design Christof Angermüller, David Dohan, David Belanger, Ramya Deshpande, Kevin Murphy, Lucy J. Colwell code -1
BayesOpt Adversarial Attack Binxin Ru, Adam D. Cobb, Arno Blaas, Yarin Gal code -1
Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies Sungryull Sohn, Hyunjae Woo, Jongwook Choi, Honglak Lee code -1
Hypermodels for Exploration Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy code -1
RaPP: Novelty Detection with Reconstruction along Projection Pathway Ki Hyun Kim, Sangwoo Shim, Yongsub Lim, Jongseob Jeon, Jeongwoo Choi, Byungchan Kim, Andre S. Yoon code -1
Dynamics-Aware Embeddings William F. Whitney, Rajat Agarwal, Kyunghyun Cho, Abhinav Gupta code -1
Functional Regularisation for Continual Learning with Gaussian Processes Michalis K. Titsias, Jonathan Schwarz, Alexander G. de G. Matthews, Razvan Pascanu, Yee Whye Teh code -1
You CAN Teach an Old Dog New Tricks! On Training Knowledge Graph Embeddings Daniel Ruffinelli, Samuel Broscheit, Rainer Gemulla code -1
AdvectiveNet: An Eulerian-Lagrangian Fluidic Reservoir for Point Cloud Processing Xingzhe He, Helen Lu Cao, Bo Zhu code -1
Never Give Up: Learning Directed Exploration Strategies Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martín Arjovsky, Alexander Pritzel, Andrew Bolt, Charles Blundell code -1
Fair Resource Allocation in Federated Learning Tian Li, Maziar Sanjabi, Ahmad Beirami, Virginia Smith code -1
Smooth markets: A basic mechanism for organizing gradient-based learners David Balduzzi, Wojciech M. Czarnecki, Tom Anthony, Ian M. Gemp, Edward Hughes, Joel Z. Leibo, Georgios Piliouras, Thore Graepel code -1
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding Wei Wang, Bin Bi, Ming Yan, Chen Wu, Jiangnan Xia, Zuyi Bao, Liwei Peng, Luo Si code -1
Training binary neural networks with real-to-binary convolutions Brais Martínez, Jing Yang, Adrian Bulat, Georgios Tzimiropoulos code -1
Permutation Equivariant Models for Compositional Generalization in Language Jonathan Gordon, David LopezPaz, Marco Baroni, Diane Bouchacourt code -1
Continual learning with hypernetworks Johannes von Oswald, Christian Henning, João Sacramento, Benjamin F. Grewe code -1
Phase Transitions for the Information Bottleneck in Representation Learning Tailin Wu, Ian S. Fischer code -1
Variational Template Machine for Data-to-Text Generation Rong Ye, Wenxian Shi, Hao Zhou, Zhongyu Wei, Lei Li code -1
Memory-Based Graph Networks Amir Hosein Khas Ahmadi, Kaveh Hassani, Parsa Moradi, Leo Lee, Quaid Morris code -1
AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty Dan Hendrycks, Norman Mu, Ekin Dogus Cubuk, Barret Zoph, Justin Gilmer, Balaji Lakshminarayanan code -1
AtomNAS: Fine-Grained End-to-End Neural Architecture Search Jieru Mei, Yingwei Li, Xiaochen Lian, Xiaojie Jin, Linjie Yang, Alan L. Yuille, Jianchao Yang code -1
Residual Energy-Based Models for Text Generation Yuntian Deng, Anton Bakhtin, Myle Ott, Arthur Szlam, Marc'Aurelio Ranzato code -1
A closer look at the approximation capabilities of neural networks Kai Fong Ernest Chong code -1
Deep Audio Priors Emerge From Harmonic Convolutional Networks Zhoutong Zhang, Yunyun Wang, Chuang Gan, Jiajun Wu, Joshua B. Tenenbaum, Antonio Torralba, William T. Freeman code -1
Expected Information Maximization: Using the I-Projection for Mixture Density Estimation Philipp Becker, Oleg Arenz, Gerhard Neumann code -1
A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms Yoshua Bengio, Tristan Deleu, Nasim Rahaman, Nan Rosemary Ke, Sébastien Lachapelle, Olexa Bilaniuk, Anirudh Goyal, Christopher J. Pal code -1
On the interaction between supervision and self-play in emergent communication Ryan Lowe, Abhinav Gupta, Jakob N. Foerster, Douwe Kiela, Joelle Pineau code -1
Dynamic Model Pruning with Feedback Tao Lin, Sebastian U. Stich, Luis Barba, Daniil Dmitriev, Martin Jaggi code -1
Latent Normalizing Flows for Many-to-Many Cross-Domain Mappings Shweta Mahajan, Iryna Gurevych, Stefan Roth code -1
Transferring Optimality Across Data Distributions via Homotopy Methods Matilde Gargiani, Andrea Zanelli, Quoc TranDinh, Moritz Diehl, Frank Hutter code -1
Regularizing activations in neural networks via distribution matching with the Wasserstein metric Taejong Joo, Donggu Kang, Byunghoon Kim code -1
Mutual Information Gradient Estimation for Representation Learning Liangjian Wen, Yiji Zhou, Lirong He, Mingyuan Zhou, Zenglin Xu code -1
Lite Transformer with Long-Short Range Attention Zhanghao Wu, Zhijian Liu, Ji Lin, Yujun Lin, Song Han code -1
A Function Space View of Bounded Norm Infinite Width ReLU Nets: The Multivariate Case Greg Ongie, Rebecca Willett, Daniel Soudry, Nathan Srebro code -1
Adversarial Lipschitz Regularization Dávid Terjék code -1
Compositional Language Continual Learning Yuanpeng Li, Liang Zhao, Kenneth Church, Mohamed Elhoseiny code -1
End to End Trainable Active Contours via Differentiable Rendering Shir Gur, Tal Shaharabany, Lior Wolf code -1
Provable Filter Pruning for Efficient Neural Networks Lucas Liebenwein, Cenk Baykal, Harry Lang, Dan Feldman, Daniela Rus code -1
Effect of Activation Functions on the Training of Overparametrized Neural Nets Abhishek Panigrahi, Abhishek Shetty, Navin Goyal code -1
Lipschitz constant estimation of Neural Networks via sparse polynomial optimization Fabian Latorre Gómez, Paul Rolland, Volkan Cevher code -1
State Alignment-based Imitation Learning Fangchen Liu, Zhan Ling, Tongzhou Mu, Hao Su code -1
Learning to Group: A Bottom-Up Framework for 3D Part Discovery in Unseen Categories Tiange Luo, Kaichun Mo, Zhiao Huang, Jiarui Xu, Siyu Hu, Liwei Wang, Hao Su code -1
Discriminative Particle Filter Reinforcement Learning for Complex Partial observations Xiao Ma, Péter Karkus, David Hsu, Wee Sun Lee, Nan Ye code -1
Unrestricted Adversarial Examples via Semantic Manipulation Anand Bhattad, Min Jin Chong, Kaizhao Liang, Bo Li, David A. Forsyth code -1
Classification-Based Anomaly Detection for General Data Liron Bergman, Yedid Hoshen code -1
Scale-Equivariant Steerable Networks Ivan Sosnovik, Michal Szmaja, Arnold W. M. Smeulders code -1
On Generalization Error Bounds of Noisy Gradient Methods for Non-Convex Learning Jian Li, Xuanyuan Luo, Mingda Qiao code -1
Consistency Regularization for Generative Adversarial Networks Han Zhang, Zizhao Zhang, Augustus Odena, Honglak Lee code -1
Differentiable learning of numerical rules in knowledge graphs PoWei Wang, Daria Stepanova, Csaba Domokos, J. Zico Kolter code -1
Learning to Move with Affordance Maps William Qi, Ravi Teja Mullapudi, Saurabh Gupta, Deva Ramanan code -1
Neural tangent kernels, transportation mappings, and universal approximation Ziwei Ji, Matus Telgarsky, Ruicheng Xian code -1
SCALOR: Generative World Models with Scalable Object Representations Jindong Jiang, Sepehr Janghorbani, Gerard de Melo, Sungjin Ahn code -1
Prediction Poisoning: Towards Defenses Against DNN Model Stealing Attacks Tribhuvanesh Orekondy, Bernt Schiele, Mario Fritz code -1
Domain Adaptive Multibranch Networks Róger BermúdezChacón, Mathieu Salzmann, Pascal Fua code -1
DiffTaichi: Differentiable Programming for Physical Simulation Yuanming Hu, Luke Anderson, TzuMao Li, Qi Sun, Nathan Carr, Jonathan RaganKelley, Frédo Durand code -1
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning Arsenii Ashukha, Alexander Lyzhov, Dmitry Molchanov, Dmitry P. Vetrov code -1
Episodic Reinforcement Learning with Associative Memory Guangxiang Zhu, Zichuan Lin, Guangwen Yang, Chongjie Zhang code -1
Sub-policy Adaptation for Hierarchical Reinforcement Learning Alexander C. Li, Carlos Florensa, Ignasi Clavera, Pieter Abbeel code -1
Critical initialisation in continuous approximations of binary neural networks George Stamatescu, Federica Gerace, Carlo Lucibello, Ian G. Fuss, Langford B. White code -1
Deep Orientation Uncertainty Learning based on a Bingham Loss Igor Gilitschenski, Roshni Sahoo, Wilko Schwarting, Alexander Amini, Sertac Karaman, Daniela Rus code -1
Co-Attentive Equivariant Neural Networks: Focusing Equivariance On Transformations Co-Occurring in Data David W. Romero, Mark Hoogendoorn code -1
Mixed Precision DNNs: All you need is a good parametrization Stefan Uhlich, Lukas Mauch, Fabien Cardinaux, Kazuki Yoshiyama, Javier Alonso García, Stephen Tiedemann, Thomas Kemp, Akira Nakamura code -1
Information Geometry of Orthogonal Initializations and Training Piotr Aleksander Sokól, Il Memming Park code -1
Extreme Classification via Adversarial Softmax Approximation Robert Bamler, Stephan Mandt code -1
Learning Nearly Decomposable Value Functions Via Communication Minimization Tonghan Wang, Jianhao Wang, Chongyi Zheng, Chongjie Zhang code -1
Robust Subspace Recovery Layer for Unsupervised Anomaly Detection ChiehHsin Lai, Dongmian Zou, Gilad Lerman code -1
Learning to Coordinate Manipulation Skills via Skill Behavior Diversification Youngwoon Lee, Jingyun Yang, Joseph J. Lim code -1
NAS-Bench-1Shot1: Benchmarking and Dissecting One-shot Neural Architecture Search Arber Zela, Julien Siems, Frank Hutter code -1
Conservative Uncertainty Estimation By Fitting Prior Networks Kamil Ciosek, Vincent Fortuin, Ryota Tomioka, Katja Hofmann, Richard E. Turner code -1
Understanding Generalization in Recurrent Neural Networks Zhuozhuo Tu, Fengxiang He, Dacheng Tao code -1
The Shape of Data: Intrinsic Distance for Data Distributions Anton Tsitsulin, Marina Munkhoeva, Davide Mottin, Panagiotis Karras, Alexander M. Bronstein, Ivan V. Oseledets, Emmanuel Müller code -1
How to 0wn the NAS in Your Spare Time Sanghyun Hong, Michael Davinroy, Yigitcan Kaya, Dana DachmanSoled, Tudor Dumitras code -1
Enabling Deep Spiking Neural Networks with Hybrid Conversion and Spike Timing Dependent Backpropagation Nitin Rathi, Gopalakrishnan Srinivasan, Priyadarshini Panda, Kaushik Roy code -1
Breaking Certified Defenses: Semantic Adversarial Examples with Spoofed robustness Certificates Amin Ghiasi, Ali Shafahi, Tom Goldstein code -1
Query-efficient Meta Attack to Deep Neural Networks Jiawei Du, Hu Zhang, Joey Tianyi Zhou, Yi Yang, Jiashi Feng code -1
Massively Multilingual Sparse Word Representations Gábor Berend code -1
Monotonic Multihead Attention Xutai Ma, Juan Miguel Pino, James Cross, Liezl Puzon, Jiatao Gu code -1
Gradients as Features for Deep Representation Learning Fangzhou Mu, Yingyu Liang, Yin Li code -1
Pay Attention to Features, Transfer Learn Faster CNNs Kafeng Wang, Xitong Gao, Yiren Zhao, Xingjian Li, Dejing Dou, ChengZhong Xu code -1
Program Guided Agent ShaoHua Sun, TeLin Wu, Joseph J. Lim code -1
Sparse Coding with Gated Learned ISTA Kailun Wu, Yiwen Guo, Ziang Li, Changshui Zhang code -1
Graph Neural Networks Exponentially Lose Expressive Power for Node Classification Kenta Oono, Taiji Suzuki code -1
Multi-Scale Representation Learning for Spatial Feature Distributions using Grid Cells Gengchen Mai, Krzysztof Janowicz, Bo Yan, Rui Zhu, Ling Cai, Ni Lao code -1
InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization FanYun Sun, Jordan Hoffmann, Vikas Verma, Jian Tang code -1
On Robustness of Neural Ordinary Differential Equations Hanshu Yan, Jiawei Du, Vincent Y. F. Tan, Jiashi Feng code -1
Defending Against Physically Realizable Attacks on Image Classification Tong Wu, Liang Tong, Yevgeniy Vorobeychik code -1
Estimating Gradients for Discrete Random Variables by Sampling without Replacement Wouter Kool, Herke van Hoof, Max Welling code -1
Learning to Control PDEs with Differentiable Physics Philipp Holl, Nils Thuerey, Vladlen Koltun code -1
Intensity-Free Learning of Temporal Point Processes Oleksandr Shchur, Marin Bilos, Stephan Günnemann code -1
A Signal Propagation Perspective for Pruning Neural Networks at Initialization Namhoon Lee, Thalaiyasingam Ajanthan, Stephen Gould, Philip H. S. Torr code -1
Skip Connections Matter: On the Transferability of Adversarial Examples Generated with ResNets Dongxian Wu, Yisen Wang, ShuTao Xia, James Bailey, Xingjun Ma code -1
White Noise Analysis of Neural Networks Ali Borji, Sikun Lin code -1
Neural Machine Translation with Universal Visual Representation Zhuosheng Zhang, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao code -1
Tranquil Clouds: Neural Networks for Learning Temporally Coherent Features in Point Clouds Lukas Prantl, Nuttapong Chentanez, Stefan Jeschke, Nils Thuerey code -1
PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, GuoJun Qi, Qi Tian, Hongkai Xiong code -1
Online and stochastic optimization beyond Lipschitz continuity: A Riemannian approach Kimon Antonakopoulos, Elena Veronica Belmega, Panayotis Mertikopoulos code -1
Enhancing Adversarial Defense by k-Winners-Take-All Chang Xiao, Peilin Zhong, Changxi Zheng code -1
Encoding word order in complex embeddings Benyou Wang, Donghao Zhao, Christina Lioma, Qiuchi Li, Peng Zhang, Jakob Grue Simonsen code -1
DDSP: Differentiable Digital Signal Processing Jesse H. Engel, Lamtharn Hantrakul, Chenjie Gu, Adam Roberts code -1
Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation HungYu Tseng, HsinYing Lee, JiaBin Huang, MingHsuan Yang code -1
Ridge Regression: Structure, Cross-Validation, and Sketching Sifan Liu, Edgar Dobriban code -1
Finite Depth and Width Corrections to the Neural Tangent Kernel Boris Hanin, Mihai Nica code -1
Meta-Learning without Memorization Mingzhang Yin, George Tucker, Mingyuan Zhou, Sergey Levine, Chelsea Finn code -1
Influence-Based Multi-Agent Exploration Tonghan Wang, Jianhao Wang, Yi Wu, Chongjie Zhang code -1
Hoppity: Learning Graph Transformations to Detect and Fix Bugs in Programs Elizabeth Dinella, Hanjun Dai, Ziyang Li, Mayur Naik, Le Song, Ke Wang code -1
Sliced Cramer Synaptic Consolidation for Preserving Deeply Learned Representations Soheil Kolouri, Nicholas A. Ketz, Andrea Soltoggio, Praveen K. Pilly code -1
How much Position Information Do Convolutional Neural Networks Encode? Md. Amirul Islam, Sen Jia, Neil D. B. Bruce code -1
Hamiltonian Generative Networks Peter Toth, Danilo J. Rezende, Andrew Jaegle, Sébastien Racanière, Aleksandar Botev, Irina Higgins code -1
CoPhy: Counterfactual Learning of Physical Dynamics Fabien Baradel, Natalia Neverova, Julien Mille, Greg Mori, Christian Wolf code -1
Estimating counterfactual treatment outcomes over time through adversarially balanced representations Ioana Bica, Ahmed M. Alaa, James Jordon, Mihaela van der Schaar code -1
Gradientless Descent: High-Dimensional Zeroth-Order Optimization Daniel Golovin, John Karro, Greg Kochanski, Chansoo Lee, Xingyou Song, Qiuyi (Richard) Zhang code -1
Conditional Learning of Fair Representations Han Zhao, Amanda Coston, Tameem Adel, Geoffrey J. Gordon code -1
Inductive Matrix Completion Based on Graph Neural Networks Muhan Zhang, Yixin Chen code -1
Duration-of-Stay Storage Assignment under Uncertainty Michael Lingzhi Li, Elliott Wolf, Daniel Wintz code -1
Emergence of functional and structural properties of the head direction system by optimization of recurrent neural networks Christopher J. Cueva, Peter Y. Wang, Matthew Chin, XueXin Wei code -1
Deep neuroethology of a virtual rodent Josh Merel, Diego Aldarondo, Jesse Marshall, Yuval Tassa, Greg Wayne, Bence Olveczky code -1
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation Ziyang Tang, Yihao Feng, Lihong Li, Dengyong Zhou, Qiang Liu code -1
Learning Compositional Koopman Operators for Model-Based Control Yunzhu Li, Hao He, Jiajun Wu, Dina Katabi, Antonio Torralba code -1
CLEVRER: Collision Events for Video Representation and Reasoning Kexin Yi, Chuang Gan, Yunzhu Li, Pushmeet Kohli, Jiajun Wu, Antonio Torralba, Joshua B. Tenenbaum code -1
The Logical Expressiveness of Graph Neural Networks Pablo Barceló, Egor V. Kostylev, Mikaël Monet, Jorge Pérez, Juan L. Reutter, Juan Pablo Silva code -1
The Break-Even Point on Optimization Trajectories of Deep Neural Networks Stanislaw Jastrzebski, Maciej Szymczak, Stanislav Fort, Devansh Arpit, Jacek Tabor, Kyunghyun Cho, Krzysztof J. Geras code -1
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, Radu Soricut code -1
Disentangling neural mechanisms for perceptual grouping Junkyung Kim, Drew Linsley, Kalpit Thakkar, Thomas Serre code -1
Learning to Plan in High Dimensions via Neural Exploration-Exploitation Trees Binghong Chen, Bo Dai, Qinjie Lin, Guo Ye, Han Liu, Le Song code -1
Symplectic Recurrent Neural Networks Zhengdao Chen, Jianyu Zhang, Martín Arjovsky, Léon Bottou code -1
Asymptotics of Wide Networks from Feynman Diagrams Ethan Dyer, Guy GurAri code -1
Learning The Difference That Makes A Difference With Counterfactually-Augmented Data Divyansh Kaushik, Eduard H. Hovy, Zachary Chase Lipton code -1
Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning? Simon S. Du, Sham M. Kakade, Ruosong Wang, Lin F. Yang code -1
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning Hengyuan Hu, Jakob N. Foerster code -1
Network Deconvolution Chengxi Ye, Matthew Evanusa, Hua He, Anton Mitrokhin, Tom Goldstein, James A. Yorke, Cornelia Fermüller, Yiannis Aloimonos code -1
Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension Xinyun Chen, Chen Liang, Adams Wei Yu, Denny Zhou, Dawn Song, Quoc V. Le code -1
Real or Not Real, that is the Question Yuanbo Xiangli, Yubin Deng, Bo Dai, Chen Change Loy, Dahua Lin code -1
Dream to Control: Learning Behaviors by Latent Imagination Danijar Hafner, Timothy P. Lillicrap, Jimmy Ba, Mohammad Norouzi code -1
A Probabilistic Formulation of Unsupervised Text Style Transfer Junxian He, Xinyi Wang, Graham Neubig, Taylor BergKirkpatrick code -1
Emergent Tool Use From Multi-Agent Autocurricula Bowen Baker, Ingmar Kanitscheider, Todor M. Markov, Yi Wu, Glenn Powell, Bob McGrew, Igor Mordatch code -1
NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search Xuanyi Dong, Yi Yang code -1
Strategies for Pre-training Graph Neural Networks Weihua Hu, Bowen Liu, Joseph Gomes, Marinka Zitnik, Percy Liang, Vijay S. Pande, Jure Leskovec code -1
Behaviour Suite for Reinforcement Learning Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepesvári, Satinder Singh, Benjamin Van Roy, Richard S. Sutton, David Silver, Hado van Hasselt code -1
FreeLB: Enhanced Adversarial Training for Natural Language Understanding Chen Zhu, Yu Cheng, Zhe Gan, Siqi Sun, Tom Goldstein, Jingjing Liu code -1
Kernelized Wasserstein Natural Gradient Michael Arbel, Arthur Gretton, Wuchen Li, Guido Montúfar code -1
And the Bit Goes Down: Revisiting the Quantization of Neural Networks Pierre Stock, Armand Joulin, Rémi Gribonval, Benjamin Graham, Hervé Jégou code -1
A Latent Morphology Model for Open-Vocabulary Neural Machine Translation Duygu Ataman, Wilker Aziz, Alexandra Birch code -1
Understanding Why Neural Networks Generalize Well Through GSNR of Parameters Jinlong Liu, Yunzhi Bai, Guoqing Jiang, Ting Chen, Huayan Wang code -1
Model Based Reinforcement Learning for Atari Lukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos, Blazej Osinski, Roy H. Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Afroz Mohiuddin, Ryan Sepassi, George Tucker, Henryk Michalewski code -1
Disagreement-Regularized Imitation Learning Kianté Brantley, Wen Sun, Mikael Henaff code -1
Stable Rank Normalization for Improved Generalization in Neural Networks and GANs Amartya Sanyal, Philip H. S. Torr, Puneet K. Dokania code -1
Measuring the Reliability of Reinforcement Learning Algorithms Stephanie C. Y. Chan, Samuel Fishman, Anoop Korattikara, John F. Canny, Sergio Guadarrama code -1
Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue Byeongchang Kim, Jaewoo Ahn, Gunhee Kim code -1
Neural Tangents: Fast and Easy Infinite Neural Networks in Python Roman Novak, Lechao Xiao, Jiri Hron, Jaehoon Lee, Alexander A. Alemi, Jascha SohlDickstein, Samuel S. Schoenholz code -1
Self-labelling via simultaneous clustering and representation learning Yuki Markus Asano, Christian Rupprecht, Andrea Vedaldi code -1
The intriguing role of module criticality in the generalization of deep networks Niladri S. Chatterji, Behnam Neyshabur, Hanie Sedghi code -1
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks Sanjeev Arora, Simon S. Du, Zhiyuan Li, Ruslan Salakhutdinov, Ruosong Wang, Dingli Yu code -1
Differentiation of Blackbox Combinatorial Solvers Marin Vlastelica Pogancic, Anselm Paulus, Vít Musil, Georg Martius, Michal Rolínek code -1
Scaling Autoregressive Video Models Dirk Weissenborn, Oscar Täckström, Jakob Uszkoreit code -1
The Ingredients of Real World Robotic Reinforcement Learning Henry Zhu, Justin Yu, Abhishek Gupta, Dhruv Shah, Kristian Hartikainen, Avi Singh, Vikash Kumar, Sergey Levine code -1
Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer, Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel code -1
Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning Dexter R. R. Scobee, S. Shankar Sastry code -1
Spectral Embedding of Regularized Block Models Nathan de Lara, Thomas Bonald code -1
Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models Xisen Jin, Zhongyu Wei, Junyi Du, Xiangyang Xue, Xiang Ren code -1
word2ket: Space-efficient Word Embeddings inspired by Quantum Entanglement Aliakbar Panahi, Seyran Saeedi, Tomasz Arodz code -1
What Can Neural Networks Reason About? Keyulu Xu, Jingling Li, Mozhi Zhang, Simon S. Du, Kenichi Kawarabayashi, Stefanie Jegelka code -1
Training individually fair ML models with sensitive subspace robustness Mikhail Yurochkin, Amanda Bower, Yuekai Sun code -1
Learning from Rules Generalizing Labeled Exemplars Abhijeet Awasthi, Sabyasachi Ghosh, Rasna Goyal, Sunita Sarawagi code -1
Directional Message Passing for Molecular Graphs Johannes Klicpera, Janek Groß, Stephan Günnemann code -1
Explanation by Progressive Exaggeration Sumedha Singla, Brian Pollack, Junxiang Chen, Kayhan Batmanghelich code -1
Compression based bound for non-compressed network: unified generalization error analysis of large compressible deep neural network Taiji Suzuki, Hiroshi Abe, Tomoaki Nishimura code -1
At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks? Niv Giladi, Mor Shpigel Nacson, Elad Hoffer, Daniel Soudry code -1
Disentanglement by Nonlinear ICA with General Incompressible-flow Networks (GIN) Peter Sorrenson, Carsten Rother, Ullrich Köthe code -1
Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps Tri Dao, Nimit Sharad Sohoni, Albert Gu, Matthew Eichhorn, Amit Blonder, Megan Leszczynski, Atri Rudra, Christopher Ré code -1
Improving Generalization in Meta Reinforcement Learning using Learned Objectives Louis Kirsch, Sjoerd van Steenkiste, Jürgen Schmidhuber code -1
Drawing Early-Bird Tickets: Toward More Efficient Training of Deep Networks Haoran You, Chaojian Li, Pengfei Xu, Yonggan Fu, Yue Wang, Xiaohan Chen, Richard G. Baraniuk, Zhangyang Wang, Yingyan Lin code -1
Truth or backpropaganda? An empirical investigation of deep learning theory Micah Goldblum, Jonas Geiping, Avi Schwarzschild, Michael Moeller, Tom Goldstein code -1
Neural Arithmetic Units Andreas Madsen, Alexander Rosenberg Johansen code -1
DeepSphere: a graph-based spherical CNN Michaël Defferrard, Martino Milani, Frédérick Gusset, Nathanaël Perraudin code -1
SUMO: Unbiased Estimation of Log Marginal Probability for Latent Variable Models Yucen Luo, Alex Beatson, Mohammad Norouzi, Jun Zhu, David Duvenaud, Ryan P. Adams, Ricky T. Q. Chen code -1
Deep Learning For Symbolic Mathematics Guillaume Lample, François Charton code -1
Making Sense of Reinforcement Learning and Probabilistic Inference Brendan O'Donoghue, Ian Osband, Catalin Ionescu code -1
Unbiased Contrastive Divergence Algorithm for Training Energy-Based Latent Variable Models Yixuan Qiu, Lingsong Zhang, Xiao Wang code -1
A Mutual Information Maximization Perspective of Language Representation Learning Lingpeng Kong, Cyprien de Masson d'Autume, Lei Yu, Wang Ling, Zihang Dai, Dani Yogatama code -1
Energy-based models for atomic-resolution protein conformations Yilun Du, Joshua Meier, Jerry Ma, Rob Fergus, Alexander Rives code -1
Depth-Width Trade-offs for ReLU Networks via Sharkovsky's Theorem Vaggos Chatziafratis, Sai Ganesh Nagarajan, Ioannis Panageas, Xiao Wang code -1
Generalization of Two-layer Neural Networks: An Asymptotic Viewpoint Jimmy Ba, Murat A. Erdogdu, Taiji Suzuki, Denny Wu, Tianzong Zhang code -1
Reconstructing continuous distributions of 3D protein structure from cryo-EM images Ellen D. Zhong, Tristan Bepler, Joseph H. Davis, Bonnie Berger code -1
Progressive Learning and Disentanglement of Hierarchical Representations Zhiyuan Li, Jaideep Vitthal Murkute, Prashnna Kumar Gyawali, Linwei Wang code -1
An Exponential Learning Rate Schedule for Deep Learning Zhiyuan Li, Sanjeev Arora code -1
Geom-GCN: Geometric Graph Convolutional Networks Hongbin Pei, Bingzhe Wei, Kevin ChenChuan Chang, Yu Lei, Bo Yang code -1
CATER: A diagnostic dataset for Compositional Actions & TEmporal Reasoning Rohit Girdhar, Deva Ramanan code -1
BackPACK: Packing more into Backprop Felix Dangel, Frederik Kunstner, Philipp Hennig code -1
GenDICE: Generalized Offline Estimation of Stationary Values Ruiyi Zhang, Bo Dai, Lihong Li, Dale Schuurmans code -1
Principled Weight Initialization for Hypernetworks Oscar Chang, Lampros Flokas, Hod Lipson code -1
On the Convergence of FedAvg on Non-IID Data Xiang Li, Kaixuan Huang, Wenhao Yang, Shusen Wang, Zhihua Zhang code -1
Data-dependent Gaussian Prior Objective for Language Generation Zuchao Li, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Zhuosheng Zhang, Hai Zhao code -1
Contrastive Learning of Structured World Models Thomas N. Kipf, Elise van der Pol, Max Welling code -1
Neural Network Branching for Neural Network Verification Jingyue Lu, M. Pawan Kumar code -1
Why Gradient Clipping Accelerates Training: A Theoretical Justification for Adaptivity Jingzhao Zhang, Tianxing He, Suvrit Sra, Ali Jadbabaie code -1
Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information Yichi Zhou, Jialian Li, Jun Zhu code -1
Mogrifier LSTM Gábor Melis, Tomás Kociský, Phil Blunsom code -1
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech David Harwath, WeiNing Hsu, James R. Glass code -1
Mirror-Generative Neural Machine Translation Zaixiang Zheng, Hao Zhou, Shujian Huang, Lei Li, XinYu Dai, Jiajun Chen code -1
Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning Ruqi Zhang, Chunyuan Li, Jianyi Zhang, Changyou Chen, Andrew Gordon Wilson code -1
Your classifier is secretly an energy based model and you should treat it like one Will Grathwohl, KuanChieh Wang, JörnHenrik Jacobsen, David Duvenaud, Mohammad Norouzi, Kevin Swersky code -1
Dynamics-Aware Unsupervised Discovery of Skills Archit Sharma, Shixiang Gu, Sergey Levine, Vikash Kumar, Karol Hausman code -1
Optimal Strategies Against Generative Attacks Roy Mor, Erez Peterfreund, Matan Gavish, Amir Globerson code -1
GraphZoom: A Multi-level Spectral Approach for Accurate and Scalable Graph Embedding Chenhui Deng, Zhiqiang Zhao, Yongyu Wang, Zhiru Zhang, Zhuo Feng code -1
Harnessing Structures for Value-Based Planning and Reinforcement Learning Yuzhe Yang, Guo Zhang, Zhi Xu, Dina Katabi code -1
Comparing Rewinding and Fine-tuning in Neural Network Pruning Alex Renda, Jonathan Frankle, Michael Carbin code -1
Meta-Q-Learning Rasool Fakoor, Pratik Chaudhari, Stefano Soatto, Alexander J. Smola code -1
Mathematical Reasoning in Latent Space Dennis Lee, Christian Szegedy, Markus N. Rabe, Sarah M. Loos, Kshitij Bansal code -1
A Theory of Usable Information under Computational Constraints Yilun Xu, Shengjia Zhao, Jiaming Song, Russell Stewart, Stefano Ermon code -1
Geometric Analysis of Nonconvex Optimization Landscapes for Overcomplete Learning Qing Qu, Yuexiang Zhai, Xiao Li, Yuqian Zhang, Zhihui Zhu code -1
Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds Jordan T. Ash, Chicheng Zhang, Akshay Krishnamurthy, John Langford, Alekh Agarwal code -1
Understanding and Robustifying Differentiable Architecture Search Arber Zela, Thomas Elsken, Tonmoy Saikia, Yassine Marrakchi, Thomas Brox, Frank Hutter code -1
A Closer Look at Deep Policy Gradients Andrew Ilyas, Logan Engstrom, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry code -1
Implementation Matters in Deep RL: A Case Study on PPO and TRPO Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry code -1
Fast Task Inference with Variational Intrinsic Successor Features Steven Hansen, Will Dabney, André Barreto, David WardeFarley, Tom Van de Wiele, Volodymyr Mnih code -1
Learning to Balance: Bayesian Meta-Learning for Imbalanced and Out-of-distribution Tasks Haebeom Lee, Hayeon Lee, Donghyun Na, Saehoon Kim, Minseop Park, Eunho Yang, Sung Ju Hwang code -1
RNA Secondary Structure Prediction By Learning Unrolled Algorithms Xinshi Chen, Yu Li, Ramzan Umarov, Xin Gao, Le Song code -1
Watch the Unobserved: A Simple Approach to Parallelizing Monte Carlo Tree Search Anji Liu, Jianshu Chen, Mingze Yu, Yu Zhai, Xuewen Zhou, Ji Liu code -1
Target-Embedding Autoencoders for Supervised Representation Learning Daniel Jarrett, Mihaela van der Schaar code -1
Reformer: The Efficient Transformer Nikita Kitaev, Lukasz Kaiser, Anselm Levskaya code -1
Rotation-invariant clustering of neuronal responses in primary visual cortex Ivan Ustyuzhaninov, Santiago A. Cadena, Emmanouil Froudarakis, Paul G. Fahey, Edgar Y. Walker, Erick Cobos, Jacob Reimer, Fabian H. Sinz, Andreas S. Tolias, Matthias Bethge, Alexander S. Ecker code -1
Causal Discovery with Reinforcement Learning Shengyu Zhu, Ignavier Ng, Zhitang Chen code -1
Intrinsically Motivated Discovery of Diverse Patterns in Self-Organizing Systems Chris Reinke, Mayalen Etcheverry, PierreYves Oudeyer code -1
Restricting the Flow: Information Bottlenecks for Attribution Karl Schulz, Leon Sixt, Federico Tombari, Tim Landgraf code -1
Building Deep Equivariant Capsule Networks Sai Raam Venkataraman, S. Balasubramanian, R. Raghunatha Sarma code -1
A Generalized Training Approach for Multiagent Learning Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Pérolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Rémi Munos code -1
High Fidelity Speech Synthesis with Adversarial Networks Mikolaj Binkowski, Jeff Donahue, Sander Dieleman, Aidan Clark, Erich Elsen, Norman Casagrande, Luis C. Cobo, Karen Simonyan code -1
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference Lasse Espeholt, Raphaël Marinier, Piotr Stanczyk, Ke Wang, Marcin Michalski code -1
Meta-Learning with Warped Gradient Descent Sebastian Flennerhag, Andrei A. Rusu, Razvan Pascanu, Francesco Visin, Hujun Yin, Raia Hadsell code -1
Convolutional Conditional Neural Processes Jonathan Gordon, Wessel P. Bruinsma, Andrew Y. K. Foong, James Requeima, Yann Dubois, Richard E. Turner code -1
Gradient Descent Maximizes the Margin of Homogeneous Neural Networks Kaifeng Lyu, Jian Li code -1
Adversarial Training and Provable Defenses: Bridging the Gap Mislav Balunovic, Martin T. Vechev code -1
Differentiable Reasoning over a Virtual Knowledge Base Bhuwan Dhingra, Manzil Zaheer, Vidhisha Balachandran, Graham Neubig, Ruslan Salakhutdinov, William W. Cohen code -1
Federated Learning with Matched Averaging Hongyi Wang, Mikhail Yurochkin, Yuekai Sun, Dimitris S. Papailiopoulos, Yasaman Khazaeni code -1