ICLR 2018论文列表 - 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings.| 数据学习 (DataLearner)

ICLR 2018 论文列表

6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings.

On the Information Bottleneck Theory of Deep Learning.

Andrew M. Saxe Yamini Bansal Joel Dapello Madhu Advani Artemy Kolchinsky Brendan D. Tracey David D. Cox

Memory Architectures in Recurrent Neural Network Language Models.

Dani Yogatama Yishu Miao Gábor Melis Wang Ling Adhiguna Kuncoro Chris Dyer Phil Blunsom

Mixed Precision Training of Convolutional Neural Networks using Integer Operations.

Dipankar Das Naveen Mellempudi Dheevatsa Mudigere Dhiraj D. Kalamkar Sasikanth Avancha Kunal Banerjee Srinivas Sridharan Karthik Vaidyanathan Bharat Kaul Evangelos Georganas Alexander Heinecke Pradeep Dubey Jesús Corbal Nikita Shustrov Roman Dubtsov Evarist Fomenko Vadim O. Pirogov

Gaussian Process Behaviour in Wide Deep Neural Networks.

Alexander G. de G. Matthews Jiri Hron Mark Rowland Richard E. Turner Zoubin Ghahramani

Variational Continual Learning.

Cuong V. Nguyen Yingzhen Li Thang D. Bui Richard E. Turner

Learning Sparse Neural Networks through L_0 Regularization.

Christos Louizos Max Welling Diederik P. Kingma

Learning From Noisy Singly-labeled Data.

Ashish Khetan Zachary C. Lipton Animashree Anandkumar

Sobolev GAN.

Youssef Mroueh Chun-Liang Li Tom Sercu Anant Raj Yu Cheng

Training GANs with Optimism.

Constantinos Daskalakis Andrew Ilyas Vasilis Syrgkanis Haoyang Zeng

Variational Network Quantization.

Jan Achterhold Jan M. Köhler Anke Schmeink Tim Genewein

Temporally Efficient Deep Learning with Spikes.

Peter O'Connor Efstratios Gavves Matthias Reisser Max Welling

Deep Sensing: Active Sensing using Multi-directional Recurrent Neural Networks.

Jinsung Yoon William R. Zame Mihaela van der Schaar

Multi-Mention Learning for Reading Comprehension with Neural Cascades.

Swabha Swayamdipta Ankur P. Parikh Tom Kwiatkowski

Fix your classifier: the marginal value of training the last weight layer.

Elad Hoffer Itay Hubara Daniel Soudry

A New Method of Region Embedding for Text Classification.

Chao Qiao Bo Huang Guocheng Niu Daren Li Daxiang Dong Wei He Dianhai Yu Hua Wu

A Compressed Sensing View of Unsupervised Text Embeddings, Bag-of-n-Grams, and LSTMs.

Sanjeev Arora Mikhail Khodak Nikunj Saunshi Kiran Vodrahalli

Divide-and-Conquer Reinforcement Learning.

Dibya Ghosh Avi Singh Aravind Rajeswaran Vikash Kumar Sergey Levine

Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning.

Tianmin Shu Caiming Xiong Richard Socher

N2N learning: Network to Network Compression via Policy Gradient Reinforcement Learning.

Anubhav Ashok Nicholas Rhinehart Fares Beainy Kris M. Kitani

Progressive Reinforcement Learning with Distillation for Multi-Skilled Motion Control.

Glen Berseth Cheng Xie Paul Cernek Michiel van de Panne

Memory Augmented Control Networks.

Arbaaz Khan Clark Zhang Nikolay Atanasov Konstantinos Karydis Vijay Kumar Daniel D. Lee

Overcoming Catastrophic Interference using Conceptor-Aided Backpropagation.

Xu He Herbert Jaeger

Active Neural Localization.

Devendra Singh Chaplot Emilio Parisotto Ruslan Salakhutdinov

Neural Map: Structured Memory for Deep Reinforcement Learning.

Emilio Parisotto Ruslan Salakhutdinov

Eigenoption Discovery through the Deep Successor Representation.

Marlos C. Machado Clemens Rosenbaum Xiaoxiao Guo Miao Liu Gerald Tesauro Murray Campbell

On the regularization of Wasserstein GANs.

Henning Petzka Asja Fischer Denis Lukovnikov

Robustness of Classifiers to Universal Perturbations: A Geometric Perspective.

Seyed-Mohsen Moosavi-Dezfooli Alhussein Fawzi Omar Fawzi Pascal Frossard Stefano Soatto

Stochastic gradient descent performs variational inference, converges to limit cycles for deep networks.

Pratik Chaudhari Stefano Soatto

Online Learning Rate Adaptation with Hypergradient Descent.

Atilim Gunes Baydin Robert Cornish David Martínez-Rubio Mark Schmidt Frank Wood

When is a Convolutional Filter Easy to Learn?

Simon S. Du Jason D. Lee Yuandong Tian

Policy Optimization by Genetic Distillation.

Tanmay Gangwani Jian Peng

Guide Actor-Critic for Continuous Control.

Voot Tangkaratt Abbas Abdolmaleki Masashi Sugiyama

Boosting the Actor with Dual Critic.

Bo Dai Albert Shaw Niao He Lihong Li Le Song

Adaptive Quantization of Neural Networks.

Soroosh Khoram Jing Li

Residual Loss Prediction: Reinforcement Learning With No Incremental Feedback.

Hal Daumé III John Langford Amr Sharaf

Alternating Multi-bit Quantization for Recurrent Neural Networks.

Chen Xu Jianqiang Yao Zhouchen Lin Wenwu Ou Yuanbin Cao Zhirong Wang Hongbin Zha

TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning.

Gregory Farquhar Tim Rocktäschel Maximilian Igl Shimon Whiteson

Temporal Difference Models: Model-Free Deep RL for Model-Based Control.

Vitchyr Pong Shixiang Gu Murtaza Dalal Sergey Levine

DORA The Explorer: Directed Outreaching Reinforcement Action-Selection.

Lior Fox Leshem Choshen Yonatan Loewenstein

TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning.

Artemij Amiranashvili Alexey Dosovitskiy Vladlen Koltun Thomas Brox

mixup: Beyond Empirical Risk Minimization.

Hongyi Zhang Moustapha Cissé Yann N. Dauphin David Lopez-Paz

Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning.

Wei Ping Kainan Peng Andrew Gibiansky Sercan Ömer Arik Ajay Kannan Sharan Narang Jonathan Raiman John Miller

Non-Autoregressive Neural Machine Translation.

Jiatao Gu James Bradbury Caiming Xiong Victor O. K. Li Richard Socher

PixelNN: Example-based Image Synthesis.

Aayush Bansal Yaser Sheikh Deva Ramanan

Learning to Teach.

Yang Fan Fei Tian Tao Qin Xiang-Yang Li Tie-Yan Liu

Auto-Encoding Sequential Monte Carlo.

Tuan Anh Le Maximilian Igl Tom Rainforth Tom Jin Frank Wood

Synthesizing realistic neural population activity patterns using Generative Adversarial Networks.

Manuel Molano-Mazon Arno Onken Eugenio Piasini Stefano Panzeri

Parameter Space Noise for Exploration.

Matthias Plappert Rein Houthooft Prafulla Dhariwal Szymon Sidor Richard Y. Chen Xi Chen Tamim Asfour Pieter Abbeel Marcin Andrychowicz

SMASH: One-Shot Model Architecture Search through HyperNetworks.

Andrew Brock Theodore Lim James M. Ritchie Nick Weston

An image representation based convolutional network for DNA classification.

Bojian Yin Marleen Balvert Davide Zambrano Alexander Schönhuth Sander M. Bohté

Improving the Universality and Learnability of Neural Programmer-Interpreters with Combinator Abstraction.

Da Xiao Jo-Yu Liao Xingyuan Yuan

Expressive power of recurrent neural networks.

Valentin Khrulkov Alexander Novikov Ivan V. Oseledets

Towards Synthesizing Complex Programs From Input-Output Examples.

Xinyun Chen Chang Liu Dawn Song

Deep Learning and Quantum Entanglement: Fundamental Connections with Implications to Network Design.

Yoav Levine David Yakira Nadav Cohen Amnon Shashua

A Simple Neural Attentive Meta-Learner.

Nikhil Mishra Mostafa Rohaninejad Xi Chen Pieter Abbeel

Learning Robust Rewards with Adverserial Inverse Reinforcement Learning.

Justin Fu Katie Luo Sergey Levine

Learning to Multi-Task by Active Sampling.

Sahil Sharma Ashutosh Kumar Jha Parikshit Hegde Balaraman Ravindran

Gradient Estimators for Implicit Models.

Yingzhen Li Richard E. Turner

Self-ensembling for visual domain adaptation.

Geoffrey French Michal Mackiewicz Mark H. Fisher

Understanding Short-Horizon Bias in Stochastic Meta-Optimization.

Yuhuai Wu Mengye Ren Renjie Liao Roger B. Grosse

WHAI: Weibull Hybrid Autoencoding Inference for Deep Topic Modeling.

Hao Zhang Bo Chen Dandan Guo Mingyuan Zhou

Learning Sparse Latent Representations with the Deep Copula Information Bottleneck.

Aleksander Wieczorek Mario Wieser Damian Murezzan Volker Roth

Boundary Seeking GANs.

R. Devon Hjelm Athul Paul Jacob Adam Trischler Gerry Che Kyunghyun Cho Yoshua Bengio

Learning a Generative Model for Validity in Complex Discrete Structures.

David Janz Jos van der Westhuizen Brooks Paige Matt J. Kusner José Miguel Hernández-Lobato

Debiasing Evidence Approximations: On Importance-weighted Autoencoders and Jackknife Variational Inference.

Sebastian Nowozin

On Unifying Deep Generative Models.

Zhiting Hu Zichao Yang Ruslan Salakhutdinov Eric P. Xing

Backpropagation through the Void: Optimizing control variates for black-box gradient estimation.

Will Grathwohl Dami Choi Yuhuai Wu Geoffrey Roeder David Duvenaud

Learning Awareness Models.

Brandon Amos Laurent Dinh Serkan Cabi Thomas Rothörl Sergio Gomez Colmenarejo Alistair Muldal Tom Erez Yuval Tassa Nando de Freitas Misha Denil

Understanding image motion with group representations.

Andrew Jaegle Stephen Phillips Daphne Ippolito Kostas Daniilidis

Predicting Floor-Level for 911 Calls with Neural Networks and Smartphone Sensor Data.

William Falcon Henning Schulzrinne

Spatially Transformed Adversarial Examples.

Chaowei Xiao Jun-Yan Zhu Bo Li Warren He Mingyan Liu Dawn Song

Generating Natural Adversarial Examples.

Zhengli Zhao Dheeru Dua Sameer Singh

Deep Gaussian Embedding of Graphs: Unsupervised Inductive Learning via Ranking.

Aleksandar Bojchevski Stephan Günnemann

Detecting Statistical Interactions from Neural Network Weights.

Michael Tsang Dehua Cheng Yan Liu

Learning how to explain neural networks: PatternNet and PatternAttribution.

Pieter-Jan Kindermans Kristof T. Schütt Maximilian Alber Klaus-Robert Müller Dumitru Erhan Been Kim Sven Dähne

Not-So-Random Features.

Brian Bullins Cyril Zhang Yi Zhang

SpectralNet: Spectral Clustering using Deep Neural Networks.

Uri Shaham Kelly P. Stanton Henry Li Ronen Basri Boaz Nadler Yuval Kluger

Global Optimality Conditions for Deep Neural Networks.

Chulhee Yun Suvrit Sra Ali Jadbabaie

Loss-aware Weight Quantization of Deep Networks.

Lu Hou James T. Kwok

Active Learning for Convolutional Neural Networks: A Core-Set Approach.

Ozan Sener Silvio Savarese

Scalable Private Learning with PATE.

Nicolas Papernot Shuang Song Ilya Mironov Ananth Raghunathan Kunal Talwar Úlfar Erlingsson

Combining Symbolic Expressions and Black-box Function Evaluations in Neural Programs.

Forough Arabshahi Sameer Singh Animashree Anandkumar

Reinforcement Learning on Web Interfaces using Workflow-Guided Exploration.

Evan Zheran Liu Kelvin Guu Panupong Pasupat Tianlin Shi Percy Liang

Hierarchical Representations for Efficient Architecture Search.

Hanxiao Liu Karen Simonyan Oriol Vinyals Chrisantha Fernando Koray Kavukcuoglu

Beyond Shared Hierarchies: Deep Multitask Learning through Soft Layer Ordering.

Elliot Meyerson Risto Miikkulainen

Compositional Attention Networks for Machine Reasoning.

Drew A. Hudson Christopher D. Manning

Dynamic Neural Program Embeddings for Program Repair.

Ke Wang Rishabh Singh Zhendong Su

The Role of Minimal Complexity Functions in Unsupervised Learning of Semantic Mappings.

Tomer Galanti Lior Wolf Sagie Benaim

Lifelong Learning with Dynamically Expandable Networks.

Jaehong Yoon Eunho Yang Jeongtae Lee Sung Ju Hwang

Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning.

Rajarshi Das Shehzaad Dhuliawala Manzil Zaheer Luke Vilnis Ishan Durugkar Akshay Krishnamurthy Alex Smola Andrew McCallum

Deep Active Learning for Named Entity Recognition.

Yanyao Shen Hyokun Yun Zachary C. Lipton Yakov Kronrod Animashree Anandkumar

Learning Intrinsic Sparse Structures within Long Short-Term Memory.

Wei Wen Yuxiong He Samyam Rajbhandari Minjia Zhang Wenhan Wang Fang Liu Bin Hu Yiran Chen Hai Li

Neural Language Modeling by Jointly Learning Syntax and Lexicon.

Yikang Shen Zhouhan Lin Chin-Wei Huang Aaron C. Courville

FusionNet: Fusing via Fully-aware Attention with Application to Machine Comprehension.

Hsin-Yuan Huang Chenguang Zhu Yelong Shen Weizhu Chen

Improving the Improved Training of Wasserstein GANs: A Consistency Term and Its Dual Effect.

Xiang Wei Boqing Gong Zixia Liu Wei Lu Liqiang Wang

Coulomb GANs: Provably Optimal Nash Equilibria via Potential Fields.

Thomas Unterthiner Bernhard Nessler Calvin Seward Günter Klambauer Martin Heusel Hubert Ramsauer Sepp Hochreiter

Activation Maximization Generative Adversarial Nets.

Zhiming Zhou Han Cai Shu Rong Yuxuan Song Kan Ren Weinan Zhang Jun Wang Yong Yu

Training Generative Adversarial Networks via Primal-Dual subgradient Methods: a Lagrangian Perspective on GaN.

Xu Chen Jiang Wang Hao Ge

Learning Wasserstein Embeddings.

Nicolas Courty Rémi Flamary Mélanie Ducoffe

CausalGAN: Learning Causal Implicit Generative Models with Adversarial Training.

Murat Kocaoglu Christopher Snyder Alexandros G. Dimakis Sriram Vishwanath

Matrix capsules with EM routing.

Geoffrey E. Hinton Sara Sabour Nicholas Frosst

Decision Boundary Analysis of Adversarial Examples.

Warren He Bo Li Dawn Song

Mitigating Adversarial Effects Through Randomization.

Cihang Xie Jianyu Wang Zhishuai Zhang Zhou Ren Alan L. Yuille

Cascade Adversarial Machine Learning Regularized with a Unified Embedding.

Taesik Na Jong Hwan Ko Saibal Mukhopadhyay

Can Neural Networks Understand Logical Entailment?

Richard Evans David Saxton David Amos Pushmeet Kohli Edward Grefenstette

Consequentialist conditional cooperation in social dilemmas with imperfect information.

Alexander Peysakhovich Adam Lerer

Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning.

Benjamin Eysenbach Shixiang Gu Julian Ibarz Sergey Levine

Reinforcement Learning Algorithm Selection.

Romain Laroche Raphaël Féraud

Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play.

Sainbayar Sukhbaatar Zeming Lin Ilya Kostrikov Gabriel Synnaeve Arthur Szlam Rob Fergus

Distributed Fine-tuning of Language Models on Private Data.

Vadim Popov Mikhail A. Kudinov Irina Piontkovskaya Petr Vytovtov Alex Nevidomsky

Multi-Task Learning for Document Ranking and Query Suggestion.

Wasi Uddin Ahmad Kai-Wei Chang Hongning Wang

Natural Language Inference over Interaction Space.

Yichen Gong Heng Luo Jian Zhang

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning.

Sandeep Subramanian Adam Trischler Yoshua Bengio Christopher J. Pal

All-but-the-Top: Simple and Effective Postprocessing for Word Representations.

Jiaqi Mu Pramod Viswanath

Word translation without parallel data.

Guillaume Lample Alexis Conneau Marc'Aurelio Ranzato Ludovic Denoyer Hervé Jégou

DCN+: Mixed Objective And Deep Residual Coattention for Question Answering.

Caiming Xiong Victor Zhong Richard Socher

Regularizing and Optimizing LSTM Language Models.

Stephen Merity Nitish Shirish Keskar Richard Socher

Sensitivity and Generalization in Neural Networks: an Empirical Study.

Roman Novak Yasaman Bahri Daniel A. Abolafia Jeffrey Pennington Jascha Sohl-Dickstein

Implicit Causal Models for Genome-wide Association Studies.

Dustin Tran David M. Blei

A Bayesian Perspective on Generalization and Stochastic Gradient Descent.

Samuel L. Smith Quoc V. Le

Adaptive Dropout with Rademacher Complexity Regularization.

Ke Zhai Huan Wang

Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step.

William Fedus Mihaela Rosca Balaji Lakshminarayanan Andrew M. Dai Shakir Mohamed Ian J. Goodfellow

The Implicit Bias of Gradient Descent on Separable Data.

Daniel Soudry Elad Hoffer Mor Shpigel Nacson Nathan Srebro

On the importance of single directions for generalization.

Ari S. Morcos David G. T. Barrett Neil C. Rabinowitz Matthew M. Botvinick

A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks.

Behnam Neyshabur Srinadh Bhojanapalli Nathan Srebro

SGD Learns Over-parameterized Networks that Provably Generalize on Linearly Separable Data.

Alon Brutzkus Amir Globerson Eran Malach Shai Shalev-Shwartz

Neumann Optimizer: A Practical Optimization Algorithm for Deep Neural Networks.

Shankar Krishnan Ying Xiao Rif A. Saurous

Proximal Backpropagation.

Thomas Frerix Thomas Möllenhoff Michael Möller Daniel Cremers

Kronecker-factored Curvature Approximations for Recurrent Neural Networks.

James Martens Jimmy Ba Matt Johnson

Don't Decay the Learning Rate, Increase the Batch Size.

Samuel L. Smith Pieter-Jan Kindermans Chris Ying Quoc V. Le

Recasting Gradient-Based Meta-Learning as Hierarchical Bayes.

Erin Grant Chelsea Finn Sergey Levine Trevor Darrell Thomas L. Griffiths

Monotonic Chunkwise Attention.

Chung-Cheng Chiu Colin Raffel

Learn to Pay Attention.

Saumya Jetley Nicholas A. Lord Namhoon Lee Philip H. S. Torr

Training wide residual networks for deployment using a single bit for each weight.

Mark D. McDonnell

Understanding Deep Neural Networks with Rectified Linear Units.

Raman Arora Amitabh Basu Poorya Mianjy Anirbit Mukherjee

Towards Reverse-Engineering Black-Box Neural Networks.

Seong Joon Oh Max Augustin Mario Fritz Bernt Schiele

Do GANs learn the distribution? Some Theory and Empirics.

Sanjeev Arora Andrej Risteski Yi Zhang

FearNet: Brain-Inspired Model for Incremental Learning.

Ronald Kemker Christopher Kanan

Wavelet Pooling for Convolutional Neural Networks.

Travis L. Williams Robert Li

Routing Networks: Adaptive Selection of Non-Linear Functions for Multi-Task Learning.

Clemens Rosenbaum Tim Klinger Matthew Riemer

Bi-Directional Block Self-Attention for Fast and Memory-Efficient Sequence Modeling.

Tao Shen Tianyi Zhou Guodong Long Jing Jiang Chengqi Zhang

Skip Connections Eliminate Singularities.

A. Emin Orhan Xaq Pitkow

Deep Complex Networks.

Chiheb Trabelsi Olexa Bilaniuk Ying Zhang Dmitriy Serdyuk Sandeep Subramanian João Felipe Santos Soroush Mehri Negar Rostamzadeh Yoshua Bengio Christopher J. Pal

Learning to cluster in order to transfer across domains and tasks.

Yen-Chang Hsu Zhaoyang Lv Zsolt Kira

Generalizing Across Domains via Cross-Gradient Training.

Shiv Shankar Vihari Piratla Soumen Chakrabarti Siddhartha Chaudhuri Preethi Jyothi Sunita Sarawagi

A DIRT-T Approach to Unsupervised Domain Adaptation.

Rui Shu Hung H. Bui Hirokazu Narui Stefano Ermon

Meta-Learning for Semi-Supervised Few-Shot Classification.

Mengye Ren Eleni Triantafillou Sachin Ravi Jake Snell Kevin Swersky Joshua B. Tenenbaum Hugo Larochelle Richard S. Zemel

A Framework for the Quantitative Evaluation of Disentangled Representations.

Cian Eastwood Christopher K. I. Williams

Semantically Decomposing the Latent Spaces of Generative Adversarial Networks.

Chris Donahue Zachary C. Lipton Akshay Balsubramani Julian J. McAuley

Few-Shot Learning with Graph Neural Networks.

Victor Garcia Satorras Joan Bruna Estrach

Learning a neural response metric for retinal prosthesis.

Nishal P. Shah Sasidhar Madugula E. J. Chichilnisky Yoram Singer Jonathon Shlens

Emergence of grid-like representations by training recurrent neural networks to perform spatial localization.

Christopher J. Cueva Xue-Xin Wei

Identifying Analogies Across Domains.

Yedid Hoshen Lior Wolf

Hierarchical Density Order Embeddings.

Ben Athiwaratkun Andrew Gordon Wilson

SCAN: Learning Hierarchical Compositional Visual Concepts.

Irina Higgins Nicolas Sonnerat Loic Matthey Arka Pal Christopher P. Burgess Matko Bosnjak Murray Shanahan Matthew M. Botvinick Demis Hassabis Alexander Lerchner

Compositional Obverter Communication Learning from Raw Visual Input.

Edward Choi Angeliki Lazaridou Nando de Freitas

Few-shot Autoregressive Density Estimation: Towards Learning to Learn Distributions.

Scott E. Reed Yutian Chen Thomas Paine Aäron van den Oord S. M. Ali Eslami Danilo J. Rezende Oriol Vinyals Nando de Freitas

Generative Models of Visually Grounded Imagination.

Ramakrishna Vedantam Ian Fischer Jonathan Huang Kevin Murphy

Relational Neural Expectation Maximization: Unsupervised Discovery of Objects and their Interactions.

Sjoerd van Steenkiste Michael Chang Klaus Greff Jürgen Schmidhuber

Simulated+Unsupervised Learning With Adaptive Data Generation and Bidirectional Mappings.

Kangwook Lee Hoon Kim Changho Suh

Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting.

Yaguang Li Rose Yu Cyrus Shahabi Yan Liu

Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers.

Jianbo Ye Xin Lu Zhe Lin James Z. Wang

On the Expressive Power of Overlapping Architectures of Deep Learning.

Or Sharir Amnon Shashua

Critical Percolation as a Framework to Analyze the Training of Deep Networks.

Zohar Ringel Rodrigo Andrade de Bem

Generative networks as inverse problems with Scattering transforms.

Tomás Angles Stéphane Mallat

Improving GAN Training via Binarized Representation Entropy (BRE) Regularization.

Yanshuai Cao Gavin Weiguang Ding Kry Yik-Chau Lui Ruitong Huang

Quantitatively Evaluating GANs With Divergences Proposed for Training.

Daniel Jiwoong Im He Ma Graham W. Taylor Kristin Branson

Deep Rewiring: Training very sparse deep networks.

Guillaume Bellec David Kappel Wolfgang Maass Robert Legenstein

Learning Discrete Weights Using the Local Reparameterization Trick.

Oran Shayer Dan Levi Ethan Fetaya

Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection.

Bo Zong Qi Song Martin Renqiang Min Wei Cheng Cristian Lumezanu Dae-ki Cho Haifeng Chen

A Hierarchical Model for Device Placement.

Azalia Mirhoseini Anna Goldie Hieu Pham Benoit Steiner Quoc V. Le Jeff Dean

Noisy Networks For Exploration.

Meire Fortunato Mohammad Gheshlaghi Azar Bilal Piot Jacob Menick Matteo Hessel Ian Osband Alex Graves Volodymyr Mnih Rémi Munos Demis Hassabis Olivier Pietquin Charles Blundell Shane Legg

Depthwise Separable Convolutions for Neural Machine Translation.

Lukasz Kaiser Aidan N. Gomez François Chollet

Attacking Binarized Neural Networks.

Angus Galloway Graham W. Taylor Medhat Moussa

Parallelizing Linear Recurrent Neural Nets Over Sequence Length.

Eric Martin Chris Cundy

Can recurrent neural networks warp time?

Corentin Tallec Yann Ollivier

Fraternal Dropout.

Konrad Zolna Devansh Arpit Dendi Suhubdy Yoshua Bengio

Ensemble Adversarial Training: Attacks and Defenses.

Florian Tramèr Alexey Kurakin Nicolas Papernot Ian J. Goodfellow Dan Boneh Patrick D. McDaniel

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models.

Pouya Samangouei Maya Kabkab Rama Chellappa

Certified Defenses against Adversarial Examples.

Aditi Raghunathan Jacob Steinhardt Percy Liang

PixelDefend: Leveraging Generative Models to Understand and Defend against Adversarial Examples.

Yang Song Taesup Kim Sebastian Nowozin Stefano Ermon Nate Kushman

Initialization matters: Orthogonal Predictive State Recurrent Neural Networks.

Krzysztof Choromanski Carlton Downey Byron Boots

Memory-based Parameter Adaptation.

Pablo Sprechmann Siddhant M. Jayakumar Jack W. Rae Alexander Pritzel Adrià Puigdomènech Badia Benigno Uria Oriol Vinyals Demis Hassabis Razvan Pascanu Charles Blundell

On the State of the Art of Evaluation in Neural Language Models.

Gábor Melis Chris Dyer Phil Blunsom

Towards Neural Phrase-based Machine Translation.

Po-Sen Huang Chong Wang Sitao Huang Dengyong Zhou Li Deng

Multi-level Residual Networks from Dynamical Systems View.

Bo Chang Lili Meng Eldad Haber Frederick Tung David Begert

Neural Speed Reading via Skim-RNN.

Min Joon Seo Sewon Min Ali Farhadi Hannaneh Hajishirzi

Unsupervised Cipher Cracking Using Discrete GANs.

Aidan N. Gomez Sicong Huang Ivan Zhang Bryan M. Li Muhammad Osama Lukasz Kaiser

Simulating Action Dynamics with Neural Process Networks.

Antoine Bosselut Omer Levy Ari Holtzman Corin Ennis Dieter Fox Yejin Choi

Communication Algorithms via Deep Learning.

Hyeji Kim Yihan Jiang Ranvir Rana Sreeram Kannan Sewoong Oh Pramod Viswanath

Deep Learning for Physical Processes: Incorporating Prior Scientific Knowledge.

Emmanuel de Bézenac Arthur Pajot Patrick Gallinari

Towards Deep Learning Models Resistant to Adversarial Attacks.

Aleksander Madry Aleksandar Makelov Ludwig Schmidt Dimitris Tsipras Adrian Vladu

HexaConv.

Emiel Hoogeboom Jorn W. T. Peters Taco S. Cohen Max Welling

Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approach.

Tsui-Wei Weng Huan Zhang Pin-Yu Chen Jinfeng Yi Dong Su Yupeng Gao Cho-Jui Hsieh Luca Daniel

i-RevNet: Deep Invertible Networks.

Jörn-Henrik Jacobsen Arnold W. M. Smeulders Edouard Oyallon

Learning to Count Objects in Natural Images for Visual Question Answering.

Yan Zhang Jonathon S. Hare Adam Prügel-Bennett

Semi-parametric topological memory for navigation.

Nikolay Savinov Alexey Dosovitskiy Vladlen Koltun

Emergent Communication through Negotiation.

Kris Cao Angeliki Lazaridou Marc Lanctot Joel Z. Leibo Karl Tuyls Stephen Clark

Residual Connections Encourage Iterative Inference.

Stanislaw Jastrzebski Devansh Arpit Nicolas Ballas Vikas Verma Tong Che Yoshua Bengio

Universal Agent for Disentangling Environments and Tasks.

Jiayuan Mao Honghua Dong Joseph J. Lim

Emergent Complexity via Multi-Agent Competition.

Trapit Bansal Jakub Pachocki Szymon Sidor Ilya Sutskever Igor Mordatch

Interactive Grounded Language Acquisition and Generalization in a 2D World.

Haonan Yu Haichao Zhang Wei Xu

Interpretable Counting for Visual Question Answering.

Alexander Trott Caiming Xiong Richard Socher

Twin Networks: Matching the Future for Sequence Generation.

Dmitriy Serdyuk Nan Rosemary Ke Alessandro Sordoni Adam Trischler Chris Pal Yoshua Bengio

Modular Continual Learning in a Unified Visual Environment.

Kevin T. Feigelis Blue Sheffer Daniel L. K. Yamins

Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks.

Víctor Campos Brendan Jou Xavier Giró-i-Nieto Jordi Torres Shih-Fu Chang

Countering Adversarial Images using Input Transformations.

Chuan Guo Mayank Rana Moustapha Cissé Laurens van der Maaten

Towards better understanding of gradient-based attribution methods for Deep Neural Networks.

Marco Ancona Enea Ceolini Cengiz Öztireli Markus Gross

Automatically Inferring Data Quality for Spatiotemporal Forecasting.

Sungyong Seo Arash Mohegh George Ban-Weiss Yan Liu

Towards Image Understanding from Deep Compression Without Decoding.

Robert Torfason Fabian Mentzer Eirikur Agustsson Michael Tschannen Radu Timofte Luc Van Gool

Stochastic Variational Video Prediction.

Mohammad Babaeizadeh Chelsea Finn Dumitru Erhan Roy H. Campbell Sergey Levine

Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.

Ofir Nachum Mohammad Norouzi Kelvin Xu Dale Schuurmans

Thermometer Encoding: One Hot Way To Resist Adversarial Examples.

Jacob Buckman Aurko Roy Colin Raffel Ian J. Goodfellow

GANITE: Estimation of Individualized Treatment Effects using Generative Adversarial Nets.

Jinsung Yoon James Jordon Mihaela van der Schaar

Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip.

Feiwen Zhu Jeff Pool Michael Andersch Jeremy Appleyard Fung Xie

Stochastic Activation Pruning for Robust Adversarial Defense.

Guneet S. Dhillon Kamyar Azizzadenesheli Zachary C. Lipton Jeremy Bernstein Jean Kossaifi Aran Khanna Animashree Anandkumar

Memorization Precedes Generation: Learning Unsupervised GANs with Memory Networks.

Youngjin Kim Minjung Kim Gunhee Kim

Measuring the Intrinsic Dimension of Objective Landscapes.

Chunyuan Li Heerad Farkhoor Rosanne Liu Jason Yosinski

Unbiased Online Recurrent Optimization.

Corentin Tallec Yann Ollivier

Decision-Based Adversarial Attacks: Reliable Attacks Against Black-Box Machine Learning Models.

Wieland Brendel Jonas Rauber Matthias Bethge

On the Discrimination-Generalization Tradeoff in GANs.

Pengchuan Zhang Qiang Liu Dengyong Zhou Tao Xu Xiaodong He

Empirical Risk Landscape Analysis for Understanding Deep Neural Networks.

Pan Zhou Jiashi Feng

The power of deeper networks for expressing natural functions.

David Rolnick Max Tegmark

Learning Parametric Closed-Loop Policies for Markov Potential Games.

Sergio Valcarcel Macua Javier Zazo Santiago Zazo

Critical Points of Linear Neural Networks: Analytical Forms and Landscape Properties.

Yi Zhou Yingbin Liang

Learning One-hidden-layer Neural Networks with Landscape Design.

Rong Ge Jason D. Lee Tengyu Ma

Unsupervised Neural Machine Translation.

Mikel Artetxe Gorka Labaka Eneko Agirre Kyunghyun Cho

QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension.

Adams Wei Yu David Dohan Minh-Thang Luong Rui Zhao Kai Chen Mohammad Norouzi Quoc V. Le

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training.

Yujun Lin Song Han Huizi Mao Yu Wang Bill Dally

Compressing Word Embeddings via Deep Compositional Code Learning.

Raphael Shu Hideki Nakayama

A Deep Reinforced Model for Abstractive Summarization.

Romain Paulus Caiming Xiong Richard Socher

Unsupervised Machine Translation Using Monolingual Corpora Only.

Guillaume Lample Alexis Conneau Ludovic Denoyer Marc'Aurelio Ranzato

Generating Wikipedia by Summarizing Long Sequences.

Peter J. Liu Mohammad Saleh Etienne Pot Ben Goodrich Ryan Sepassi Lukasz Kaiser Noam Shazeer

Mastering the Dungeon: Grounded Language Learning by Mechanical Turker Descent.

Zhilin Yang Saizheng Zhang Jack Urbanek Will Feng Alexander H. Miller Arthur Szlam Douwe Kiela Jason Weston

Learning Differentially Private Recurrent Language Models.

H. Brendan McMahan Daniel Ramage Kunal Talwar Li Zhang

Large scale distributed neural network training through online distillation.

Rohan Anil Gabriel Pereyra Alexandre Passos Róbert Ormándi George E. Dahl Geoffrey E. Hinton

VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop.

Yaniv Taigman Lior Wolf Adam Polyak Eliya Nachmani

Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples.

Kimin Lee Honglak Lee Kibok Lee Jinwoo Shin

Learning from Between-class Examples for Deep Sound Recognition.

Yuji Tokozume Yoshitaka Ushiku Tatsuya Harada

Distributed Prioritized Experience Replay.

Dan Horgan John Quan David Budden Gabriel Barth-Maron Matteo Hessel Hado van Hasselt David Silver

Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy.

Asit K. Mishra Debbie Marr

The High-Dimensional Geometry of Binary Neural Networks.

Alexander G. Anderson Cory P. Berg

A Scalable Laplace Approximation for Neural Networks.

Hippolyt Ritter Aleksandar Botev David Barber

Kernel Implicit Variational Inference.

Jiaxin Shi Shengyang Sun Jun Zhu

Flipout: Efficient Pseudo-Independent Weight Perturbations on Mini-Batches.

Yeming Wen Paul Vicol Jimmy Ba Dustin Tran Roger B. Grosse

Variational Inference of Disentangled Latent Concepts from Unlabeled Observations.

Abhishek Kumar Prasanna Sattigeri Avinash Balakrishnan

Variational image compression with a scale hyperprior.

Johannes Ballé David Minnen Saurabh Singh Sung Jin Hwang Nick Johnston

Action-dependent Control Variates for Policy Optimization via Stein Identity.

Hao Liu Yihao Feng Yi Mao Dengyong Zhou Jian Peng Qiang Liu

Variational Message Passing with Structured Inference Networks.

Wu Lin Nicolas Hubacher Mohammad Emtiyaz Khan

Model compression via distillation and quantization.

Antonio Polino Razvan Pascanu Dan Alistarh

Learning to Share: simultaneous parameter tying and Sparsification in Deep Learning.

Dejiao Zhang Haozhu Wang Mário A. T. Figueiredo Laura Balzano

Learning Approximate Inference Networks for Structured Prediction.

Lifu Tu Kevin Gimpel

Deep Learning as a Mixed Convex-Combinatorial Optimization Problem.

Abram L. Friesen Pedro M. Domingos

Smooth Loss Functions for Deep Top-k Classification.

Leonard Berrada Andrew Zisserman M. Pawan Kumar

Demystifying MMD GANs.

Mikolaj Binkowski Danica J. Sutherland Michael Arbel Arthur Gretton

Adversarial Dropout Regularization.

Kuniaki Saito Yoshitaka Ushiku Tatsuya Harada Kate Saenko

Learning Latent Representations in Neural Networks for Clustering through Pseudo Supervision and Graph-based Activity Regularization.

Ozsel Kilinc Ismail Uysal

NerveNet: Learning Structured Policy with Graph Neural Networks.

Tingwu Wang Renjie Liao Jimmy Ba Sanja Fidler

An efficient framework for learning sentence representations.

Lajanugen Logeswaran Honglak Lee

Emergent Translation in Multi-Agent Communication.

Jason Lee Kyunghyun Cho Jason Weston Douwe Kiela

FastGCN: Fast Learning with Graph Convolutional Networks via Importance Sampling.

Jie Chen Tengfei Ma Cao Xiao

Emergent Communication in a Multi-Modal, Multi-Step Referential Game.

Katrina Evtimova Andrew Drozdov Douwe Kiela Kyunghyun Cho

Unsupervised Representation Learning by Predicting Image Rotations.

Spyros Gidaris Praveer Singh Nikos Komodakis

cGANs with Projection Discriminator.

Takeru Miyato Masanori Koyama

Viterbi-based Pruning for Sparse Matrix with Fixed and High Index Compression Ratio.

Dongsoo Lee Daehyun Ahn Taesu Kim Pierce I-Jen Chuang Jae-Joon Kim

Parametrized Hierarchical Procedures for Neural Programming.

Roy Fox Richard Shin Sanjay Krishnan Ken Goldberg Dawn Song Ion Stoica

Hierarchical Subtask Discovery with Non-Negative Matrix Factorization.

Adam Christopher Earle Andrew M. Saxe Benjamin Rosman

Distributed Distributional Deterministic Policy Gradients.

Gabriel Barth-Maron Matthew W. Hoffman David Budden Will Dabney Dan Horgan Dhruva TB Alistair Muldal Nicolas Heess Timothy P. Lillicrap

SEARNN: Training RNNs with global-local losses.

Rémi Leblond Jean-Baptiste Alayrac Anton Osokin Simon Lacoste-Julien

The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning.

Audrunas Gruslys Will Dabney Mohammad Gheshlaghi Azar Bilal Piot Marc G. Bellemare Rémi Munos

MGAN: Training Generative Adversarial Nets with Multiple Generators.

Quan Hoang Tu Dinh Nguyen Trung Le Dinh Q. Phung

WRPN: Wide Reduced-Precision Networks.

Asit K. Mishra Eriko Nurvitadhi Jeffrey J. Cook Debbie Marr

Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering.

Shuohang Wang Mo Yu Jing Jiang Wei Zhang Xiaoxiao Guo Shiyu Chang Zhiguo Wang Tim Klinger Gerald Tesauro Murray Campbell

Neural-Guided Deductive Search for Real-Time Program Synthesis from Examples.

Ashwin Kalyan Abhishek Mohta Oleksandr Polozov Dhruv Batra Prateek Jain Sumit Gulwani

Syntax-Directed Variational Autoencoder for Structured Data.

Hanjun Dai Yingtao Tian Bo Dai Steven Skiena Le Song

Deep Neural Networks as Gaussian Processes.

Jaehoon Lee Yasaman Bahri Roman Novak Samuel S. Schoenholz Jeffrey Pennington Jascha Sohl-Dickstein

Meta Learning Shared Hierarchies.

Kevin Frans Jonathan Ho Xi Chen Pieter Abbeel John Schulman

Maximum a Posteriori Policy Optimisation.

Abbas Abdolmaleki Jost Tobias Springenberg Yuval Tassa Rémi Munos Nicolas Heess Martin A. Riedmiller

Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm.

Chelsea Finn Sergey Levine

Divide and Conquer Networks.

Alex Nowak David Folqué Joan Bruna

MaskGAN: Better Text Generation via Filling in the _______.

William Fedus Ian J. Goodfellow Andrew M. Dai

Latent Constraints: Learning to Generate Conditionally from Unconditional Generative Models.

Jesse H. Engel Matthew D. Hoffman Adam Roberts

Mixed Precision Training.

Paulius Micikevicius Sharan Narang Jonah Alben Gregory F. Diamos Erich Elsen David García Boris Ginsburg Michael Houston Oleksii Kuchaiev Ganesh Venkatesh Hao Wu

The Kanerva Machine: A Generative Distributed Memory.

Yan Wu Greg Wayne Alex Graves Timothy P. Lillicrap

Improving GANs Using Optimal Transport.

Tim Salimans Han Zhang Alec Radford Dimitris N. Metaxas

An Online Learning Approach to Generative Adversarial Networks.

Paulina Grnarova Kfir Y. Levy Aurélien Lucchi Thomas Hofmann Andreas Krause

Generalizing Hamiltonian Monte Carlo with Neural Networks.

Daniel Levy Matthew D. Hoffman Jascha Sohl-Dickstein

Minimax Curriculum Learning: Machine Teaching with Desirable Difficulties and Scheduled Diversity.

Tianyi Zhou Jeff A. Bilmes

Graph Attention Networks.

Petar Velickovic Guillem Cucurull Arantxa Casanova Adriana Romero Pietro Liò Yoshua Bengio

Stabilizing Adversarial Nets with Prediction Methods.

Abhay Kumar Yadav Sohil Shah Zheng Xu David W. Jacobs Tom Goldstein

Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks.

Shiyu Liang Yixuan Li R. Srikant

Polar Transformer Networks.

Carlos Esteves Christine Allen-Blanchette Xiaowei Zhou Kostas Daniilidis

Decoupling the Layers in Residual Networks.

Ricky Fok Aijun An Zana Rashidi Xiaogang Wang

Auto-Conditioned Recurrent Networks for Extended Complex Human Motion Synthesis.

Yi Zhou Zimo Li Shuangjiu Xiao Chong He Zeng Huang Hao Li

Espresso: Efficient Forward Propagation for Binary Deep Neural Networks.

Fabrizio Pedersoli George Tzanetakis Andrea Tagliasacchi

Efficient Sparse-Winograd Convolutional Neural Networks.

Xingyu Liu Jeff Pool Song Han William J. Dally

Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis.

Rudy Bunel Matthew J. Hausknecht Jacob Devlin Rishabh Singh Pushmeet Kohli

Hyperparameter optimization: a spectral approach.

Elad Hazan Adam R. Klivans Yang Yuan

Imitation Learning from Visual Data with Multiple Intentions.

Aviv Tamar Khashayar Rohanimanesh Yinlam Chow Chris Vigorito Ben Goodrich Michael Kahane Derik Pridmore

Latent Space Oddity: on the Curvature of Deep Generative Models.

Georgios Arvanitidis Lars Kai Hansen Søren Hauberg

Fidelity-Weighted Learning.

Mostafa Dehghani Arash Mehrjou Stephan Gouws Jaap Kamps Bernhard Schölkopf

Semantic Interpolation in Implicit Models.

Yannic Kilcher Aurélien Lucchi Thomas Hofmann

Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling.

Carlos Riquelme George Tucker Jasper Snoek

Multi-View Data Generation Without View Supervision.

Mickaël Chen Ludovic Denoyer Thierry Artières

Unsupervised Learning of Goal Spaces for Intrinsically Motivated Goal Exploration.

Alexandre Péré Sébastien Forestier Olivier Sigaud Pierre-Yves Oudeyer

Learning an Embedding Space for Transferable Robot Skills.

Karol Hausman Jost Tobias Springenberg Ziyu Wang Nicolas Heess Martin A. Riedmiller

Learning Latent Permutations with Gumbel-Sinkhorn Networks.

Gonzalo E. Mena David Belanger Scott W. Linderman Jasper Snoek

Deep Learning with Logged Bandit Feedback.

Thorsten Joachims Adith Swaminathan Maarten de Rijke

A Neural Representation of Sketch Drawings.

David Ha Douglas Eck

Model-Ensemble Trust-Region Policy Optimization.

Thanard Kurutach Ignasi Clavera Yan Duan Aviv Tamar Pieter Abbeel

Truncated horizon Policy Search: Combining Reinforcement Learning & Imitation Learning.

Wen Sun J. Andrew Bagnell Byron Boots

Large Scale Optimal Transport and Mapping Estimation.

Vivien Seguy Bharath Bhushan Damodaran Rémi Flamary Nicolas Courty Antoine Rolet Mathieu Blondel

Minimal-Entropy Correlation Alignment for Unsupervised Deep Domain Adaptation.

Pietro Morerio Jacopo Cavazza Vittorio Murino

AmbientGAN: Generative models from lossy measurements.

Ashish Bora Eric Price Alexandros G. Dimakis

Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs.

W. James Murdoch Peter J. Liu Bin Yu

Zero-Shot Visual Imitation.

Deepak Pathak Parsa Mahmoudieh Guanghao Luo Pulkit Agrawal Dian Chen Yide Shentu Evan Shelhamer Jitendra Malik Alexei A. Efros Trevor Darrell

Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines.

Cathy Wu Aravind Rajeswaran Yan Duan Vikash Kumar Alexandre M. Bayen Sham M. Kakade Igor Mordatch Pieter Abbeel

Progressive Growing of GANs for Improved Quality, Stability, and Variation.

Tero Karras Timo Aila Samuli Laine Jaakko Lehtinen

Neural Sketch Learning for Conditional Program Generation.

Vijayaraghavan Murali Letao Qi Swarat Chaudhuri Chris Jermaine

Boosting Dilated Convolutional Networks with Mixed Tensor Decompositions.

Nadav Cohen Ronen Tamari Amnon Shashua

Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments.

Maruan Al-Shedivat Trapit Bansal Yura Burda Ilya Sutskever Igor Mordatch Pieter Abbeel

Breaking the Softmax Bottleneck: A High-Rank RNN Language Model.

Zhilin Yang Zihang Dai Ruslan Salakhutdinov William W. Cohen

Characterizing Adversarial Subspaces Using Local Intrinsic Dimensionality.

Xingjun Ma Bo Li Yisen Wang Sarah M. Erfani Sudanthi N. R. Wijewickrema Grant Schoenebeck Dawn Song Michael E. Houle James Bailey

Learning to Represent Programs with Graphs.

Miltiadis Allamanis Marc Brockschmidt Mahmoud Khademi

Spectral Normalization for Generative Adversarial Networks.

Takeru Miyato Toshiki Kataoka Masanori Koyama Yuichi Yoshida

Wasserstein Auto-Encoders.

Ilya O. Tolstikhin Olivier Bousquet Sylvain Gelly Bernhard Schölkopf

Learning Deep Mean Field Games for Modeling Large Population Behavior.

Jiachen Yang Xiaojing Ye Rakshit Trivedi Huan Xu Hongyuan Zha

Certifying Some Distributional Robustness with Principled Adversarial Training.

Aman Sinha Hongseok Namkoong John C. Duchi

On the insufficiency of existing momentum schemes for Stochastic Optimization.

Rahul Kidambi Praneeth Netrapalli Prateek Jain Sham M. Kakade

Ask the Right Questions: Active Question Reformulation with Reinforcement Learning.

Christian Buck Jannis Bulian Massimiliano Ciaramita Wojciech Gajewski Andrea Gesmundo Neil Houlsby Wei Wang

Spherical CNNs.

Taco S. Cohen Mario Geiger Jonas Köhler Max Welling

Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input.

Angeliki Lazaridou Karl Moritz Hermann Karl Tuyls Stephen Clark

Training and Inference with Integers in Deep Neural Networks.

Shuang Wu Guoqi Li Feng Chen Luping Shi

Multi-Scale Dense Networks for Resource Efficient Image Classification.

Gao Huang Danlu Chen Tianhong Li Felix Wu Laurens van der Maaten Kilian Q. Weinberger

Synthetic and Natural Noise Both Break Neural Machine Translation.

Yonatan Belinkov Yonatan Bisk

On the Convergence of Adam and Beyond.

Sashank J. Reddi Satyen Kale Sanjiv Kumar