
Neural Program Synthesis with a Differentiable Fixer
We present a new program synthesis approach that combines an encoderdec...
SCELMo: Source Code Embeddings from Language Models
Continuous embeddings of tokens in computer programs have been used to s...
OptTyper: Probabilistic Type Inference by Optimising Logical and Natural Constraints
We present a new approach to the type inference problem for dynamic lang...
Big Code != Big Vocabulary: OpenVocabulary Models for Source Code
Statistical language modeling techniques have successfully been applied ...
Towards Modular Algorithm Induction
We present a modular neural network architecture Main that learns algori...
Incremental Sampling Without Replacement for Sequence Models
Sampling is a fundamental technique, and sampling without replacement is...
Learning to Represent Programs with Property Signatures
We introduce the notion of property signatures, a representation for pro...
Learning to Fix Build Errors with Graph2Diff Neural Networks
Professional software developers spend a significant amount of time fixi...
Robust Variational Autoencoders for Outlier Detection in MixedType Data
We focus on the problem of unsupervised cell outlier detection in mixed ...
How Often Do SingleStatement Bugs Occur? The ManySStuBs4J Dataset
Program repair is an important but difficult software engineering proble...
Learning Semantic Annotations for Tabular Data
The usefulness of tabular data such as web tables critically depends on ...
Maybe Deep Neural Networks are the Best Choice for Modeling Source Code
Statistical language modeling techniques have successfully been applied ...
Wrangling Messy CSV Files by Detecting Row and Type Patterns
It is well known that data scientists spend the majority of their time o...
ColNet: Embedding the Semantics of Web Tables for Column Type Prediction
Automatically annotating column types with knowledge base (KB) concepts ...
Probabilistic Programming with Densities in SlicStan: Efficient, Flexible and Deterministic
Stan is a probabilistic programming language that has been increasingly ...
Deep Learning to Detect Redundant Method Comments
Comments in software are critical for maintenance and reuse. But apart f...
Ratio Matching MMD Nets: Low dimensional projections for effective deep generative models
Deep generative models can learn to generate realisticlooking images on...
Variational Inference In Pachinko Allocation Machines
The Pachinko Allocation Machine (PAM) is a deep topic model that allows ...
Synthesis of Differentiable Functional Programs for Lifelong Learning
We present a neurosymbolic approach to the lifelong learning of algorith...
Interpreting Deep Classifier by Visual Distillation of Dark Knowledge
Interpreting black box classifiers, such as deep networks, allows an ana...
Popularity of arXiv.org within Computer Science
It may seem surprising that, out of all areas of science, computer scien...
A Survey of Machine Learning for Big Code and Naturalness
Research at the intersection of machine learning, programming languages,...
VEEGAN: Reducing Mode Collapse in GANs using Implicit Variational Learning
Deep generative models provide powerful tools for distributions over com...
Autoencoding Variational Inference For Topic Models
Topic models are one of the most popular methods for learning representa...
Learning Continuous Semantic Representations of Symbolic Expressions
Combining abstract, symbolic reasoning with continuous neural reasoning ...
Clustering with a Reject Option: Interactive Clustering as Bayesian Prior Elicitation
A good clustering can help a data analyst to explore and understand a da...
A Subsequence Interleaving Model for Sequential Pattern Mining
Recent sequential pattern mining methods have used the minimum descripti...
A Convolutional Attention Network for Extreme Summarization of Source Code
Attention mechanisms in neural networks have proved useful for problems ...
Latent Bayesian melding for integrating individual and population models
In many statistical problems, a more coarsegrained model may be suitabl...
A Bayesian Network Model for Interesting Itemsets
Mining itemsets that are the most interesting under a statistical model ...
SemiSeparable Hamiltonian Monte Carlo for Inference in Bayesian Hierarchical Models
Sampling from hierarchical Bayesian models is often difficult for MCMC m...
Piecewise Training for Undirected Models
For many large undirected models that arise in realworld applications, ...
Improved Dynamic Schedules for Belief Propagation
Belief propagation and its variants are popular methods for approximate ...
An Introduction to Conditional Random Fields
Often we wish to predict a large number of variables that depend on each...
Bayesian inference for queueing networks and modeling of internet services
Modern Internet services, such as those at Google, Yahoo!, and Amazon, h...
Machine learning, natural language processing, computer systems applications of machine learning. Graphical modelling, approximate inference.