April 25 2017
Looking for an automated way to assess the quality of your generated summaries? The authors of this paper generate a bunch of questions based off of a source text, and then ask those questions using the generated text as the database. If the information is found in both, then the generated text can be said to be a good representation of the source text. Seems reasonable to me. I might try it for this site.
A program that writes programs needs recursion is the conclusion of this new paper. The authors take several standard programming tasks and propose a model architecture that proves it has perfect generalizability with small amounts of training data.
Visual Q&A models are incredibly useful, but many of them are still fragile. This paper proposes an end-to-end solution that does away with parsers while reducing errors by almost 50%. Holy cow, batman.
April 13 2017
Breakthrough on the French Street Names dataset. This paper boasts a huge improvement over previous state-of-the-art models including Inception V3 and Inception Resnet V2. And since that wasn’t hard enough, they also tested it against Google Street View, cause it’s all like nbd.
What’s better than BEGAN? MAGAN! Or at least that’s what the authors of this paper are saying. They are probably right because a) they have a simpler setup and b) they use public datasets so their work is reproducible. Plus they are promising to release the code on Github in the near future.
New 3d point cloud dataset and benchmark with semantic labels. Should be a lot of use to people doing AR. They did a nice job with the benchmark too, and have an automated submission process with a public leaderboard.
April 12 2017
There’s a lot of research around VQA right now. This new paper gets a new state-of-the-art on VQA 1, an improvement of 0.4%. It’s by a small margin, but it’s also using a simpler architecture.
Autoencoding for networks. Struc2Vec gives you a new way to view and understand your networks.
New paper out of Stanford, Pennsylvania State and Google Brain delves into what happens when a ML model is attacked, and how to incorporate robustness. It’s an interesting read in an interesting space.
April 11 2017
Not just for pong, this paper simulates environments from 2d Atari games to 3d racing sims. It has DQN Scores on a bunch of games (and goes into high detail on how they got those scores), but they don’t explicitly compare those score with current state-of-the-art.
Building a recommendation engine? This paper gets a new state-of-the-art on ratings predictions. Show your users what they want to see.
April 07 2017
I am looking forward to the day when I can write a model that writes models for me. Maybe someone will turn it into an API and I won’t have to write anything. This paper gets us another step closer. Working with Python and other high-level languages, they have a new state of the art in code generation.
Telling the difference between various types of red sedans or crows is hard for humans, let alone neural nets. This paper does it better and cheaper.
Looking for ways to generate a FAQ from your docs? This paper points to a potential direction.
April 06 2017
The next time we do a Kaggle comptition we’ll try this technique. When working on an image segmentation problem, it’s those last couple pixels around the edges that make all the difference in your mIoU.
State Of The Art Results
- Apr 25 End to End Module Networks
- Apr 13 General Approach to Real World Text Extraction
- Apr 13 MAGAN, Better than BEGAN
- Apr 12 New SOTA for VQA 1.0
- Apr 11 Predicting Recomendations with TransNets
- Apr 7 Use Machine Learning to Write Your Code For You
- Apr 5 Build A Faster Image Search
- Apr 5 2D to 3D Depth in Noisy Environments
- Apr 4 New State of The Art on Keyphrase Boundary Classification
- Apr 4 New State of the Art In Semantic Role Labeling
- Apr 21 SREFI: Synthesis of Realistic Example Face Images
- Apr 15 Neural Paraphrase Identification of Questions with Noisy Pretraining
- Apr 13 3d Point Cloud Dataset and Benchmark
- Apr 10 Loss Max-Pooling for Semantic Image Segmentation
- Apr 9 BigHand2.2M Benchmark: Hand Pose Dataset and State of the Art Analysis
- Apr 6 A Low Altitude Geo-Referenced Drone Dataset
- Apr 3 Auto-Encode Your Way to Realistic Images
- Mar 21 Boost Your Cross-Media Retrieval Process with Twitter100k
- Mar 21 Counterfactual Fairness: Combat the Inherent Social Biases of Your Dataset
- Apr 24 Accelerated Nearest Neighbor Search with Quick ADC
- Apr 24 Consistency of community detection in multi-layer networks using spectral and matrix factorization methods
- Apr 24 A Saddle Point Approach to Structured Low-rank Matrix Learning in Large-scale Applications
- Apr 24 Detecting and Recognizing Human-Object Interactions
- Apr 24 A Trie-Structured Bayesian Model for Unsupervised Morphological Segmentation
- Apr 24 Accurate Optical Flow via Direct Cost Volume Processing
- Apr 24 Elite Bases Regression: A Real-time Algorithm for Symbolic Regression
- Apr 24 A Real-time Hand Gesture Recognition and Human-Computer Interaction System
- Apr 24 Measuring the Accuracy of Object Detectors and Trackers
- Apr 24 Joint Modeling of Text and Acoustic-Prosodic Cues for Neural Parsing
- Apr 24 Fast PET reconstruction using Multi-scale Fully Convolutional Neural Networks
- Apr 24 Supervised Adversarial Networks for Image Saliency Detection
- Apr 24 Automatic Liver Lesion Segmentation Using A Deep Convolutional Neural Network Method
- Apr 24 Learning from Comparisons and Choices
- Apr 24 Entropic Trace Estimates for Log Determinants
- Apr 24 Turing at SemEval-2017 Task 8: Sequential Approach to Rumour Stance Classification with Branch-LSTM
- Apr 24 What is the Essence of a Claim? Cross-Domain Claim Identification
- Apr 24 Reinforcement Learning Based Dynamic Selection of Auxiliary Objectives with Preserving of the Best Found Solution
- Apr 24 Stochastic Constraint Programming as Reinforcement Learning
- Apr 24 Monocular Visual Odometry with a Rolling Shutter Camera