Optimization as a Model for Few-Shot Learning

Ravi, Sachin; Larochelle, Hugo

articleInternational Conference on Learning RepresentationsApr 24, 2017Closed access

Optimization as a Model for Few-Shot Learning

Princeton University · Université de Sherbrooke

Abstract

Though deep neural networks have shown great success in the large data domain, they generally perform poorly on few-shot learning tasks, where a model has to quickly generalize after seeing very few examples from each class. The general belief is that gradient-based optimization in high capacity models requires many iterative steps over many examples to perform well. Here, we propose an LSTM-based meta-learner model to learn the exact optimization algorithm used to train another learner neural network in the few-shot regime. The parametrization of our model allows it to learn appropriate parameter updates specifically for the scenario where a set amount of updates will be made, while also learning a general…

Citation impact

2,440

total citations

FWCI: 130.73
Percentile: 100%
References: 20

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Initialization
Artificial intelligence
Meta learning (computer science)
Convergence (economics)
Deep learning
Metric (unit)
Artificial neural network

No related works found for this paper.