Abstract
Though deep neural networks have shown great success in the large data domain, they generally perform poorly on few-shot learning tasks, where a model has to quickly generalize after seeing very few examples from each class. The general belief is that gradient-based optimization in high capacity models requires many iterative steps over many examples to perform well. Here, we propose an LSTM-based meta-learner model to learn the exact optimization algorithm used to train another learner neural network in the few-shot regime. The parametrization of our model allows it to learn appropriate parameter updates specifically for the scenario where a set amount of updates will be made, while also learning a general…
Citation impact
2,440
total citations
- FWCI
- 130.73
- Percentile
- 100%
- References
- 20
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Computer science
- Initialization
- Artificial intelligence
- Meta learning (computer science)
- Convergence (economics)
- Deep learning
- Metric (unit)
- Artificial neural network
No related works found for this paper.