End-to-End Learning of Driving Models from Large-Scale Video Datasets

Xu, Huazhe; Gao, Yang; Yu, Fisher; Darrell, Trevor

doi:10.1109/cvpr.2017.376

preprintJul 1, 2017Closed access

End-to-End Learning of Driving Models from Large-Scale Video Datasets

HXHuazhe Xu YGYang Gao FYFisher Yu TDTrevor Darrell

University of California, Berkeley

Indexed incrossref

Abstract

Robust perception-action models should be learned from training data with diverse visual appearances and realistic behaviors, yet current approaches to deep visuomotor policy learning have been generally limited to in-situ models learned from a single vehicle or simulation environment. We advocate learning a generic vehicle motion model from large scale crowd-sourced video data, and develop an end-to-end trainable architecture for learning to predict a distribution over future vehicle egomotion from instantaneous monocular camera observations and previous vehicle state. Our model incorporates a novel FCN-LSTM architecture, which can be learned from large-scale crowd-sourced vehicle action data, and leverages…

Citation impact

864

total citations

FWCI: 77.66
Percentile: 100%
References: 41

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Monocular
Machine learning
Segmentation
Scale (ratio)
Deep learning
Action (physics)

No related works found for this paper.

Funding

DA
Defense Advanced Research Projects Agency