articleJun 14, 2009Closed access

Feature hashing for large scale multitask learning

Yahoo (United States)

Indexed incrossref

Abstract

Empirical evidence suggests that hashing is an effective strategy for dimensionality reduction and practical nonparametric estimation. In this paper we provide exponential tail bounds for feature hashing and show that the interaction between random subspaces is negligible with high probability. We demonstrate the feasibility of this approach with experimental results for a new use case --- multitask learning with hundreds of thousands of tasks.

Citation impact

936
total citations
FWCI
27.57
Percentile
100%
References
17
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Feature (linguistics)
  • Scale (ratio)
  • Hash function
  • Artificial intelligence
  • Feature learning
  • Feature hashing
  • Machine learning
No related works found for this paper.