articleJun 28, 2009Closed access
Beyond blacklists
University of California, San Diego
Indexed incrossref
Abstract
Malicious Web sites are a cornerstone of Internet criminal activities. As a result, there has been broad interest in developing systems to prevent the end user from visiting such sites. In this paper, we describe an approach to this problem based on automated URL classification, using statistical methods to discover the tell-tale lexical and host-based properties of malicious Web site URLs. These methods are able to learn highly predictive models by extracting and automatically analyzing tens of thousands of features potentially indicative of suspicious URLs. The resulting classifiers obtain 95-99% accuracy, detecting large numbers of malicious Web sites from their URLs, with only modest false positives.
Citation impact
768
total citations
- FWCI
- 70.63
- Percentile
- 100%
- References
- 23
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Computer science
UN Sustainable Development Goals
- Peace, Justice and strong institutions
No related works found for this paper.