articleMay 8, 2007Closed access

Cantina

University of Pittsburgh · Carnegie Mellon University

Indexed incrossref

Abstract

Phishing is a significant problem involving fraudulent email and web sites that trick unsuspecting users into revealing private information. In this paper, we present the design, implementation, and evaluation of CANTINA, a novel, content-based approach to detecting phishing web sites, based on the TF-IDF information retrieval algorithm. We also discuss the design and evaluation of several heuristics we developed to reduce false positives. Our experiments show that CANTINA is good at detecting phishing sites, correctly labeling approximately 95% of phishing sites.

Citation impact

737
total citations
FWCI
58.84
Percentile
100%
References
19
Citations per year

Authors

3

Topics & keywords

Keywords
  • Phishing
  • Computer science
  • Heuristics
  • False positive paradox
  • Information retrieval
  • Private information retrieval
  • World Wide Web
  • The Internet
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.

Funding