articleJan 1, 2003Closed access

Video Google: a text retrieval approach to object matching in videos

Oxford Research Group · University of Oxford

Indexed incrossref

Abstract

We describe an approach to object and scene retrieval which searches for and localizes all the occurrences of a user outlined object in a video. The object is represented by a set of viewpoint invariant region descriptors so that recognition can proceed successfully despite changes in viewpoint, illumination and partial occlusion. The temporal continuity of the video within a shot is used to track the regions in order to reject unstable regions and reduce the effects of noise in the descriptors. The analogy with text retrieval is in the implementation where matches on descriptors are pre-computed (using vector quantization), and inverted file systems and document rankings are used. The result is that retrieved…

Citation impact

6,432
total citations
FWCI
38.26
Percentile
100%
References
24
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Computer vision
  • Information retrieval
  • Object (grammar)
  • Image retrieval
  • Matching (statistics)
  • Invariant (physics)
No related works found for this paper.