Ask Your Question
0

detecting near-duplicate images

asked 2014-04-06 06:50:46 -0600

drevicko gravatar image

Does OpenCV have algorithms for duplicate image detection? By that, I mean finding in a collection of images sets of images that differ from some original by simple transformations such as compression (eg: to smaller pixel dimensions), minor cropping, changing to greyscale, minor colour balance changes etc..

Typical steps might involve:

  • making feature representations of images (eg: SURF, SIFT etc..)
  • matching features between pairs of images
  • identifying sets of matching features as indicating a duplicate

For the first two, there are feature extractors and matchers. I am less sure about how to achieve the 3rd. Perhaps finding an affine transformation with a confidence level?

edit retag flag offensive close merge delete

Comments

1

Looks more like a retrieval problem which is typically solved via Bag-of-(visual)-Words, s. also http://answers.opencv.org/question/8677/image-comparison-with-a-database/#8686

Guanta gravatar imageGuanta ( 2014-04-06 07:55:26 -0600 )edit

1 answer

Sort by ยป oldest newest most voted
0

answered 2014-08-08 07:49:19 -0600

I'm really not sure whether OpenCV has algorithms to scan identical files. I found this article will be of useful for you.

A duplicate program is specially designed to scan duplicate files in your hard disk, come up with results of numerous duplicate files. It'll prompt you which files to be deleted and which files you need. Accordingly, you select the files you would like to delete, and go ahead. It'll do rest task for you.

edit flag offensive delete link more

Question Tools

1 follower

Stats

Asked: 2014-04-06 06:50:46 -0600

Seen: 3,108 times

Last updated: Aug 08 '14

Related questions