Google Map-Similar image search principle-Java implementation

Last Update:2015-05-27 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

The former in Ruan Yi Feng's blog saw this "similar image search principle" blog, there is a kind of impulse to realize these principles.

Google "Similar image search": You can use an image to search all images on the internet that resemble it.

To open the Google Image search page:

Click Use to upload a Angelababy original image:

When you click Search, Google will find a similar image, with the higher the image similarity. Such as:

What is the principle of this technology? How does a computer know that two pictures are similar?

According to Dr. Neal Krawetz's explanation, the key technology for similar image search is called "Perceptual hashing Algorithm" (Perceptualhash algorithm), which generates a "fingerprint" (fingerprint) string for each image. Then compare the fingerprints of the different pictures. The closer the result is, the more similar the picture is.

The following is one of the simplest Java implementations:

Preprocessing: Reading pictures

[Java]View Plaincopyprint?

File inputfile = newFile (filename);
BufferedImage sourceimage = Imageio.read (inputfile); //Read picture file

The first step is to reduce the size.

Reduce the image to 8x8 's size, a total of 64 pixels. The role of this step is to remove the details of the picture, only the structure, shading and other basic information, discard the different sizes, proportions of the picture differences.

[Java]View Plaincopyprint?

int width= 8;
Intheight = 8;
Targetw,targeth indicates the target length and width, respectively.
int type= sourceimage.gettype (); //Picture type
Bufferedimagethumbimage = null;
Double sx= (double) width/sourceimage.getwidth ();
Double sy= (double) height/sourceimage.getheight ();

[Java]View Plaincopyprint?

Set the width and height of the image to the same length, whichever is shorter
if (b) {
if (SX > Sy) {
sx= Sy;
width= (int) (SX * SOURCEIMAGE.GETWIDTH ());
}Else {
sy= SX;
height= (int) (SY * sourceimage.getheight ());
}
}
Custom picture
if (type== bufferedimage.type_custom) { //handmade
COLORMODELCM = Sourceimage.getcolormodel ();
Writablerasterraster = Cm.createcompatiblewritableraster (width,height);
booleanalphapremultiplied = Cm.isalphapremultiplied ();
thumbimage= New BufferedImage (cm, raster, alphapremultiplied, null);
} Else {
//Known pictures, such as Jpg,png,gif
thumbimage= New BufferedImage (width, height, type);
}
Call drawing class drawing to reduce the size of the diagram
GRAPHICS2DG = Target.creategraphics ();
Smoother than Exlax:
G.setrenderinghint (renderinghints.key_rendering, renderinghints.value_render_quality);
G.drawrenderedimage (sourceimage,affinetransform.getscaleinstance (SX, SY));
G.dispose ();

The second step is to simplify the color.

Converts the zoomed-in image to a level 64 grayscale. That is, all pixels have a total of 64 colors.

[HTML]View Plaincopyprint?

int[]pixels = New int[width * height];
for (inti = 0; I < width; i++) {
for (int j = 0; J < height; j + +) {
pixels[i* height + j] = Rgbtogray (Thumbimage.getrgb (i, j));
}
}
/**
* Gray Value Calculation
* @param pixels color RGB value (red-green-blue Red green blue)
* @return int Gray value
*/
public static int Rgbtogray (int pixels) {
int _alpha = (pixels >>) & 0xFF;
int _red = (pixels >>) & 0xFF;
int _green = (pixels >> 8) & 0xFF;
int _blue = (pixels) & 0xFF;
return (int) (0.3 * _red + 0.59 * _green + 0.11 * _blue);
}

The third step is to calculate the average.

Calculates a grayscale average of all 64 pixels.

[Java]View Plaincopyprint?

int avgpixel= 0;
int m = 0;
for (int i =0; i < pixels.length; ++i) {
M +=pixels[i];
}
m = m/pixels.length;
Avgpixel = m;

Fourth step, compare the grayscale of the pixel.

The grayscale of each pixel is compared to the average. Greater than or equal to the average, recorded as 1, less than the average, recorded as 0.

[Java]View Plaincopyprint?

Int[] comps= new int[width * height];
for (inti = 0; i < comps.length; i++) {
if (Pixels[i] >= avgpixel) {
comps[i]= 1;
}Else {
comps[i]= 0;
}
}

Fifth step, calculate the hash value.

By combining the results of the previous step, you make up a 64-bit integer, which is the fingerprint of the image. The order of the combinations is not important, just make sure all the pictures are in the same order.

= = 8f373714acfcf4d0

[HTML]View Plaincopyprint?

Stringbufferhashcode = new StringBuffer ();
for (inti = 0; I < comps.length; i+= 4) {
intresult = comps[i] * (int) Math.pow (2, 3) + comps[i + 1] * (int) Math.pow (2, 2) + Comps[i + 2] * (int) Math.pow (2 , 1) + Comps[i + 2];
Hashcode.append (Binarytohex (result));//binary into 16 binary
}
Stringsourcehashcode = hashcode.tostring ();

After getting the fingerprint, you can compare different pictures and see how many of the 64 bits are not the same. In theory, this equates to the calculation of "Hamming distance" (hammingdistance). If the data bits are not more than 5, the two images are similar, and if they are greater than 10, they are two different pictures.

[Java]View Plaincopyprint?

int difference = 0;
int Len =sourcehashcode.length ();
for (inti = 0; i < len; i++) {
if (Sourcehashcode.charat (i)! = Hashcode.charat (i)) {
difference++;
}
}

You can put a few pictures together, and also calculate their Hamming distance comparison, you can see whether the two pictures are similar.

The advantages of this algorithm are simple and fast, not affected by the size of the picture, the disadvantage is that the contents of the picture can not be changed. If you add a few words to the picture, it will not be recognized. So, it's best to use thumbnails to find out the original image.

In practical applications, more powerful phash algorithms and sift algorithms are often used to identify the deformation of images. As long as the degree of deformation does not exceed 25%, they can match the original image. Although these algorithms are more complex, the principle is the same as the simple algorithm above, that is, to first convert the image into a hash string, and then compare.

Most of the above content directly from the Nanyi site copy, want to see the original children's shoes can go to the top link click to see.

Provide source download, source download link: http://download.csdn.net/detail/luohong722/3965112

Reference Links: Magic image processing algorithm, 11 similar image search engine recommendation, to map search will not be difficult, http://insidesearch.blogspot.com/2011/07/teaching-computers-to-see-image.html

Google Map-Similar image search principle-Java implementation

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Google Map-Similar image search principle-Java implementation

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support

Google Map-Similar image search principle-Java implementation

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support