1.robots.txt file
robots.txt file We've written about crawlers, we know that this file tells us which directories are forbidden to crawl. But most of the time we can judge the type of CMS by robots.txt file.
Such as:
From the WP path can be seen that this is the WordPress CMS
This is more obvious, just tell us it's pageadmin cms.
There are some robots.txt that are not clearly written. Let's see the weaving dream.
From the robots.txt can not directly see what the CMS, we will directly copy him to Baidu to inquire
So we found a dream CMS . 2. Search by Copyright information
Generally pull directly to the bottom to view the copyright information, some sites will show up, such as weaving dream of this 3. How to view the source code of the Web page
Some sites do not have robot.txt, also change the version information, this time the home page to view the source code may be found 4. Compare site MD5 values
Some CMS scanner is to use this principle, the first collection of a certain path of a CMS file MD5 value, require that this file is generally not modified by the user. Then visit this site under the same path if the file exists, and the existing words compare MD5 values. The same can be reported for the CMS type. The ability to test the dictionary.