We can judge whether it is a spider by http_user_agent, the spider of search engine has its own unique symbol, the following list takes part.
- function Is_crawler () {
- $userAgent = Strtolower ($_server[' http_user_agent ');
- $spiders = Array (
- ' Googlebot ',//Google crawler
- ' Baiduspider ',//Baidu Crawler
- ' Yahoo! slurp ',//Yahoo crawler
- ' Yodaobot ',//Youdao crawler
- ' MSNBot '//Bing Crawler
- More crawler Keywords
- );
- foreach ($spiders as $spider) {
- $spider = Strtolower ($spider);
- if (Strpos ($userAgent, $spider)!== false) {
- return true;
- }
- }
- return false;
- }
Copy Code The following PHP code comes with more spider logos
- function Iscrawler () {
- echo $agent = Strtolower ($_server[' http_user_agent ');
- if (!empty ($agent)) {
- $spiderSite = Array (
- "Tencenttraveler",
- "Baiduspider+",
- "Baidugame",
- "Googlebot",
- "MSNBot",
- "Sosospider+",
- "Sogou web Spider",
- "Ia_archiver",
- "Yahoo! slurp",
- "Youdaobot",
- "Yahoo slurp",
- "MSNBot",
- "Java (Often spam bot)",
- "Baiduspider",
- "Voila",
- "Yandex bot",
- "Bspider",
- "Twiceler",
- "Sogou Spider",
- "Speedy Spider",
- "Google AdSense",
- "Heritrix",
- "Python-urllib",
- "Alexa (IA archiver)",
- "Ask",
- "Exabot",
- "Custo",
- "Outfoxbot/yodaobot",
- "YaCy",
- "Surveybot",
- "Legs",
- "Lwp-trivial",
- "Nutch",
- "Stackrambler",
- "The Web Archive (IA archiver)",
- "Perl Tool",
- "Mj12bot",
- "Netcraft",
- "Msiecrawler",
- "WGet Tools",
- "Larbin",
- "Fish Search",
- );
- foreach ($spiderSite as $val) {
- $str = Strtolower ($val);
- if (Strpos ($agent, $STR)!== false) {
- return true;
- }
- }
- } else {
- return false;
- }
- }
- if (Iscrawler ()) {
- echo "Hello Spider essence!" ";
- }
- else{
- echo "You're not a Spider-Man!" ";
- }
Copy Code
|