Linux system using the wget command to download the entire station as a mirror site
2011-05-28 18:13:01 | 1 Reads | comments:0 Article | itokit
Under Linux, the complete command to use the wget command to capture the site as a mirror is to ignore the robots.txt restrictions in the root directory of the website. and can simulate a normal browser information to download the site.
C + + Code copy content to clipboard
- Wget-m-E robots=off-u "mozilla/5.0 (Windows; U Windows NT 5.1; ZH-CN; rv:1.9.1.6) gecko/20091201 firefox/3.5.6 " " http://www.example.com/"
wget Command Parameter comments:
C + + Code copy content to clipboard
- "-E Robots=off" let wget bullying ignore robots.txt protocol
- -U "mozilla/5.0 (Windows; U Windows NT 5.1; ZH-CN; rv:1.9.1.6) gecko/20091201 firefox/3.5.6 "Forged agent information
Linux system using the wget command to download the entire station as a mirror site