The author (the watcher ms) has read a lot of Chinese documents in the process of setting up and developing the nutch, but the content is not detailed and there are errors. Therefore, he recorded the personal practice process and corrected someArticleErrors: a detailed process shows a simple secondary development process, lowering the threshold for beginners. But it cannot be guaranteed that there are no errors at all. If you find any problems, you may want to correct them.
This article is from the "watcher ms" blog,Decline reprinting!
Directory:
1. detailed introduction to the Second Development of nutch1.2 (1) [Image and text] ------ setting up the cygwin environment on the Windows platform
2. detailed introduction to the Second Development of nutch1.2 (2) [Image and text] ------ setting up nutch1.2 on the Windows platform
3. detailed introduction to the Second Development of nutch1.2 (3) [Image and text] ------ Secondary Development of nutch1.2 (about interface modification)
4. detailed introduction to the Second Development of nutch1.2 (4) [Image and text] ------ Secondary Development of nutch1.2 (about Chinese Word Segmentation)
I. Development Environment Introduction (taking my personal account as an example ):
Personal development end: Windows Server 2003 + cygwin + eclipse3.2
2. steps:
1. Download and install cygwin (http://cygwin.com/install.html)
<1>. Install cygwin
Click the downloaded setup.exe
Select next.
Select the first item install from Internet.
Select the installation directory (depending on your situation ). Next step
Select the storage directory of the package downloaded from the Internet. Next step
If you are not using a proxy to access the Internet, select the first item by default and click Next.
Select the cygwin image download site. Currently, only 163 of images are available in China. Therefore, the first option is preferred for domestic users by default.
The next step is the most important, and I want to emphasize it. When I first came into contact with nutch and installed cygwin on Windows, refer to the online materials to say that this step should be all installed, otherwise, there will be many errors. At that time, I believed it was true. To avoid the following development errors, I chose to download all the packages. I had no choice but to download them on the server for two days. Finally, it has been proved by practice that the default option can be executed in this step (the entire installation process takes about 5 minutes), and there is no need to download all the packages. For more information, see.
For cygwin installation, click Next.
<2>. Configure cygwin
After cygwin is installed, the most important operation is to configure environment variables for cygwin.
In the Edit System variable path, add the absolute path of the bin folder under your cygwin installation directory to the variable value.
For example, mine is G: \ cygwin \ bin.
So far, the first step of developing and constructing a nutch on Windows platform has been completed, and cygwin is successfully installed.