At present, many social network research uses the foreign platform data, but the domestic Sina Weibo does not have the very good interface to facilitate the researcher to data to carry on the analysis. In order to quickly obtain the data in the micro-blog, developed a support for parallel micro-bo data crawling tool. The tool can capture the fan information, micro-Bo Zhengwen and other content of the users in the microblog in real time, and the tool uses keyword matching technology to match the micro-blog with the specified conditions, and to crawl the relevant content; The tool supports parallel crawling and can crawl multiple user information at the same time. Finally, the serial Micro-blog crawler tool is compared with its parallel version, and the tool is used to analyze the problem of the influenza in some micro-blog data. The experimental results show that the parallel crawler has better speedup and can obtain data quickly, and the data is real-time and accurate.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.