[Data set] Sina Weibo data set MICROBLOGPCU

Source: Internet
Author: User

Data set: Download

Absrtact: MICROBLOGPCU is collected from Sina Weibo, which can be used to study machine learning methods and social relationships.

This data set is used by the original author to explore spammers in Weibo (the person who sent the spam message), and their demo is here


Property information for the dataset:

Weibo_user.csv
-USER_ID: User ID
-user_name: User Nickname
-gender: Sex, Male,female,other
-class: Account Level
-message: Account registration location or other personal information
-post_num: Zip Code
Number of-follower_num:followers
Number of-followee_num:followee
-follow Ratio:followee_num/follower_num;
-is_spammer:manually Annotated label, 1 means spammer,0 represents non-spammer;


User_post.csv
-POST_ID: The ID of Weibo
-post_time: Release Time
-POSTER_ID: The ID of the publishing user
-repost_num: Number of forwards
-commnet_num: Number of reviews


Followe-followee.csv
-follower:the nickname of follower;
-follower_id:the user ID of follower;
-followee:the nickname of Followee;
-followee_id:the User ID of Followee;


post.csv and user_post.csv are similar, and the post in it is retrievalled by a certain key word related to a topic;


Take out some of the data to make a diagram

Install.packages (' igraph ') library (igraph) follower_followee<-read.csv (' Microblogpcu/follower_followee.csv ') Follower_followee_part<-follower_followee[1:1000,]gg<-graph.data.frame (Data.frame (er=follower_followee_ part$follower_id,ee=follower_followee_part$followee_id) plot (GG,     Vertex.label=na,      # #不显示标签     Edge.arrow.mode= '-',  # #不使用箭头     vertex.size = 5       # #设置结点圆的大小     )



Reference: Https://archive.ics.uci.edu/ml/datasets/microblogPCU


Reprint Please specify source: http://blog.csdn.net/zhyoulun/article/details/46442899

[Data set] Sina Weibo data set MICROBLOGPCU

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.