Research on implementation and application of cloud storage system oriented to mass data
Nanjing University of Aiming
In this paper, mass data storage and massive data mining are investigated. In order to facilitate the research, this paper takes the management literature of scientific research workers as an example to materialize the mass data sources into the electronic data in the network. On the basis of this, this paper successfully builds a cloud storage system for mass literature data through cloud storage and cloud computing platform, which realizes the management and analysis of document data. The system first needs user registration, then the user can upload the literature (such as PDF file) stored in the cloud, then users can manage their own uploaded literature, such as the addition of literature, delete literature, etc., at the same time, the system also provides literature information retrieval and clustering analysis functions.
Key words: Mass data Cloud cloud storage glusterfs nutch Hadoop Mahout text Clustering
Temp_12080207244991.rar