The most important thing in big data mining is deciding what kind of knowledge to dig, which is a problem that needs serious consideration in the whole process of data collection, processing and mining.
Big data technology involves storage, search, transmission, computation, mining and many other aspects. Big Data mining is designed to dig out unknown and useful knowledge from big data. By digging, the value of big data can be reflected, so mining is important to big data.
Big Data Mining has two basic questions, namely "What to Mine" and "How to mine". The former decides what kind of information to extract from the data, what kind of laws are counted, and the latter decides how to extract and count the concrete. The former is a problem to be considered in the collection, processing and mining of data, the latter is often confined to mining. "How to dig" is usually the core of data mining research, but "digging what" is often more important in the application of data mining, because it determines the value of the mining results. In practical matters, it is more critical to decide whether to dig gold or silver, or to dig in copper, than to dig with a hoe, or to dig with a shovel.
Ling Jiu ljparser Network Search and mining system is a basic toolset for the development of web search, natural language comprehension and text mining, and the development platform is composed of multiple middleware, and the middleware API can be seamlessly integrated into various complex application systems of customers. .
Ling Jiu Ljparser network Search and mining systems focus on big Data acquisition and data integration :
1. collecting data is the first step of data mining, it is necessary to judge the record and what data is collected, which directly affects what kind of knowledge can be mined from the data. Paddle, there is no data in a certain aspect, it can not be related to mining. However, the storage and processing of data is a cost, the key to improve the efficiency of data mining is to only record and collect useful data. Therefore, the need for the content of the collection of data reasonable judgment, at this time, should try to imagine the scene of excavation, on the basis of which may be useful data all recorded, collected.
2. try to integrate data , the way to make your data more effective is to combine the relevant data for mining. Data integration helps to understand the full picture of things, discover unknown relationships, and increase the accuracy of predictions. Local data is only "Luo", and the overall data is "terrible big net".
the key to big data mining is deciding what to dig, which is more important than deciding how to dig. When collecting data, we should try to imagine the scene of excavation, record and collect the data as many as possible, and collect the data so as to integrate the data together; Before data mining, we should observe the data carefully to help judge what kind of knowledge to dig. Only in this way can the value of big data mining be reflected.
A platform for semantic mining of big data in Ling JIU Ljparser system