Martial Arts in many people are required to all kinds of weapons can be used, but everyone will have a weapon they are best at. Previously, these four kinds of software were like "swords and sticks" in big data. Weapons are only part of the matter or our own understanding of big data is equivalent to internal strength. After all, the two sides of the tournament, the weapon to win the part is very big, but not the decisive factor! Imagine, a person with a deep internal strength and a person who will only make the gun to a contest, perhaps the other side to pick the leaves can hurt the only person who will make the sword ....
All right! We are one by one to uncover these four kinds of "weapons" veil!
Say R first, not so much a language as a software. His more applications are in the use of data volumes in small and medium-sized companies. Personal feeling, it will also be the next more hot language in the country. From the big data point of view, what kind of data is most valuable, the first is the operator's data, but the bank data, and the data, e-commerce data .... And this data for all departments of the data, mostly in the city as a unit to store. It is equivalent to the data is divided into a small shard, which is conducive to the display of R. In doing data mining and visualization, my mentor said, in the country, it is best to allow customers to see the value of your data mining within two weeks. And to achieve such a goal, with R will have a good effect. Especially in the area of data presentation.
And for R Learning, it is necessary to have a certain code logic and call specifications. Because of the minority, it will continue to connect with other languages, it is equivalent to a Chinese speaking, a foreign language, intermediate translation is very important.
In other words, Python, some say, will use this language in the morning and evening because it has too many applications in the big Data age. It is based on Linux. This first facilitates the use of everyone, he can and any language and can call each other interface. This greatly facilitates the work of the OPS in the era of big data. So there's a problem: Do ops people have to master one or two development languages? The operation of the new period will be large area for automatic operation, passive maintenance as the active protection. This requires the operation and maintenance personnel in addition to the machine, to be able to simple server and related network equipment has a certain development and customization capabilities.
For Python, My Learning Plan is to start learning after the Devil of R has finished training. Strive to avoid learning more and are not refined the current image, first learn a language, and comprehend by analogy learn another language.
For SAS, let's put it in the first place! After all, this software is to charge, it has more built-in algorithms, some data related statistical effect is better. Applicable to some scientific research institutions for the collection of large amounts of data, statistical use. This software, which I used to install on my own virtual machine, runs up to memory. And his code, the overall feeling is similar to C. Big data is good to use with it, but the software is expensive. In the current domestic situation, it is not recommended to start a company.
Finally, again, SPSS, this IBM software. Some people say that it is the same as SAS, but this software, personally feel it is best to use it to figure out the data in Excel, or to show the leader and the customer when the data mining process is demonstrated to be used. But this software has not been used, just see the teacher chain good line, to run the data, it has higher requirements for raw data. So you can also knot all r and SAS after processing the original data, and then use SPSS to walk the process will be better.
Above, is the understanding of the four software that you know, in the field of big data, will be how much use of these four software. And how to use it depends on us personally.
My humble Caishuxueqian, if have the same fellow human, if have offended, also hope the liberal enlighten! Technology to learn and grow together!
This article is from the "Data Mining and Visualization" blog, reproduced please contact the author!
Four weapons--the relationship between big data and R,PYTHON,SAS,SPSS?