Data analysis software
工欲善其事, its prerequisite!
Data analysis, statistical analysis, data mining, business intelligence or need to learn a variety of analytical tools and skills, especially to master analysis software tools! I once said, Shen Teacher's learning method, is generally the first to learn the software to start, then to apply, and then learn the theory and principles, because it is the teacher, and then to teach others! A method without software will not be learned, because you can not do it, unless you will be programmed.
So what are the software analysis tools in the field of data analysis? How to choose? In fact, many areas or analytical methods have the corresponding software tools, as long as you want to find it should be able to locate!
Here I divide the software into four levels of Quadrant diagram to express!
First dimension: Data storage Layer--Data report Layer--data analysis layer--Data presentation layer
Second dimension: User-level--departmental--enterprise class-->bi
First, the storage tier:
We must be able to store data, at least for individuals should master a database technology, of course, not necessarily skilled operation, but at least to be able to understand the data storage and data of the basic structure and data types, such as data security, uniqueness, redundancy, table relationships, granularity, capacity, etc., It is best to understand the basic structure and reading of SQL query Language and so on!
- Access2003, ACCESS07, etc.: This is the most basic personal database, often used for personal or part of the basic data storage;
- MySQL database, this for departmental or Internet database application is necessary, this time key master database structure and SQL language data query ability;
- SQL Server 2005 or later, for small and medium enterprises, some large enterprises can also adopt SQL Server database, in fact, this time itself in addition to data storage, also includes data reports and data analysis, and even data mining tools in it;
- Db2,oracle database is a large database, mainly enterprise-level, especially large enterprises or the demand for data storage is necessary, the general large database companies provide a very good data integration application platform;
- Bi-level, in fact, this is not a database, but built on the basis of the previous database, this is mainly the database of enterprise application level, generally this time the database is called Data Warehouse, data Warehouse, built on the DW level is basically a business intelligence platform, Perhaps integrated with a variety of data analysis, reporting, analysis and presentation!
Second: report layer
When the enterprise stores the data, the first to resolve the report, not to analyze the problem, is to be able to see, see reports, a variety of reports! At home and abroad have specialized in providing report Analysis Services enterprises and software.
- Crystal Report, Bill reports, this is the world's most popular reporting tools, very standardized report design ideas, early business intelligence in fact, most people's understanding is the report system, without the help of IT technical personnel can obtain a variety of enterprise information-reports. And many database built-in reports are also embedded in the development version of CR Report!
- Tableau Software, this software is a very good software in recent years, of course, it is not a simple data reporting software, but a more visual data analysis software, because I often use it to report from the database and visual analysis, first staged in the report surface;
This software starts from 3.0, now has the 5.1 version, two years time already to the server and the Web way!
Of course, if the enterprise has tens of thousands of reports, need to be well managed, there is security, concurrent requests, etc., you need to have the server version;
Bo Yi Chih is specialized in providing sales and software services for the Crystal Report and Crystal Report Server editions;
Third: Data analysis layer
This layer actually has a lot of analysis tools, of course, our most commonly used is Excel, I often use statistical analysis and data mining tools;
- Excel software, the first version of the higher the more easy to use this is certain; Of course, for Excel many people just mastered the 5%excel function, Excel is very powerful, and even complete all the statistical analysis work! But I also often say, have the ability to play Excel as a statistical tool rather than specialized to learn statistical software;
- SPSS Software: The current version is 18, the name has also changed to PASW Statistics; I start from 3.0 DOS environment programming analysis, to the current version of the changes can be seen in the SPSS social Science statistics software package change, from the emphasis on medicine, chemistry, etc. began to pay more attention to business analysis, It has become a predictive analytics software.
- Clementine Software: Current version 13.0, data mining tools, I started from 6.0, to the 13 version, has been more and more to improve the more good modeling tools, now renamed PASW Modeler 13 Modeler. And with the SPSS statistical function has more integration, data processing is more flexible and useful.
- SAS software: SAS is more powerful than SPSS, SAS is platform-based, EM mining module platform integration, relatively speaking, SAS is more difficult to learn, but if the master SAS will be more valuable, such as discrete choice model, sampling problem, orthogonal experiment design and so on SAS more useful, in addition, SAS learning materials are more, also open, there will be a harvest!
Of course, I mainly use SPSS and Clementine, sometimes is the habit, of course, will be a software learning other is not very difficult!
- JMP Analytics: An analytic branch of SAS
- Xlstat:excel plug-in, you can complete most of the SPSS statistical analysis function
- Ucinet Social networking analysis software: SNA Social network analysis is a very popular and valuable analytical tool and method, especially from the perspective of relational analysis of social networks, relationship analysis is very important, we used to be attribute data analysis
If you need a trial version, you can contact Bo Yi, they can provide SPSS and Clementine software version of the consultation.
IV: Presentation layer
Recently I have been studying data visualization technology, on the one hand, because of the need of Excel, on the other hand, I was the first to buy Xcelsius, also wrote "Excel Advanced application and Data analysis" and "Data presentation Art--xcelsius". This field of software, especially some gadgets are very valuable!
- PowerPoint software: This does not have to say, most people use PPT to write reports;
- Visio, SmartDraw Software: These are very useful flowcharts, marketing charts, maps, etc., and from here can get a lot of parts;
- Swiff Chart Software: The software that makes the chart, generates flash;
- Color wheel software: color Matching software
- yed Software: Network Diagram, flowchart and graphics analysis software, similar to SNA analysis, I often used to design flowcharts, there is the analysis of optimization diagram;
- Netdraw Software: This is the social network analysis display software, mainly visual Network Diagram, read the Ucinet software;
- MindManager Software: Mind map, very good software, you can quickly build up the non-linear thinking, and Project organization management, report design ideas can be applied, directly generated PPT, of course, this software is very powerful, my students use it to make notes and meeting records;
- Xcelsius Software: Dashboard production and data visualization reporting tools, can directly read the database, in Excel modeling, the Internet display, the biggest feature or can be implemented in PPT dynamic report; This is a software tool I most want to use, very valuable!
Finally, it should be explained that the classification of my hierarchy is not software, just want to explain the application of software, in fact, each level of software are mutual integration, the pursuit of: platform, integration, intelligence, visualization, specialization, are all unique, the price is different, there are free, there are millions of; There is a server version of the, there are genuine, there are pirated!
Sometimes we use the database for report analysis, sometimes the report is analysis, sometimes analysis is to show, of course, sometimes show is analysis, analysis is a report, the report is data storage!
no best, only better, for you is the best!
In fact, there are many data analysis software:
- Amos software: Structural equation model SEM, empirical research and theoretical model of important analytical tools, people engaged in academic research, especially social science workers should master;
- Lisrel software: Structural equation model SEM, ditto!
- HLM software: layered linear model;
Original address: http://shenhaolaoshi.blog.sohu.com/148204624.html
Data analysis software