SPSS ClementineYesSPSSCompany AcquisitionIslThe obtained data mining tool. InGartnerOnly two vendors are listed as leaders in the evaluation of customer data mining tools:SASAndSPSS.SASObtained the highestAbility to executeRating, representingSASBest Performance in marketing, promotion, and cognition; andSPSSObtained the highestCompleteness of vision, IndicatingSPSSIt is far ahead in technological innovation.
Basic client interface
SPSS Clementine(Hereinafter referred toClementineThe service is automatically enabled after the installation. You must use SPSS predictive Enterprise Manager to manage the server.ClementineThere are no complex management tools. Generally, data mining personnel use the client to complete all the work. Below isClementineClient Interface.
Once you see the above interface, I believe that you have usedSSIS + SSAsIf you deploy the data mining model, you should have understood the six or seven points. Are you eager to try it? Don't worry. The highlights are still coming.^ _'
Project Area
As its name implies, it manages projects and provides two views. WhereCRISP-DM(Cross Industry Standard Process for Data Mining, Data Mining Cross-Industry Standard Process) isSPSS,DaimlerChrysler(Daimler Chrysler, Automotive Company ),NCR(That is, the ownerTeradata.ClementineBy organizingCRISP-DMTo complete the project. You can add a stream, node, output, and model to a project.
Toolbar
The toolbar includesETL, Data analysis, mining model tools, tools can be added to the data flow design area,SSISData Streams in are very similar.ClementineIn6Class tool.
Source Tool (Sources)
EquivalentSSISThe source component in the data stream,ClementineSupported data sources include databases, flat files,Excel, Dimension data,SASData and user input.
Record operation (Record Ops) And field operations (Field Ops)
EquivalentSSISData Stream Conversion component,Record OpsIs to convert data rows,Field OpsIs to convert columns, some typesSSISAsynchronous output conversion and synchronous output conversion (aboutSSISFor more information about asynchronous and synchronous output, see the following example:Http://www.cnblogs.com/esestt/archive/2007/06/03/769411.html).
Graphics (Graphs)
Used for visual data analysis.
Output (Output)
ClementineThe output is not justETLIn ProcessLoadThe output includes the output of the statistical analysis report on the data.
※InVer 11,OutputInETLThe data target tool is dividedExportIn the toolbar.
Model (Model)
ClementineContains a wide range of data mining models.
Data Flow Design Area
There is nothing to say about it. You can see the graph and the directed arrow indicates the data flow.ClementineThe project can have multiple data flow design zones, just as inPhotoshopYou can enable multiple design diagrams at the same time.
For example, I have two data streams:Stream1AndStream2. ThroughStreamsClick to switch the number of streams.
Management area
Management area includesStreams,Outputs,ModelsThree columns.StreamsAs mentioned above, it is used to manage data streams.
Outputs
Do not mix it with the output in the toolbar.OutputsIs the analysis result produced by graphics and output tools. For example, the following data source is connected to the matrix, data review, and histogram tool. After data flow is executed, this tool generates three outputs. In the management areaOutputsDouble-click the output to view the output graph or report.
Models
The trained model will appear in this column, which is like a real table (Truth TableIn this way, the trained model can be added to the data stream for prediction and scoring. In addition, the model can be exported to supportPmmlProtocolXMLFile,PmmlNo specification is given for all models, and many vendors arePmmlIn addition to the extended SPSS smartscore, Clementine can also export the standard pmml 3.1.