SPSS Clementine data mining (1)

Source: Internet
Author: User
Tags pmml ssis

SPSS ClementineYesSPSSCompany AcquisitionIslThe obtained data mining tool. InGartnerOnly two vendors are listed as leaders in the evaluation of customer data mining tools:SASAndSPSS.SASObtained the highestAbility to executeRating, representingSASBest Performance in marketing, promotion, and cognition; andSPSSObtained the highestCompleteness of vision, IndicatingSPSSIt is far ahead in technological innovation.

 

 

Basic client interface

 

SPSS Clementine(Hereinafter referred toClementineThe service is automatically enabled after the installation. You must use SPSS predictive Enterprise Manager to manage the server.ClementineThere are no complex management tools. Generally, data mining personnel use the client to complete all the work. Below isClementineClient Interface.

 

Once you see the above interface, I believe that you have usedSSIS + SSAsIf you deploy the data mining model, you should have understood the six or seven points. Are you eager to try it? Don't worry. The highlights are still coming.^ _'

 

Project Area

 

As its name implies, it manages projects and provides two views. WhereCRISP-DM(Cross Industry Standard Process for Data Mining, Data Mining Cross-Industry Standard Process) isSPSS,DaimlerChrysler(Daimler Chrysler, Automotive Company ),NCR(That is, the ownerTeradata.ClementineBy organizingCRISP-DMTo complete the project. You can add a stream, node, output, and model to a project.

 

Toolbar

 

The toolbar includesETL, Data analysis, mining model tools, tools can be added to the data flow design area,SSISData Streams in are very similar.ClementineIn6Class tool.

Source Tool (Sources)

EquivalentSSISThe source component in the data stream,ClementineSupported data sources include databases, flat files,Excel, Dimension data,SASData and user input.

Record operation (Record Ops) And field operations (Field Ops)

EquivalentSSISData Stream Conversion component,Record OpsIs to convert data rows,Field OpsIs to convert columns, some typesSSISAsynchronous output conversion and synchronous output conversion (aboutSSISFor more information about asynchronous and synchronous output, see the following example:Http://www.cnblogs.com/esestt/archive/2007/06/03/769411.html).

Graphics (Graphs)

Used for visual data analysis.

Output (Output)

ClementineThe output is not justETLIn ProcessLoadThe output includes the output of the statistical analysis report on the data.

※InVer 11,OutputInETLThe data target tool is dividedExportIn the toolbar.

Model (Model)

ClementineContains a wide range of data mining models.

Data Flow Design Area

 

There is nothing to say about it. You can see the graph and the directed arrow indicates the data flow.ClementineThe project can have multiple data flow design zones, just as inPhotoshopYou can enable multiple design diagrams at the same time.

For example, I have two data streams:Stream1AndStream2. ThroughStreamsClick to switch the number of streams.

 

Management area

 

Management area includesStreams,Outputs,ModelsThree columns.StreamsAs mentioned above, it is used to manage data streams.

Outputs

Do not mix it with the output in the toolbar.OutputsIs the analysis result produced by graphics and output tools. For example, the following data source is connected to the matrix, data review, and histogram tool. After data flow is executed, this tool generates three outputs. In the management areaOutputsDouble-click the output to view the output graph or report.

Models

The trained model will appear in this column, which is like a real table (Truth TableIn this way, the trained model can be added to the data stream for prediction and scoring. In addition, the model can be exported to supportPmmlProtocolXMLFile,PmmlNo specification is given for all models, and many vendors arePmmlIn addition to the extended SPSS smartscore, Clementine can also export the standard pmml 3.1.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.