algorithm), GA (Genetic algorithm genetic algorithm)Feature Selection (Feature selection algorithm):Mutual information (Mutual information), Documentfrequence (document frequency), information Gain (information gain), chi-squared test (Chi-square test), Gini (Gini coefficient).Outlier Detection (anomaly detection algorithm):Statistic-based (based on statistics), distance-based (distance based), density-based (based on density), clustering-based (based on clustering).Learning to Rank (based on l
web|xml| data
Web-oriented data miningThere is a large amount of data information on the Web, and how to apply these data to complex applications has become a hot research topic in modern database technology. Data
PY, universal language, and more practical. With Python machine learning book There is a "machine learning combat" is very good, from the principle to the example to achieve, is simply God book. MATLAB is suitable for learning and research, the practical problem is that in the business sector can not find a job
" Tools for learning data
warehouse for people to test and evaluate. Another online weekly for ds* (DS Representative decision Support), October 7, 1997 began publishing, can submit a free subscription to dstrial@tgc.com application. Online, there is also a free forum DM email Club, people through e-mail to discuss DMKD hot issues.
As for DMKD books, you can find more than 10 copies in any computer bookstore, but mostly with commercial color. The author suggests that interested persons may read the
formed a more perfect experience accumulation of the application scene. There are many applications in data mining that need to be developed, even if it is possible to dig out valuable patterns. Like Recommender systems, computer vision, and NLP, these values are known to be more fortunate than others. Write the Book of course everything to write, is there somet
Http://itindex.net/blog/2015/01/09/1420751820000.htmlWeka:weka is a collection of machine learning algorithms that can be used for data mining tasks. The algorithm can be applied directly to a dataset or called from its own Java code. Weka contains data preprocessing, classification, regression, clustering, association
Structure Mining, and web usage record mining. The previous two researches have produced many achievements. For example, Web Structure Mining is a technology used by various search engines.AlgorithmIncluding hits and Google PageRank. The Research on Web application record mining is relatively small, or it may be becau
Project homepage:
Http://code.google.com/p/python-data-mining-platform/ (may need to flip)
Tutorial and other content have been added to the googlecode. You can view it in the Wiki.
Project Introduction (copied from the project homepage ):
This is a matrix that can be represented in CSV format or a Chinese document based on the source data.AlgorithmA platform to get results.
Algorithms can run
This article mainly introduces four knowledge points, which is also the content of my lecture.
1.PCA Dimension reduction operation;
PCA expansion pack of Sklearn in 2.Python;
3.Matplotlib subplot function to draw a child graph;
4. Through the Kmeans to the diabetes dataset clustering, and draw a child map.
Previous recommendation:The Python data Mining course. Introduction to installing Python and crawler"
: Published in 2012, corresponding to Mahout version 0.5, is currently mahout the latest book books. At present, only English version, but a bit, the inside vocabulary is basically a computer-based vocabulary, and map and source code, is suitable for reading.? IBM mahout Introduction: http://www.ibm.com/developerworks/cn/java/j-mahout/Note: Chinese version, update is time for 09, but inside for Mahout elaborated more comprehensive, recommended reading
---restore content starts---After reading the big talk data mining this book the first 36 pages, learned the knowledge.Data Mining (Mining) and Knowledge Discovery (KDD) in the database are aliases to each other.Examples of data
Defined
Data Mining is the nontrivial process of acquiring effective, novel, potentially useful, and ultimately understandable patterns from large amounts of data stored in databases, data warehouses, or other repositories.
What is the use of.
Data
Recently looking at a book called "Big Talk Data Mining", a simple summary summarizes some of the basic theoretical knowledge of data mining:1.Data Mining (also known in academia as
Yunshan's staff can fully develop external interfaces, Wu Yan put his main energy into data mining, continue to study how to apply algorithms in WEKA to your project. Half a month later, Wu Yan implemented algorithms such as naive Bayes, demo-tree, and association rule, and found application scenarios in the project, for example, Naive Bayes is suitable for Pred
Tags: Big Data System Architecture diagram database/* Copyright notice: Can be reproduced arbitrarily, please be sure to indicate the original source of the article and the author information . */Author: Zhang JunlinExcerpt from "Big Data Day know: Architecture and Algorithms" Chapter 14, book catalogue herefor the calculation of offline
Preface 1The first part of social network guidancePrologue 13The 1th Chapter explores Twitter: Exploring hot topics, discovering what people are talking about, etc. 151.1 Overview 15Reasons for 1.2 Twitter rage 161.3 Explore Twitter API 181.4 Analysis of 140 word tweets 331.5 Summary of this chapter 471.6 Recommended Exercises 481.7 Resources Online 482nd Chapter Mining Facebook: Analyzing fan pages, viewing friends, etc. 502.1 Overview 512.2 Explore
Machine learning, data mining, and other
In this book, we constantly mention "intelligence". What is "intelligence "? Are we talking about artificial intelligence? Or machine learning? What does it have to do with Data Mining and soft computing? In academia, the exact defini
This book provides a comprehensive overview of data mining, covering five topics: data, classification, correlation analysis, clustering, and anomaly detection. In addition to anomaly detection, each topic has two chapters. The previous chapter covers basic concepts, representative algorithms, and evaluation techniques
Data mining-how to make it better (as I expected)Because I didn't even make it well, I just thought about the problems I encountered and how to solve them!
Recently, this may be due to the high-dimensional reasons. Most of the theories and examples in the book are low-dimensional (less than 100). The theory is perfect, but all problems come out in practice, and
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.