Senin, 16 April 2018

Top 10 Data Mining Algorithms, Explained

Top 10 Data Mining Algorithms, Explained

Image source: https://image.slidesharecdn.com/top10algorithmsdatamining-130716180141-phpapp02/95/top10-algorithms-data-mining-29-638.jpg?cb=1373997814

What are we waiting for? Lets get started!
Here are the algorithms:
1. C4.5
2. k-means
3. Support vector machines
4. Apriori
5. EM
6. PageRank
7. AdaBoost
8. kNN
9. Naive Bayes
10. CART
We also provide interesting resources at the end.
1. C4.5
What does it do? C4.5 constructs a classifier in the form of a decision tree. In order to do this, C4.5 is given a set of data representing things that are already classified.
Wait, whats a classifier? A classifier is a tool in data mining that takes a bunch of data representing things we want to classify and attempts to predict which class the new data belongs to.
Whats an example of this? Sure, suppose a dataset contains a bunch of patients. We know various things about each patient like age, pulse, blood pressure, VO2max, family history, etc. These are called attributes.
2. k-means
What does it do? k-means creates k groups from a set of objects so that the members of a group are more similar. Its a popular cluster analysis technique for exploring a dataset.
Hang on, whats cluster analysis? Cluster analysis is a family of algorithms designed to form groups such that the group members are more similar versus non-group members. Clusters and groups are synonymous in the world of cluster analysis.
Is there an example of this? Definitely, suppose we have a dataset of patients. In cluster analysis, these would be called observations. We know various things about each patient like age, pulse, blood pressure, VO2max, cholesterol, etc. This is a vector representing the patient.
3. Support vector machines
What does it do? Support vector machine (SVM) learns a hyperplane to classify data into 2 classes. At a high-level, SVM performs a similar task like C4.5 except SVM doesnt use decision trees at all.
Whoa, a hyper-what? A hyperplane is a function like the equation for a line, y = mx + b. In fact, for a simple classification task with just 2 features, the hyperplane can be a line.
4. Apriori
What does it do? The Apriori algorithm learns association rules and is applied to a database containing a large number of transactions.
What are association rules? Association rule learning is a data mining technique for learning correlations and relations among variables in a database.
Whats an example of Apriori? Lets say we have a database full of supermarket transactions. You can think of a database as a giant spreadsheet where each row is a customer transaction and every column represents a different grocery item.
5. EM
What does it do? In data mining, expectation-maximization (EM) is generally used as a clustering algorithm (like k-means) for knowledge discovery.
In statistics, the EM algorithm iterates and optimizes the likelihood of seeing observed data while estimating the parameters of a statistical model with unobserved variables.
6. PageRank
What does it do? PageRank is a link analysis algorithm designed to determine the relative importance of some object linked within a network of objects.
Yikes.. whats link analysis? Its a type of network analysis looking to explore the associations (a.k.a. links) among objects.
Heres an example: The most prevalent example of PageRank is Googles search engine. Although their search engine doesnt solely rely on PageRank, its one of the measures Google uses to determine a web pages importance.
7. AdaBoost
What does it do? AdaBoost is a boosting algorithm which constructs a classifier.
As you probably remember, a classifier takes a bunch of data and attempts to predict or classify which class a new data element belongs to.
But whats boosting? Boosting is an ensemble learning algorithm which takes multiple learning algorithms (e.g. decision trees) and combines them. The goal is to take an ensemble or group of weak learners and combine them to create a single strong learner.
Whats the difference between a strong and weak learner? A weak learner classifies with accuracy barely above chance. A popular example of a weak learner is the decision stump which is a one-level decision tree.
8. kNN
What does it do? kNN, or k-Nearest Neighbors, is a classification algorithm. However, it differs from the classifiers previously described because its a lazy learner.
Whats a lazy learner? A lazy learner doesnt do much during the training process other than store the training data. Only when new unlabeled data is input does this type of learner look to classify.
9. Naive Bayes
What does it do? Naive Bayes is not a single algorithm, but a family of classification algorithms that share one common assumption:
Every feature of the data being classified is independent of all other features given the class.
What does independent mean? 2 features are independent when the value of one feature has no effect on the value of another feature.
10. CART
What does it do? CART stands for classification and regression trees. It is a decision tree learning technique that outputs either classification or regression trees. Like C4.5, CART is a classifier.
Is a classification tree like a decision tree? A classification tree is a type of decision tree. The output of a classification tree is a class.
For example, given a patient dataset, you might attempt to predict whether the patient will get cancer. The class would either be will get cancer or wont get cancer.
Whats a regression tree? Unlike a classification tree which predicts a class, regression trees predict a numeric or continuous value e.g. a patients length of stay or the price of a smartphone.

8 Ways To Cosy Up Your Home This Chilly Season

Image source: http://www.modern-glam.com/wp-content/uploads/2018/01/fireplace8.jpg During summer season we get the addiction to head away do...