Data Mining: Introduction Essay examples

2235 Words Dec 24th, 2013 9 Pages
Data Mining: Introduction

Lecture Notes for Chapter 1 Introduction to Data Mining by Tan, Steinbach, Kumar

© Tan,Steinbach, Kumar

Introduction to Data Mining

4/18/2004

1

Why Mine Data? Commercial Viewpoint
Lots of data is being collected and warehoused – Web data, e-commerce – purchases at department/ grocery stores – Bank/Credit Card transactions Computers have become cheaper and more powerful Competitive Pressure is Strong – Provide better, customized services for an edge (e.g. in Customer Relationship Management)
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 2

Why Mine Data? Scientific Viewpoint
Data collected and stored at enormous speeds (GB/hour) – remote sensors on a satellite – telescopes
…show more content…
– Certain names are more prevalent in certain US locations (O’Brien, O’Rurke, O’Reilly… in Boston area) – Group together similar documents returned by search engine according to their context (e.g. Amazon rainforest, Amazon.com,)
4/18/2004 6

Introduction to Data Mining

Origins of Data Mining
Draws ideas from machine learning/AI, pattern recognition, statistics, and database systems Traditional Techniques may be unsuitable due to Statistics/ Machine Learning/ – Enormity of data AI Pattern Recognition – High dimensionality of data Data Mining – Heterogeneous, distributed nature Database systems of data
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 7

Data Mining Tasks
Prediction Methods – Use some variables to predict unknown or future values of other variables. Description Methods – Find human-interpretable patterns that describe the data.

From [Fayyad, et.al.] Advances in Knowledge Discovery and Data Mining, 1996 © Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 8

Data Mining Tasks...
Classification [Predictive] Clustering [Descriptive] Association Rule Discovery [Descriptive] Sequential Pattern Discovery [Descriptive] Regression [Predictive] Deviation Detection [Predictive]

© Tan,Steinbach, Kumar

Introduction to Data Mining

4/18/2004

9

Classification: Definition
Given a collection of

Related Documents