Basic Classification Essay

5720 Words Oct 1st, 2012 23 Pages
Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Introduction to Data Mining by Tan, Steinbach, Kumar

Introduction to Data Mining

4/18/2004

1

Classification: Definition
Given a collection of records (training set )
– Each record contains a set of attributes, one of the attributes is the class.

Find a model for class attribute as a function of the values of other attributes. Goal: previously unseen records should be assigned a class as accurately as possible.
– A test set is used to determine the accuracy of the model. Usually, the given data set is divided into training and test sets, with training set used to build the model and test

Introduction to Data Mining

4/18/2004

7

Tid 1 2 3 4 5 6 7 8 9 10
10

Attrib1 Yes No No Yes No No Yes No No No

Attrib2 Large Medium Small Medium Large Medium Large Small Medium Small

Attrib3 125K 100K 70K 120K 95K 60K 220K 85K 75K

