• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/118

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

118 Cards in this Set

  • Front
  • Back

Statistics

A science which deals with the collection ; tabulation or presentation; analysis; and interpretation of data

-Defining the Problem-Collecting and Assembling Relevant Information-Conducting an Original investigation of the problem-Classifying the data-presenting the data and analyzing -interpreting the results

6 Steos in Statistical Analysis

3800 BC

Babylonia

3000

China

1491 BC

Moses

Achenwall

-1719 - 1772


-first introduced the word statistics

Zimmermann and Sinclair

popularized the name statistics in their books

16th Century

European became interested in coolecting information

Girotamo Cardano

wrote the first known study of the principles of probability

Mathematicians requested by gamblers

-Pascal


-Leibnitz


-Fermat


-Bernoulli

17th Century

companies begin to thrive and were already compiling

De Moivre

discovered the equation for the normal distribution upon which many of the theories of inferential statistics have been based

Adolf Quetelet

-Belgian


-Father of Modern Statistics


-applied the theory of probability to anthropological measurements and expanded the same principle to the field of psychology and education

Sir Francis Galton

developed the use of percentiles and contributed the application if statistics to heredity

Karl Pearson

worked with Galton to develop the theory of regression and correlation

William S. Gosset

develop methods for decision-making derived from smaller sets of data

Sir Ronald Fisher

Fisher's Test


-used in the analysis of variance in inferential statisticd

Division of Statistics

-Descriptive Statistics


-Inferential Statistics

Descriptive Statistics

concerned with the gathering, classification and presentation of data and the collection of summarizing values to describe group characteristics

Tpes of Sumamrizing Values Most COmmonly USed

-Measure of Central Tendency or POsition or Location


-Measure of Variability

Measure of Central Tendency or POsition or Location

a single figure which is representative of the general level of magnitudes or values of the items in a set of dta


-a measure thst determines where the group tends to cluster or to the center


-a single value which best represents the entire group

Mean(Me)

the arithmetic average of all the scores in a distribution. It is the most stable, sensitive, consistent and reliable measuring instrument



Mode(Mo)

the most frequent score

Median(Md)

the central value that divides the ordered data collection into two equal parts


-is the value of the middle term after arranging the data in ascending or descending order

Measure of Variability

a measure which aids the statisticians in making comparisons


- a measures which describes or show how far above or below is a score from the mean

Variance

the same as the standard deviation except that the square root is not taken

Range

the distance between the highest and lowest score in a array of data

Inferential Statistics

methods used ti describe a population by studying a random sample of that population

Population

any specified group taken as the subjects of the study or research

Sample

only a portion or subset of a population

Types of Inferential Statistics

Parametric Statistics


Nonparametric Statistics

Parametric Statistics

COme from a population with normal distribution

Nonparametric Statistics

does not require any assumption regarding the population where the sample is taken

USes of Statistics

-To be able to read professional literature with understanding


-To know how to treat research data adequately and present results corectly


-To develop an ability to evaluate data and to suspend correct judgment


-Training in statistics is a protection against errors in judgement

Applications of Statistics

-Education


-Government


-Business and Economics


-Medicine and Science


-Psychology


-Sociology and Population Dynamics


-Sports


-Engineers/Economists/Managers


-Astronomy


-Zoology


-Biology

Statistical Data can be categorized as

-Primary


-Secondary

Primary

refers to the informatin which are gathered directly from an original source or to the infoemation based in direct or first hand experiences

Secondary

taken from published or unpublished papers which were previously gathered by other individuals or agencies

Methods used in the Collection of Data

-Direct/Interview Method


-Indirect / Questionaire Method


-Registration Method


-Observation Method


-Experiment Method

Direct or Interview MEthod

a method of person to person exchanged between the interviewer or interviewee

Indirect or Questionnaire Method

int his method, written responses are given to prepared questions. Questions may be mailed or hand carried

Registration Method

in this method, the gathering of information is enforced by certain laws

Observation Method

in this method, the investigator observes the behavior of persons of organizations and their outcomes. It is usually used when the subjects can't talk or write

Experiment Method

this is a method used when the objective is to determine the cause and effect relationship of certain phenomena under controlled conditions

VAriable

the characteristic of the population investigated


-considered as causes and effects in a research project or experiment

Variables' Functional relationship

dependent


-independemt

Independent Variable

the controlling factors

Dependent Variable

depending on the controlling factors

Types of Independent Variables

-Experimental


-Manipulated


-Controlled


-Moderator


-Intervening


-Extraneous

Experimental

variable being analyzed in an experiment or research

Manipulated

variable which is reduced, increased, removed or added in the study to ascertain its effect on or relationship to the dependent variable

Controlled

variable being held constant in order to prevent it from affecting the dependent variable

moderator

variables which are not the main focus of the study but are analyzed in terms of their impact on the dependent variable

intervening

not mentioned in the problem or hypothesis but affects the results

extraneous

always figures out in any study


-cannot be controlled


-error or chance variable

personal variable

-age, sex, income education, family size, adresses, health, hobbies etc.

environmental variable

location, geographical nature, organizational climate

Variables according to continuity

-continuous


-discrete/discontinuous

continuous

variables that can theoretically assume any value between two given values.

discrete/discontinuous

vaariables with definite values and cannot take the form of decimals

qualitative variable

takes the form of attributes

quantitative variables

they come in emasurements

Variables can be categorized according to scale of measurement

-nominal scale


-ordinal scale


-interval scale


-ratio scale

nominal scale

numbers sued in scale are frequently used in everyday life

ordinal scale

scale of measurement is possible when the researcher can detect differing degree of an attribute or property in objects

interval scale

possible when the measures can distinguish not only different amounts of the property of objects but can also discern equal differences between objects

ratio scale

assigned in measurement reflect ratios amount od the property measured

Presentation of Data

Textual Form


Tabular Form


Graphical Form

Textual Form

in this form, the data us presented in paragraph form

Tabular Form

in this form, the data is presented in rows and columns

Types of Tables

-General or Reference Table


-Summary or Text Table

General or Reference Table

used as a warehouse of information in order to present data in such a way that individual items may easily be found and is usually found in appendix

Summary or Text Table

-designed to guide the reader in anlyzing the data

Graphical Form

in this form, numerical data are presented with the use of graphs`

Kinds of Graphs

-Bar graphs


-Line graphs


-Circle Graphs


-Pictographs


-Map Graph



Bar graph

represents the magnitude of the quantities being compared

Line graph

show the relationship between two or more sets of quantities

Circle Graphs

represent quantities that make up a whole

Pictographs

uses picture

Map graph

-best way to present geographical data


-also known as cartogram

Frequency Distribution

most fundamental technique used by statistician for putting into useful order a disarray of collected data

Parts of frequency distribution

-Class Interval/ Class Limit


-Class boundaries


-Class Size


-Class Frequency


-Class MArk

Class Intervals/ Class limit

category defines by a lower limit and uan upper limit

Class boundaries

refers to the value midway between the upper limit of a certain interval and the lower limit of the next

Class Size

-also known as class width


-difference between two successive lower or upper limits

Class frequency

-refers to the number of observations belonging to a class interval or the number of values that fall in a given interval

Class MArk

average of upper and lower clalss limits


-midpoint of the class interval

i=Range/k


where


k = 1+3.3logn(rounded off to the next higher whole number)


n-> sample size

upper and lower limit


number + interval - 1``

class mark = upper limit + lower limit / 2


% rel. frequency/ total freq. * 100``

Graphical Devicees Used to represent frequency distribution

-Histogram


-Frequency Polygon


-Cumulative Frequency Distribution or Ogive(cfd)


-Relative Frequency Distribution

Histogram

freq. vs. class marks


-consist if rectangles

Frequency Polygon

frequency plotted against classmarks

cfd/ogive

tabular arrangement of data by class intervals whose frequency are cumulated

the "less than cfd"

whose sum of frequencies for each class interval is less than the upper class boundary

the "greater than" cfd

whose sum of freq. for each class interval us greater than the lower class boundary

Relative frequency Distribution

tabular arrangement of data showign the proportion in percent of each frequency to the total frequency

Grouped

number or cases/samples is more than 30

Ungrouped Data

if sample(n) is less than 30

Me = summation of x/n




summ of x = sum of all values in the distribution


n = number of values in the distributiom

Mean Formula (Ungrouped)

Median(Ungrouped)

arrange data in ascending order..


if odd Md = Middle most


if even Md = average of 2 middlemost valueus

Mode(Ungrouped)

the repeated data


11, 29, 29 mode = 29


it may not exist like: 1, 2, 3

Range(Ungrouped)

highest val - lowest val

Variance(Ungrouped)

V=summation(x-Me)^2/ N



Standard Deviation(Ungrouped)

SD = sqr(V)

Mean(Grouped)

Me = AM + i(summ(fd)/N)


AM = assumed mean


f = freq


d= deviation


i = class size


N = num of class`

Median(Grouped)

Md = LL[(N/2 - Fup)i] / f


Fup = cummulative freq corresponding to freq below the MD


f=freq of median


LL = lower true limit of the median class


N = num of classes


i= class size

Mode(Grouped)

Mo = Me - 3(Me - Md)

Range(grouped)

highest - lowest

Standard Deviation(Grouped)

SD = i[sqrt((summfd^2/N) - (summ of fd/N)^2]

Variance(Grouped

V= SD^2

Quartile(Q)

divides thes et of values into 4 equal parts

Quartile formula

Q = LL+((kapila na Q)N/4)-Fup/f)i

Decile(D)

divides the set of values into 10 equal parts

Decile Formula

D = LL + [(ikapila na decile(N)/10 - Fup / f)i

Percentile(P)

divides the set of values into 100 equal pasrts

Percentile(P0 formula

P = LL + (ikapila(N)/ 100 - Fup / f)i

Percentile Rank(PR)

Pr = 100 * [ts + (x-LL/i)f] / N


ts = the subtotal or sum of the frequencies up to, but not including, the frequencies of the class interval where the raw score in question is found


LL = the real lower limit of the class interval where the raw score in question is found


N = total number of cases


i= class size


x = raw score