Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
118 Cards in this Set
- Front
- Back
Statistics |
A science which deals with the collection ; tabulation or presentation; analysis; and interpretation of data |
|
-Defining the Problem-Collecting and Assembling Relevant Information-Conducting an Original investigation of the problem-Classifying the data-presenting the data and analyzing -interpreting the results |
6 Steos in Statistical Analysis |
|
3800 BC |
Babylonia |
|
3000 |
China |
|
1491 BC |
Moses |
|
Achenwall |
-1719 - 1772 -first introduced the word statistics |
|
Zimmermann and Sinclair |
popularized the name statistics in their books |
|
16th Century |
European became interested in coolecting information |
|
Girotamo Cardano |
wrote the first known study of the principles of probability |
|
Mathematicians requested by gamblers |
-Pascal -Leibnitz -Fermat -Bernoulli |
|
17th Century |
companies begin to thrive and were already compiling |
|
De Moivre |
discovered the equation for the normal distribution upon which many of the theories of inferential statistics have been based |
|
Adolf Quetelet |
-Belgian -Father of Modern Statistics -applied the theory of probability to anthropological measurements and expanded the same principle to the field of psychology and education |
|
Sir Francis Galton |
developed the use of percentiles and contributed the application if statistics to heredity |
|
Karl Pearson |
worked with Galton to develop the theory of regression and correlation |
|
William S. Gosset |
develop methods for decision-making derived from smaller sets of data |
|
Sir Ronald Fisher |
Fisher's Test -used in the analysis of variance in inferential statisticd |
|
Division of Statistics |
-Descriptive Statistics -Inferential Statistics |
|
Descriptive Statistics |
concerned with the gathering, classification and presentation of data and the collection of summarizing values to describe group characteristics |
|
Tpes of Sumamrizing Values Most COmmonly USed |
-Measure of Central Tendency or POsition or Location -Measure of Variability |
|
Measure of Central Tendency or POsition or Location |
a single figure which is representative of the general level of magnitudes or values of the items in a set of dta -a measure thst determines where the group tends to cluster or to the center -a single value which best represents the entire group |
|
Mean(Me) |
the arithmetic average of all the scores in a distribution. It is the most stable, sensitive, consistent and reliable measuring instrument |
|
Mode(Mo) |
the most frequent score |
|
Median(Md) |
the central value that divides the ordered data collection into two equal parts -is the value of the middle term after arranging the data in ascending or descending order |
|
Measure of Variability |
a measure which aids the statisticians in making comparisons - a measures which describes or show how far above or below is a score from the mean |
|
Variance |
the same as the standard deviation except that the square root is not taken |
|
Range |
the distance between the highest and lowest score in a array of data |
|
Inferential Statistics |
methods used ti describe a population by studying a random sample of that population |
|
Population |
any specified group taken as the subjects of the study or research |
|
Sample |
only a portion or subset of a population |
|
Types of Inferential Statistics |
Parametric Statistics Nonparametric Statistics |
|
Parametric Statistics |
COme from a population with normal distribution |
|
Nonparametric Statistics |
does not require any assumption regarding the population where the sample is taken |
|
USes of Statistics |
-To be able to read professional literature with understanding -To know how to treat research data adequately and present results corectly -To develop an ability to evaluate data and to suspend correct judgment -Training in statistics is a protection against errors in judgement |
|
Applications of Statistics |
-Education -Government -Business and Economics -Medicine and Science -Psychology -Sociology and Population Dynamics -Sports -Engineers/Economists/Managers -Astronomy -Zoology -Biology |
|
Statistical Data can be categorized as |
-Primary -Secondary |
|
Primary |
refers to the informatin which are gathered directly from an original source or to the infoemation based in direct or first hand experiences |
|
Secondary |
taken from published or unpublished papers which were previously gathered by other individuals or agencies |
|
Methods used in the Collection of Data |
-Direct/Interview Method -Indirect / Questionaire Method -Registration Method -Observation Method -Experiment Method |
|
Direct or Interview MEthod |
a method of person to person exchanged between the interviewer or interviewee |
|
Indirect or Questionnaire Method |
int his method, written responses are given to prepared questions. Questions may be mailed or hand carried |
|
Registration Method |
in this method, the gathering of information is enforced by certain laws |
|
Observation Method |
in this method, the investigator observes the behavior of persons of organizations and their outcomes. It is usually used when the subjects can't talk or write |
|
Experiment Method |
this is a method used when the objective is to determine the cause and effect relationship of certain phenomena under controlled conditions |
|
VAriable |
the characteristic of the population investigated -considered as causes and effects in a research project or experiment |
|
Variables' Functional relationship |
dependent -independemt |
|
Independent Variable |
the controlling factors |
|
Dependent Variable |
depending on the controlling factors |
|
Types of Independent Variables |
-Experimental -Manipulated -Controlled -Moderator -Intervening -Extraneous |
|
Experimental |
variable being analyzed in an experiment or research |
|
Manipulated |
variable which is reduced, increased, removed or added in the study to ascertain its effect on or relationship to the dependent variable |
|
Controlled |
variable being held constant in order to prevent it from affecting the dependent variable |
|
moderator |
variables which are not the main focus of the study but are analyzed in terms of their impact on the dependent variable |
|
intervening |
not mentioned in the problem or hypothesis but affects the results |
|
extraneous |
always figures out in any study -cannot be controlled -error or chance variable |
|
personal variable |
-age, sex, income education, family size, adresses, health, hobbies etc. |
|
environmental variable |
location, geographical nature, organizational climate |
|
Variables according to continuity |
-continuous -discrete/discontinuous |
|
continuous |
variables that can theoretically assume any value between two given values. |
|
discrete/discontinuous |
vaariables with definite values and cannot take the form of decimals |
|
qualitative variable |
takes the form of attributes |
|
quantitative variables |
they come in emasurements |
|
Variables can be categorized according to scale of measurement |
-nominal scale -ordinal scale -interval scale -ratio scale |
|
nominal scale |
numbers sued in scale are frequently used in everyday life |
|
ordinal scale |
scale of measurement is possible when the researcher can detect differing degree of an attribute or property in objects |
|
interval scale |
possible when the measures can distinguish not only different amounts of the property of objects but can also discern equal differences between objects |
|
ratio scale |
assigned in measurement reflect ratios amount od the property measured |
|
Presentation of Data |
Textual Form Tabular Form Graphical Form |
|
Textual Form |
in this form, the data us presented in paragraph form |
|
Tabular Form |
in this form, the data is presented in rows and columns |
|
Types of Tables |
-General or Reference Table -Summary or Text Table |
|
General or Reference Table |
used as a warehouse of information in order to present data in such a way that individual items may easily be found and is usually found in appendix |
|
Summary or Text Table |
-designed to guide the reader in anlyzing the data |
|
Graphical Form |
in this form, numerical data are presented with the use of graphs` |
|
Kinds of Graphs |
-Bar graphs -Line graphs -Circle Graphs -Pictographs -Map Graph |
|
Bar graph |
represents the magnitude of the quantities being compared |
|
Line graph |
show the relationship between two or more sets of quantities |
|
Circle Graphs |
represent quantities that make up a whole |
|
Pictographs |
uses picture |
|
Map graph |
-best way to present geographical data -also known as cartogram |
|
Frequency Distribution |
most fundamental technique used by statistician for putting into useful order a disarray of collected data |
|
Parts of frequency distribution |
-Class Interval/ Class Limit -Class boundaries -Class Size -Class Frequency -Class MArk |
|
Class Intervals/ Class limit |
category defines by a lower limit and uan upper limit |
|
Class boundaries |
refers to the value midway between the upper limit of a certain interval and the lower limit of the next |
|
Class Size |
-also known as class width -difference between two successive lower or upper limits |
|
Class frequency |
-refers to the number of observations belonging to a class interval or the number of values that fall in a given interval |
|
Class MArk |
average of upper and lower clalss limits -midpoint of the class interval |
|
i=Range/k where k = 1+3.3logn(rounded off to the next higher whole number) n-> sample size |
|
|
upper and lower limit number + interval - 1`` |
|
|
class mark = upper limit + lower limit / 2 % rel. frequency/ total freq. * 100`` |
|
|
Graphical Devicees Used to represent frequency distribution |
-Histogram -Frequency Polygon -Cumulative Frequency Distribution or Ogive(cfd) -Relative Frequency Distribution |
|
Histogram |
freq. vs. class marks -consist if rectangles |
|
Frequency Polygon |
frequency plotted against classmarks |
|
cfd/ogive |
tabular arrangement of data by class intervals whose frequency are cumulated |
|
the "less than cfd" |
whose sum of frequencies for each class interval is less than the upper class boundary |
|
the "greater than" cfd |
whose sum of freq. for each class interval us greater than the lower class boundary |
|
Relative frequency Distribution |
tabular arrangement of data showign the proportion in percent of each frequency to the total frequency |
|
Grouped |
number or cases/samples is more than 30 |
|
Ungrouped Data |
if sample(n) is less than 30 |
|
Me = summation of x/n summ of x = sum of all values in the distribution n = number of values in the distributiom |
Mean Formula (Ungrouped) |
|
Median(Ungrouped) |
arrange data in ascending order.. if odd Md = Middle most if even Md = average of 2 middlemost valueus |
|
Mode(Ungrouped) |
the repeated data 11, 29, 29 mode = 29 it may not exist like: 1, 2, 3 |
|
Range(Ungrouped) |
highest val - lowest val |
|
Variance(Ungrouped) |
V=summation(x-Me)^2/ N |
|
Standard Deviation(Ungrouped) |
SD = sqr(V) |
|
Mean(Grouped) |
Me = AM + i(summ(fd)/N) AM = assumed mean f = freq d= deviation i = class size N = num of class` |
|
Median(Grouped) |
Md = LL[(N/2 - Fup)i] / f Fup = cummulative freq corresponding to freq below the MD f=freq of median LL = lower true limit of the median class N = num of classes i= class size |
|
Mode(Grouped) |
Mo = Me - 3(Me - Md) |
|
Range(grouped) |
highest - lowest |
|
Standard Deviation(Grouped) |
SD = i[sqrt((summfd^2/N) - (summ of fd/N)^2] |
|
Variance(Grouped |
V= SD^2 |
|
Quartile(Q) |
divides thes et of values into 4 equal parts |
|
Quartile formula |
Q = LL+((kapila na Q)N/4)-Fup/f)i |
|
Decile(D) |
divides the set of values into 10 equal parts |
|
Decile Formula |
D = LL + [(ikapila na decile(N)/10 - Fup / f)i |
|
Percentile(P) |
divides the set of values into 100 equal pasrts |
|
Percentile(P0 formula |
P = LL + (ikapila(N)/ 100 - Fup / f)i |
|
Percentile Rank(PR) |
Pr = 100 * [ts + (x-LL/i)f] / N ts = the subtotal or sum of the frequencies up to, but not including, the frequencies of the class interval where the raw score in question is found LL = the real lower limit of the class interval where the raw score in question is found N = total number of cases i= class size x = raw score |