• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/66

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

66 Cards in this Set

  • Front
  • Back
Observed variance in y can be broken down into 2 parts. What are they?
1. Variance due to a linear relationship with x, which is equal to sy^(2) *r^(2)
2. Residual variance, or
s^(2)y
What is the difference between r and rho?
r is the correlation in a sample, while rho is associated with the same quanitiy in a population.
What are the 3 basic hypotheses that we can test with regards to rho (p)?
1. p = 0 in the population
2. p = a specific value in the population
3. p (sample 1) = p (sample 2), which would indicate that two samples came from the same population. Another way of expressing this hypothesis is that 2 populations have the same correlation, rho, between x and y.
If r^2 = .00016, what does this mean?

(not sure if this should be r or R)
If r^2 = .00016, then you can only explain .016% of the variance. This illustrates that what is statistically significant may not mean much if you can't explain much of the variance.
When performing hypothesis testing with rho (p), which circumstances require that you convert p to a fisher's z?
If rho equals a specific value in the population
OR
If you are testing the hypothesis that the rho from sample 1 is equal to the rho from sample 2.
What are the steps associated with calculating confidence intervals for rho?
1. Convert r to fisher's z
2. compute the standard deviation of z'
3. choose an alpha level and find the critical value of z that corresponds to that alpha level
4. construct a confidence interval around z'
5. transform confidence interval around z' back to r using table.
A narrow CI is preferred if you want...
a truly accurate estimate of the correlation.

(Conversely, wider CIs give you greater range w/in which to detect an effect).
When calculating a CI for rho, why do we have to transform the CI around z' back to r at the end?
Unless rho is zero, the distribution of the mean is not normal. The first 2 steps associated with calculating CIs allow us to use a normal distribution, necessitating a switch back at the end.
When calculating CIs for rho, why do we initially convert r to fisher's z' using the r to z table?
this transforms r to a normal distribution.
What is Beta?
population regression coeficient.
What is rho? How is it represented?
population correlation coeficient, looks like a p
Distinguish between Beta and b.
Beta = population regression coeficient

b = sample regression coeficient = slope
CIs are written 1-alpha. If alpha = .05, then what % CI are we calculating?
95%
What does it mean to have a 95% CI?
If you took many samples from the population and computed r and CI for each, 95% of the time, rho will be within the range specified by CI. In other words, 95% of the samples will contain rho.
Describe the 4 assumptions of regression.
LINH

Linear, Independence of residuals, Normal distribution of residuals, Homoscedasticity of jt distribution of residuals

1. There is a linear relationship between x and y

2. For every value of x, the residuals E = Y - Predicted Y are normally distributed with a mean of zero.

3. The joint distribution of residuals is homoscedastic. this means that the spread of the errors around the regression line is constant along the whole length of the line).

4. The residuals are independent of each other. (Error for one person isn't related to error for another person)
The normal error regression model reads Y = alpha + BetaX +E, where E~N(0, sigma squared) IID. What does the IID mean?
Errors are independent of each other and are identically distributed
The normal error regression model reads Y = alpha + BetaX +E, where E~N(0, sigma squared) IID. What does the E~N(0, sigma squared) part mean?
Errors are normally distributed with a mean of zero and a certain fized variance.
One of the assumptions of regression is that for every x, residuals are normally distributed with a mean of zero. Define the term residual. How are residuals represented?
residuals = errors

E = Y - predicted Y

(I think that this E might be an Epsilon).
One of the assumptions of regression is that the joint distribution of residuals is homoskedastic. What does this mean?
There will be equal variance for every value of x in terms of errors around that line.

Variance for each value of x is normally distributed.
For this value of x, the errors will be normally distributed. There will be man points closer to the line and fewer farther out.
This statement reflects which of the assumptions of regression?
For every x, residuals are normally distributed with a mean of zero.
You don't need to meet this assumption of regression if you are just calculating a value, but you must meet this assumption if you are doing a hypothesis test.
Assumption 4: Residuals are independent of each other; error for one person is not related to error for another person.
If power = .37, what does this mean?
If the null is false, we only have a 37% chance of finding that effect.
If we know 3 of the following factors associated with power, the 4th can be determined. What are these 4 factors?
PEAS

1. power of the test
2. effect size in the population
3. alpha level of the test
4. sample size
Why might you do a power analysis on a study that you've already done?
1. power analyses can help you revise plans for study
2. helps you understand the limits of your research (If I can't detect an effect with this sample size, then..)
What are the two common and important types of questions about power that often arise in research.
What is the power of our test?

What sample size do we need?
What 4 things influence r?
1. reliability of measures
2. restriction of range
3. combining samples
4. outliers
Reliability is one of the things that influences r. If you have low reliability, how is r affected?
low reliability, correlation is reduced.
Restricted range is one of the things that wil affect r. If you restrict range, what happens to r?
Restricted range typically lowers r, unless you have a curvilinear relationship (in which case restriction of range could increase r).
Will combining samples reduce or elvate r?
Either, depending on the range.
Outliers can influence r in three different ways. What are these ways that outliers can influence r?
Outliers might:
increase r, decrease r, or not affect it
How do you determine the order of a matrix?
list the number of rows (across) followed by number of columns (up and down).
If you want to direct soeone to a single number inside of a matrix, tell them..
the row and then the column
What types of matrices are the most common in regression?
1. square = have same number of rows and columns

2. vector = a column of numbers or a row of numbers
What are the differences between square and vector matrices?
1. square = have same number of rows and columns; you must have square matrices to divide.

2. vector = a column of numbers or a row of numbers
Have same number of rows and columns
square matrices
Are needed for division of matrices.
square matrices
represent a single column of numbers or a row of numbers
vector matrices
(row vector matrices, column vector matrices).
In order to add/subtract matrices, what must be true?
the natrices must be of the same order
In order to multiply matrices, what must be true?
The number of columns in the first matrix has to equal the number of rows in the second matrix. (ie, 2*3 and 3*4).
In order to divide matrices, what must be true?
Matrices must be square.
What is a scalar?
A scalar is a single number that is used when multiplying matrices.
can you use the communicative property when multiplying matrices?
no
A is post multiplied by D. How is this represented?
A*D
D is premultiplied by A
A*D
Describe the process of transposing a matrix.
When transposing a matrix, rows become columns and columns become rows.
Taking any matrix and multiplying by the transpose gives us...
a square matrix
Matrices cannot be divided directly. How, then, does division occur?
To divide matrices you must multiply the inverse of the divisor by the dividend.
Division of matrices requires multiplication of the inverse of the divisor by the dividend. Distinguish between the divisor and the dividend.
In 100/4, 4 is the divisor and 100 is the dividend.
Division of matrices requires that you have:
1. a square matrix
and
2. the inverse of the divisor matrix.
How are each of these steps achieved?
1. To get a square matrix, premultiply by transpose matrix.
2. We don't need to know how to compute the inverse of the divisor matrix
Any square matrix that has ones on the diagonal and zeros everywhere else.
identity matrix
What is the inverse of a divisor matrix and why is it useful?
Inverse matrix = the reciprocal of the original matrix.

Inverses are useful because if you multiply any matrix by it's inverse, you get an identity matrix. (An identity matrix is any square matrix that has ones on the diagonal and zeros everywhere else.
If you multiply your matrix by this type of matrix, you get your original matrix.
identity matrix
What are 2 important properties of identity matrices?
1. if you multiply any matrix by it's inverse, you get an identity matrix.
2. If you multiply your matrix by this type of matrix, you get your original matrix.
What is the rank of a matrix?
The rank of a matrix tells you the number of rows or columns that give us new info.

Redundant info will reduce rank. (One Example of redudant info includes one row being twice the value of another row).

The highest possible rank is 2.
For square matrices, you can only find the inverse if the matrix is...
full rank (ie, every row and column gives you new info).
A square matrix that doesn't give you any redundant info, contains all independent info.
full rank matrix
What is the difference between full rank and singular matrices?
Full rank: A square matrix in which all rows and columns contain independent info.

Singular matrix: a square matrix that is not of full rank. We cannot compute the inverse of a singular matrix and consequently cannot divide.
How can we determine if a matrix is of full rank (and therefore has an inverse)?
Find the determinant. If the determinant is zero, then the matrix is singular and has no inverse. If the determinant is any number other than zero (negative numbers are fine), then you have a full rank matrix.
In this type of matrix, rows and calumns provide independent information
full rank matrix
In this type of matrix, you can compute an inverse.
full rank matrix.
If you can compute an inverse of a matrix, this means that...
you can use the matrix to do division.
Describe the things that are true of a full rank matrix.
1. Rows and columns provide independent info.
2. You can compute an inverse, which means that you can use matrix to do division.
Describe how to interpret the regression constant (i.e., y intercept).
First, check the range of values for the x variable to see if it can truly go to zero.

*If x cannot be zero, then you need to say that the regression constant is a constant value that we need to add to the slope component in order to predict y more accurately.

If x can go to zero, plug in zero for x and make sure that the resulting value of y can fit into the range given for y.
y predicted = 10.32 +2.41x
where x = social support score
y=depression

In the above equation, how would we interpret the regression coeficient?
For every 1 point increase in social support (x), depression(y) increases by 2.41.
Zy(depression) =
0.41 Zx(social support)

In the above equation, how would we interpret the standardized regression coeficient?
For every one SD increase in social support, depression scores increase by .41 SDs, on average.
Zy(depression) =
0.41 Zx(social support)

In the above equation, what is the correlation between x and y? Why?
r = 0.41

For a single predictor, standardized b is equal to r. (This is not true for unstandardized b).