Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
38 Cards in this Set
- Front
- Back
What are the phases of Database Design?
|
Conceptual, Logical, Physical
|
|
Name the steps of conceptual design phase.
|
1. Identify entities
2. Identify relationships 3. Identify and associate attributes with entities and relationships 4.Determine attribute domains 5.Determine candidate, primary, and alternate key attributes 6.Specialize/generalize entities (optional) 7.Check the model for redundancy 8.Check that the model supports user transactions 9.Review the conceptual database design with users |
|
Explain superkey, candidate, primary, and alternate keys.
|
Superkey - a set of columns that uniquely identifies a record in a table.
Candidate key - superkey with only the minimum columns necessary for identification. Primary key - candidate key chosen for unique id of records. Alternate - not chosen candidate key. |
|
Explain strong/weak entities.
|
Strong have their own primary key. Weak depends on other entities for unique identification.
|
|
What is Significance level? What is its threshold.
|
Probability that the relationship is due to chance. p < 0.05
|
|
Explain the difference between patent and copyright.
|
Copyright - automatic, protects manifestation of idea, not the idea itself. Covers software.
Patent - protects the idea behind the invention, needs to be applied for, software and many mental proccesses are not patentable. |
|
What is Triangulation?
|
Data obtained using various research methods.
|
|
What is reliability of research?
|
Replicability. "Will we get the same result each time we measure?"
|
|
What is validity of research?
|
"Are we actually measuring what we claim to be measuring?"
|
|
Types of measurements :
|
Scale ( quantitative)
- in rank order, equal intervals, ratio Ordinal(qualitative) example:Likert scale - rank order, no interval and ratios Nominal(categorical) - for grouping Example: 1- single, 2 - married etc. |
|
Describe measures of central tendency.
|
Mean - average
Median - middle value Mode - most common value |
|
Describe measure of normality.
|
Kurtosis - peakedness, pos. values cluster around center, neg. flat distribution.
Skewness - symmetry. Pos. more low value (skewed tow. left), neg. more high values (right). Mean and St.D. should not be used if distribution is not normal. |
|
What is correlation? What test measures it?
|
Extent to which 2 variables co-vary.
Can be measured by Pearson's Product-Moment Correlation, parametric test. (r) |
|
Describe measures of dispersion.
|
Range - from min to max
Standard deviation - avg. difference from sample mean. In a normal distribution 68% of values lie within 1 SD of mean, 95% - 2 SD. |
|
Independent - dependent variables.
|
Independent variable believed to affect the dependent variable.
Example: coffee consumption(IV) --> insomnia(DV) |
|
When parametric tests can be used?
|
When:
Distributions are normal. Variances are equivalent. Measurements are on a continuos scale. |
|
What test is used to measure the Difference?
|
T - test.
0.01 - small effect 0.06 - moderate effect 0.14 - large effect |
|
What is a DBMS? Examples.
|
Database Management System. An application that allows users to interact with the database.
|
|
Describe a three-tier database model.
|
User interface layer. (the client)
Business logic and data processing layer ( the application server). DBMS (the database server) distributed over different machines. |
|
Describe the three levels of abstraction in ANSI-SPARC database architecture.
|
External level - Different User views.
Conceptual level - community view. Internal level - computer's view. |
|
What is database schema and database instance?
|
Schema - description of the database structure.
Instance - data in the database at a particular point in time. |
|
Explain data independence.
|
Immunity of schemas to changes in different level schemas.
|
|
Explain entity integrity.
|
It is a constraint that states that in a base table no column of a primary key can be null.
|
|
Explain referential integrity.
|
Constraint that states that foreign key values must match a candidate key value of some record in the home(parent) table or be wholly null.
|
|
Explain functional and transitive dependencies.
|
fd - X --> Y , for every X there is only 1 Y, but for every Y may be several X. td - if X --> Y --> Z then Z is transitively dependent on X.
|
|
What are the most common fact-finding techniques.
|
Examining documentation.
Interviewing. Observation. Secondary research. Questionnaires. |
|
What is the degree of a relationship? Provide examples.
|
Number of participating entities. Can be unary(recursive), binary, ternary etc.
|
|
Explain simple, composite, derived attributes.
|
Single component/ multiple component/ calculated from a different attribute.
|
|
What is an entity?
|
A set of objects with the same properties that are identified as having an independent existence.
|
|
Explain multiplicity, cardinality and participation.
|
Number of entity occurrences that relate to other entity occurrences through a particular relationship / number of possible relationships for each participating entity (maximum) / describes whether all or only some entity occurrences participate in a relationship (minimum).
|
|
Explain first, second and third normal forms.
|
1NF - no intersection of column and record may contain more than 1 value
2NF - applies only to tables with composite primary key, all columns must be determined by values in ALL the columns that make up primary key. 3NF - all non-primary key values must be determined ONLY by the primary key column(s) and no others. |
|
What is normalization?
|
Technique for producing a set of tables with minimal redundancy and therefore guards from update anomalies.
|
|
Explain Foreign key, parent, child.
|
Foreign key is a column within one table that is also the candidate key of another table. Child tables are those that contain foreign keys that link to the parent table.
|
|
Explain specialization and generalization.
|
Specialization is the process of maximizing the differences between members of an entity by identifying their distinguishing characteristics.
Generalization is the opposite (minimizing differences by finding common features) |
|
What constraints may apply on a superclass/subclass relationship?
|
Participation - Optional/Mandatory, determines if every occurrence of a superclass must be a member of a subclass.
Disjoint constraint - Or/And, possibility of being a member of multiple subclasses. |
|
What is the purpose of logical database design phase?
|
To describe a set of tables based on the ER model created in conceptual phase.
|
|
Steps of logical database design phase.
|
Create tables.
Check table structures using normalization. Check that the tables support user transactions. Check integrity constraints. Review the logical database design with users. |
|
When these tests are used?
ANOVA, Paired t-test, Chi-square, Independent t-test, Mann-Whitney, Wilcoxon Signed Ranks |
ANOVA - when there are more than two groups and variables
Mann-Whitney - non-parametric equivalent of independent t-test Paired t-test - parametric test to use on the same sample (i.e. before - after treatment) Wilcoxon Signed Ranks - non-parametric equivalent of paired t-test Independent t-test - parametric difference test on two different samples Chi-square is used for nominal or frequency data |