• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/38

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

38 Cards in this Set

  • Front
  • Back
What are the phases of Database Design?
Conceptual, Logical, Physical
Name the steps of conceptual design phase.
1. Identify entities
2. Identify relationships
3. Identify and associate attributes with entities and relationships
4.Determine attribute domains
5.Determine candidate, primary, and alternate key attributes
6.Specialize/generalize entities (optional)
7.Check the model for redundancy
8.Check that the model supports user transactions
9.Review the conceptual database design with users
Explain superkey, candidate, primary, and alternate keys.
Superkey - a set of columns that uniquely identifies a record in a table.
Candidate key - superkey with only the minimum columns necessary for identification.
Primary key - candidate key chosen for unique id of records.
Alternate - not chosen candidate key.
Explain strong/weak entities.
Strong have their own primary key. Weak depends on other entities for unique identification.
What is Significance level? What is its threshold.
Probability that the relationship is due to chance. p < 0.05
Explain the difference between patent and copyright.
Copyright - automatic, protects manifestation of idea, not the idea itself. Covers software.
Patent - protects the idea behind the invention, needs to be applied for, software and many mental proccesses are not patentable.
What is Triangulation?
Data obtained using various research methods.
What is reliability of research?
Replicability. "Will we get the same result each time we measure?"
What is validity of research?
"Are we actually measuring what we claim to be measuring?"
Types of measurements :
Scale ( quantitative)
- in rank order, equal intervals, ratio
Ordinal(qualitative)
example:Likert scale
- rank order, no interval and ratios
Nominal(categorical)
- for grouping
Example: 1- single, 2 - married etc.
Describe measures of central tendency.
Mean - average
Median - middle value
Mode - most common value
Describe measure of normality.
Kurtosis - peakedness, pos. values cluster around center, neg. flat distribution.
Skewness - symmetry. Pos. more low value (skewed tow. left), neg. more high values (right).
Mean and St.D. should not be used if distribution is not normal.
What is correlation? What test measures it?
Extent to which 2 variables co-vary.
Can be measured by Pearson's Product-Moment Correlation, parametric test. (r)
Describe measures of dispersion.
Range - from min to max
Standard deviation - avg. difference from sample mean. In a normal distribution 68% of values lie within 1 SD of mean, 95% - 2 SD.
Independent - dependent variables.
Independent variable believed to affect the dependent variable.
Example:
coffee consumption(IV) --> insomnia(DV)
When parametric tests can be used?
When:
Distributions are normal.
Variances are equivalent.
Measurements are on a continuos scale.
What test is used to measure the Difference?
T - test.
0.01 - small effect
0.06 - moderate effect
0.14 - large effect
What is a DBMS? Examples.
Database Management System. An application that allows users to interact with the database.
Describe a three-tier database model.
User interface layer. (the client)
Business logic and data processing layer ( the application server).
DBMS (the database server) distributed over different machines.
Describe the three levels of abstraction in ANSI-SPARC database architecture.
External level - Different User views.
Conceptual level - community view.
Internal level - computer's view.
What is database schema and database instance?
Schema - description of the database structure.
Instance - data in the database at a particular point in time.
Explain data independence.
Immunity of schemas to changes in different level schemas.
Explain entity integrity.
It is a constraint that states that in a base table no column of a primary key can be null.
Explain referential integrity.
Constraint that states that foreign key values must match a candidate key value of some record in the home(parent) table or be wholly null.
Explain functional and transitive dependencies.
fd - X --> Y , for every X there is only 1 Y, but for every Y may be several X. td - if X --> Y --> Z then Z is transitively dependent on X.
What are the most common fact-finding techniques.
Examining documentation.
Interviewing.
Observation.
Secondary research.
Questionnaires.
What is the degree of a relationship? Provide examples.
Number of participating entities. Can be unary(recursive), binary, ternary etc.
Explain simple, composite, derived attributes.
Single component/ multiple component/ calculated from a different attribute.
What is an entity?
A set of objects with the same properties that are identified as having an independent existence.
Explain multiplicity, cardinality and participation.
Number of entity occurrences that relate to other entity occurrences through a particular relationship / number of possible relationships for each participating entity (maximum) / describes whether all or only some entity occurrences participate in a relationship (minimum).
Explain first, second and third normal forms.
1NF - no intersection of column and record may contain more than 1 value
2NF - applies only to tables with composite primary key, all columns must be determined by values in ALL the columns that make up primary key.
3NF - all non-primary key values must be determined ONLY by the primary key column(s) and no others.
What is normalization?
Technique for producing a set of tables with minimal redundancy and therefore guards from update anomalies.
Explain Foreign key, parent, child.
Foreign key is a column within one table that is also the candidate key of another table. Child tables are those that contain foreign keys that link to the parent table.
Explain specialization and generalization.
Specialization is the process of maximizing the differences between members of an entity by identifying their distinguishing characteristics.
Generalization is the opposite (minimizing differences by finding common features)
What constraints may apply on a superclass/subclass relationship?
Participation - Optional/Mandatory, determines if every occurrence of a superclass must be a member of a subclass.
Disjoint constraint - Or/And, possibility of being a member of multiple subclasses.
What is the purpose of logical database design phase?
To describe a set of tables based on the ER model created in conceptual phase.
Steps of logical database design phase.
Create tables.
Check table structures using normalization.
Check that the tables support user transactions.
Check integrity constraints.
Review the logical database design with users.
When these tests are used?
ANOVA, Paired t-test, Chi-square, Independent t-test, Mann-Whitney, Wilcoxon Signed Ranks
ANOVA - when there are more than two groups and variables
Mann-Whitney - non-parametric equivalent of independent t-test
Paired t-test - parametric test to use on the same sample (i.e. before - after treatment)
Wilcoxon Signed Ranks - non-parametric equivalent of paired t-test
Independent t-test - parametric difference test on two different samples
Chi-square is used for nominal or frequency data