Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
35 Cards in this Set
- Front
- Back
___________________is information containing patterns, relationships, and trends
|
Business intelligence
|
|
One _____________is equal to 10^3 bytes
|
Kilobyte
|
|
A type written page is about 2 _______________
|
Kilobytes
|
|
A __________ conists of 10^15 bytes
|
petabyte
|
|
One _________is equal to 10^18 bytes
|
exabyte
|
|
_________systems integrate data from multiple sources, and they process that data by sorting, grouping, summing, averaging, and comparing.
|
Reporting
|
|
Reporting systems improve decision making by ______________________________________________
|
providing the right information to the right user at the right time
|
|
Decision tree analysis is a technique used by _______________ systems.
|
data-mining
|
|
___________systems improve decision making by using the discovered patterns and relationships to anticipate events or to predict future outcomes.
|
Data-mining
|
|
________________computes the correlation of items on past orders to determine items that are frequently purchased together.
|
Market-basket analysis
|
|
The advantage that ___________________have over the other systems is that these foster innovation, improve customer service, increase organizational responsiveness, and reduce costs.
|
knowledge management systems
|
|
Which of the following systems use if/then rules?
|
Expert systems
|
|
If patient_Temperature > 103, Then Initiate High_Fever-Procedure. This sort of a rule is most likely to be found in an ___________ system.
|
Expert
|
|
Problematic data are termed as ______________
|
dirty data
|
|
WhyMe@GuessWhoIAM.org is an example of _____________________
|
dirty data
|
|
Which of the following is a problem commonly associated with operational data that have been gathered over time?
|
Inconsistent
|
|
______________________refers to the degree of summarization or detail.
|
Data granularity
|
|
___________ data is highly summarized.
|
Coarse
|
|
Clickstream data is _____________
|
too fine
|
|
Generally, it is better to have data that is ___________ than data that is ___________
|
too fine, too coarse
|
|
The more attributes there are, the easier it is to build a model that fits the sample data but that is worthless as a predictor. Which of the following best explains this phenomenon?
|
The curse of dimensionality
|
|
A ________________ is a company that obtains data from public and private sources and stores, combines, and publishes it in sophisticated ways.
|
data aggregator
|
|
The purpose of a __________________is to extract and clean data from operational systems and other sources, and to store and catalog that data for processing by BI tools.
|
data warehouse
|
|
Which of the following is true for data warehouses?
|
Data are stored in a data warehouse database using a data warehouse DBMS
|
|
The facts data, such as its source, format, assumptions, constraints, and the like, are called______________________
|
metadata
|
|
A_______________is a data collection that is created to address the needs of a particular business
|
data mart
|
|
___________________are comparable to distributors in a supply chain because they take data from the data manufacturers, clean and process the data, and locate the data on the disks of its computers
|
Data warehouses
|
|
_______________is the application of statistical techniques to find patterns and relationships among data and to classify and predict.
|
Data mining
|
|
Which term is used as a synonym for data mining?
|
Knowledge discovery in databases
|
|
In _____________, statistical techniques identify groups of entities that have similar characteristics.
|
cluster analysis
|
|
_______________________is a common unsupervised data-mining technique.
|
Cluster analysis
|
|
With____________data mining, data miners develop a model prior to the analysis and apply statistical techniques to data to estimate parameters of the model.
|
supervised
|
|
Regression analysis is used in __________________
|
data-mining systems
|
|
___________________measures the impact of a set of variables on another variable.
|
Regression analysis
|
|
_____________________are a data-mining technique used to predict values and make classifications, such as "good prospect" or "poor prospect" customers
|
Neural networks
|