Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
79 Cards in this Set
- Front
- Back
Information granularity
|
refers to the extent of detail within the information (fine and detailed or course and abstract)
|
|
4 primary traits that help determine the value of information:
|
1 Information type: transactional and analytical
2. Information timeliness 3. Information quality 4. Information governance |
|
transactional information
|
encompasses all of the information contained within a single business process or unit of work, and its primary purpose is to support daily operational tasks.
|
|
Analytical information
|
encompasses all organizational info and its primary purpose is to support the performing of managerial analysis tasks. Useful when making important decisions such as whether an org. should build a new plant, hire additional salespersons, etc. Makes it possible to do many things that previously were difficult to accomplish. Ex: credit card frauds, companies can tell when a certain card has been stolen/where its used at.
|
|
Realtime information
|
means immediate, up to date information
|
|
real time systems
|
provide real time information in response to requests. Many companies use this to uncover key corporate transactional information.
|
|
One of the biggest pitfalls associated with real time information
|
continual change
|
|
Demand for real time info stems from:
|
organizations' need to make faster and more effective decisions, keep smaller inventories, operate more efficiently, and track performance more carefully.
|
|
True/ false: Business decisions are only as good as the quality of info used to make them.
|
true
|
|
Data inconsistency
|
occurs when the same data element has different values. ex: changing name after marriage, if only change in a few systems and not all then that is inconsistent
|
|
Data integrity issues
|
occur when a system produces incorrect, inconsistent, or duplicate data.
Causes managers to look at other sources for information when making decisions. |
|
5 characteristics common to high quality information
|
- accuracy
-completeness -consistency -timeliness -uniqueness |
|
4 primary reasons for low quality information are
|
-online customers intentionally enter inaccurate info to protect their privacy
-diffferent systems have different info entry standards and formats -data-entry personnel enter abbreviated info to save time or erroneous info by accident -third party and external info contains inconsistencies, inaccuracies, and errors. |
|
Erroneous decisions can
|
cost time, money, reputations, and jobs.
|
|
Consequences that occur due to using low-quality information to make decisions are
|
-inability to accurately track customers
-difficulty identifying the organizations most valuable customers -inability to identify selling opportunities -lost revenue opportunities from marketing to nonexistent customers -cost of sending nondeliverable mail -difficulty tracking revenue because of inaccurate invoices -inability to build strong relationships with customers |
|
high quality info can
|
- significantly improve the chances of making a good decision and directly incrase an orgs' bottom line.
-does not automatically guarantee that every decision made is going to be a good one, because people are not perfect. |
|
data governance
|
refers to the overall management of the availabiltiy, usability , integrity, and security of company data.
|
|
database
|
maintains info about various types of objects (inventory), events (transactions), people (employees), and places (warehouses).
|
|
database management system (DBMS)
|
creates, reads, updates, and deletes data in a database while controlling access and security. Managers send requests to the DBMS and the DBMS perfoms the manipulation of the data in database.
|
|
two primary tools for retrieving info from DBMS:
|
-query by example
-structured query language |
|
Query by example tool
|
that helps users graphically design the answer to a question against a database
|
|
structured query language
|
that asks users to write lines of code to anser questions against a database.
|
|
data element (data field)
|
is the smallest or basic unit of information. Can include customer name, address, email, shipping method, product name, quantity ordered, etc.
|
|
data models
|
logical data structures that detail the relationships among data elements using graphics or pictures.
|
|
metadata
|
provides details about data.
Ex: metadata about an image incudes size, resolution, and date created . |
|
data dictionary
|
compiles all of the metadata about th edata elements in the data model.
|
|
DBMS's 3 primary data models for organizing info:
|
heirarchical, network, and relational database
|
|
relational database model
|
stores information in the form of logically related two-dimensional tables.
|
|
relational database management system
|
allows users to create, read, update, and delete data in a realtional database.
|
|
entity (referred to as a table sometimes)
|
stores information about a eprson, place, thing, transaction, or event.
|
|
attributes(columns or fields)
|
are the data elements associated with an entity.
Ex: under the Entity Musician the attribute would be the musician name, photo, notes, etc. |
|
record
|
collection of related data elements.
|
|
primary key
|
field that uniquely identifies a given record in a table. Critical piece of relational database because they provide a way of distinguishing each record in a table.
|
|
foreign key
|
primary key of one table that appears as an attribute in another table and acts to provide a logical relationship between the two tables.
|
|
advantages of using relational database vs. spreadsheet/ word doc
|
-increased flexibility
-increased scalability and performance -reduced information redundancy -increased info intefity - increased info security |
|
physical view
|
of information deals with the physical storage of information on a storage device.
|
|
logical view
|
of information focuses on how individual users logicallyaccess information to meet their own particular business needs.
|
|
data redundancy
|
is the duplication of data or the storage of the same data in multiple places
|
|
one primary goal of a database is to
|
eliminate info redundancy by recording each piece of info in only one place in the database. Saves disk space, makes performing info updates easier, and improves info quality
|
|
Information integrity
|
a measure of the quality of information
|
|
Integrity constraints
|
rules that help ensure the quality of information. Database design needs to consider these constraints.
|
|
two types of integrity constraints
|
-relational
- business critical |
|
relational integrity constraints
|
rules that enforce basic and fundamental information-based constraints. Ex: this constraint would not allow someone to create an order for a nonexistent customer
|
|
Business-critical integrity constraints
|
business rules vital to an organization's success, and often require more insight and knowledge than relational integrity constraints. Ex: Grocery store not accepting returns of goods past 15 days because of spoilage dates.
|
|
True/ False : security risks are increaseing as more and more databases and DBMS systems are moving the data centers run in the cloud.
|
true
|
|
data-driven website
|
is an interactive website kept constantly updated and relevant to the needs of its customers using a database. Can help limit the amount of info displayed to customers based on unique search requirements
|
|
data driven website advantages:
|
- easy to manage content
-easy to store large amounts of data -easy to eliminate human error -costs less |
|
data warehouse
|
logical collection of information, gathered from many different operational databases that supports business analysis activities and decision-making tasks. Primary purpose of a data warehouse is to combine info, specifically strategic info, throughout an organization into a single repository in such a way that the people who need that info can make decisions and undertake business analysis.
|
|
Extraction, transformation, and loading
|
process that extracts information from internal and external databases, transforms it using a common set of enterprise definitions and loads it into a data warehouse. Data warehouse then sends it to data marts.
|
|
data marts
|
contains a subset of data warehouse information. functional focus.
|
|
cube
|
common term for the representation of multidimensional information.
|
|
information cleansing (scrubbing)
|
process that weeds out and fixes or discards inconsistent, incorrect, or incomplete information.
|
|
data quality audits
|
firms complete this to determine the accuracy and completeness of its data
|
|
data mining
|
the process of analyzing data to extract information not offered by the raw data alone. Companies use data mining techniques to compile a complete picture of their operations, all within a single view, allowing them to identify trends and improve forecasts.
|
|
data mining tools
|
use a variety of techniques to find patterns and realtionships in large volumes of information that predict future behavior and guide decision making.
|
|
Ex of data mining
|
Netflix uses this to analyze each customer's film viewing habits to provide recommendations for other customers with cinematch, its movie recommendation system.
|
|
structured data
|
data mining occurs on this that is already in a database or a spreadsheet.
|
|
unstructured data
|
do not exist in a fixed location and can include text documents, PDF's, voice messages, emails, and so on.
|
|
text mining
|
analyzes unstructured data to find trends and patterns in words and sentences.
|
|
web mining
|
analyzes unstructured data associated with websites to identify consumer behavior and website navigation.
|
|
3 common forms for mining structured and unstructured data
|
-cluster analysis
-association detection -statistical analysis |
|
cluster analysis
|
a technique used to divide information sets into mutually exclusive groups such that the members of each group are as close together as possible to one another and the different groups are as far apart as possible. Helps orgs identify customers with similar behavior traits, such as clusters of best customers or onetime customers. zip codes offer valuable insight into such things as income levels, demographics, lifestyles, etc.
|
|
unstructured data
|
do not exist in a fixed location and can include text documents, PDF's, voice messages, emails, and so on.
|
|
association detection
|
reveals the relationship between variables along with the nature and frequency of the relationships. Referred to as rule generators b/c they create rules to determine the lilihood of events occuring together, or following each other.
|
|
text mining
|
analyzes unstructured data to find trends and patterns in words and sentences.
|
|
Market basket anaysis
|
analyzes such items as websites and checkout scanner information to detect customers' buying behavior and predict future behavior by identifying affinities among customers' choices of products and services.
|
|
web mining
|
analyzes unstructured data associated with websites to identify consumer behavior and website navigation.
|
|
statistical analysis
|
performs such functions as information correlations, distributions, calculations, and variance analysis.
|
|
3 common forms for mining structured and unstructured data
|
-cluster analysis
-association detection -statistical analysis |
|
time series information
|
is timestamped info collected at a particular frequency. Ex: Web visits per hour, sales per month.
|
|
cluster analysis
|
a technique used to divide information sets into mutually exclusive groups such that the members of each group are as close together as possible to one another and the different groups are as far apart as possible. Helps orgs identify customers with similar behavior traits, such as clusters of best customers or onetime customers. zip codes offer valuable insight into such things as income levels, demographics, lifestyles, etc.
|
|
forecasts
|
are predictions based on time-series information.
|
|
association detection
|
reveals the relationship between variables along with the nature and frequency of the relationships. Referred to as rule generators b/c they create rules to determine the lilihood of events occuring together, or following each other.
|
|
Market basket anaysis
|
analyzes such items as websites and checkout scanner information to detect customers' buying behavior and predict future behavior by identifying affinities among customers' choices of products and services.
|
|
statistical analysis
|
performs such functions as information correlations, distributions, calculations, and variance analysis.
|
|
time series information
|
is timestamped info collected at a particular frequency. Ex: Web visits per hour, sales per month.
|
|
forecasts
|
are predictions based on time-series information.
|
|
True/ False
: organizations find themselves in the position of being data rich and information poor |
true
|
|
solution to being data rich and information poor is
|
business intelligence
|