• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/95

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

95 Cards in this Set

  • Front
  • Back
Name four types of Intelligent Storage Systems

1. Block-based storage system


2. File-based storage system


3. Object-based storage system


4. Unified storage system

What is an Intelligent Storage System?

A feature-rich RAID array that provides highly optimized I/O processing capabilities.
Features of an Intelligent Storage System

- Supports combination of HDD and SSD
- Service massive amount of IOPS


- Scale-out architecture


- Deduplication, compression, and encryption


- Automated storage tiering


- Virtual storage provisioning


- Multi-tenancy


-Supports APIs to integrate with SDDC and cloud


-Data protection

Large amount of memory

Cache
Two key components of an Intelligent Storage System


Controller


Storage

Types of Controller components within an ISS



Block-based


File-based


Object-based


Unified

Types of Storage components within an ISS


HDDs


SSDs


Combination of both



Four Components that make up a Storage Array (ISS) (Jeffy's Definition)


F.E. Ports


Cache


B.E. Ports


Disks

What is a controller?

A controller is a compute system that runs a purpose-built OS that is responsible for performing several key functions for the storage system.
Components of a Hard Disk Drive


Platter


Spindle


Read/Write Head


HDA--> Head Disk Assembly


Actuator Arm Assembly


Drive Controller Board

Characteristics of a Platter

- Data recorded on these platters is in binary code.


- Data is encoded by polarizing the magnetic area or domains of the disk.


- Data can be written to or read from both surfaces.


- The number of platters and the storage capacity of each platter determine the total capacity of the drive.



Head Disk Assembly (HDA)

A set of rotating platters is sealed in this case.
What is a Spindle?

- A spindle connects all the platters and is connected to a motor.
Function of the Read/Write Head


Read and write data from or to the platters.




Drives have two R/W heads per platter, one for each surface of the paltter.

Head Flying Height

A microscopic air gap that is maintained between the R/W heads and the platters.
What is a Head Crash?

- The magnetic coating on the platter is scratched and may cause damage to the R/W/ head.


- A head crash generally results in data loss.

Actuator Arm Assembly

R/W heads are mounted on the actuator arm assembly, which positions the R/W/ heads at the location on the platter where the data needs to be written or read.
What does the Drive Controller Board consist of?


- Controller Board consists of microprocessor, internal memory, circuitry, and firmware.




- The controller is a printed circuit board, mounted at the bottom of the disk drive.

Tracks per Inch (TPI)

The number of tracks per inch (TPI) on the platter (on track density) measure how tightly the tracks are packed on a platter.

Track is divided into


Sectors




A sector is the smallest, individually addressable united of storage.

How large is a sector typically?

512 bytes of user Data
A customer can never asked you the size of this

Track- the concentric ring on the platter around the spindle.
Cylinder, Head, and Sector (CHS) Number


Refers to specific physical address location on the disk.




EX. Student NAME

Logical Block Address (LBA)


Simplified the addressing by using a linear address to access physical blocks of data.




EX. Student 1

Calculation of Blocks


Sectors per track * heads * cylinders




Block Range 0- ^ that number

Time taken by a disk to complete an I/O request


Seek Time


Rotation latency


Data Transfer Rate

Disk Service Time Formula

Seek Time + Rotational Latency + Data Transfer Time
What is Seek Time?

Describe the time taken to position the R/W heads across the platter with a radial movement (moving along the radius of the platter)




(also called access time)

Full Stroke Seek Time


The Time taken by the R/W/ head to move across the entire width of the disk, from the innermost track to the outermost track.




(Measured in ms)

Average Seek Time

The average time taken by the
R/W head to move from one random track to another, normally listed as the time for one-third of a full stroke.




(Measured in ms)

Track-to-track Seek Time


It is the time taken by the R/W head to move between adjacent tracks.




(Measured in ms)

Short-Stroking


Results in lower usable capacity than the actual capacity of the drive.




EX. 500 GB disk set up to use the first 40%, now treated as a 200 GB Drive.

Average Rotational Latency Formula


(One-half of the time taken for a full rotation)




(1/2*1000)/ (X/60)

What is Rotational Latency?

The time taken by the platter to rotate and position the data under the R/W Head.
Data Transfer Rate

Average amount of data per unit time that the drive can deliver to the HBA.
Internal transfer rate

The speed at which data moves from a platter's service to the internal buffer (cache) of the disk

External Transfer Rate

The rate at which data can move through the interface to the HBA.
Average Response Time Formula

Service Time/ (1-Utilization)
What is a Queue?

The location where an I/P request waits before it is processed by the I/O controller and disk I/O controller processes I/Os waiting in the queue one by one.

Disks required to meet an application's capacity need (DC)




FORMULA


Dc= Total capacity required/ Capacity of a single disk.






Disks required to meet applications performance need (DP)




FORMULA


Dp= IOPS generated by an application at peak workload/ IPOS serviced by a single disk.


IOPS serviced by a disk (S) depends upon disk service time (Ts)




FORMULA


Ts= Seek Time + (0.5/ (Disk rpm/60)) + (Data Block size/ Data tranfers rate)

TS is time taken for an I/O to complete, therefore IPOS services by a disk (S) is equal to (1/TS)




FORMULA


S= 0.7 * (1/Ts)

Disk required for application

Max (DC, DP)
Components of a Solid State Drive (SSD)

I/O Interfact


Controller ( RAM Cache, Drive Controller, Non Volatile Memory)


Mass Storage (Disk)


What is a SSD?

Storage device that contain non-volatile flash memory
What is non-volatile RAM used for?

The non-volatile RAM (NVRAM) is used to store the SSD's operation softwareand data.
Write coalescing

A technique employed within RAM, the process of grouping write I/Os and writing them in a single internal operation versus many smaller-sized write operations.

Flash memory


Retain their contents when powered off.




The mass storage of an array of non-volatile flash memory chips.

Single Level Cell (SLC)
SLC- Type flash typically used in enterprise-rated SSSs for its increased memory speed and longevity.

Multi-Level Cell (MLC)

Slower than SLC, but the advantage of a greater capacity per chip.
Solid State Drive stores bits


Eight bits make up a byte




Page

Physical data object




Is the smallest object that can be read or written on a solid state drive.




Pages do NOT have a standard capacity





Solid State Drive Block


Made up of pages.




32, 64, 128 pages




Only entire blocks may be written or rased on a solid state memory chip.

Page States

Erased (empty)


Valid


Invalid




- In order to write any data to a page, its owning block location on the flash memory chip must be electrically erased.

Garage collection


The process of providing new erased Blocks




- When an invalid page needs to be erased before it can once again be written with new data.

Block States


Erased (Empty)


New


Used

Write Request- Invalidation of a page

If the drive receives a write request to a valid block page, the page must be changed. The current page containing the destination of the write is marked invalid. The block's state changes to "used because it contains invalid pages.
Delete Request - Invalidation of a page

A delete invalidates a page without resulting in a subsequent write




Access Type of SSDs


- SSDs perform random reads the best


- SSDs use all internal I/O in parallel for multi-threaded large blocks I/Os




Drive State of an SSD

New SSD or SSD with substantial unused capacity offers the best performance

SSD Workload duration

SSDs are best for workloads with short burst of activity
Why RAID?

RAID--> Redundant Array of Independent Disks




A technique that combines multiple disk drives into a logical unit (RAID set) and provides protection, performance, or both.



Benefits of RAID


- Provides data protection against drive failures


- Improves storage system performance by serving I/Os from multiple drives simultaneously

Key Function of RAID


- Management and control of drive aggregations


- Translation of I/O requests between logical and physical drives, and data regeneration in the event of drive failures.

Software RAID


Uses compute system based software to provide RAID functions and is implemented at the operating system level.




PROS- cost and simplicity benefits when compared to hardware RAID

Limitations of Software RAID


- Performance (Additional CPU cycles)


- Supported Features (Does not support all RAID levels)


- Upgrades to software ROAD or to the OS must be validate and this leads to inflexibility in the data-processing environment.



Key Functions of RAID Controllers


- Management and control of disk aggregations


- Translation of I/O between logical disk and physical disks


-Data regeneration in the event of disk failures

Controller Card RAID

Compute-system based hardware RAID Implementation which a specialized RAID controller is installed in the compute system, and disk drives are connected to it.
External RAID Controller


A storage-system based hardware RAID




- Acts as an interface between the compute system and the disks. It presented storage volume to compute system, and the compute system manages the volumes as physical drives.

Raid Array

An enclosure that contains a number of disk drives and supporting hardware to implement RAID

RAID Group

A subset of disks within a RAID array can be grouped to form logical associations called logical arrays known as a RAID Set/ RAID Group.
Striping

A technique of spreading data across multiple drives (more than one) in order to user the drives in parallel.

Strip Size

Describes the number of blocks in a strip, and is the maximum amount of data starts at the beginning of the strip.



Also called Stripe Depth


Strip Width

Refers to the number of data strips in a stripe.
Mirroring


A technique whereby the same data is stored on two different disk, yielding two copies of the data.






Parity

A method to protect striped data from disk drive failure without the cost of mirroring.

PROs of Mirroring


- Preferred for mission-critical applications that cannot afford the risk of any data loss.


- Improves read performance because read quests can be serviced by both disks.



PROs of Parity


-Ensures protection of data without maintaining a full set of duplicated data


-Parity calculation is a bitwise XOR operation.


- Considerably reduces the cost associated with data protection



RAID Level - Unprotected


No Raid, 1 Disk, "Because I'm Stupid"



RAID Level 0


Striping




- Good option for applications that need high I/O throughput




Minimum number of disks: 2*


EMC Prefers (3-16)


RAID Level 1


Mirroring




Minimum number of disks: 2




- Suitable for application that require high availability and cost is not a constraint




Write Penalty: 2


Available Storage Capacity 50%


RAID Level 3


Striped set with parallel access and dedicated parity disk




Minimum number of disks: 3* (2+1)


EMC Prefers: (5 or 9) (4+1, 8+1)




Available Storage Capacity: [(n-1)/n]*100

RAID Level 5


Striped set with independent disk access and a distributed parity




Minimum number of disks:




VMAX: 3+1 or 7+1


VNX: 3-16




Available Storage Capacity: [(n-1)/n]*100


RAID Level 6


Striped with independent disk access and dual distributed parity.




Minimum number of disks: 4 (2+2)


EMC Symme. (6+2, 14+2)




Available Storage Capacity: [(n-2)/n]*100

RAID Level 1/0

Nested RAID- known as RAID 10(Ten)




Mirroring THEN Striping




Minimum number of disks: 4




Write Penalty: 2


Available Storage Capacity 50%



VMAX


Local: Timefinder


Remote: SRDF

VNX


Local: Snap view


Remote: Mirrorview

Hot Sparing


A process that temporarily replaces a failed disk drive with a spare drive in a RAID array by taking the identity of the failed disk drive.




- IF parity is used, the data is rebuilt onto the hotspare from the parity and the data on surviving disk drives in the RAID set.


-IF mirroring is used, the data from the surviving mirror use used to copy the data onto the hot spare.

Block-level Access


The file system is created on a compute system, and data is accessed on a network at the block level. Raw disks or logical volumes are assigned to the compute system for creating the file system.

File-level Access


The file system is created on a separate file service or at the storage side, and the file-level request is sent over a network. Because data is access at the file level, this method has higher overhead, when compared to block.

Object-level Access

Data is accessed over a network in terms of self-contained objects with a unique object identifier.




The files system's user component resides on a the computer system and the storage component resides on the storage system.

Scale-Up


Architecture that provides the capability to scale the capacity and performance of a single storage system.


- This involves upgrading or adding controller and storage.

Scale-Out


Architecture that provides capability to maximize its capacity by simply adding nodes to the cluster.


- Nodes are quick to add when more performance and capacity are needed without downtime.


- Distribution of workload across all the nodes.