Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
211 Cards in this Set
- Front
- Back
Alanine
|
A
|
|
Arginine
|
R
|
|
Asparagine
|
N
|
|
Aspartic Acid
|
D
|
|
Cysteine
|
C
|
|
Glutamic acid
|
E
|
|
Glutamine
|
Q
|
|
Proline
|
P
|
|
Serine
|
S
|
|
Threonine
|
T
|
|
Tryptophan
|
W
|
|
Tyrosine
|
Y
|
|
Valine
|
V
|
|
Unknown
|
X
|
|
Glycine
|
G
|
|
Histidine
|
H
|
|
Isoleucine
|
I
|
|
Leucine
|
L
|
|
Lysine
|
K
|
|
Methionine
|
M
|
|
Phenylalanine
|
F
|
|
Purine
|
R
|
|
Pyrimidine
|
Y
|
|
G or A
|
R
|
|
T or C
|
Y
|
|
Amino
|
M
|
|
A or C
|
M
|
|
Keto
|
K
|
|
G or T
|
K
|
|
Strong
|
S
|
|
G or C
|
S
|
|
A or T
|
W
|
|
Weak
|
W
|
|
Not G
|
H
|
|
Not A
|
B
|
|
Not T
|
V
|
|
Not C
|
D
|
|
What was Sanger sequencing invented?
|
1977
|
|
What is the difference between a transcriptome and a proteome?
|
Transcriptome is the entire set of transcribed sequences produced by the genome whereas a proteome is the entire set of proteins encoded by the genome.
|
|
Two traits are homologous if...
|
they are derived from a common ancestor.
|
|
T/F. Homology can be measured.
|
F
|
|
What is homoplasy and is it common?
|
Independent origin of similar characters between species. Yes.
|
|
Wings and feathers are an example of which character state for birds?
|
apomorphy
|
|
Human speech is an example of which character state?
|
autapomorphy
|
|
Possession of vertebrae in zebras, orangutans and cheetahs is an example of which character state?
|
plesiomorphy
|
|
An example of __________ is the five toes seen on the hind legs of rats and apes. This character-state originated very early in Tetrapoda and occurs in other tetrapod groups, e.g. in lizards
|
symplesiomorphy
|
|
Which character state allows for phylogenies to be built?
|
synapomorphy
|
|
How many possible transversions are there?
|
8
|
|
How many possible transitions are there?
|
4
|
|
What do the 4 aliquots in Maxam & Gilbert sequencing detect?
|
G, A&G, C&T, C
|
|
What is the limitation for Maxam and Gilbert sequencing?
|
requires a cloning step (amplification and labeling)
|
|
T/F. Maxam and Gilbert sequencing is allowed to go to completion.
|
F
|
|
T/F. Sanger sequencing requires primers.
|
T
|
|
Which sequencing method uses dideoxyriboneucleotide triphosphates?
|
Sanger
|
|
What do Sanger and Maxam & Gilbert sequencing have in common?
|
radiolabels, gel electrophoresis, autoradiography
|
|
How does Sanger affect DNA replication?
|
ddNTPs stop DNA replication at different sites
|
|
Is it more efficient to use flurophores or radiolabelling for Sanger sequencing? Why?
|
fluorophores, only 1 lane needed, can use capillaries and a laser, fluorescence recorded in a graph and read by a computer
|
|
Why are Sanger samples bad at the end of a trace?
|
fluorophores are dying
|
|
Why are Sanger samples bad at the beginning of a trace?
|
primers show up in the trace
|
|
What is the main issue with Sanger sequencing? How can this be avoided?
|
stacking of nucleotides; try to sequence the other strand
|
|
T/F. Purines stack more than pyrimidines.
|
T
|
|
A probability of 1% error has a Phred score of ____.
|
20
|
|
A Phred score of 30 indicaties a probability of ____ error.
|
0.1%
|
|
Which type of sequencing inserts genome fragments into BACS?
|
hierarchical sequencing
|
|
What are some advantages of shotgun sequencing?
|
Can use computers instead of people; sequence in both directions to overcome stacking; assembly of all reads simultaneously
|
|
What are some disadvantages of shotgun sequencing?
|
mask repeats such as LINES, SINES, microsatellites etc
|
|
Contigs can be assembled into ___ on the chromosome during Sanger sequencing.
|
scaffolds
|
|
T/F. Contig is another word for scaffold.
|
F.
|
|
What is the role of FrameD?
|
makes guesses about the role of DNA
|
|
How does alpha-satellite data relate to sequencing?
|
we have lots of it, can be repeated up to 10 million times in a row in Drospophila therefore we don't sequence this none of the currently "complete" eukaryotic genomes are actually complete, often not found in prokaryotes
|
|
Where do alpha-satellite repeats often occur?
|
heterochromatin (centromeres and telomeres)
|
|
What began the revolution in sequencing leading to NGS?
|
pyrosequencing
|
|
Which aspect of DNA replication does pyrosequencing make use of?
|
The pyrophosphate released as the DNA strand grows
|
|
What turns pyrophosphates into ATP?
|
sulfurylase
|
|
Which gene causes fluoresces in the presence of ATP? Which sequencing technique makes use of this?
|
luciferase; 454
|
|
What is emPCR? Which sequencing method makes use of this?
|
adapters added to both ends of DNA frags, adapters attached to beads, oil added (emulsion mixture), water droplets contain only 1 bead and 1 gene, PCR occurs in each droplet, replicated DNA cannot escape due to oil, millions of PCRs in one tube, beads put on plate, 1 bead in each well, sulfurylase and luciferase added, nuc washed over, camera takes a picture, rinse plate, add next nuc. etc; 454
|
|
What is a problem with 454?
|
Hard to tell if there were 2 of the same nucleotide in a row
|
|
Which method replaced 454?
|
Illumina Solexa
|
|
Compare terminator nucs and ddNTPs?
|
Both interrupt DNA replicaiton hoewever terminator nucs are reversible.
|
|
Which sequencing method makes use of bridge amplification?
|
Illumina Solexa
|
|
Describe the illumina solexa technique.
|
DNA fragmented, adapters ligated to both ends of ds DNA frags, dense lawn of oligos on slide bind to adapters of ss frags, nucs and enzyme added,frags extended to create copies, frags clonally amplified,, DNA denatured (no more bridges), all 4 fluorescently labelled terminators are added, primers and polymerase, laser excitation, camera captures fluorescence from each cluster, rinse plate, add next terminator, etc.
|
|
What is a problem with illumina solexa?
|
analyzing data is a nightmare (look through sequential pictures)
|
|
T/F. Nucleotide runs create a problem for both 454 and illumina solexa.
|
F. Only a problem for 454
|
|
Which sequencing method has the following steps: library preparation, cluster generation, sequencing.
|
Illumina Solexa
|
|
Explain the SOLiD system.
|
DNA sheared to desired size, adapters are ligated to ends of desired frags, each fragment hybridized to a bead, emPCR, beads attached to glass slide, universal primer, di-base probes and ligase added, probe hybridizes to template and is ligated, fluorescence measured, dye cleaved off, leaving free 5' phosphate for further reactions, 7 cycles, primer reset (1 base shift)
|
|
Which sequencing detects 2 nucleotides at a time?
|
SOLiD
|
|
T/F. 454 has high throughput.
|
F
|
|
You want to sequence a short fragment. Would you use Sanger or 454. For a long fragment?
|
Sanger; 454
|
|
What has a higher read length, 454 or lllumina?
|
454
|
|
Which uses more lanes per cell, 454 or Illumina?
|
454
|
|
Is it best to sequence short fragments with 454 or Illumina?
|
Illumina
|
|
What is more accurate, SOLiD or Illumina?
|
SOLiD
|
|
T/F. Illumina and SOLiD have long read lengths.
|
F
|
|
Which sequencing technique makes use of the H+ released as the DNA strand grows?
|
Ion Torrent
|
|
Describe Ion Torrent Sequencing.
|
amplify on beads, flow nucs. one at a time on a semiconductive plate, measure electricity when H+ is released, no signal if the nucleotide is not a match.
|
|
T/F. Ion Torrent sequencing has similar limitations to 454.
|
F
|
|
Which sequencing technique follows a single polymerase as it sequences just one template?
|
Pacific Biosci
|
|
T/F. A limitation of Pacific Biosci sequencing is amplification.
|
F
|
|
What is a pro and a con of Pacific Biosci sequencing?
|
long reads; high error rate (single molecule)
|
|
Which sequencing techniques allows you to determine the sequence and the speed of sequencing?
|
Pacific Biosci
|
|
What are microarrays used for?
|
determining gene expression for many genes at once
|
|
Describe microarrays.
|
hybridixation of cDNA from a sample onto a slide spotted with probes, fluorescence intensity reflects level of gene expression
|
|
Which sequencing method sequences cDNA?
|
RNA-sequencing
|
|
Which sequencing method found genes that were not previously there?
|
RNA-sequencing
|
|
Which sequencing method found that some gene annotations were incorrect?
|
RNA-sequencing
|
|
T/F. Microarrays give absolute gene expression values.
|
F. only relative
|
|
Which sequencing method gives you absolute values of gene expression?
|
RNA-sequencing
|
|
Name the 3 nucleotide sequence dbs.
|
EMBL-EBI, NCBI, DDBJ
|
|
Which db is responsible for the accessibility of GenBank?
|
NCBI
|
|
What does NCBI stand for?
|
national center for biotechnology information
|
|
T/F. There has been an exponential growth in data due to sequencing centers.
|
T
|
|
T/F. NCBI is curated.
|
F ***
|
|
Which db has a collection of 3D structures of proteins and nucleic acids?
|
RCSB PDB
|
|
What does RCSB PDB stand for?
|
research collaboratory for structural bioinformatics protein databank
|
|
What does an RCSB PDB entry include?
|
summary of 3D structure. methods used to id structure, details on sequence, geometry details
|
|
Which db documents protein families, domains and sites?
|
InterPro
|
|
Name the db of protein domains, families and functional sites.
|
Prosite
|
|
What is the Gene Ontology Project?
|
aim to assign info about a gene product using a controlled vocab
|
|
T/F. The Gene Ontology Project uses curated evidence.
|
T
|
|
T/F. Swiss-Prot is curated.
|
T
|
|
What does OMIM stand for?
|
online medelian inheritance in man
|
|
Name a catalog of human genes and genetic disorders.
|
OMIM
|
|
PFAM??
|
????
|
|
What does KEGG stand for?
|
Kyoto Encyclopedia of Genes and Genomes
|
|
Which db contains info on chemical reactions?
|
KEGG
|
|
Which db contains info on pathways?
|
KEGG
|
|
Who developed Entrez?
|
NCBI
|
|
Which db allows the access of data domains, nucleotide and prot sequs, complete genomes and 3D structures?
|
Entrez
|
|
What is the goal of Entrez?
|
identify a representative, well-annotated mRNA sequence record
|
|
Which db contains seqs from GenBank, RefSeq and PDB?
|
Entrez
|
|
Why is it important to refine Entrez searches?
|
contains sequences from many other dbs (not all are curated) therefore quality varies, can be a high degree of redundancy
|
|
Which option would allow you to restrict your Entrez search to entries from RefSeq only?
|
"limits"
|
|
Which Entrez link allows you to find variations in your gene of interest?
|
SNP
|
|
What does it mean if the "structure" link is not present in an Entrez entry?
|
no 3D protein structure record associated
|
|
Which sequencing method is a cheaper but effective alternative to whole genome sequencing?
|
exome sequencing
|
|
What is the goal of exome sequencing?
|
selectively sequence coding regions of the genome
|
|
Which method combines exome sequencing and RNA sequencing?
|
RNA captureseq
|
|
Which sequencing technique uses tiling arrays?
|
exome sequencing and RNA CaptureSeq
|
|
Which technique is best at sequencing rare transcripts?
|
RNA CaptureSeq
|
|
Which chemical changes cytosines to uracils?
|
bisulfite
|
|
T/F. Methylated cysteines are easily de-aminnated.
|
F. they resist it
|
|
About how much does a disposable sequencer cost?
|
$900
|
|
Which sequencing technique generates the most yield per run?
|
Illumina
|
|
Which has the best accuracy, Illumina, ion torrent or pacbio? Which has the worst?
|
Illumina; Pacbio
|
|
Which has the highest read length, Illumina, ion torrent or pacbio? Which has the worst?
|
Pacbio; Illumina
|
|
What does EMBL-EBI stand for?
|
European Molecular Biology Laboratory - European Bioinformatics Institute
|
|
T/F. EMBL is redundant.
|
T?
|
|
Which db gives term info, ancestor chart, ancestor tablem child terms, protein annotation and statistices?
|
gene ontology
|
|
What does the P code of GO indicate? The I code?
|
part of; is a
|
|
T/F. Swissprot is curated.
|
T
|
|
Which db includes sequence data, references and taxonomic data?
|
Swissprot
|
|
Which has higher quality, PfamA or B?
|
A
|
|
Which db collects protein families based on protein domain and represents them by multiple alignments?
|
Pfam
|
|
Which db has HMM logos?
|
Pfam
|
|
T/F. GenBank displays homology information.
|
F
|
|
What was GenBank developed for?
|
a single nucleic acid sequence
|
|
T/F. The GenBank locus name is unique.
|
T
|
|
T/F. Accession numbers change as entries are updated.
|
F
|
|
Which part of a GenBank entry changes as an entry is updated?
|
GI number
|
|
What designates the end of an entry in GenBank?
|
//
|
|
T/F. Genbank format can display more than one gene in a single sequence.
|
T
|
|
What is the difference between the EMBL and GenBank formats?
|
EMBL is the same as GenBank format but with 2 letter codes (ex. AC = accession number)
|
|
What is the simplest format used?
|
FAST / Pearson
|
|
T/F. FASTA files store feature information.
|
F
|
|
In which file does a sequence start with a > ?
|
FASTA
|
|
Which format is adapted for high-throughput short reads?
|
FASTQ
|
|
What do FASTQ sequences start with?
|
@
|
|
Which format stores Phred scores?
|
FASTQ
|
|
Which format can store alignment information, has 2 different formats and can contain many sequences/taxa at once?
|
Phylip
|
|
Which format was made to be read by computers?
|
Abstract syntax notation (ASN1)
|
|
Which file stores 3D position of each aa, primary and secondary structures and crystallographic experiemnts?
|
PDB
|
|
L 9 10 11 12
|
go through fcs, go through practice probs
|
|
What does COSMIC stand for?
|
catalogue of somative mutations in cancer
|
|
What does BGI stand for?
|
Beijing Genomics Institute
|
|
What is the aim of the ENCODE project?
|
describe all functional elements encoded in human genome
|
|
T/F. Locus IDs change with revisions.
|
T
|
|
How many translation tables does NCBI deal with?
|
18
|
|
Which software program performs statistical phylogenetic inference and molecular evolutionary analyses?
|
PAML
|
|
If p <0.05, is the protein under positive or negative selection?
|
positive
|
|
Which is a useful db of homology genes?
|
HomoloGene
|
|
T/F. Homology implies similarity and vice versa.
|
F
|
|
How can homologous sites be identified?
|
through alignment
|
|
What is a caveat of dot plots? How can this be solved?
|
only 4 nucs therefore many dots; divide sequences into words (identity plots)
|
|
What are the benefits of using words in a dot plot?
|
can align distantly related species, can find repeat elements
|
|
Was there a deletion in sequence 1 or 2? \ .......................................................\
|
1
|
|
What is the highest value for a NW algorithm?
|
8
|
|
What does it mean if you skip a row or column while doing a NW algorithm?
|
gap
|
|
What is a global alignment?
|
try to align all of sequence 1 with all of sequence 2; short or highly similar sequences may be missed
|
|
Which algorithm allows the creation of local alignments?
|
Smith Waterman
|
|
What is the main diff. between SW and NW?
|
SW assigns negative weight to mismatches and/or gaps
|
|
What do local alignments try to maximize?
|
number of nucleotide matches for the substrings
|
|
Which type of alignment is more likely to identify a homologous region in catalytic domain?
|
global
|
|
What should have a greater penalty, a gao or a substitution?
|
gap
|
|
What does a gap correspond to?
|
an indel
|
|
Is it better to have staggered or consecutive gaps?
|
staggered
|
|
Does MM do a local or global alignment?
|
global
|
|
T/F. It is often useful to change cds DNA to its protein sequence first when doing alignments.
|
T
|
|
How can you test alignment significance?
|
permutation test (randomly arrage one seq, align them, repeat)
|
|
When should gaps not be penalized?
|
When 2 seqs have no obvious relationship at either end
|
|
Which alignment method is preferred when using two known to be homologous sequences?
|
global
|
|
T/F. An optimal alignment is statistically significant.
|
F
|
|
T/F. An alignment demonstrates honology.
|
F. not necessarily, demonstrates similarity though
|
|
What can multiple sequence alignments be used for?
|
contig assembly, phylogenetic reconstructions, pattern finding
|
|
Which methods to multiple sequence alignments use?
|
heuristic and probabilistic
|
|
For multiple alignments, computational time required grows as ____.
|
number of nucleotides x number of sequences grows
|
|
Which method uses pairwise alignments of all sequences and uses a distance matrix to contruct a phylogenetic tree.
|
Progressive global alignment
|
|
T/F. The tree has a strong influence on alignment.
|
F
|
|
Should you use a clustal tree for your studies
|
No
|
|
Which method aligns subgroups in a global alignment?
|
iterative global alignment
|
|
What is MUSCLE?
|
iterative global alignment
|
|
Does Clustal include gaps?
|
No
|