General Exam Review: DNA

I.                   Background

a.       1860s – Gregor Mendel, garden peas, hypothesized “genes” control genetic inheritance (work not accepted)

b.       Late 1800s – chromosomes & division (mitosis & meiosis) visualized by newly developed tools à scientists convinced hereditary material = chromosomes.

c.       Chromosomes found to be composed of protein + DNA. Which carries hereditary info? They believed protein – bc it’s more complex (20 subunits vs. 4).

d.       Late 1940's/ early 1950's – evidence that DNA carries hereditary info (Hershey & Chase, 1952, radioactive labeling – T2 phage grown in presence of 32P & 35S. Phages infected bacteria. DNA label was seen injected into the cell, protein label was not. Phage derived DNA caused infected cells to produce new phage particles). 

e.       1951 – Structure determined by Watson & Crick using X-ray crystallography data of Cricks & Rosalind Franklin to decipher DNA structure. Published "Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid" in the British journal Nature (April 25, 1953. volume 171:737-738.) Cited over 800 times J 

II.                Structure

a.       All living things store their hereditary info in form of double stranded DNA

b.       Nitrogenous Bases: Adenine, Thymine, Cytosine, Guanine (note: aromatic, lower affinity for water = found on inside of the helix)

                                                               i.      Purines: A, G à The 9 atoms that make up the fused rings (5 carbon, 4 nitrogen) are numbered 1-9. All ring atoms lie in the same plane.

                                                              ii.      Pyrimidines: C, T (or U) à The 6 atoms (4 carbon, 2 nitrogen) are numbered 1-6. All ring atoms lie in the same plane.

                                                            iii.      Binding: A binds T (or U), G binds

                                                            iv.      http://www.blc.arizona.edu/Molecular_Graphics/DNA_Structure/DNA_Tutorial.HTML#Purine



c.       Nucleotide =

               Dexyribose

              (no O on 2C)

             5-carbon sugar

                      +

                      Phosphate

                       +

             Nitrogenous base

 


d.       Polynucletide

                                                               i.      Sugars are linked to the next phosphate group = sugar-phosphate backbone.

                                                             ii.      Two chains, complementary, antiparallel, rotate on same axis -double helix.

                                                            iii.      Sugar: 2C or 3C projects out of the plane of atoms. Endo-confirmation = same direction as 5C, Exo-conformation = opposite direction as 5C.

                                                           iv.      Bonds in DNA

1.       Phosphoester bond between phosphate O & 5C of sugar

2.       Phosphoester bond between 3C of sugar & next NT

 

a.       Linkage is “3’5’phosphodieser bond” à gives directionality, 5’ hydroxyl group at one end, 3’ at other end.

b.       Energy needed for this bond provided by nucleoside (base+sugar) triphosphate (3Ps).

c.       Diphosphate is removed from a NT-triphosphates to add a NT.

d.       Note: 2Ps from diff PO4’s share an O = phosphodiester bond versus P–O bond in a PO4, which is a phosphomonoester bond.

 

3.       Covalent bonds between adjacent NTs of same strand

4.       H-bonds between adjacent NTs of complementary strand

5.       H-bonds between adjacent bases in same strand


 

 


                                                             v.      Nucleic acid (such as deoxyribonucleic acid) is chain of NTs.

                                                           vi.      DNA written 5' à 3'

                                                          vii.      One 360° helical turn = 10 base pairs (“Repeat”) = 3.4 nm or 34Å (“Pitch”)

                                                        viii.      Distance from one base to adjacent base in the helix = 3.4Å (“Rise”)

                                                           ix.      Bases are not exactly on central axis of the helix, this results in the major & minor grooves

                                                             x.      Hydration requirement

1.        Hydration very important for conformation and utility of nucleic acids

2.        Strength of these aqueous interactions is greater than for proteins due to their highly ionic character

3.        Hydration is greater and more strongly held around the phosphate groups that run along the inner edges of the major grooves.

4.        Hydrating water is held in a cooperative manner along the double helix in both the major and minor grooves, which aids both the zipping (annealing) and unzipping (unwinding) of the double helix.

5.        Altering hydration can change DNA structure.

 

e.       Gene is segment of DNA corresponding to a single protein (or single catalytic or structural RNA molecule)

                                                               i.      30,000 genes in human genome – allows efficient TXN. Expression of individual genes is regulated. Stretches of regulatory DNA are interspersed among coding segments. Noncoding regions bind to proteins that ctrl TXN.

                                                             ii.      477 genes in smallest known genome – Mycoplasma genitalium

                                                            iii.      200-300 genes minimum # for a viable cell in the environment today.

                                                           iv.      Cute analogy – DNA is made up of letters (NTs), these letters make words (seq for mRNA codons), these words make sentences (genes)

                                                             v.      Genes

1.       Function: DNA à mRNA à protein à mature/ modified protein

2.       Structure: Upstream regulation sites à promoter (TATA) TXN initiation site à 5' UTRà Exon à Intron à Exon à 3'UTR à Poly A add site.

3.       Splicing: 3 vital positions on RNA. 5' splice site, 3' splice site & branch point in the intron seq that forms the base of the excised lariot.  Endonuleolytic cleavage of introns and fusion of adjacent exons (performed by snRNAs+ 7 protein subunits = snRNPs = core of the splicosome that performs pre-mRNA splicing).

4.       Note: 3' end of intron = splice acceptor, 5' end of intron = splice donor

f.        Telomeres

                                                               i.      5-15 kb repeat of TTAGGG bound to proteins

                                                             ii.      Shortens 100 bp / division

                                                            iii.      Length maintained by telomerase: sees G at tip, elongates 5'à3'. It synthesizes a new copy of the repeat using an endogenous RNA template+primer. Telomerase reverse transcribes DNA. DNA POL completes synthesis of remaining lagging strand.

g.       Repetitive DNA

                                                               i.      Interspersed repeats (SINES-10% genome, LINES-10% genome)

                                                             ii.      Tandem repeats (VNTR, noncoding, often in heterochromatin)

 

III.             Organization

a.       Nuclear genome >30,000 genes, mitochondrial genome 37 genes

b.       35% genes, 65% extragenic

c.       Of the genes: 5% protein encoding (1-2% of entire genome), 95% non-coding

d.       Of extragenic: 40% mod to high copy repetitive, 60% low number repeat

e.       Chromosomes

                                                               i.      22 pairs homologous chromosomes & 2 sex chromosomes

                                                             ii.      Chromosome has centromere, telomere, & ori (classified by centromere location: acrocentric (near ends), submetacentric (midpt to end), metacentric (midpt)).  Kinetochore is present at centromere for spindle attachment

                                                            iii.      6 ft DNA compacted 10,000 fold in the chromosome

                                                           iv.      Packaging: Nucleosome (8 histones & 147 bp DNA form core)àfurther to chromatin fiber (30nm)àfurther to chromatid fibers.

                                                             v.      Nucleosome organization: 2-H2A+2-H2B (outside), 2-H3+2-H4 (inside), 147 bp DNA, 1.7 left handed turns, central 80 bp in contact with H3 & H4; H1 binds outside – completes 2 helical turns = chromatosome.

                                                           vi.      To digest it: (Remove nuclei of cell & expose to partially denaturing conditions).  Chromatin = beads on a string.  (Nuclease digest linker DNA) Nucleosome core particles.  (Disassociation w/ high [salt]) octameric histone core + piece of DNA.    Note: histone core has 142 H-bonds, hydrophobic interactions and salt linkages.  Sequence specific DNA binding proteins

f.        Chromatin compaction

                                                               i.      Active state: acetylated DNA in nucleosomes. TFN factors and RNA POL can access the DNA.

                                                             ii.      Compaction: DNA methyl-transferase methylates DNA. Histone deaceylase associates + methyl-CPF binding proteins & associated co-repressors. No aceylation, heavy methylation & tight winding = Transcriptional silencing.

 

IV.              Types/ Conformations of DNA

 

a.       A-DNA

                                                              i.      Pitch = 28.2 Å, 11 bp

                                                            ii.      Less common

                                                          iii.      Right handed

                                                          iv.      Base pairs closer to outer edge of helix

                                                            v.      More compact than B-DNA, # residues per turn (11, more fit in one rotation than B-DNA)

                                                          vi.      Narrow and deep major groove and very wide but shallow minor groove

b.      B-DNA

                                                              i.      Pitch = 34 Å, 10 bp

                                                            ii.      predominant natural DNA

                                                          iii.      Right handed

                                                          iv.      Base pairs closer to the central axis

                                                            v.      Wide and deep major groove and a narrow and deep minor groove and requires the greatest hydration. Partial dehydration converts it to A-DNA decreasing the free energy required for A-DNA deformation and twisting, which is usefully employed by encouraging supercoiling but eventually leads to denaturation.  Further dehydration will result in the least hydrated D-DNA (favored by excess counter-ions ions that shield the DNA phosphate charges), which has a very narrow minor groove with a string of alternating water and counter-ions distributed along its edge

c.       C-DNA

                                                              i.      Pitch = 31 Å, 9.3 bp

d.      D-DNA

                                                              i.      Pitch = 24.2 Å, 8 bp

e.      Z-DNA

                                                              i.      Pitch = 43 Å, 12 bp

                                                            ii.      Left handed, zig-zag path

                                                          iii.      Has even more residues per turn, most compact.

                                                          iv.      Rise is greatest bc its rigid and stretched out

f.        other conformations

                                                               i.      stem & loops

                                                             ii.      Octahedral structures

                                                            iii.      Superhelicies/ coiled coil

V.                 Function

a.       Nucleotides

                                                               i.      Energy (ATP, GTP – donates P in rxns)

                                                             ii.      Cofactors (NADH – donates an H in rxns)

                                                            iii.      Structural (DNA & RNA – radox partners)

                                                           iv.      Transfer RXNs (as ATP, charging AA to tRNA)

b.       DNA

                                                               i.      Replication

                                                             ii.      Transcription (Template strand 3' à 5', Coding strand 5' à 3' , mRNA) 5'à 3'c

                                                            iii.      DNA(neg) –protein (pos) binding

VI.              Prokaryotic vs. Eukaryotic Genomes

 

PRO                                                     EUK

Circular                                                                  Linear

No nucleosomes (has nucleoid loops)             nucleosomes à chromatid fiber

No introns                                                             Has introns

Gene dense                                                           65% extragenic

No repeats                                                             Tandem repeats & interspersed repeats

No telomere                                                           Telomere & telomerase

No splicing                                                            Splicing

Gene clustering (mult Ps on 1 gene)                 1 protein product on one gene

~1800 protein coding genes                               30,000 genes

1 replication site                                                   multiple replication sites

2-4 Mb genome                                                    >3,000 Mb genome

VII.           Applications in Science

a.       Cleavage of DNA at specific sites by restriction nucleases, which greatly facilitate the isolation and manipulation of individual genes

                                                              i.      Cleavage sites are usually palindromic & 4-8 NTs long. They can cleave blunt ends or cohesive/ staggered ends.. 1st few letters tell the organism the RE is from (ie. EcoRI is from E.coli). Bacterial genome is protected from RE activity by methylation at As & Cs.

                                                            ii.      DNA produced by splicing together two or more DNA frags = recombinant DNA.

                                                          iii.      Can also generate a collection of plasmids called a cDNA/ genomic DNA library by shotgun RE treatment to produce many gene frags of the genome or cDNA made by RT from mRNA. The rags are ss à ds DNA by DNA polymerase and are inserted in plasmid or virus vector and cloned. cDNA libarary is specific to type of cell used to prepare the library.

b.      DNA cloning either through the use of cloning vectors or PCR, where a single DNA molecule can be copied to generate many billions of identical molecules

c.       Nuclei acid hybridization, which makes it possible to find a specific sequence of DNA with good accuracy and sensitivity on the basis of its ability to bind a complementary NA sequence.

                                                              i.      Isolated DNA can be labeled with radioisotope 32P to make traceable probes.

d.      Rapid sequencing of all NTs in a purified DNA fragment, which makes it possible to ID genes and deduce the AA seq. of the proteins they encode.

 

Additional misc:

-          Oligonucleotide = A compound comprising a nucleotide linked to phosphoric acid. When polymerized, it gives rise to a nucleic acid.  A short nucleotide polymer. 
-          Nucleosie analog = A synthetic molecule that resembles a naturally occuring nucleoside, but that lacks a bond site needed to link it to an adjacent nucleotide. 
-          Note that nucleotides have only one phosphate group, whereas ATP has three, so ATP is not strictly speaking a nucleotide. AMP or adenosine mono-phosphate, is a nucleotide. AMP is also called adenosine 5'-phosphoric acid or adenylic acid.  AMP contains a ribose group as its sugar, while dAMP, or deoxy-AMP, contains a deoxyribose as its sugar. 
-          NTs present in RNA are AMP, GMP (guanylic acid, or guanosine 5'-phosphoric acid), CMP (cytidylic acid, or cytidine 5'-phosphoric acid) and UMP (uridylic acid, or uridine 5'-phosphoric acid).
-          NTs present in DNA are dAMP, dGMP, dCMP and dTMP (deoxythymidylic acid, or deoxythymidine 5'-phosphoric acid)
-          4 nucleosides of DNA are deoxyadenosine (dA), deoxyguanosine (dG), deoxycytosine (dC), & (deoxy) thymidine (dT, or T). 
-          Nucleotides are acidic, hence the name. 
-          The carbon atoms of the deoxyribose sugar are numbered 1', 2', 3', 4', and 5' to distinguish from the numbering of the atoms of the purine and pyrmidine rings. 
-          DMT (dimethoxytrityl protecting group) can be added, instead of a phosphate group, to block DNA synthesis & 3’ end tie up Carbon with triple bond to a nitrogen
-          The deoxyribose sugars are joined at both the 3'-hydroxyl and 5'-hydroxyl groups to phosphate groups in ester links, also known as "phosphodiester" bonds. 
-          Proteins that bind DNA (Topoisomerases, Histones, RNA POLY, Negative regulation – repressors (ie. lac repressor), Positive regulation – activators (ie. CFP), Attenuating proteins 

 

NT = nucleotide

TXN = transcription

AA = amino acid

RXN = reaction