Page 39 - Genetics_From_Genes_to_Genomes_6th_FULL_Part3
P. 39

9.4 Sequencing Genomes   333


                       Figure 9.9  Whole-genome shotgun sequencing. (a) Repeated elements longer than a sequence read would prevent the assembly
                       of shotgun sequences of the human genome because it is impossible to know which unique sequences (green or orange) flanking different
                       copies of a repeat belong together. (b) The paired-end sequencing method. A recombinant BAC clone is melted into single strands and
                       hybridized in one reaction with Primer 1 and in a second independent reaction with Primer 2. These primers correspond to the sequence of the
                       vector just flanking the human DNA insert on either side. The two primers hybridize with opposite strands of the recombinant DNA and are
                       oriented such that DNA polymerase will synthesize DNA corresponding to either end of the human DNA insert. [Note, as shown in part (c), that
                       the light green portion of the insert is in fact much longer than the dark green portions sequenced.] (c) Paired-end sequencing allows correct
                       assembly of genomes containing repeats because paired reads will include unique sequences on both sides of a repeat. In other words, the
                       unique sequence reads (green or orange boxes at right) align, so the repeat-containing sequence reads (green and black or orange and black
                       boxes at left) must align also.
                          (a)  Repeats prevent correct assembly of single shotgun sequence reads  (b)  Paired-end sequence reads from a BAC done
                                                 Correct assembly
                            Locus 1         Repeat
                          Sequence                                                              Vector
                             reads

                           Locus 2          Repeat                                                                5'
                          Sequence                                                     5'
                             reads                                                       Primer 1                   Primer 2
                                                                                                     Insert
                                                 Incorrect assembly
                                          Repeat
                          Sequence
                             reads
                                          Repeat                                     DNA synthesized   DNA synthesized
                                                                                     during sequencing
                                                                                                       during sequencing
                          Sequence                                                   reaction with     reaction with
                             reads                                                   Primer 1          Primer 2
                          (c)  Paired-end sequences direct correct assembly of unique sequences flanking repeats
                                            Locus 1                                           Locus 2
                                  Repeat                                           Repeat
                                  Sequence reads                                    Sequence reads



                                                                  BAC clone
                                                                    inserts











                       DNAs into sequencing reactions, and then loaded the re-  essential concepts
                       actions into the sequencing machines. This automation
                       allowed Celera to obtain relatively cheaply the millions of   •  The whole-genome shotgun sequencing strategy involves
                       DNA sequence reads required to provide about 10-fold    sequencing inserts of millions of clones selected
                       genomic equivalent coverage. The DNA sequencing ma-     randomly from libraries constructed with mechanically
                       chines fed their data into a centralized supercomputer,   sheared genomic DNA to ensure overlap between
                       whose complex software then assembled all these se-     fragments.
                       quences into the chromosomal strings. The whole- genome   •  Sequencing both ends of DNA inserts (paired-end
                       shotgun approach had such large relative efficiencies that   sequencing) provides information useful for genome
                       variants of this method have become the standard way to   assembly.
                       sequence genomes.
   34   35   36   37   38   39   40   41   42   43   44