WO2001016375A2 - High speed parallel molecular nucleic acid sequencing - Google Patents

High speed parallel molecular nucleic acid sequencing Download PDF

Info

Publication number
WO2001016375A2
WO2001016375A2 PCT/US2000/023736 US0023736W WO0116375A2 WO 2001016375 A2 WO2001016375 A2 WO 2001016375A2 US 0023736 W US0023736 W US 0023736W WO 0116375 A2 WO0116375 A2 WO 0116375A2
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
polymerase
fluorophore
nucleotide
substrate
Prior art date
Application number
PCT/US2000/023736
Other languages
French (fr)
Other versions
WO2001016375A3 (en
Inventor
Thomas D. Schneider
Denise Rubens
Original Assignee
The Government Of The United States Of America, As Represented By The Secretary, Department Of Health And Human Services
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The Government Of The United States Of America, As Represented By The Secretary, Department Of Health And Human Services filed Critical The Government Of The United States Of America, As Represented By The Secretary, Department Of Health And Human Services
Priority to US10/070,053 priority Critical patent/US6982146B1/en
Priority to AU70868/00A priority patent/AU7086800A/en
Publication of WO2001016375A2 publication Critical patent/WO2001016375A2/en
Publication of WO2001016375A3 publication Critical patent/WO2001016375A3/en
Priority to US11/204,367 priority patent/US20060292583A1/en
Priority to US12/196,139 priority patent/US20090061447A1/en
Priority to US12/886,686 priority patent/US8535881B2/en

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/62Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light
    • G01N21/63Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light optically excited
    • G01N21/64Fluorescence; Phosphorescence
    • G01N21/645Specially adapted constructive features of fluorimeters
    • G01N21/6456Spatial resolved fluorescence measurements; Imaging
    • G01N21/6458Fluorescence microscopy
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/62Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light
    • G01N21/63Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light optically excited
    • G01N21/64Fluorescence; Phosphorescence
    • G01N21/6428Measuring fluorescence of fluorescent products of reactions or of fluorochrome labelled reactive substances, e.g. measuring quenching effects, using measuring "optrodes"
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/62Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light
    • G01N21/63Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light optically excited
    • G01N21/64Fluorescence; Phosphorescence
    • G01N21/645Specially adapted constructive features of fluorimeters
    • G01N21/648Specially adapted constructive features of fluorimeters using evanescent coupling or surface plasmon coupling for the excitation of fluorescence

Definitions

  • This disclosure relates to an automated method for sequencing nucleic acids, such as DNA and RNA, which may be used for research and the diagnosis of disease in clinical applications.
  • the DNA template is incubated with a mixture containing all four deoxynucleoside 5'- triphosphates (dNTPs), one or more of which is labeled with 32 P, and a 2',3'-dideoxynucleoside triphosphate analog (ddNTP).
  • dNTPs deoxynucleoside 5'- triphosphates
  • ddNTP 2',3'-dideoxynucleoside triphosphate analog
  • Four separate incubation mixtures are prepared, each containing a different ddNTP analog (ddATP, ddCTP, ddGTP, or ddTTP).
  • the dideoxynucleotide analog is incorporated normally into the growing complementary DNA strand by the DNA polymerase, through their 5' triphosphate groups.
  • each reaction mixture contains a population of DNA molecules having a common 5' terminus, but varying in length to a nucleotide base specific 3' terminus.
  • These four preparations with heterogeneous fragments each ending in either cytosine (C), guanine (G), adenine (A) or thymine (T) are separated in four parallel lanes on polyacrylamide gels. The sequence is determined after autoradiography, by determining the terminal nucleotide base at each incremental cleavage in the molecular weight of the electrophoresed fragments.
  • the Maxam-Gilbert method of DNA sequencing involves the chemical-specific cleavage of DNA.
  • radio-labeled DNA molecules are incubated in four separate reaction mixtures, each of which partially cleaves the DNA at one or two nucleotides of a specific identity (G, A+G, C or C+T).
  • the resulting DNA fragments are separated by polyacrylamide gel electrophoresis, with each of the four reactions fractionated in a separate lane of the gel.
  • the DNA sequence is determined after autoradiography, again by observing the macromolecular separation of the fragments in the four lanes of the gel.
  • the use of fluorescent nucleotides has eliminated the need for radioactive nucleotides, and provided a means to automate DNA sequencing.
  • Electrophoresis requires macroscopic separation, with the necessity of expensive reagents, long gel preparation time, tedious sample loading, the dangers of exposure to the neurotoxin acrylamide. Macromolecular electrophoretic separation also exposes the technician to high voltage devices, requires prolonged electrophoresis time, produces gel artifacts, and requires calculations to adjust for dye mobilities. Furthermore, sequencing runs only allow for the sequencing of less than 1000 bases at a time, which can be a substantial drawback to the sequencing of long stretches of the genome.
  • Mills for example, described the use of mass spectrometry to separate the DNA fragments as an alternative to electrophoresis (U.S. Patent Nos. 5,221,518 and 5,064,754).
  • mass spectrometry devices are expensive, and because the method depends on size separation, it has a size resolution limit.
  • Jett et al uses an exonuclease to sequentially shorten a DNA molecule that is being sequenced. After a complementary DNA strand is synthesized in the presence of fluorescent nucleotides, the exonuclease cleaves individual fluorescent nucleotides from the end of the synthesized DNA molecule. These nucleotides pass through a detector, and the fluorescent signal emitted by each nucleotide is recorded to determine the DNA sequence.
  • FRET fluorescence resonance energy transfer
  • the present disclosure provides an improved method and device for sequencing nucleic acids.
  • the method allows several nucleic acids to be sequenced simultaneously at the molecular level.
  • the method uses a donor and acceptor class of dyes. This method and device minimize shearing the sample nucleic acids to be sequenced, and can be readily automated.
  • a method of sequencing a sample nucleic acid molecule by exposing the sample nucleic acid molecule to an oligonucleotide primer and a polymerase in the presence of a mixture of nucleotides.
  • the polymerase carries a fluorophore, and each different type of nucleotide (e.g.
  • A, T/U, C or G carries a fluorophore which emits a signal that is distinguishable from a signal emitted by the fluorophore carried by each of the other types of nucleotides.
  • the fluorophore on the polymerase is a donor fluorophore and the fluorophore carried on the nucleotides are acceptor fluorophores.
  • the donor fluorophore can be excited by a source of electromagnetic radiation (such as a laser) that specifically excites the donor fluorophore and not the acceptor fluorophores. This excitation induces the donor to emit light at a wavelength that can transfer energy to excite only the acceptor fluorophores that are added to the complementary strand by the polymerase.
  • a signal characteristic of the specific nucleotide being added e.g. A, T U, C or G
  • a series of sequential signals emitted by the added nucleotides is detected, and converted into the complement of the nucleic acid sample.
  • the unique emission signal for each nucleotide is generated by luminescence resonance energy transfer (LRET) or fluorescent resonance energy transfer (FRET).
  • the nucleic acid is a DNA or RNA molecule
  • the polymerase is a DNA or RNA polymerase, if DNA is being sequenced, or reverse transcriptase if RNA is being sequenced.
  • the polymerase is a Klenow fragment of DNA polymerase I.
  • the polymerase is a GFP-polymerase.
  • the donor fluorophore is green fluorescent protein (GFP).
  • GFP green fluorescent protein
  • the donor fluorophore, such as GFP is excited by a laser.
  • GFP can be excited by a luminescent molecule, for example aequorin.
  • the donor fluorophore is a luminescent molecule, for example aequorin or europium chelates.
  • the donor fluorophore does not require excitation by a source of electromagnetic radiation, because the luminescent donor fluorophore is naturally in an excited state.
  • the acceptor fluorophores are BODIPY, fluorescein, rhodamine green, and Oregon green or derivatives thereof.
  • the donor fluorophore and one of the acceptor fluorophores comprise a donor/acceptor fluorophore pair selected from the group consisting of the GFP mutant H9-40, tetramethylrhodamine, LissamineTM, Texas Red and naphthofluorescein.
  • the polymerase may be fixed to a substrate, for example by a linker molecule that includes a polymerase component and a substrate component.
  • the linker may be selected from the group consisting of streptavidin-biotin, histidine-Ni, S-tag-S- protein, and glutathione-glutathione-S-transferase (GST).
  • GST glutathione-glutathione-S-transferase
  • a nucleic acid may be fixed to a substrate.
  • the oligonucleotide primer is fixed to a substrate, for example at its 5' end.
  • the sample nucleic acid to be sequenced is fixed to the substrate.
  • the sample nucleic acid to be sequenced is fixed to the substrate by its 5' end, 3' end or anywhere in between.
  • a plurality of polymerases, oligonucleotide primers, or sample nucleic acids are fixed directly or indirectly to the substrate in a predetermined pattern.
  • the polymerases can be deposited into channels which have been etched in an orderly array or by micropipetting droplets containing the polymerases onto a slide, for example either by manually pipetting or with an automated arrayer.
  • a plurality of sequencing reactions are performed substantially simultaneously, and the signals from the plurality of sequencing reactions detected.
  • the unique emission signals are detected with a charged-coupled device (CCD) camera as an example of a detector, which can detect a sequence of signals from a predetermined position on the substrate, and convert them into the nucleic acid sequence.
  • CCD charged-coupled device
  • the unique emission signals may be stored in a computer readable medium.
  • GFP-polymerase contains an affinity tag that attaches the GFP-polymerase to the substrate.
  • the GFP-polymerase is attached to the substrate by a linker.
  • Other embodiments disclosed herein include a method of sequencing a sample nucleic acid by attaching a polymerase to a substrate, adding the sample nucleic acid with an annealed oligonucleotide to the polymerase, and allowing the sample nucleic acid to bind to the polymerase in the presence of nucleotides for inco ⁇ oration into a complementary nucleic acid.
  • the polymerase and nucleotides are labeled with donor and acceptor fluorophores that emit a distinguishable signal when a particular type of nucleotide (e.g. A, T/U, C or G) is inco ⁇ orated into the complementary nucleic acid.
  • a sequence of the distinguishable signals are detected as the nucleotides are sequentially added to the complementary nucleic acid, and the sequence of signals are converted into a corresponding nucleic acid sequence.
  • Also disclosed herein is a method of sequencing a sample nucleic acid by attaching a sample nucleic acid to a substrate, adding an oligonucleotide primer and allowing the oligonucleotide primer to anneal to the attached sample nucleic acid, adding a polymerase in the presence of nucleotides, and allowing the sample nucleic acid to bind to the polymerase in the presence of nucleotides for inco ⁇ oration into a complementary nucleic acid.
  • the polymerase and nucleotides are labeled with donor and acceptor fluorophores that emit a distinguishable signal when a particular type of nucleotide (e.g.
  • A, T/U, C or G) is inco ⁇ orated into the complementary nucleic acid.
  • a sequence of the distinguishable signals are detected as the nucleotides are sequentially added to the complementary nucleic acid, and the sequence of signals are converted into a corresponding nucleic acid sequence.
  • the sample nucleic acid can be attached to a substrate, for example at its 5'- or 3' end, or any where in between.
  • Another embodiment disclosed herein is a method of sequencing a sample nucleic acid by attaching an oligonucleotide primer to a substrate, adding a sample nucleic acid and allowing the oligonucleotide primer to anneal to the sample nucleic acid, adding a polymerase in the presence of nucleotides, and allowing the sample nucleic acid to bind to the polymerase in the presence of nucleotides for inco ⁇ oration into a complementary nucleic acid.
  • the polymerase and nucleotides are labeled with donor and acceptor fluorophores that emit a distinguishable signal when a particular type of nucleotide (e.g.
  • A, T/U, C or G) is inco ⁇ orated into the complementary nucleic acid.
  • a sequence of the distinguishable signals is detected as the nucleotides are sequentially added to the complementary nucleic acid, and the sequence of signals is converted into a corresponding nucleic acid sequence.
  • the present disclosure also includes a device for sequencing a nucleic acid molecule, in which a polymerase (carrying a donor fluorophore), oligonucleotide primer, or sample nucleic acid is attached to a substrate.
  • the device also includes a viewing means to view the polymerase, and a detection means that detects a characteristic signal from an acceptor fluorophore carried by a corresponding nucleotide, as the nucleotide is added to the nucleic acid molecule by the polymerase.
  • An electromagnetic radiation source (such as light of a specified wavelength range) excites the donor fluorophore but not the acceptor fluorophore, so that a signal emitted by the donor fluorophore specifically excites the acceptor fluorophore as each nucleotide is added to the synthesized complementary strand by the polymerase.
  • the electromagnetic radiation source is optional if LRET is used.
  • a decoding means then converts a series of characteristic signals emitted by the acceptor fluorophores into a nucleic acid sequence that corresponds to the nucleic acid sequence of the complement.
  • the substrate may be a glass microscope slide or a three- dimensional matrix.
  • the electromagnetic radiation is from a laser that emits light of the particular wavelength
  • the viewing means includes a microscope objective.
  • the detection means of the device may include a CCD camera, and the decoding means (which converts the series of unique signals into a nucleic acid sequence) is a digital computer.
  • the device for sequencing a nucleic acid is a glass microscope slide to which an oligonucleotide primer, sample nucleic acid, or polymerase is attached, and the polymerase includes a GFP donor fluorophore.
  • a laser is positioned to stimulate the donor fluorophore at a specific wavelength, and the donor fluorophore emits a first signal that induces the acceptor fluorophore to emit a signal when the acceptor fluorophore is brought sufficiently close to the donor fluorophore during chain elongation.
  • the signal emitted by the acceptor fluorophore is unique to each type of nucleotide (e.g.
  • A, T/U, C or G so that the emitted signal indicates the nucleotide that is added to the complement.
  • a microscope objective is positioned to view the sequence of signals emitted by the individual acceptor fluorophore molecules as the nucleotides are added to the polymerase.
  • a spectrophotometer then converts the sequence of signals into a series of specfrographic signals that correspond to the series of signals emitted by the acceptor fluorophore.
  • a CCD camera detects the sequence of signals and a digital computer converts the sequence of signals into a nucleic acid sequence.
  • FIG. 1A is a schematic drawing showing the attachment of a polymerase to a substrate, and the polymerase associated with a template and primer strand.
  • FIG. IB is a schematic drawing showing the attachment of an oligonucleotide primer to a substrate, and the polymerase associated with a template and primer strand.
  • FIG. 1C is a schematic drawing showing the attachment of a nucleic acid to be sequenced by its 3' end, a substrate, and the polymerase associated with a template and primer strand.
  • FIG. ID is a schematic drawing showing the attachment of a nucleic acid to be sequenced by its 5' end, a substrate, and the polymerase associated with a template and primer strand.
  • FIG. 2 is a schematic drawing illustrating fluorescence resonance energy transfer (FRET) between a donor fluorophore on a polymerase and an acceptor fluorophore on a nucleotide. Note that a laser 26 which emits electromagnetic radiation 28 is not required for luminescence resonance energy transfer (LRET).
  • FIG. 3 is a schematic drawing illustrating a microscope and computer assembly that can be used to sequence nucleic acids using TDS. Note that a laser 26 which emits electromagnetic radiation 28 is not required for LRET.
  • Acceptor fluorophores will generally be compounds which absorb energy from the donor fluorophore in the range of about 400 to 900 nm, usually in the range of about 500 to 800 nm. Acceptor fluorophores in the disclosed embodiments have an excitation spectra which overlaps with the emission of the donor fluorophore, such that energy emitted by the donor can excite the acceptor. The acceptor fluorophores are capable of being attached to nucleotides.
  • Acceptor fluorophores will generally absorb light at a wavelength which is usually at least 10 nm higher, more usually at least 20 nm higher, than the maximum absorbance wavelength of the donor fluorophore, and will have a fluorescence emission maximum at a wavelength ranging from about 400 to 900 nm.
  • Acceptor fluorophores may be rhodamines, fluorescein derivatives, Green Fluorescent Protein (GFP), BODIPY (4,4-difluoro-4-bora-3a,4a-diaza-s-indacene) and cyanine dyes.
  • acceptor fluorescer moieties include 5-carboxyfluorescein (FAM), 2'7'-dimethoxy-4'5'- dichloro-6-carboxyfluorescein (JOE), N,N,N',N'-tetramethyl-6-carboxyrhodamine (TAMRA), 6- carboxy-X-rhodamine (ROX), BODIPY and cyanine dyes. Additional fluorophores which may be used in the herein disclosed method are listed below.
  • Affinity Tag A molecule, such as a protein, attached to the N- or C-terminus of a recombinant protein using genetic engineering methods, to aid in the purification of the recombinant protein.
  • affinity tags include, but are not limited to: histidine, S-tag, glutathione-S- transferase (GST) and streptavidin.
  • Affinity tags may also be used to attach a protein or nucleic acid to a substrate.
  • cDNA complementary DNA: A piece of DNA lacking internal, non-coding segments (introns) and regulatory sequences which determine transcription. cDNA can be synthesized in the laboratory by reverse transcription from messenger RNA extracted from cells.
  • Characteristic Signal The resulting signal emitted from a fluorescently-labeled nucleotide, which can be predicted by the fluorophore(s) attached to the nucleotide.
  • nucleic acids that are "complementary" can be perfectly or imperfectly complementary, as long as the desired property resulting from the complementarity is not lost, e.g., ability to hybridize.
  • the donor fluorophore will generally be compounds which absorb in the range of about 300 to 900 nm, usually in the range of about 350 to 800 nm, and are capable of transferring energy to the acceptor fluorophore.
  • the donor fluorophore will have a strong molar absorbance co-efficient at the desired excitation wavelength, for example greater than about 10 3 M "1 cm "1 .
  • a variety of compounds may be employed as donor fluorescer components, including fluorescein, GFP, phycoerythrin, BODIPY, DAPI (4',6-diamidino-2-phenylindole), Indo-1, coumarin, dansyl, and cyanine dyes.
  • Specific donor labels of interest include fluorescein, rhodamine, and cyanine dyes.
  • Other fluorophores that can be used in the method disclosed herein are provided below.
  • the donor fluorophore is a luminescent molecule, such as aequorin, as discussed below.
  • Electromagnetic Radiation A series of electromagnetic waves that are propagated by simultaneous periodic variations of electric and magnetic field intensity, and that includes radio waves, infrared, visible light, ultraviolet light, X-rays and gamma rays.
  • electromagnetic radiation can be emitted by a laser, which can possess properties of monochromaticity, directionality, coherence, polarization, and intensity.
  • Lasers are particularly useful sources of electromagnetic energy for the method disclosed herein, because lasers are capable of emitting light at a particular wavelength (or across a relatively narrow range of wavelengths), such that energy from the laser can excite a donor but not an acceptor fluorophore.
  • Emission Signal The wavelength of light generated from a fluorophore after the fluorophore absorbs an excitation wavelength of light.
  • Emission Spectrum The broad energy spectra which results after a fluorophore is excited by a specific wavelength of light. Each fluorophore has its own unique emission spectrum. Therefore, when individual fluorophores are attached to nucleotides, the emission spectrums from the fluorophores provide a means for distinguishing between the different nucleotides.
  • Excitation Signal The wavelength of light necessary to raise a fluorophore to a state such that the fluorophore will emit a longer wavelength of light.
  • Fluorophore A chemical compound, which when excited by exposure to a particular wavelength of light, emits light (i.e., fluoresces), for example at a different wavelength.
  • luminescent molecules which are chemical compounds which do not require exposure to a particular wavelength of light to fluoresce; luminescent compounds naturally fluoresce. Therefore, the use of luminescent signals eliminates the need for an external source of electromagnetic radiation, such as a laser.
  • An example of a luminescent molecule includes, but is not limited to, aequorin (Tsien, 1998, Ann. Rev. Biochem. 67:509). Further description is provided below. Examples of fluorophores that may be used in the method disclosed herein are provided in
  • rhodamine and derivatives such as 6-carboxy-X-rhodamine (ROX), 6-carboxyrhodamine (R6G), lissamine rhodamine B sulfonyl chloride, rhodamine (Rhod), rhodamine B, rhodamine 123, rhodamine X isothiocyanate, sulforhodamine B, sulforhodamine 101 and sulfonyl chloride derivative of sulforhodamine 101 (Texas Red); N,N,N',N'-tetramethyl-6- carboxyrhodamine (TAMRA); tetramethyl rhodamine; tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid and terbium chelate derivatives.
  • ROX 6-carboxy-X-rhodamine
  • fluorophores include thiol-reactive europium chelates which emit at approximately 617 nm (Heyduk and Heyduk, Analyt. Biochem. 248:216-27, 1997; J. Biol. Chem. 274:3315-22, 1999).
  • fluorophores include GFP, LissamineTM, diethylaminocoumarin, fluorescein chlorotriazinyl, naphthofluorescein, 4,7-dichlororhodamine and xanthene (as described in U.S. Patent No. 5,800,996 to Lee et al, herein inco ⁇ orated by reference) and derivatives thereof.
  • fluorophores known to those skilled in the art may also be used, for example those available from Molecular Probes (Eugene, OR).
  • the fluorophores disclosed herein may be used as a donor fluorophore or as an acceptor fluorophore.
  • Particularly useful fluorophores have the ability to be attached to a polymerase or a nucleotide, are stable against photobleaching, and have high quantum efficiency.
  • the fluorophores on different sets of nucleotides e.g. A, T/U, G, C
  • Fluorescence resonance energy transfer A process in which an excited fluorophore (the donor) transfers its excited state energy to a light absorbing molecule (the acceptor). This energy transfer is non-radioactive, and due primarily to a dipole-dipole interaction between the donor and acceptor fluorophores. This energy can be passed over a distance, for example a limited distance such as 10-100 A. Limitation on the distance over which the energy can travel helps limit transfer to a desired target (such as between a donor fluorophore on a polymerase and a target acceptor fluorophore on a nucleotide, without collateral stimulation of other acceptor fluorophores).
  • FRET pairs Sets of fluorophores that can engage in fluorescence resonance energy transfer (FRET). Examples of FRET pairs that can be used are listed below. However, one skilled in the art will recognize that numerous other combinations of fluorophores can be used.
  • FAM is most efficiently excited by light with a wavelength of 488 nm, emits light with a spectrum of 500 to 650 nm, and has an emission maximum of 525 nm.
  • FAM is a suitable donor fluorophore for use with JOE, TAMRA, and ROX (all of which have their excitation maximum at 514 nm, and will not be significantly stimulated by the light that stimulates FAM).
  • the GFP mutant H9-40 (Tsien, 1998, Ann. Rev. Biochem. 67:509), which is excited at 399 nm and emits at 511 nm, may serve as a suitable donor fluorophore for use with BODIPY, fluorescein, rhodamine green and Oregon green.
  • fluorophores tetramethylrhodamine, LissamineTM, Texas Red and naphthofluorescein can be used as acceptor fluorophores with this GFP mutant.
  • the fluorophore 3-( ⁇ -carboxy-pentyl)-3'-ethyl-5,5'-dimethyloxacarbocyanine (CYA) is maximally excited at 488 nm and may therefore serve as a donor fluorophore for fluorescein or rhodamine derivatives (such as R6G, TAMRA, and ROX) which can be used as acceptor fluorophores (see Hung et al, Analytical Biochemistry, 243 : 15-27, 1996).
  • CYA and FAM are not examples of a good FRET pair, because both are excited maximally at the same wavelength (488 nm).
  • Fusion Protein A protein comprising two amino acid sequences that are not found joined together in nature.
  • the term "GFP-polymerase fusion protein” refers to a protein that includes a first amino acid sequence and a second amino acid sequence, wherein the first amino acid sequence is a GFP molecule (mutant or wild-type) and the second amino acid sequence is a polymerase.
  • the link between the first and second domains of the fusion protein is typically, but not necessarily, a peptide linkage.
  • GFP-aequorin fusion protein refers to a protein that includes a first amino acid sequence and a second amino acid sequence, wherein the first amino acid sequence is a GFP molecule (mutant or wild-type) and the second amino acid sequence is an aequorin.
  • GFP- aequorin fusion proteins can be generated using the method of Baubet et al. (Proc. Natl. Acad. Sci. USA 97:7260-5, 2000, herein inco ⁇ orated by reference). These fusion proteins may also be represented by the formula X-Y wherein X is a fluorophore, such as GFP, and Y is a polymerase protein.
  • an affinity tag sequence may be linked to the N- or C-terminus of the first protein.
  • T is the affinity tag
  • X is a protein, such as a fluorescent protein
  • Y is a polymerase protein.
  • GFP refers to both the wild-type protein, and spectrally shifted mutants thereof, for example as described in Tsien, 1998, Ann. Rev. Biochem. 67:509 and in U.S. Patent Nos.
  • GFP is excited using a laser.
  • GFP is excited using aequorin, for example using a GFP-aequorin fusion protein.
  • GFP-polymerase Recombinant fusion protein containing both a functional GFP molecule and a functional polymerase.
  • the GFP can be located at the N- or C- terminus of the polymerase. Alternatively, the GFP molecule can be located anywhere within the polymerase. Regardless of GFP position, it is important that the polymerase remain functional (i.e. able to catalyze the elongation of the complementary nucleic acid strand).
  • the GFP-polymerase may also contain an affinity tag to aid in its purification and/or attachment to a substrate (Tag-GFP-polymerase).
  • the GFP- polymerase may also contain a functional aequorin sequence, for example if the use of LRET is desired.
  • Linker Means by which to attach a polymerase or a nucleic acid to a substrate.
  • the linker ideally does not significantly interfere with binding to or inco ⁇ oration by the polymerase.
  • the linker can be a covalent or non-covalent means of attachment.
  • the linker is a pair of molecules, having high affinity for one another, one molecule on the polymerase (such as an affinity tag), the other on the substrate.
  • high-affinity molecules include streptavidin and biotin, histidine and nickel (Ni), and GST and glutathione.
  • the linker is a straight-chain or branched amino- or mercapto- hydrocarbon with more than two carbon atoms in the unbranched chain. Examples include aminoalkyl, aminoalkenyl and aminoalkynyl groups.
  • the linker is an alkyl chain of 10- 20 carbons in length, and may be attached through a Si-C direct bond or through an ester, Si-O-C, linkage (see U.S. Patent No. 5,661,028 to Foote, herein inco ⁇ orated by reference).
  • Other linkers are provided in U.S. Patent No. 5,306,518 to Prober et al, column 19; and U.S. Patent No. 4,711,955 to Ward et al, columns 8-9; and U.S. Patent No. 5,707,804 to Mathies et al columns 6-7 (all herein inco ⁇ orated by reference).
  • nucleic acids for example the oligonucleotide primer or the nucleic acid to be sequenced
  • methods for attaching a nucleic acid include, but are not limited to: synthesizing a 5' biotinylated nucleic acid and affixing it to a streptavidin coated substrate (Beaucage, Tetrahedron Letters 22:1859-62, 1981; Caruthers, Meth. Enzym.
  • Luminescence Resonance Energy Transfer A process similar to FRET, except that the donor molecule is itself a luminescent molecule, or is excited by a luminescent molecule, instead of a laser.
  • the luminescent molecule is naturally in an excited state; it does not require excitation by an external source of electromagnetic radiation, such as a laser. This will decrease the background fluorescence.
  • the luminescent molecule can be attached to a polymerase, for example GFP-polymerase, as a means to produce local excitation of the GFP donor fluorophore, without the need for an external source of electromagnetic radiation.
  • the luminescent molecule is the donor fluorophore.
  • the fluorescence emitted from the luminescent molecule excites the acceptor flurophores.
  • luminescent molecule that can be used includes, but is not limited to, aequorin.
  • the bioluminescence from aequorin which peaks at 470 nm, can be used to excite a donor GFP fluorophore (Tsien, 1998, Ann. Rev. Biochem. 67:509; Baubet et al, 2000, Proc. Natl. Acad. Sci. U.S.A., 97:7260-5).
  • GFP transfers its resonance to the acceptor fluorophores disclosed herein.
  • both aequorin and GFP can be attached to the polymerase.
  • nucleic acid refers to both DNA and RNA molecules.
  • a sample nucleic acid molecule is a nucleic acid to be sequenced, and can be obtained in purified form, by any method known to those skilled in the art. For example, as described in U.S. Patent No. 5,674,743 to Ulmer, herein inco ⁇ orated by reference.
  • Nucleotides The major nucleotides of DNA are deoxyadenosine 5'-triphosphate (dATP or A), deoxyguanosine 5'-triphosphate (dGTP or G), deoxycytidine 5'-triphosphate (dCTP or C) and deoxythymidine 5'-triphosphate (dTTP or T).
  • RNA The major nucleotides of RNA are adenosine 5'- triphosphate (ATP or A), guanosine 5'-triphosphate (GTP or G), cytidine 5'-triphosphate (CTP or C) and uridine 5'-triphosphate (UTP or U).
  • ATP adenosine 5'- triphosphate
  • GTP guanosine 5'-triphosphate
  • CTP or C cytidine 5'-triphosphate
  • UTP uridine 5'-triphosphate
  • the nucleotides disclosed herein also include nucleotides containing modified bases, modified sugar moieties and modified phosphate backbones, for example as described in U.S. Patent No. 5,866,336 to Nazarenko et al. (herein inco ⁇ orated by reference).
  • modified base moieties which can be used to modify nucleotides at any position on its structure include, but are not limited to: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta- D-galactosylqueosine, inosine, N ⁇ 6-sopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2- dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6- adenine, 7-methylguanine, 5-methylaminomethyluracil, methoxyarninomethyl-2-thiouracil, beta-D- mannosylqueosine,
  • modified sugar moieties which may be used to modify nucleotides at any position on its structure include, but are not limited to: arabinose, 2-fluoroarabinose, xylose, and hexose, or a modified component of the phosphate backbone, such as phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, or a formacetal or analog thereof.
  • Such modifications allow for inco ⁇ oration of the nucleotide into a growing nucleic acid chain. That is, they do not result in the termination of nucleic acid synthesis.
  • nucleotide precursors are dependent on the nucleic acid to be sequenced. If the template is a single-stranded DNA molecule, deoxyribonucleotide precursors (dNTPs) are used in the presence of a DNA-directed DNA polymerase. Alternatively, ribonucleotide precursors (NTPs) are used in the presence of a DNA-directed RNA polymerase. However, if the nucleic acid to be sequenced is RNA, then dNTPs and an RNA-directed DNA polymerase are used.
  • dNTPs deoxyribonucleotide precursors
  • RNA-directed DNA polymerase RNA-directed DNA polymerase
  • a "type" of nucleotide refers to a set of nucleotides that share a common characteristic that is to be detected.
  • the types of nucleotides may be divided into four types: A, T, C and G (for DNA) or A, U, C and G (for RNA).
  • each type of nucleotide of the method disclosed herein will be labeled with a unique acceptor fluorophore, so as to be distinguishable from the other types by fluorescent spectroscopy or by other optical means.
  • fluorophores are known in the art and include those listed above.
  • the fluorescent label generally is not part of the 3'-OH group, so as to allow the polymerase to continue to add subsequent nucleotides.
  • a polynucleotide is a linear sequence of up to about 200 nucleotide bases in length, for example a polynucleotide (such as DNA or RNA) which is at least 6 nucleotides, for example at least 15, 50, 100 or even 200 nucleotides long.
  • a polynucleotide such as DNA or RNA
  • ORF open reading frame: A series of nucleotide triplets (codons) coding for amino acids without any termination codons. These sequences are usually translatable into a peptide.
  • Polymerase The enzyme which catalyzes the elongation of the primer strand, in the 5' to 3' direction along the nucleic acid template to be sequenced.
  • polymerases which may be used in the method disclosed herein include, but are not limited to: the E. coli DNA polymerase I, specifically the Klenow fragment which has 3' to 5' exonuclease activity, Taq polymerase, reverse transcriptase, E. coli RNA polymerase, and wheat germ RNA polymerase II.
  • polymerase The choice of polymerase is dependent on the nucleic acid to be sequenced. If the template is a single-stranded DNA molecule, a DNA-directed DNA or RNA polymerase may be used; if the template is a single-stranded RNA molecule, then a reverse transcriptase (i.e., an RNA-directed DNA polymerase) may be used.
  • Polynucleotide A linear nucleic acid sequence of any length. Therefore, a polynucleotide includes molecules which are 15, 50, 100, 200 (oligonucleotides) and also nucleotides as long as a full length cDNA.
  • Primer Short nucleic acids, for example DNA oligonucleotides 10 nucleotides or more in length, which are annealed to a complementary target nucleic acid strand by nucleic acid hybridization to form a hybrid between the primer and the target nucleic acid strand, then extended along the target nucleic acid strand by a polymerase enzyme. Therefore, individual primers can be used for nucleic acid sequencing. In addition, primer pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or other nucleic-acid amplification methods known in the art. Primers comprise at least 10 nucleotides of the nucleic acid sequences to be sequenced.
  • PCR polymerase chain reaction
  • primers having 15, 20, 30, 40, 50, 60, 70, 80, 90 or 100 consecutive nucleotides of the nucleic acid sequences to be sequenced.
  • Methods for preparing and using primers are described in, for example, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York; Ausubel et al (1987) Current Protocols in Molecular Biology, Greene Publ. Assoc. & Wiley-Intersciences.
  • the primer used may be DNA, RNA, or a mixture of both.
  • the primer used may be RNA or DNA.
  • a purified GFP-polymerase protein preparation is one in which the GFP-polymerase protein is more pure than the protein in its environment within a cell.
  • a preparation of a GFP-polymerase protein is purified such that the GFP-polymerase protein represents at least 50% of the total protein content of the preparation, but may be, for example 90 or even 98% of the total protein content.
  • Recombinant A recombinant nucleic acid is one that has a sequence that is not naturally occurring or has a sequence that is made by an artificial combination of two otherwise separated segments of sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques.
  • Reverse Transcriptase A template-directed DNA polymerase that generally uses RNA as its template.
  • RNA polymerase Catalyzes the polymerization of activated ribonucleotide precursors that are complementary to the DNA template. Sequence of signals: The sequential series of emission signals, including light or spectra signals, that are emitted from fluorescently labeled nucleotides as they are added to the growing complementary nucleic acid strand.
  • Substrate Material in the microscope field of view that the polymerase or nucleic acid is attached to.
  • the substrate is made of biocompatible material that is transparent to light, including glass and quartz.
  • the substrate may be a 3 cm long by 1 cm wide by 0.25 cm thick glass microscope slide.
  • the substrate can be a gel matrix, to allow sequencing in three-dimensions.
  • the substrate can be opaque.
  • the substrate can be treated before use.
  • glass microscope slides can be washed by ultrasonication in water for 30 minutes, soaked in 10% NaOH for 30 minutes, rinsed with distilled water and dried in an 80°C oven for 10 minutes or air-dried overnight.
  • Two dye sequencing A method of sequencing nucleic acids using at least two sets of fluorophores, with one set on the nucleotides (a different acceptor dye for each class of nucleotides), and the other set on the polymerase (a donor dye). In particular embodiments, two sets of fluorophores are used.
  • a transformed cell is a cell into which has been introduced a nucleic acid molecule by molecular biology techniques.
  • transformation encompasses all techniques by which a nucleic acid molecule might be introduced into such a cell, including transfection with viral vectors, transformation with plasmid vectors, and introduction of naked DNA by electroporation, lipofection, and particle gun acceleration.
  • each fluorophore is unique. By attaching one or more individual fluorophores or other labels to each type of nucleotide, each different type of nucleotide (e.g. A, T/U, C or G) has its own individual or own combination of signals (such as fluorophores that emit at unique different wavelengths). Each nucleotide class will have a unique emission signal, that in the examples is based on the fluorophore(s) present on that class of nucleotide. This signal can be used to determine which type of nucleotide (e.g. A, T/U, C or G) has been added to a growing complementary strand of nucleic acid, and these signals in combination indicate the nucleic acid sequence.
  • each type of nucleotide e.g. A, T/U, C or G
  • Vector A nucleic acid molecule as introduced into a host cell, thereby producing a transformed host cell.
  • a vector may include nucleic acid sequences that permit it to replicate in a host cell, such as an origin of replication.
  • a vector may also include one or more selectable marker genes and other genetic elements known in the art.
  • TDS Two Dye Sequencing
  • the donor fluorophore is on a polymerase
  • the acceptor fluorophore is on the nucleotides which are inco ⁇ orated into the nucleic acid as a complementary strand is generated (FIGS 1-3).
  • a polymerase 10 is attached to a substrate 12, such as a microscope slide, by a linker 14.
  • the nucleic acid 16 to be sequenced has an annealed oligonucleotide primer 18, and is bound by the anchored polymerase 10.
  • a mixture of nucleotides 20 is added.
  • the polymerase 10 then sequentially adds the appropriate nucleotide 20 to the complementary strand.
  • the substrate 12 can be mounted onto a microscope stage 34.
  • the sequencing reaction may take place in an aqueous environment 36, which may be sealed to prevent desiccation, for example by covering with a glass cover slip 38.
  • FIGS. IB- ID show alternative embodiments in which a nucleic acid, for example an oligonucleotide primer 18 (FIG. IB) or a nucleic acid to be sequenced 16 (FIGS. IC and ID) is attached to a substrate 12, such as a microscope slide, by a linker 14.
  • the nucleic acid to be sequenced can be attached by its 5' (FIG. ID) or 3' end (FIG. IC). In other embodiments, the nucleic acid to be sequenced can be attached to the substrate by any nucleotide within the nucleic acid.
  • a mixture of nucleotides 20 and polymerase 10 is added as described above.
  • FIG. 2 illustrates the fluorophores on both the polymerase 10 and the nucleotides 20.
  • the polymerase 10 is labeled with a donor fluorophore 22, such as green fluorescent protein (GFP).
  • the nucleotide 20 (A, T/U, C, or G) is labeled with at least one acceptor fluorophore 24. After attaching the fluorescent polymerase 10 to a substrate 12 in a microscope field of view, the fluorescent nucleotides 20 are added to the reaction chamber.
  • the fluorophore 22 on the polymerase 10 While each nucleotide 20 is added to the complementary strand, the fluorophore 22 on the polymerase 10, but not the fluorophore(s) 24 on the nucleotides 20, is continually excited using electromagnetic radiation, for example a coherent beam of light provided by a laser 26 which emits electromagnetic radiation 28 of a particular wavelength, or light within a narrow range of wavelengths.
  • the donor fluorophore 22 can be a luminescent molecule, or a luminescent molecule can be used to excite the donor fluorophore 22.
  • a source of electromagnetic radiation, such as a laser 26, is not required.
  • An example of a luminescent molecule is aequorin.
  • the laser 26 provides an excitation signal 28 that excites the donor fluorophore 22 on the polymerase 10, but not the acceptor fluorophore 24 on the inco ⁇ orated or free nucleotides 20.
  • the emission signal 30 from the donor fluorophore 22 will excite the acceptor fluorophore 24 associated with the particular nucleotide being added to the sequence.
  • the acceptor fluorophore 24 then emits its own unique emission signal 32, which acts as an indicator of the corresponding type of nucleotide (uniquely associated with that fluorophore) that has been added to the sequence.
  • This transfer of energy from the donor fluorophore to the acceptor fluorophore is fluorescence resonance energy transfer (FRET).
  • FRET fluorescence resonance energy transfer
  • a luminescent molecule such as aequorin, (instead of a laser 26) is used to excite the donor fluorophore (or is the donor fluorophore)
  • the resulting emission signal 30 from the donor fluorophore 22 or luminescent molecule
  • the acceptor fluorophore 24 then emits its own unique emission signal 32, which acts as an indicator of the corresponding type of nucleotide (uniquely associated with that fluorophore) that has been added to the sequence.
  • This transfer of energy is luminescent resonance energy transfer (LRET).
  • the unique emission signal 32 for each type of nucleotide 20 (A, T/U, C or G) is converted into a nucleic acid sequence as shown in FIG. 3.
  • the series of emission signals 32, emitted in the microscope field as each nucleotide is added to the sequence, is collected with a microscope objective lens 40, and a complete emission spectrum 42 for each nucleotide emission 32 is generated by a spectrophotometer 44.
  • the complete emission spectrum 42 is captured by a detection device, such as CCD-camera 46 for each nucleotide 20 as it is added to the nucleic acid strand 16 in the microscope field of view.
  • the CCD camera 46 collects the emission spectrum 42 for each added nucleotide, and converts the spectrum 42 into a charge 48.
  • the charge 48 for each nucleotide addition may be recorded by a computer 50, for converting the sequence of emission spectrums into a nucleic acid sequence 52 for each nucleic acid in the microscope field of view using an algorithm 54, such as a least-squares fit between the signal spectrum 42 and the dye spectra for the fluors 24 on each class of nucleotides 20.
  • a computer 50 for converting the sequence of emission spectrums into a nucleic acid sequence 52 for each nucleic acid in the microscope field of view using an algorithm 54, such as a least-squares fit between the signal spectrum 42 and the dye spectra for the fluors 24 on each class of nucleotides 20.
  • the donor fluorophore 22 carried by the polymerase 10 is GFP H9-40, and the nucleotides are labeled with acceptor fluorophores as follows: A is labeled with
  • the donor fluorophore 22 carried by the polymerase 10 is H9-40, and the nucleotides are labeled with acceptor fluorophores as follows: A is labeled with tetramethylrhodamine; T U is labeled with napthofluorescein; C is labeled with lissamine; G is labeled with Texas Red.
  • the emission spectrum of each of the acceptor fluorophores is monitored, and the spectrum of each of the fluorophores can be distinguished from each other, so that the addition of each different type of nucleotide can be detected.
  • the method allows for the sequencing of nucleic acids by monitoring the inco ⁇ oration of individual nucleotides into individual DNA or RNA molecules on the molecular level, instead of sequencing by monitoring macromolecular events, such as a pattern on an electrophoresis gel, whose signal is representative of a large population of nucleic acid molecules.
  • macromolecular events such as a pattern on an electrophoresis gel, whose signal is representative of a large population of nucleic acid molecules.
  • Using this method in combination with a large field of view it is possible that 1000 or more DNA molecules could be sequenced simultaneously, at sequencing speeds of 360 bases or more per hour.
  • Each DNA molecule to be copied/sequenced, and its associated polymerase/donor dye may correspond to a particular field of view, or a particular sensor for a position in which the polymerase mediated reaction is occurring. Therefore, using multiple such devices, molecular sequencing with the method can permit sequencing entire chromosomes or genomes within a day.
  • This example describes how to prepare polymerases containing at least one fluorophore or luminescent molecule.
  • the fluorophore or luminescent molecule may be a donor fluorophore.
  • Green fluorescent protein includes a chromophore formed by amino acids in the center of the GFP. GFP is photostable, making it a desirable fluorophore to use on the polymerase, because it is resistant to photobleaching during excitation. Wild-type GFP is excited at 393 nm or 476 nm to produce an emission at 508 nm. GFP mutants have alternative excitation and emission spectra.
  • the polymerase used for elongation of the primer strand can be attached to GFP to generate a fusion protein, GFP-polymerase, by recombinant techniques known to those skilled in the art. Methods for making fusion proteins are described in Sambrook et al (Molecular Cloning, A
  • Plasmids containing the wild-type or mutant GFP gene sequences and a multiple cloning site (MCS) into which the polymerase sequence can be inserted are available from Clontech (Palo Alto, CA). Briefly, both the polymerase DNA and the GFP plasmid are digested with the appropriate restriction enzyme(s) which allow for the insertion of the polymerase into the MCS of the GFP plasmid in the sense orientation. The resulting fragments are ligated and expressed in bacteria, such as E. coli.
  • the expressed recombinant GFP-polymerase is then purified using methods known by those skilled in the art.
  • the GFP molecule may be placed at the N- or C-terminus of the polymerase, or anywhere in between.
  • the resulting GFP-polymerases are tested to determine which has the optimal properties for sequencing. Such properties can include: ease of protein purification, amount of protein produced, amount of fluorescence signal emitted after excitation, minimal alteration of the fluorescent properties of the GFP.
  • affinity tags that can be genetically engineered at either the N- or C-terminus of recombinant proteins.
  • Such tags can be attached to the GFP-polymerase protein, to aid in its purification and subsequent attachment to a substrate (see Example 2).
  • affinity tags include histidine (His), streptavidin, S-tags, and glutathione-S-transferase (GST). Other tags known to those skilled in the art can also be used.
  • the affinity tags are placed at the N- or C-terminus of a protein.
  • Commercially available vectors contain one or multiple affinity tags.
  • vectors can be used directly, or if desired, the sequences encoding the tag can be amplified from the vectors using PCR, then ligated into a different vector such as the GFP-containing vectors described above.
  • a Tag-GFP- polymerase recombinant fusion protein vectors are constructed which contain sequences encoding the tag, GFP (wild-type or mutant), and the polymerase. The sequences are ordered to generate the desired Tag-GFP-polymerase recombinant fusion protein.
  • Such methods are well known to those skilled in the art (Sambrook et al, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, Chapter 17, 1989). This vector is expressed in bacteria such as E.
  • the protein purified is the protein purified.
  • the method of purification will depend on the affinity tag attached.
  • the bacterial lysate is applied to a column containing a resin having high affinity for the tag on the fusion protein. After applying the lysate and allowing the tagged-fusion protein to bind, unbound proteins are washed away, and the fusion protein is subsequently eluted.
  • His-6 or His- 10 moiety can be attached to GFP-polymerase by using pET vectors (Novagen, Madison, WI).
  • pET vectors Novagen, Madison, WI.
  • GFP-His Park and Raines, Protein Sci. 6:2344-9, 1997) and protein-GFP-His recombinant proteins have described previously (Prescott et al, FEBS Lett. 411 :97-101 , 1997, herein inco ⁇ orated by reference).
  • the His-containing fusion proteins can be purified as described in Paborsky et al. (Anal.
  • the cell lysate is immobilized using affinity chromatography on Ni 2+ -NTA-Agarose (QIAGEN, Valencia, CA). After washing away unbound proteins, for example using a buffer containing 8 mM imidazole, 50 mM Tris HC1, pH 7.5, 150 mM NaCl, the bound recombinant protein is eluted using the same buffer containing a higher concentration of imidazole, for example 100-500 mM.
  • the S-tag system is based on the interaction of the 15 amino acid S-tag peptide with the S- protein derived from pancreatic ribonuclease A.
  • S-tag fusion proteins are available from Novagen (Madison, WI).
  • Novagen Madison, WI
  • vectors pET29a-c and ⁇ ET30a-c can be used.
  • the S-tag fusion protein is purified by incubating the cell lysate with S-protein agarose, which retains S-tag fusion proteins. After washing away unbound proteins, the fusion protein is released by incubation of the agarose beads with site- specific protease, which leaves behind the S-tag peptide.
  • the affinity tag streptavidin binds with very high affinity to D-biotin.
  • Vectors for generating streptavidin-fusion proteins, and methods for purifying these proteins, are described in Santo and Cantor (Biochem. Biophys. Res. Commun. 176:571-7, 1991, herein inco ⁇ orated by reference).
  • the cell lysate is applied to a 2-iminobiotin agarose column, (other biotin-containing columns may be used), and after washing away unbound proteins, the fusion protein is eluted, for example with 6 M urea, 50 mM ammonium acetate (pH 4.0).
  • GST glutathione-S-transferase
  • GST Plasmid expression vectors containing GST (pGEX) are disclosed in U.S. Patent No. 5,654,176 to Smith, herein inco ⁇ orated by reference and in Sharrocks (Gene, 138:105-8, 1994, herein inco ⁇ orated by reference). pGEX vectors are available from Amersham Pharmacia Biotech (Piscataway, NJ). The cell lysate is incubated with glutathione-agarose beads and after washing, the fusion protein is eluted, for example, with 50 mM Tris-HCl (pH 8.0) containing 5 mM reduced glutathione.
  • the GST moiety can be released by specific proteolytic cleavage. If the GST-fusion protein is insoluble, it can be purified by affinity chromatography if the protein is solubilized in a solubilizing agent which does not disrupt binding to glutathione-agarose, such as 1% Triton X-100, 1% Tween 20, 10 mM dithiothreitol or 0.03% NaDodSOj. Other methods used to solubilize GST-fusion proteins are described by Frangioni and Neel (Anal. Biochem. 210:179-87, 1993, herein inco ⁇ orated by reference)
  • Recombinant GFP-aequorin-polymerase can be generated using methods known to those skilled in the art, for example the method disclosed by Baubet et al. (Proc. Natl. Acad. Sci. USA 97:7260-5, 2000, herein inco ⁇ orated by reference). Briefly, aequorin cDNA (for example Genbank Accession No. L29571), polymerase DNA, and a GFP plasmid are digested with the appropriate restriction enzyme(s) which allow for the insertion of the aequorin and polymerase into the MCS of a GFP plasmid in the sense orientation. The resulting fragments are ligated and expressed in bacteria, such as E. coli. The expressed recombinant GFP-aequorin-polymerase is then purified as described above. Affinity tags can also be added.
  • the ordering of the GFP, aequorin, and polymerase sequences can be optimized.
  • the resulting GFP-aequorin-polymerases are tested to determine which has the optimal properties for sequencing.
  • properties can include: ease of protein purification, amount of protein produced, amount of chemiluminescent signal emitted, amount of fluorescent signal emitted after excitation, minimal alteration of the fluorescent properties of the GFP and aequorin, and amount of polymerase activity
  • Amine-reactive fluorophores are frequently used to create fluorescently-labeled proteins.
  • amine-reactive probes examples include, but are not limited to: fluorescein,
  • thiol-reactive probes can be used to generate a fluorescently-labeled polymerase.
  • proteins thiol groups are present in cysteine residues. Reaction of fluors with thiols usually proceeds rapidly at or below room temperature (RT) in the physiological pH range (pH 6.5- 8.0) to yield chemically stable thioesters.
  • RT room temperature
  • examples of thiol-reactive probes that can be used include, but are not limited to: fluorescein, BODIPY, cumarin, rhodamine, Texas Red and their derivatives.
  • fluorescently-labeled polymerases have a high fluorescence yield, and retain the critical features of the polymerase, primarily the ability to synthesize a complementary strand of a nucleic acid molecule.
  • the polymerase may therefore have a less-than- maximal fluorescence yield to preserve the function of the polymerase.
  • unconjugated dye is removed, for example by gel filtration, dialysis or a combination of these methods.
  • This example describes methods that can be used to attach the fluorescent polymerase generated in Example 1 , or a nucleic acid, to a substrate, such as a microscope slide or gel matrix.
  • a substrate such as a microscope slide or gel matrix.
  • nucleic acids for example the sample nucleic acid to be sequenced or an oligonucleotide primer
  • nucleic acids can be attached by their 5' or 3' end, or anywhere in between.
  • a 5' biotinylated primer can be synthesized (Beaucage, Tetrahedron Letters 22:1859-62, 1981; Caruthers, Meth. Enzym. 154:287-313, 1987), and affixed to a streptavidin coated substrate surface (Hultman, Nuc Acids Res. 17:4937-46, 1989).
  • the nucleic acid can be dried on amino- propyl-silanized (APS) glass, as described by Ha et al (Proc. Natl. Acad. Sci. USA. 93:6264-68, 1996), herein inco ⁇ orated by reference.
  • APS amino- propyl-silanized
  • a silyl moiety can be attached to a nucleic acid, which can be used to attach the nucleic acid directly to a glass substrate, for example using the methods disclosed by Kumar et al. (Nucleic Acids Res. 28 :e71, 2000, herein inco ⁇ orated by reference). Briefly, silane is conjugated to a nucleic acid using the following method.
  • Mercaptosilane [(3-Mercaptopropyl)-trimethoxysilane] is diluted to 5 mM stock solution with a reaction buffer such as sodium acetate (30 mM, pH 4.3) or sodium citrate (30 mM, pH 4).
  • a reaction buffer such as sodium acetate (30 mM, pH 4.3) or sodium citrate (30 mM, pH 4).
  • 1 nmol nucleotides are reacted with 5 nmol mercaptosilane in 20 ⁇ l of the same buffer for 10-120 min at RT.
  • the reaction mixture is used directly or diluted with the reaction buffer to a desired concentration for immobilization on a substrate, such as a glass microscope slide.
  • 5'-acrylic-labeled oligonucleotides are conjugated to mercaptosilane using an identical procedure.
  • the 5'-thiol-labeled nucleotides are conjugated with aminosilane [(3-aminopropyl)- trimethoxysilane] in dimethylsulfoxide (DMSO) in the presence of heterobifunctional linkers N- succinimidyl-3-(2-pyridyldithiol)-propionate (SPDP) or succinimidyl-6-(iodoacctyl-amino)- hexanoate (SIAX).
  • DMSO dimethylsulfoxide
  • SPDP succinimidyl-3-(2-pyridyldithiol)-propionate
  • SIAX succinimidyl-6-(iodoacctyl-amino)- hexanoate
  • Nucleotides (final concentration 5-50 ⁇ M) are combined with 2.5 nmol aminosilane (added from 5 mM solution in ethanol) and 2.5 nmol bifunctional reagents (added from 5 mM stock solution in DMSO) in 10 ⁇ l DMSO, and the reaction allowed to proceed for 1-2 hours at RT.
  • Acrylic-labeled oligonucleotides (50-500 pmol) are combined with 25 nmol acrylicsilane ( ⁇ - methacryloxy-propyl-trimethoxysilanc) in 10 ⁇ l of 30 mM NaOAc, pH 4.3.
  • Ammonium persulfate (10% in H20) and N,N,N',N'-tetramethylethylenediamine (TEMED) are added to final concentration of 0.5 and 2%, respectively, and the mixture allowed to react for 30 minutes at RT.
  • silanized nucleic acid After the conjugation reactions, the reaction mixture is referred to as silanized nucleic acid, and can be directly used for spotting onto a substrate.
  • Silanized nucleic acids can be spotted on the glass slides manually (120 nl/spot) or with an automated arrayer (Genetic Microsystem, Woburn. USA) (1 nl spot).
  • Nucleic acids in aqueous solutions can be kept in a humidified chamber for 15 minutes at RT after spotting onto the glass slide, dried at 50°C for five minutes, dipped into boiling water for 30 seconds to remove non-covalently bound nucleic acids, and dried with nitrogen before hybridization.
  • Nucleotides in DMSO are left at RT for 15 minutes after spotting onto glass slides and dried at 50°C for 10 minutes. These slides are sequentially washed with DMSO (3 x 2 min), ethanol (3 x 2 min) and boiling water (2 min) and dried with nitrogen for later use.
  • the nucleotide to be hybridized is diluted to between 20 nM and 1 ⁇ M in 5x SSC (750 mM NaCl, 125 mM sodium citrate, pH 7) with 0.1% Tween-20.
  • 5x SSC 750 mM NaCl, 125 mM sodium citrate, pH 7.
  • Hybridization is done under coverslips in a humidifier at 37°C for 30 minutes to overnight.
  • Non-hybridized and nonspecific nucleotides are removed by washing with 5x SSC containing 0.1% Tween-20 (3 x 1 min) followed by lx SSC containing 0.1% Tween-20 (2 x 15 min).
  • hybridization is carried out at 65°C for four hours in 3 x SSC with 0.1% SDS and 1 ⁇ g/ ⁇ l yeast tRNA. The slides are then washed with lx SSC containing 0.1% SDS (3 x 2 min) and 0. lx SSC containing 0.1% SDS (3 x 5 min) at RT.
  • the slides can be dried with nitrogen gas. If repeated hybridization on the same substrate is desired, the substrate is boiled in water for one minute then dried with nitrogen gas before proceeding to the next hybridization reaction.
  • a terminal transferase can be used to "tail" the molecule.
  • the polymerase can be attached to the substrate.
  • the polymerase can be linked to a substrate by first generating a streptavidin-polymerase fusion protein using the methods described above in Example 1.
  • the polymerase-streptavidin protein is then affixed to a biotinylated substrate, for example as described by Mazzola and Fodor (Biophys. J. 68:1653-60, 1995) or Itakura et al. (Biochem. Biophys. Res. Commun. 196:1504-10, 1993).
  • BCML N,N- bis[carobxymethyl]lysine
  • 100 ⁇ l of N,N- bis[carobxymethyl]lysine (BCML) is added (10 mM BCML in 0,1 M NaP0 4 , pH 8) to each well and incubated overnight at RT.
  • the plate is subsequently washed with 200 ⁇ l of 0.05% Tween, blocked (3% BSA in 50 mM Tris HC1, pH 7.5, 150 mM NaCl, 0.05% Tween) and washed with a series of buffers.
  • the polymerases may be arranged on a two-dimensional substrate surface in an organized array. Polymerases may be spaced by micrometer distances as described by M ⁇ ller et al. (Science 268:272- 3, 1995, herein inco ⁇ orated by reference). In addition, patterns of channels that are approximately 50 ⁇ m in width and approximately 10-20 ⁇ m in depth, can be formed in the substrate using standard photolithographic procedures followed by chemical wet etching as described in U.S. Patent No. 5,661,028 to Foote (herein inco ⁇ orated by reference). Much smaller channels can be generated using nanolithography techniques.
  • Dense periodic arrays of holes or chambers 20 nm across are fabricated into a silicon nitride coated substrate by the method of Park et al. (Science, 276:1401-4, 1997, herein inco ⁇ orated by reference). In each chamber, a single sequencing reaction would take place.
  • the polymerase may also be attached to the substrate in an orderly array by micropipetting droplets containing the polymerase onto the surface of the substrate. The droplets are then covered, for example with a glass coverslip, to prevent evaporation.
  • the polymerase or nucleic acid may be embedded into a three-dimensional gel matrix.
  • the polymerase or nucleic acid is added to the liquid matrix, which is allowed to solidify, trapping the polymerases or nucleic acids within it.
  • this type of matrix include agarose and acrylamide, for example Ni 2+ -NTA-Agarose (QIAGEN, Valencia, CA).
  • This example describes how to prepare nucleotides containing at least one fluorophore, for example an acceptor fluorophore.
  • this example lists sources of commercially available fluorescent nucleotides that can be used in the present disclosure.
  • acceptor fluorophores it is important that the frequency used to excite the donor fluorophore on the polymerase (Example 1) not overlap the excitation spectra of the acceptor fluorophores on the nucleotides.
  • Each nucleotide should possess at least one acceptor fluorophore having an excitation spectrum which overlaps the emission spectrum of the donor fluorophore attached to the polymerase (Example 1), such that the emission from the donor fluorophore excites the acceptor fluorophore.
  • NEN Life Science Products (Boston, MA) offers all four deoxynucleotides and ribonucleotide analogs with fluorophores attached. There are several different fluorophores available including fluorescein, Texas Red®, tetramethylrhodamine, coumarin, napthofluorescein, cyanine-3, cyanine-5, and LissamineTM. In addition, Molecular Probes (Eugene, OR) sells deoxyuridinetriphosphate (dUTP) labeled with various fluorophores replacing the methyl group of thymine, synthesized by the method of U.S. Patent No. 5,047,519. Because these nucleotides have 3' hydroxyls, they can be used directly for sequencing.
  • dUTP deoxyuridinetriphosphate
  • nucleotides containing other acceptor fluorophores can be prepared.
  • the fluorophores are capable of being attached to the nucleotide, are stable against photobleaching, and have high quantum efficiency.
  • each type of nucleotide e.g. A, T U, C and G
  • U will be substituted for T in this example.
  • the fluorophore ideally does not interfere excessively with the degree or fidelity of nucleotide inco ⁇ oration. After attaching the fluorophores, the nucleotide is still able to undergo polymerization, complementary base pairing, and retain a free 3' hydroxyl end.
  • the fluorophore can either be directly or indirectly attached to the nucleotide.
  • the method described above for attaching fluorophores to polymerases can be used (Example 1).
  • the fluorophore may be attached indirectly to the nucleotide by a linker molecule.
  • a linker molecule For example, a streptavidin linkage may be used.
  • the linker does not significantly interfere with binding to or inco ⁇ oration by the polymerase.
  • the use of a linker would make the nucleotide bulky, allowing less FRET to previous bases. This may make it easier to distinguish nucleotides as they are added to the complementary nucleic acid strand.
  • the nucleotides can be cleaved from the DNA molecules after their inco ⁇ oration, for example by attaching DNase to the end of the polymerase, producing free mono nucleotides that could not be reused.
  • fluorescent molecules containing two attachment points can be used to orient the fluorophore on the polymerase (Corrie et al, 1999, Nature 400:425, herein inco ⁇ orated by reference).
  • the linkage can be peptidase sensitive, allowing the fluorophore to be released after the emission signal is detected as a result of the acceptor fluorophore on the nucleotide being added to the complementary nucleic acid strand.
  • the use of a linker may allow the fluorophore orientation to be controlled, so that the optimal orientation for FRET can be determined.
  • An optimal orientation is one that generates the brightest emission signal from the acceptor fluorophore, without the nucleotide losing its ability to inco ⁇ orate into the complementary nucleic acid strand.
  • linkers to separate a nucleotide from a fluorophore.
  • linkers may include a straight-chained alkylene, C r C 2 o, optionally containing within the chain double bonds, triple bonds, aryl groups or heteroatoms such as N, O or S.
  • Substituents on the diradical moiety can include C ⁇ -C 6 alkyl, aryl, ester, ether, amine, amide or chloro groups.
  • the sequencing method described herein is asynchronous. Therefore, it can be difficult to distinguish multiple bases of the same type (i.e. poly T).
  • "dummy" nucleotides can be supplied, such as four dNMP or dNDPs that have a fifth fluorophore distinct from the four used to identify the nucleotides. Because these molecules do not contain three phosphate groups, they can enter the polymerase, but they cannot bond covalently. If included in a higher concentration relative to the nucleotide fluorophores, they can provide a specific signal indicating the transition between attachment of one base and the next.
  • the signal from the "counter" nucleotide would usually be received between each actual signal, and serve to indicate that a new actual nucleotide has been added (for example syncopating the addition of three Ts as: T-counter-T-counter-T-counter).
  • Repeat sequencing can be performed to confirm the result, and address the possibility that the counter may not be added or detected in some instances of each sequencing reaction.
  • the counter nucleotides can also provide a means of determining the number of bases in runs of bases. However, important information about sequences can be obtained even without the use of the counter nucleotides, such as a rough approximation of the sequence, or quick confirmation of a sequence obtained by other methods.
  • Another approach to distinguishing multiple bases of the same type is to incubate the reaction at a low temperature, such as 0-30°C, for example 4°C or at RT. At these lower temperatures, the polymerases are selected that are able to function properly at lower temperatures. This temperature range allows for a more narrow spectral line and hence higher coding complexity. If more than one fluorescent acceptor is present on each nucleotide, then the individual classes of nucleotides are coded. The lower temperature sha ⁇ ens the spectrum, allowing more distinct spectra to be read. It is important to avoid freezing, which would interfere with the polymerization reaction.
  • Other approaches to distinguishing multiple bases of the same type include making the aqueous environment 36 more viscous, reducmg the concentration of nucleotides, and using a polymerase containing one or more mutations which slow the polymerase.
  • the fluorescent nucleotides are first tested using a fluorescence spectrophotometer. For example, 5' biotin-labeled single-stranded nucleic acid is attached to magnetic streptavidin particles. A primer is annealed and the polymerase and one or more fluorescent nucleotides are added. After washing the beads, the nucleic acid is cleaved at a restriction site close to the bead. The fluorescence spectrophotometer is used to detect addition of the fluorescent nucleotides.
  • the test can also be performed by separation of the labeled nucleic acids on an agarose gel and detection under UV lamp or using an ABI sequencing machine. Therefore, the contribution of previously inco ⁇ orated bases to the current spectrum can be determined by accounting for known spectrum of nucleotides at various previous positions. Since the previous sequence is known, the predicted effect of the previous nucleotides can be removed from the current spectrum.
  • Another method that can be used to distinguish the latest nucleotide added onto the growing nucleotide chain is to use polarized light to measure the rotation of single molecules.
  • the newly added base is fixed in orientation.
  • the location of the donor dipole is adjusted to match the most recently added acceptor fluorophore so that the most recent fluorophore generates the strongest FRET signal.
  • Harms et al. teaches the use of polarized light to measure rotation of single molecules (Biophys. J. 77:2864-70, 1999, herein inco ⁇ orated by reference).
  • Yet another method to distinguish the individual nucleotides is to label the nucleotides with more than one fluorophore.
  • two or more different fluorophores can be added to each nucleotide.
  • the combination of fluorophores generates an emission spectrum which is easier to distinguish than the emission spectrum from only one fluorophore on each nucleotide.
  • Multiple tags thereby allow each nucleotide to be coded by more than one spectrum, helping to reduce the ambiguity of strings of the same nucleotide.
  • Such a compressed sequence is still usable because it is unique. For example, if RNA from E. coli is sequenced and results in sequence 3 above, the location of the RNA can be determined. When the entire human genome is sequenced, this method can be used to count individual mRNA molecules directly. The first step is to compress the entire human genomic sequence. Then, the NCBI Basic Local Alignment Search Tool (BLAST), or other program is used to search this compressed human genomic sequence using the results obtained from the sequencing methods of the present disclosure. This method does not require macroscopic handling for high-throughput analysis, and is highly useful for studying gene expression.
  • BLAST Basic Local Alignment Search Tool
  • BLAST Altschul et al, J. Mol. Biol. 215:403-10, 1990
  • NCBI National Center for Biological Information
  • tblastn tblastn
  • tblastx tblastx. Additional information can be found at the NCBI web site.
  • EXAMPLE 4 Microscope Set-Up This example describes microscope systems that can be used to sequence nucleic acids using the method disclosed herein.
  • TIR fluorescence microscopy can be used, for example using the methods and device described by Pierce et al. (Nature, 388:338, 1997; Methods Cell Biol. 58:49, 1999); Funatsu et al. (Nature, 374:555, 1995); Weiss (Science, 283:1676, 1999) and Schutt et al. (U.S. Patent No. 5,017,009).
  • TIR is an optical phenomenon that occurs when light is directed at less than a critical angle, through a high refractive index material, toward an interface of that material with a second material having a lower refractive index. In this situation, all light is reflected back from that interface, except for a microscopic evanescent wave which propagates into the second material for only a short distance.
  • the first material is a glass substrate and the second material is water or another aqueous medium in which an assay is being conducted.
  • the fluorescent molecules can be energized, and fluorescence detected which then emanates into the overlying solution.
  • the advantage of TIR is that it produces a superior signal-to-noise ratio, and reduces the photobleaching of the fluorescent molecules since only a thin layer of the sample is exposed.
  • a confocal microscopy system can be used.
  • An example of such a confocal laser is the Leica Confocal Spectrophotometer TCS-SP (Leica,
  • the confocal laser would only illuminate sequencing polymerases, leaving the remainder of the reservoir dark. To accomplish this, one can first scan the entire volume available for polymerases, then program the microscope to only expose those small regions containing functioning polymerases. Another advantage of confocal microscopy is that sequencing reactions could occur in three dimensions. Confocal microscopy excludes planes that are not of interest, allowing one to increase the total number of sequences taken. This would allow more sequencing reactions to be performed and detected per field of view.
  • Another means that can be used to reduce photobleaching is to incubate the sample in a solution containing an oxygen scavenger system, for example as described by Kitamura et al. (Nature, 397:129, 1999); Okada and Hirokawa (Science, 283: 1152, 1999); Harada et al. (J. Mol. Biol. 216:49, 1990).
  • solutions include: 1% glucose, 0.05 mg/ml glucose oxidase and 0.1 mg/ml catalase; and 0.5% 2-mercaptoethanol, 4.5 mg/ml glucose, 216 ⁇ g/ml glucose oxidase, 36 ⁇ g/ml catalase, 2 mM ATP in buffer.
  • NSM Near-field scanning optical microscopy
  • an aperture having a diameter that is smaller than an optical wavelength is positioned in close proximity (i.e., within less than one wavelength) to the surface of a specimen and scanned over the surface.
  • Light may be either emitted or collected by such an aperture in the end of a probe.
  • Mechanical or piezoelectric means are provided for moving the probe relative to the sample.
  • Light that has interacted with the sample is collected and detected by, for example, a spectrophotometer, and then a CCD camera.
  • the strength of the detected light signal is typically stored, in the form of digital data, as a function of the probe position relative to the sample. The stored data can be converted into a nucleic acid sequence.
  • NSOM allows optical measurements with sub-wavelength resolution, can measure FRET, and works well in solution (Ha et al, Proc. Natl. Acad. Sci. USA 93:6264-8, 1996).
  • Standard microscopes can be converted to a near-field optical microscope using a device sold by Nanonics Ltd. (Malha, Jerusalem, Israel).
  • the advantage of NSOM is that high resolution of the sample can be obtained.
  • the probe scans the surface of the substrate, the number of sequencing reactions that can be monitored at any one time decreases. To help compensate for this decrease, the rate of nucleotide addition can be decreased by increasing the viscosity of the solution or decreasing the temperature.
  • Kairos Scientific provides a Fluorescence Imaging MicroSpectrophotometer (FIMS). This microscope generates a fluorescence emission spectrum for every pixel in the field of view. Therefore, a unique emission spectrum is generated for each nucleotide as it is added to the complementary nucleic acid strand.
  • FIMS Fluorescence Imaging MicroSpec
  • the method allows for single molecule detection (SMD), for example using the system disclosed by Fang and Tan (Anal. Chem. 1999, 71 :3101-5, herein inco ⁇ orated by reference).
  • SMD single molecule detection
  • an optical fiber is used to probe into a fluorophore solution (i.e. the aqueous environment 36 of FIG, 3), or at a solid surface (i.e. the substrate 12 shown in FIG. 3).
  • the optical fiber has total internal reflection, allowing fluorescent molecules close to the surface to be excited by the evanescent wave.
  • the fluorescent signals generated by the fluorophores are detected by an intensified charge-coupled device (ICCD)-based microscope system.
  • ICCD intensified charge-coupled device
  • Optical fibers can be purchased from Newport Co ⁇ . (Irvine, CA).
  • SMD can be performed using the method disclosed by Unger et al. (BioTechniques, 1999, 27: 1008-14, herein inco ⁇ orated by reference). Briefly, using a standard fluorescent microscope with mercury lamp excitation and a CCD camera, single fluorescent molecules can be observed in air and in aqueous solution, if the molecules are sufficiently separated by dilution.
  • electromagnetic radiation can be emitted by a laser.
  • the choice of laser used will depend on the specific donor fluorophore used.
  • the wavelength of the laser light is selected to excite the donor fluorophore.
  • wild-type GFP and FITC can be excited by an argon laser at 488 nm.
  • blue laser diodes which emit at 400 nm (Nichia Chemical Industries Ltd.) or 404 nm (Power Technology Inc., Little Rock, AK) can be used.
  • Other sources of electromagnetic radiation known by those skilled in the art can also be used, for example HeNe lasers and mercury lamps.
  • a fluid handling system is optional. For simplicity, one may prefer to add all of the necessary reagents, then seal the chamber with a glass coverslip or a drop of oil to prevent desiccation. Alternatively, a slow flow of nucleotide containing solution can be provided to replenish the nucleotides and to remove the products (diphosphate). Such a system would increase nucleotide use, but would maintain steady state conditions, which may increase the length of sequencing runs.
  • a computer chip that performs the liquid handling can be built that sits on the stage of a fluorescent microscope.
  • Micromachine and microfluidic devices and methods for the dispensing of nanoliter size liquid samples has been previously described (Service, Science 282:399-401, 1998; Burns et al. Science 282:484-7 1998).
  • a detector acts as the primary tool to capture the emission spectrums generated by the spectrophotometer.
  • a CCD camera can be used as the detector to capture the image.
  • the emission spectrums generated by the spectrophotometer are collected by the CCD camera, which converts this input into a charge.
  • the charge is converted into a signal by the CCD output.
  • the resulting signal is digitized, as a characteristic signal associated with each type of nucleotide (e.g. A, T/U, C or G), and the digital data is captured into memory, such as the hard-drive of a computer.
  • the sum of the captured data is then processed into a nucleotide sequence.
  • CCD cameras are commercially available from many sources including Kodak (Rochester, NY).
  • a monochrome CCD containing filters or other means of obtaining a spectrum may be used. This would require that the spectrum be swept. To reduce background noise, any of the CCD cameras may be cooled.
  • the rate at which sequencing of the nucleic acids occurs can be controlled by many factors. Faster rates can be obtained by increasing the temperature (using a heat stable polymerase) or by running the reactions under high pressure, as in HPLC. The reaction rate can be slowed by making the solution more viscous, by lowering the reaction temperature, or by having fewer reactive nucleotides available. The rate of polymerization may be controlled in this manner not to exceed the rate of the CCD integration and computer recording time. Therefore, the rate of polymerization is controlled in this manner such that the fluorescent signal can be more reliably read by the CCD and inte ⁇ reted by the computer.
  • the method is performed in a closed chamber device that produces sequencing signals, which enter the computer directly.
  • the method sequences nucleic acids by monitoring the inco ⁇ oration of individual nucleotides into individual nucleic acid molecules on the molecular level, instead of sequencing nucleic acids by monitoring macromolecular events, such as a pattern on an electrophoresis gel, that is representative of a large population of nucleic acids molecules.
  • photomultiplier tubes or an intensified charge-coupled device (ICCD) can be used.
  • ICCD intensified charge-coupled device
  • the methods disclosed herein can be performed in the general context of computer- executable instructions of a computer program that runs on a personal computer.
  • program modules include routines, programs, components, data structures, etc. that perform particular tasks or implement particular abstract data types.
  • the method may be practiced with other computer system configurations, including hand-held devices, multi-processor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, and the like.
  • the methods disclosed herein may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network.
  • program modules may be located in both local and remote memory storage devices.
  • the present implementation platform of the methods disclosed herein is a system implemented on a Sun computer having at least one megabyte of main memory and a one gigabyte hard disk drive, with Unix as the user interface.
  • the application software is written in Pascal or other computer language.
  • This example describes methods for sequencing nucleic acids from different sources.
  • One application of the disclosed method is the sequencing of a plasmid. After introducing random nicks into the plasmid, the DNA is added onto a substrate containing fixed fluorescent polymerases (Examples 1 and 2). The entire plasmid is then sequenced from many points. The computer keeps track of all the sequences and automatically assembles them into a complete plasmid sequence.
  • Another use is for sequencing a randomized region of a nucleic acid.
  • the primer used is specific for a position just outside the randomized region.
  • the randomized nucleic acid is placed onto the field of fixed polymerases. This method allows one to obtain the entire results of a randomization experiment in parallel, thereby saving time and money.
  • the source of specimen obtained from a subject may include peripheral blood, urine, saliva, tissue biopsy, fine needle aspirates, surgical specimen, amniocentesis samples and autopsy material.
  • the sample is attached to a substrate, such as a glass slide. Care must be taken to preserve the nucleic acids present in the sample.
  • the nucleic acids could be isolated from the sample, and then subjected to TDS.
  • TDS TDS
  • the presence of viral and/or bacterial pathogens can be detected by the presence of the viral and/or bacterial nucleic acid sequences.
  • the methods disclosed herein allows for nucleic acid sequencing in situ, by adding a primer, the GFP-polymerase (or GFP-aequorin- polymerase) and the four nucleotides, to a thin tissue slice.

Abstract

A method and device is disclosed for high speed, automated sequencing of nucleic acid molecules. A nucleic acid molecule to be sequenced is exposed to a polymerase in the presence of nucleotides which are to be incorporated into a complementary nucleic acid strand. The polymerase carries a donor fluorophore, and each type of nucleotide (e.g. A, T/U, C and G) carries a distinguishable acceptor fluorophore characteristic of the particular type of nucleotide. As the polymerase incorporates individual nucleic acid molecules into a complementary strand, a laser continuously irradiates the donor fluorophore, at a wavelength that causes it to emit an emission signal (but the laser wavelength does not stimulate the acceptor fluorophore). In particular embodiments, no laser is needed if the donor fluorophore is a luminescent molecule or is stimulated by one. The emission signal from the polymerase is capable of stimulating any of the donor fluorophores (but not acceptor fluorophores), so that as a nucleotide is added by the polymerase, the acceptor fluorophore emits a signal associated with the type of nucleotide added to the complementary strand. The series of emission signals from the acceptor fluorophores is detected, and correlated with a sequence of nucleotides that correspond to the sequence of emission signals.

Description

HIGH SPEED PARALLEL MOLECULAR NUCLEIC ACID SEQUENCING
FIELD
This disclosure relates to an automated method for sequencing nucleic acids, such as DNA and RNA, which may be used for research and the diagnosis of disease in clinical applications.
BACKGROUND
Approaches to DNA sequencing over the past twenty years have varied widely. The use of enzymes and chemicals is making it possible to sequence the human genome. However, this effort takes enormous resources.
Until recently, there were only two general sequencing methods available, the Maxam- Gilbert chemical degradation method (Maxam and Gilbert, 1977, Proc. Natl. Acad. Sci., USA 74:560), and the Sanger dideoxy chain termination method (Sanger et al., 1977, Proc. Natl. Acad. Sci., USA 74:5463). Using the dideoxy chain termination DNA sequencing method, DNA molecules of differing lengths are generated by enzymatic extension of a synthetic primer, using DNA polymerase and a mixture of deoxy- and dideoxy- nucleoside triphosphates. To perform this reaction, the DNA template is incubated with a mixture containing all four deoxynucleoside 5'- triphosphates (dNTPs), one or more of which is labeled with 32P, and a 2',3'-dideoxynucleoside triphosphate analog (ddNTP). Four separate incubation mixtures are prepared, each containing a different ddNTP analog (ddATP, ddCTP, ddGTP, or ddTTP). The dideoxynucleotide analog is incorporated normally into the growing complementary DNA strand by the DNA polymerase, through their 5' triphosphate groups.
However, because of the absence of a 3'-OH group on the ddNTP, phosphodiesrer bonds cannot be formed with the next incoming dNTPs. This results in termination of the growing complementary DNA chain. Therefore, at the end of the incubation period, each reaction mixture contains a population of DNA molecules having a common 5' terminus, but varying in length to a nucleotide base specific 3' terminus. These four preparations, with heterogeneous fragments each ending in either cytosine (C), guanine (G), adenine (A) or thymine (T) are separated in four parallel lanes on polyacrylamide gels. The sequence is determined after autoradiography, by determining the terminal nucleotide base at each incremental cleavage in the molecular weight of the electrophoresed fragments.
The Maxam-Gilbert method of DNA sequencing involves the chemical-specific cleavage of DNA. In this method, radio-labeled DNA molecules are incubated in four separate reaction mixtures, each of which partially cleaves the DNA at one or two nucleotides of a specific identity (G, A+G, C or C+T). The resulting DNA fragments are separated by polyacrylamide gel electrophoresis, with each of the four reactions fractionated in a separate lane of the gel. The DNA sequence is determined after autoradiography, again by observing the macromolecular separation of the fragments in the four lanes of the gel. The use of fluorescent nucleotides has eliminated the need for radioactive nucleotides, and provided a means to automate DNA sequencing. As fluorescent DNA fragments on an electrophoresis gel pass by a detector, the sequential fluorescent signals (which correspond to a fragment ending in a particular nucleotide) are automatically converted into the DNA sequence, eliminating the additional step of exposing the gel to film. Improvements on this general concept have been the subject of several U.S. patents, including U.S. Patent No. 5,124,247 to Ansorge, U.S. Patent No. 5,242,796 to Prober et al, U.S. Patent No. 5,306,618 to Prober et al, U.S. Patent No. 5,360,523 to Middendorf et l, U.S. Patent No. 5,556,790 to Pettit, and U.S. Patent No. 5,821,058 to Smith et al. However, the methods disclosed in these patents still require the inconvenient step of separating the generated DNA fragments by size, using electrophoresis.
There are several disadvantages associated with using electrophoresis for nucleic acid sequencing. Electrophoresis requires macroscopic separation, with the necessity of expensive reagents, long gel preparation time, tedious sample loading, the dangers of exposure to the neurotoxin acrylamide. Macromolecular electrophoretic separation also exposes the technician to high voltage devices, requires prolonged electrophoresis time, produces gel artifacts, and requires calculations to adjust for dye mobilities. Furthermore, sequencing runs only allow for the sequencing of less than 1000 bases at a time, which can be a substantial drawback to the sequencing of long stretches of the genome.
Given the practical drawbacks of electrophoresis, attempts have been made to eliminate this step. Mills, for example, described the use of mass spectrometry to separate the DNA fragments as an alternative to electrophoresis (U.S. Patent Nos. 5,221,518 and 5,064,754). However, mass spectrometry devices are expensive, and because the method depends on size separation, it has a size resolution limit.
Others have attempted to separate nucleic acid sequences by size using capillary electrophoresis (Karger, Nucl. Acids Res. 19:4955-62, 1991). In this method, fused silica capillaries filled with polyacrylamide gel are used as an alternative to slab gel electrophoresis. However, this method is limited by the separation process and requires very high detection sensitivity and wavelength selectivity due to the small sample size.
Melamede (U.S. Patent No. 4,863,849) and Cheeseman (U.S. Patent No. 5,302,509) describe DNA sequencing methods which require a complex external liquid pumping system to add and remove necessary reagents. In these "open" systems, which contain the polymerase and the DNA to be sequenced, fluorescent nucleotides are pumped into a reaction chamber and added to the DNA molecule. After the incorporation of a single nucleotide, unincorporated fluorescent dNTPs are removed, leaving behind the DNA and its newly incorporated fluorescent nucleotide. This incorporated nucleotide is detected, its signal converted into a DNA sequence, and the process is repeated until the sequencing is complete. Although these methods can eliminate the electrophoresis step, the addition of nucleotides must be monitored one at a time as they are added to a population of DNA molecules, by continually pumping materials in and out of the reaction chamber. In another automated process, Jett et al (U.S. Patent Nos. 4,962,037 and 5,405,747) uses an exonuclease to sequentially shorten a DNA molecule that is being sequenced. After a complementary DNA strand is synthesized in the presence of fluorescent nucleotides, the exonuclease cleaves individual fluorescent nucleotides from the end of the synthesized DNA molecule. These nucleotides pass through a detector, and the fluorescent signal emitted by each nucleotide is recorded to determine the DNA sequence.
In the methods of Melamede (U.S. Patent No. 4,863,849) and Cheeseman (U.S. Patent No. 5,302,509) described above, the addition or release of nucleotides from several DNA molecules is monitored simultaneously. This is sequencing at the macromolecular level, as opposed to sequencing at the molecular level, which involves monitoring the addition or release of nucleotides from a single DNA molecule. A disadvantage of macromolecular sequencing methods is that even though all of the DNA molecules start with identical nucleotides, they may quickly evolve into a mixed population. When using the macromolecular methods, some chains may more efficiently incorporate nucleotides than others, and some DNA may be degraded more slowly or rapidly than others. To solve this synchronization problem, Jett et al. (U.S. Patent No. 4,962,037) and Ulmer
(U.S. Patent No. 5,674,743) developed molecular level sequencing systems in which a single fluorescently labeled DNA base is sequentially cleaved from a DNA molecule. The fluorescent signal from each cleaved dNTP is used to determine the DNA sequence. One drawback to these methods, however, is that the DNA molecule which is being sequenced must be held in a stream, which often results in shearing of the DNA, especially at higher flow rates. The sheared DNA molecule can not be accurately sequenced. In addition, only one DNA molecule can be sequenced at a time by this method.
The development of fluorescence resonance energy transfer (FRET) labels for DNA sequencing has been described by Ju (U.S. Patent No. 5,814,454) and Mathies et al. (U.S. Patent No. 5,707,804). During FRET, exciting the donor dye with light of a first wavelength releases light of a second wavelength, which in turn excites the acceptor dye(s) to emit light of a third wavelength, which is then detected. These patents disclose the attachment of FRET labels to oligonucleotide primers for sequencing DNA molecules. A drawback of these methods is that there is still a need for size separation (for example using electrophoresis) prior to determining the DNA sequence. Therefore, there remains a need for a method of sequencing nucleic acids at the molecular scale, that does not require the use of electrophoresis or complex liquid pumping systems, and does not result in the shearing of nucleic acids. In addition, methods that are automated would be particularly useful.
SUMMARY OF THE DISCLOSURE
The present disclosure provides an improved method and device for sequencing nucleic acids. The method allows several nucleic acids to be sequenced simultaneously at the molecular level. In particular examples, the method uses a donor and acceptor class of dyes. This method and device minimize shearing the sample nucleic acids to be sequenced, and can be readily automated. Herein disclosed is a method of sequencing a sample nucleic acid molecule by exposing the sample nucleic acid molecule to an oligonucleotide primer and a polymerase in the presence of a mixture of nucleotides. The polymerase carries a fluorophore, and each different type of nucleotide (e.g. A, T/U, C or G) carries a fluorophore which emits a signal that is distinguishable from a signal emitted by the fluorophore carried by each of the other types of nucleotides. In particular embodiments the fluorophore on the polymerase is a donor fluorophore and the fluorophore carried on the nucleotides are acceptor fluorophores. The donor fluorophore can be excited by a source of electromagnetic radiation (such as a laser) that specifically excites the donor fluorophore and not the acceptor fluorophores. This excitation induces the donor to emit light at a wavelength that can transfer energy to excite only the acceptor fluorophores that are added to the complementary strand by the polymerase. As the donor fluorophore excites the acceptor, a signal characteristic of the specific nucleotide being added (e.g. A, T U, C or G) is emitted by the acceptor fluorophore. A series of sequential signals emitted by the added nucleotides is detected, and converted into the complement of the nucleic acid sample. In particular embodiments, the unique emission signal for each nucleotide is generated by luminescence resonance energy transfer (LRET) or fluorescent resonance energy transfer (FRET).
In other embodiments, the nucleic acid is a DNA or RNA molecule, and correspondingly, the polymerase is a DNA or RNA polymerase, if DNA is being sequenced, or reverse transcriptase if RNA is being sequenced. In a further embodiment, the polymerase is a Klenow fragment of DNA polymerase I. In particular embodiments, the polymerase is a GFP-polymerase. In another embodiment, the donor fluorophore is green fluorescent protein (GFP). In particular embodiments, the donor fluorophore, such as GFP, is excited by a laser. In other embodiments, GFP can be excited by a luminescent molecule, for example aequorin.
Alternatively, the donor fluorophore is a luminescent molecule, for example aequorin or europium chelates. In this embodiment, the donor fluorophore does not require excitation by a source of electromagnetic radiation, because the luminescent donor fluorophore is naturally in an excited state.
In yet another embodiment, the acceptor fluorophores are BODIPY, fluorescein, rhodamine green, and Oregon green or derivatives thereof. In particular, the donor fluorophore and one of the acceptor fluorophores comprise a donor/acceptor fluorophore pair selected from the group consisting of the GFP mutant H9-40, tetramethylrhodamine, Lissamine™, Texas Red and naphthofluorescein.
Also disclosed herein are embodiments in which the polymerase may be fixed to a substrate, for example by a linker molecule that includes a polymerase component and a substrate component. The linker may be selected from the group consisting of streptavidin-biotin, histidine-Ni, S-tag-S- protein, and glutathione-glutathione-S-transferase (GST). In another embodiment, a nucleic acid may be fixed to a substrate. In particular embodiments the oligonucleotide primer is fixed to a substrate, for example at its 5' end. In yet other embodiments, the sample nucleic acid to be sequenced is fixed to the substrate. In particular embodiments, the sample nucleic acid to be sequenced is fixed to the substrate by its 5' end, 3' end or anywhere in between. In another embodiment, a plurality of polymerases, oligonucleotide primers, or sample nucleic acids are fixed directly or indirectly to the substrate in a predetermined pattern. For example, the polymerases can be deposited into channels which have been etched in an orderly array or by micropipetting droplets containing the polymerases onto a slide, for example either by manually pipetting or with an automated arrayer. In other embodiments, a plurality of sequencing reactions are performed substantially simultaneously, and the signals from the plurality of sequencing reactions detected.
Many different sequencing reactions can be performed substantially simultaneously on a single substrate, in which case signals are detected from each of the sequencing reactions. The unique emission signals are detected with a charged-coupled device (CCD) camera as an example of a detector, which can detect a sequence of signals from a predetermined position on the substrate, and convert them into the nucleic acid sequence. The unique emission signals may be stored in a computer readable medium.
Also disclosed is a substrate to which is attached a GFP-polymerase. In another embodiment, GFP-polymerase contains an affinity tag that attaches the GFP-polymerase to the substrate. In yet another embodiment, the GFP-polymerase is attached to the substrate by a linker. Other embodiments disclosed herein include a method of sequencing a sample nucleic acid by attaching a polymerase to a substrate, adding the sample nucleic acid with an annealed oligonucleotide to the polymerase, and allowing the sample nucleic acid to bind to the polymerase in the presence of nucleotides for incoφoration into a complementary nucleic acid. The polymerase and nucleotides are labeled with donor and acceptor fluorophores that emit a distinguishable signal when a particular type of nucleotide (e.g. A, T/U, C or G) is incoφorated into the complementary nucleic acid. A sequence of the distinguishable signals are detected as the nucleotides are sequentially added to the complementary nucleic acid, and the sequence of signals are converted into a corresponding nucleic acid sequence. Also disclosed herein is a method of sequencing a sample nucleic acid by attaching a sample nucleic acid to a substrate, adding an oligonucleotide primer and allowing the oligonucleotide primer to anneal to the attached sample nucleic acid, adding a polymerase in the presence of nucleotides, and allowing the sample nucleic acid to bind to the polymerase in the presence of nucleotides for incoφoration into a complementary nucleic acid. The polymerase and nucleotides are labeled with donor and acceptor fluorophores that emit a distinguishable signal when a particular type of nucleotide (e.g. A, T/U, C or G) is incoφorated into the complementary nucleic acid. A sequence of the distinguishable signals are detected as the nucleotides are sequentially added to the complementary nucleic acid, and the sequence of signals are converted into a corresponding nucleic acid sequence. The sample nucleic acid can be attached to a substrate, for example at its 5'- or 3' end, or any where in between.
Another embodiment disclosed herein is a method of sequencing a sample nucleic acid by attaching an oligonucleotide primer to a substrate, adding a sample nucleic acid and allowing the oligonucleotide primer to anneal to the sample nucleic acid, adding a polymerase in the presence of nucleotides, and allowing the sample nucleic acid to bind to the polymerase in the presence of nucleotides for incoφoration into a complementary nucleic acid. The polymerase and nucleotides are labeled with donor and acceptor fluorophores that emit a distinguishable signal when a particular type of nucleotide (e.g. A, T/U, C or G) is incoφorated into the complementary nucleic acid. A sequence of the distinguishable signals is detected as the nucleotides are sequentially added to the complementary nucleic acid, and the sequence of signals is converted into a corresponding nucleic acid sequence.
The present disclosure also includes a device for sequencing a nucleic acid molecule, in which a polymerase (carrying a donor fluorophore), oligonucleotide primer, or sample nucleic acid is attached to a substrate. The device also includes a viewing means to view the polymerase, and a detection means that detects a characteristic signal from an acceptor fluorophore carried by a corresponding nucleotide, as the nucleotide is added to the nucleic acid molecule by the polymerase. An electromagnetic radiation source (such as light of a specified wavelength range) excites the donor fluorophore but not the acceptor fluorophore, so that a signal emitted by the donor fluorophore specifically excites the acceptor fluorophore as each nucleotide is added to the synthesized complementary strand by the polymerase. The electromagnetic radiation source is optional if LRET is used. A decoding means then converts a series of characteristic signals emitted by the acceptor fluorophores into a nucleic acid sequence that corresponds to the nucleic acid sequence of the complement.
In particular embodiments, the substrate may be a glass microscope slide or a three- dimensional matrix. In addition, the electromagnetic radiation is from a laser that emits light of the particular wavelength, and the viewing means includes a microscope objective. The detection means of the device may include a CCD camera, and the decoding means (which converts the series of unique signals into a nucleic acid sequence) is a digital computer.
In yet another embodiment, the device for sequencing a nucleic acid is a glass microscope slide to which an oligonucleotide primer, sample nucleic acid, or polymerase is attached, and the polymerase includes a GFP donor fluorophore. A laser is positioned to stimulate the donor fluorophore at a specific wavelength, and the donor fluorophore emits a first signal that induces the acceptor fluorophore to emit a signal when the acceptor fluorophore is brought sufficiently close to the donor fluorophore during chain elongation. The signal emitted by the acceptor fluorophore is unique to each type of nucleotide (e.g. A, T/U, C or G), so that the emitted signal indicates the nucleotide that is added to the complement. A microscope objective is positioned to view the sequence of signals emitted by the individual acceptor fluorophore molecules as the nucleotides are added to the polymerase. A spectrophotometer then converts the sequence of signals into a series of specfrographic signals that correspond to the series of signals emitted by the acceptor fluorophore. A CCD camera detects the sequence of signals and a digital computer converts the sequence of signals into a nucleic acid sequence.
The foregoing and other objects, features, and advantages of the disclosed method will become more apparent from the following detailed description of several embodiments which proceeds with reference to the accompanying figures. BRIEF DESCRIPTION OF THE FIGURES
FIG. 1A is a schematic drawing showing the attachment of a polymerase to a substrate, and the polymerase associated with a template and primer strand. FIG. IB is a schematic drawing showing the attachment of an oligonucleotide primer to a substrate, and the polymerase associated with a template and primer strand.
FIG. 1C is a schematic drawing showing the attachment of a nucleic acid to be sequenced by its 3' end, a substrate, and the polymerase associated with a template and primer strand.
FIG. ID is a schematic drawing showing the attachment of a nucleic acid to be sequenced by its 5' end, a substrate, and the polymerase associated with a template and primer strand.
FIG. 2 is a schematic drawing illustrating fluorescence resonance energy transfer (FRET) between a donor fluorophore on a polymerase and an acceptor fluorophore on a nucleotide. Note that a laser 26 which emits electromagnetic radiation 28 is not required for luminescence resonance energy transfer (LRET). FIG. 3 is a schematic drawing illustrating a microscope and computer assembly that can be used to sequence nucleic acids using TDS. Note that a laser 26 which emits electromagnetic radiation 28 is not required for LRET.
DETAILED DESCRIPTION OF SEVERAL EMBODIMENTS Abbreviations and Definitions
The following definitions and methods are provided to better define the materials and methods disclosed herein, and to guide those of ordinary skill in the art and in the practice of the materials and methods disclosed herein. As used herein (including the appended claims), the singular forms "a" or "an" or "the" include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to "a protein" includes a plurality of such proteins and reference to "the affinity tag" includes reference to one or more affinity tags and equivalents thereof known to those skilled in the art, and so forth.
RT: Room temperature
Acceptor fluorophore: Acceptor fluorophores will generally be compounds which absorb energy from the donor fluorophore in the range of about 400 to 900 nm, usually in the range of about 500 to 800 nm. Acceptor fluorophores in the disclosed embodiments have an excitation spectra which overlaps with the emission of the donor fluorophore, such that energy emitted by the donor can excite the acceptor. The acceptor fluorophores are capable of being attached to nucleotides.
Acceptor fluorophores will generally absorb light at a wavelength which is usually at least 10 nm higher, more usually at least 20 nm higher, than the maximum absorbance wavelength of the donor fluorophore, and will have a fluorescence emission maximum at a wavelength ranging from about 400 to 900 nm. Acceptor fluorophores may be rhodamines, fluorescein derivatives, Green Fluorescent Protein (GFP), BODIPY (4,4-difluoro-4-bora-3a,4a-diaza-s-indacene) and cyanine dyes. Specific acceptor fluorescer moieties include 5-carboxyfluorescein (FAM), 2'7'-dimethoxy-4'5'- dichloro-6-carboxyfluorescein (JOE), N,N,N',N'-tetramethyl-6-carboxyrhodamine (TAMRA), 6- carboxy-X-rhodamine (ROX), BODIPY and cyanine dyes. Additional fluorophores which may be used in the herein disclosed method are listed below.
Affinity Tag: A molecule, such as a protein, attached to the N- or C-terminus of a recombinant protein using genetic engineering methods, to aid in the purification of the recombinant protein. Examples of affinity tags include, but are not limited to: histidine, S-tag, glutathione-S- transferase (GST) and streptavidin. Affinity tags may also be used to attach a protein or nucleic acid to a substrate. cDNA (complementary DNA): A piece of DNA lacking internal, non-coding segments (introns) and regulatory sequences which determine transcription. cDNA can be synthesized in the laboratory by reverse transcription from messenger RNA extracted from cells.
Characteristic Signal: The resulting signal emitted from a fluorescently-labeled nucleotide, which can be predicted by the fluorophore(s) attached to the nucleotide.
Complementary: As referred to herein, nucleic acids that are "complementary" can be perfectly or imperfectly complementary, as long as the desired property resulting from the complementarity is not lost, e.g., ability to hybridize.
Donor Fluorophore: The donor fluorophore will generally be compounds which absorb in the range of about 300 to 900 nm, usually in the range of about 350 to 800 nm, and are capable of transferring energy to the acceptor fluorophore. The donor fluorophore will have a strong molar absorbance co-efficient at the desired excitation wavelength, for example greater than about 103 M"1 cm"1. A variety of compounds may be employed as donor fluorescer components, including fluorescein, GFP, phycoerythrin, BODIPY, DAPI (4',6-diamidino-2-phenylindole), Indo-1, coumarin, dansyl, and cyanine dyes. Specific donor labels of interest include fluorescein, rhodamine, and cyanine dyes. Other fluorophores that can be used in the method disclosed herein are provided below.
In other embodiments, the donor fluorophore is a luminescent molecule, such as aequorin, as discussed below. Electromagnetic Radiation: A series of electromagnetic waves that are propagated by simultaneous periodic variations of electric and magnetic field intensity, and that includes radio waves, infrared, visible light, ultraviolet light, X-rays and gamma rays. In particular embodiments, electromagnetic radiation can be emitted by a laser, which can possess properties of monochromaticity, directionality, coherence, polarization, and intensity. Lasers are particularly useful sources of electromagnetic energy for the method disclosed herein, because lasers are capable of emitting light at a particular wavelength (or across a relatively narrow range of wavelengths), such that energy from the laser can excite a donor but not an acceptor fluorophore.
Emission Signal: The wavelength of light generated from a fluorophore after the fluorophore absorbs an excitation wavelength of light. Emission Spectrum: The broad energy spectra which results after a fluorophore is excited by a specific wavelength of light. Each fluorophore has its own unique emission spectrum. Therefore, when individual fluorophores are attached to nucleotides, the emission spectrums from the fluorophores provide a means for distinguishing between the different nucleotides. Excitation Signal: The wavelength of light necessary to raise a fluorophore to a state such that the fluorophore will emit a longer wavelength of light.
Fluorophore: A chemical compound, which when excited by exposure to a particular wavelength of light, emits light (i.e., fluoresces), for example at a different wavelength.
Also encompassed by the term "fluorophore" are luminescent molecules, which are chemical compounds which do not require exposure to a particular wavelength of light to fluoresce; luminescent compounds naturally fluoresce. Therefore, the use of luminescent signals eliminates the need for an external source of electromagnetic radiation, such as a laser. An example of a luminescent molecule includes, but is not limited to, aequorin (Tsien, 1998, Ann. Rev. Biochem. 67:509). Further description is provided below. Examples of fluorophores that may be used in the method disclosed herein are provided in
U.S. Patent No. 5,866,366 to Nazarenko et al. : 4-acetamido-4'-isothiocyanatostilbene-2,2'disulfonic acid, acridine and derivatives such as acridine and acridine isothiocyanate, 5-(2'- aminoethyl)aminonaphthalene-l-sulfonic acid (EDANS), 4-amino-N-[3- vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate (Lucifer Yellow VS), N-(4-anilino-l- naphthyl)maleimide, anthranilamide, Brilliant Yellow, coumarin and derivatives such as coumarin, 7- amino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4-trifluoromethylcouluarin (Coumaran 151); cyanosine; 4',6-diaminidino-2-phenylindole (DAPI); 5', 5"-dibromopyrogallol-sulfonephthalein (Bromopyrogallol Red); 7-diethylamino-3-(4'-isothiocyanatophenyl)-4-methylcoumarin; diethylenetriamine pentaacetate; 4,4'-diisothiocyanatodihydro-stilbene-2,2'-disulfonic acid; 4,4'- diisothiocyanatostilbene-2,2'-disulfonic acid; 5-[dimethylamino]naphthalene-l-sulfonyl chloride (DNS, dansyl chloride); 4-(4'-dimethylaminophenylazo)benzoic acid (DABCYL); 4- dimethylaminophenylazophenyl-4'-isothiocyanate (DABITC); eosin and derivatives such as eosin and eosin isothiocyanate; erythrosin and derivatives such as erythrosin B and erythrosin isothiocyanate; ethidium; fluorescein and derivatives such as 5-carboxyfluorescein (FAM), 5-(4,6- dichlorotriazin-2-yl)aminofluorescein (DTAF), 2'7'-dimethoxy-4'5'-dichloro-6-carboxyfluorescein (JOE), fluorescein, fluorescein isothiocyanate (FITC), and QFITC (XRITC); fluorescamine; IR144; IR1446; Malachite Green isothiocyanate; 4-methylumbelliferone; ortho cresolphthalein; nitrotyrosine; pararosaniline; Phenol Red; B-phycoerythrin; o-phthaldialdehyde; pyrene and derivatives such as pyrene, pyrene butyrate and succinimidyl 1 -pyrene butyrate; Reactive Red 4 (Cibacron .RTM. Brilliant Red 3B-A); rhodamine and derivatives such as 6-carboxy-X-rhodamine (ROX), 6-carboxyrhodamine (R6G), lissamine rhodamine B sulfonyl chloride, rhodamine (Rhod), rhodamine B, rhodamine 123, rhodamine X isothiocyanate, sulforhodamine B, sulforhodamine 101 and sulfonyl chloride derivative of sulforhodamine 101 (Texas Red); N,N,N',N'-tetramethyl-6- carboxyrhodamine (TAMRA); tetramethyl rhodamine; tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid and terbium chelate derivatives.
Other suitable fluorophores include thiol-reactive europium chelates which emit at approximately 617 nm (Heyduk and Heyduk, Analyt. Biochem. 248:216-27, 1997; J. Biol. Chem. 274:3315-22, 1999).
Other suitable fluorophores include GFP, Lissamine™, diethylaminocoumarin, fluorescein chlorotriazinyl, naphthofluorescein, 4,7-dichlororhodamine and xanthene (as described in U.S. Patent No. 5,800,996 to Lee et al, herein incoφorated by reference) and derivatives thereof. Other fluorophores known to those skilled in the art may also be used, for example those available from Molecular Probes (Eugene, OR).
The fluorophores disclosed herein may be used as a donor fluorophore or as an acceptor fluorophore. Particularly useful fluorophores have the ability to be attached to a polymerase or a nucleotide, are stable against photobleaching, and have high quantum efficiency. In addition, the fluorophores on different sets of nucleotides (e.g. A, T/U, G, C) are advantageously selected to have distinguishable emission spectra, such that emission from one fluorophore (such as A) is distinguishable from the fluorophore carried by another nucleotide (such as T).
Fluorescence resonance energy transfer (FRET): A process in which an excited fluorophore (the donor) transfers its excited state energy to a light absorbing molecule (the acceptor). This energy transfer is non-radioactive, and due primarily to a dipole-dipole interaction between the donor and acceptor fluorophores. This energy can be passed over a distance, for example a limited distance such as 10-100 A. Limitation on the distance over which the energy can travel helps limit transfer to a desired target (such as between a donor fluorophore on a polymerase and a target acceptor fluorophore on a nucleotide, without collateral stimulation of other acceptor fluorophores). FRET pairs: Sets of fluorophores that can engage in fluorescence resonance energy transfer (FRET). Examples of FRET pairs that can be used are listed below. However, one skilled in the art will recognize that numerous other combinations of fluorophores can be used.
FAM is most efficiently excited by light with a wavelength of 488 nm, emits light with a spectrum of 500 to 650 nm, and has an emission maximum of 525 nm. FAM is a suitable donor fluorophore for use with JOE, TAMRA, and ROX (all of which have their excitation maximum at 514 nm, and will not be significantly stimulated by the light that stimulates FAM).
The GFP mutant H9-40 (Tsien, 1998, Ann. Rev. Biochem. 67:509), which is excited at 399 nm and emits at 511 nm, may serve as a suitable donor fluorophore for use with BODIPY, fluorescein, rhodamine green and Oregon green. In addition, the fluorophores tetramethylrhodamine, Lissamine™, Texas Red and naphthofluorescein can be used as acceptor fluorophores with this GFP mutant.
The fluorophore 3-(ε-carboxy-pentyl)-3'-ethyl-5,5'-dimethyloxacarbocyanine (CYA) is maximally excited at 488 nm and may therefore serve as a donor fluorophore for fluorescein or rhodamine derivatives (such as R6G, TAMRA, and ROX) which can be used as acceptor fluorophores (see Hung et al, Analytical Biochemistry, 243 : 15-27, 1996). However, CYA and FAM are not examples of a good FRET pair, because both are excited maximally at the same wavelength (488 nm).
One of ordinary skill in the art can easily determine, using art-known techniques of spectrophotometry, which fluorophores will make suitable donor-acceptor FRET pairs. Fusion Protein: A protein comprising two amino acid sequences that are not found joined together in nature. The term "GFP-polymerase fusion protein" refers to a protein that includes a first amino acid sequence and a second amino acid sequence, wherein the first amino acid sequence is a GFP molecule (mutant or wild-type) and the second amino acid sequence is a polymerase. The link between the first and second domains of the fusion protein is typically, but not necessarily, a peptide linkage. Similarly, the term "GFP-aequorin fusion protein" refers to a protein that includes a first amino acid sequence and a second amino acid sequence, wherein the first amino acid sequence is a GFP molecule (mutant or wild-type) and the second amino acid sequence is an aequorin. GFP- aequorin fusion proteins can be generated using the method of Baubet et al. (Proc. Natl. Acad. Sci. USA 97:7260-5, 2000, herein incoφorated by reference). These fusion proteins may also be represented by the formula X-Y wherein X is a fluorophore, such as GFP, and Y is a polymerase protein. In a further embodiment of the fusion proteins disclosed, an affinity tag sequence may be linked to the N- or C-terminus of the first protein. Such a three part protein can thus be represented as T-X-Y wherein T is the affinity tag, X is a protein, such as a fluorescent protein and Y is a polymerase protein. Green fluorescent protein (GFP): The source of fluorescent light emission in Aequorea victoria. As used herein, GFP refers to both the wild-type protein, and spectrally shifted mutants thereof, for example as described in Tsien, 1998, Ann. Rev. Biochem. 67:509 and in U.S. Patent Nos. 5,777,079 and 5,625,048 to Tsien and Heim, herein incoφorated by reference. In particular embodiments, GFP is excited using a laser. In other embodiments, GFP is excited using aequorin, for example using a GFP-aequorin fusion protein.
GFP-polymerase: Recombinant fusion protein containing both a functional GFP molecule and a functional polymerase. The GFP can be located at the N- or C- terminus of the polymerase. Alternatively, the GFP molecule can be located anywhere within the polymerase. Regardless of GFP position, it is important that the polymerase remain functional (i.e. able to catalyze the elongation of the complementary nucleic acid strand). The GFP-polymerase may also contain an affinity tag to aid in its purification and/or attachment to a substrate (Tag-GFP-polymerase). Furthermore, the GFP- polymerase may also contain a functional aequorin sequence, for example if the use of LRET is desired.
Linker: Means by which to attach a polymerase or a nucleic acid to a substrate. The linker ideally does not significantly interfere with binding to or incoφoration by the polymerase. The linker can be a covalent or non-covalent means of attachment. In one embodiment, the linker is a pair of molecules, having high affinity for one another, one molecule on the polymerase (such as an affinity tag), the other on the substrate. Such high-affinity molecules include streptavidin and biotin, histidine and nickel (Ni), and GST and glutathione. When the polymerase and substrate are brought into contact, they bind to one another due to the interaction of the high-affinity molecules.
In another embodiment, the linker is a straight-chain or branched amino- or mercapto- hydrocarbon with more than two carbon atoms in the unbranched chain. Examples include aminoalkyl, aminoalkenyl and aminoalkynyl groups. Alternatively, the linker is an alkyl chain of 10- 20 carbons in length, and may be attached through a Si-C direct bond or through an ester, Si-O-C, linkage (see U.S. Patent No. 5,661,028 to Foote, herein incoφorated by reference). Other linkers are provided in U.S. Patent No. 5,306,518 to Prober et al, column 19; and U.S. Patent No. 4,711,955 to Ward et al, columns 8-9; and U.S. Patent No. 5,707,804 to Mathies et al columns 6-7 (all herein incoφorated by reference).
Several methods for attaching nucleic acids to a substrate are available. For example, methods for attaching the oligonucleotide primer to the substrate via a linker are disclosed in U.S. Patent No. 5,302,509 to Cheeseman, herein incoφorated by reference. Other methods for attaching a nucleic acid (for example the oligonucleotide primer or the nucleic acid to be sequenced) to the substrate include, but are not limited to: synthesizing a 5' biotinylated nucleic acid and affixing it to a streptavidin coated substrate (Beaucage, Tetrahedron Letters 22:1859-62, 1981; Caruthers, Meth. Enzym. 154:287-313, 1987), (Hultman, Nucl. Acids Res. 17:4937-46, 1989); drying the nucleic acid on amino-propyl-silanized (APS) glass (Ha et al Proc. Natl. Acad. Sci. USA. 93:6264-68, 1996); and cross-linking the nucleic acid to an unmodified substrate by conjugating an active silyl moiety onto a nucleic acid (Kumar et al. Nucleic Acids Res. 28:e71 , 2000).
Luminescence Resonance Energy Transfer (LRET): A process similar to FRET, except that the donor molecule is itself a luminescent molecule, or is excited by a luminescent molecule, instead of a laser. The luminescent molecule is naturally in an excited state; it does not require excitation by an external source of electromagnetic radiation, such as a laser. This will decrease the background fluorescence. In particular embodiments, the luminescent molecule can be attached to a polymerase, for example GFP-polymerase, as a means to produce local excitation of the GFP donor fluorophore, without the need for an external source of electromagnetic radiation. In other embodiments, the luminescent molecule is the donor fluorophore. In this embodiment, the fluorescence emitted from the luminescent molecule excites the acceptor flurophores. An example of luminescent molecule that can be used includes, but is not limited to, aequorin. The bioluminescence from aequorin, which peaks at 470 nm, can be used to excite a donor GFP fluorophore (Tsien, 1998, Ann. Rev. Biochem. 67:509; Baubet et al, 2000, Proc. Natl. Acad. Sci. U.S.A., 97:7260-5). GFP transfers its resonance to the acceptor fluorophores disclosed herein. In this example, both aequorin and GFP can be attached to the polymerase. Nucleic Acid: As used herein, nucleic acid refers to both DNA and RNA molecules. A sample nucleic acid molecule is a nucleic acid to be sequenced, and can be obtained in purified form, by any method known to those skilled in the art. For example, as described in U.S. Patent No. 5,674,743 to Ulmer, herein incoφorated by reference. Nucleotides: The major nucleotides of DNA are deoxyadenosine 5'-triphosphate (dATP or A), deoxyguanosine 5'-triphosphate (dGTP or G), deoxycytidine 5'-triphosphate (dCTP or C) and deoxythymidine 5'-triphosphate (dTTP or T). The major nucleotides of RNA are adenosine 5'- triphosphate (ATP or A), guanosine 5'-triphosphate (GTP or G), cytidine 5'-triphosphate (CTP or C) and uridine 5'-triphosphate (UTP or U). The nucleotides disclosed herein also include nucleotides containing modified bases, modified sugar moieties and modified phosphate backbones, for example as described in U.S. Patent No. 5,866,336 to Nazarenko et al. (herein incoφorated by reference).
Examples of modified base moieties which can be used to modify nucleotides at any position on its structure include, but are not limited to: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta- D-galactosylqueosine, inosine, N~6-sopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2- dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6- adenine, 7-methylguanine, 5-methylaminomethyluracil, methoxyarninomethyl-2-thiouracil, beta-D- mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- isopentenyladenine, uracil-5-oxyacetic acid, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2- thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-S- oxyacetic acid, 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, and 2,6- diaminopurine. Examples of modified sugar moieties which may be used to modify nucleotides at any position on its structure include, but are not limited to: arabinose, 2-fluoroarabinose, xylose, and hexose, or a modified component of the phosphate backbone, such as phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, or a formacetal or analog thereof. Such modifications however, allow for incoφoration of the nucleotide into a growing nucleic acid chain. That is, they do not result in the termination of nucleic acid synthesis.
The choice of nucleotide precursors is dependent on the nucleic acid to be sequenced. If the template is a single-stranded DNA molecule, deoxyribonucleotide precursors (dNTPs) are used in the presence of a DNA-directed DNA polymerase. Alternatively, ribonucleotide precursors (NTPs) are used in the presence of a DNA-directed RNA polymerase. However, if the nucleic acid to be sequenced is RNA, then dNTPs and an RNA-directed DNA polymerase are used.
A "type" of nucleotide refers to a set of nucleotides that share a common characteristic that is to be detected. For example, the types of nucleotides may be divided into four types: A, T, C and G (for DNA) or A, U, C and G (for RNA). In this example, each type of nucleotide of the method disclosed herein will be labeled with a unique acceptor fluorophore, so as to be distinguishable from the other types by fluorescent spectroscopy or by other optical means. Such fluorophores are known in the art and include those listed above. The fluorescent label generally is not part of the 3'-OH group, so as to allow the polymerase to continue to add subsequent nucleotides. Oligonucleotide: A polynucleotide is a linear sequence of up to about 200 nucleotide bases in length, for example a polynucleotide (such as DNA or RNA) which is at least 6 nucleotides, for example at least 15, 50, 100 or even 200 nucleotides long.
ORF (open reading frame): A series of nucleotide triplets (codons) coding for amino acids without any termination codons. These sequences are usually translatable into a peptide.
Polymerase: The enzyme which catalyzes the elongation of the primer strand, in the 5' to 3' direction along the nucleic acid template to be sequenced. Examples of polymerases which may be used in the method disclosed herein include, but are not limited to: the E. coli DNA polymerase I, specifically the Klenow fragment which has 3' to 5' exonuclease activity, Taq polymerase, reverse transcriptase, E. coli RNA polymerase, and wheat germ RNA polymerase II.
The choice of polymerase is dependent on the nucleic acid to be sequenced. If the template is a single-stranded DNA molecule, a DNA-directed DNA or RNA polymerase may be used; if the template is a single-stranded RNA molecule, then a reverse transcriptase (i.e., an RNA-directed DNA polymerase) may be used. Polynucleotide: A linear nucleic acid sequence of any length. Therefore, a polynucleotide includes molecules which are 15, 50, 100, 200 (oligonucleotides) and also nucleotides as long as a full length cDNA.
Primer: Short nucleic acids, for example DNA oligonucleotides 10 nucleotides or more in length, which are annealed to a complementary target nucleic acid strand by nucleic acid hybridization to form a hybrid between the primer and the target nucleic acid strand, then extended along the target nucleic acid strand by a polymerase enzyme. Therefore, individual primers can be used for nucleic acid sequencing. In addition, primer pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or other nucleic-acid amplification methods known in the art. Primers comprise at least 10 nucleotides of the nucleic acid sequences to be sequenced. In order to enhance specificity, longer primers may also be employed, such as primers having 15, 20, 30, 40, 50, 60, 70, 80, 90 or 100 consecutive nucleotides of the nucleic acid sequences to be sequenced. Methods for preparing and using primers are described in, for example, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York; Ausubel et al (1987) Current Protocols in Molecular Biology, Greene Publ. Assoc. & Wiley-Intersciences. If the nucleic acid to be sequenced is DNA, the primer used may be DNA, RNA, or a mixture of both. If the nucleic acid to be sequenced is RNA, the primer used may be RNA or DNA.
Purified: The term purified does not imply absolute purity; rather, it is intended as a relative term. Thus, for example, a purified GFP-polymerase protein preparation is one in which the GFP-polymerase protein is more pure than the protein in its environment within a cell. Preferably, a preparation of a GFP-polymerase protein is purified such that the GFP-polymerase protein represents at least 50% of the total protein content of the preparation, but may be, for example 90 or even 98% of the total protein content. Recombinant: A recombinant nucleic acid is one that has a sequence that is not naturally occurring or has a sequence that is made by an artificial combination of two otherwise separated segments of sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques.
Reverse Transcriptase: A template-directed DNA polymerase that generally uses RNA as its template.
RNA polymerase: Catalyzes the polymerization of activated ribonucleotide precursors that are complementary to the DNA template. Sequence of signals: The sequential series of emission signals, including light or spectra signals, that are emitted from fluorescently labeled nucleotides as they are added to the growing complementary nucleic acid strand.
Substrate: Material in the microscope field of view that the polymerase or nucleic acid is attached to. In particular embodiments, the substrate is made of biocompatible material that is transparent to light, including glass and quartz. For example, the substrate may be a 3 cm long by 1 cm wide by 0.25 cm thick glass microscope slide. In another embodiment, the substrate can be a gel matrix, to allow sequencing in three-dimensions. In yet another embodiment, for example when LRET is used, the substrate can be opaque.
The substrate can be treated before use. For example, glass microscope slides can be washed by ultrasonication in water for 30 minutes, soaked in 10% NaOH for 30 minutes, rinsed with distilled water and dried in an 80°C oven for 10 minutes or air-dried overnight.
Two dye sequencing (TDS): A method of sequencing nucleic acids using at least two sets of fluorophores, with one set on the nucleotides (a different acceptor dye for each class of nucleotides), and the other set on the polymerase (a donor dye). In particular embodiments, two sets of fluorophores are used.
Transformed: A transformed cell is a cell into which has been introduced a nucleic acid molecule by molecular biology techniques. As used herein, the term transformation encompasses all techniques by which a nucleic acid molecule might be introduced into such a cell, including transfection with viral vectors, transformation with plasmid vectors, and introduction of naked DNA by electroporation, lipofection, and particle gun acceleration.
Unique Emission Signal: The emission spectrum for each fluorophore is unique. By attaching one or more individual fluorophores or other labels to each type of nucleotide, each different type of nucleotide (e.g. A, T/U, C or G) has its own individual or own combination of signals (such as fluorophores that emit at unique different wavelengths). Each nucleotide class will have a unique emission signal, that in the examples is based on the fluorophore(s) present on that class of nucleotide. This signal can be used to determine which type of nucleotide (e.g. A, T/U, C or G) has been added to a growing complementary strand of nucleic acid, and these signals in combination indicate the nucleic acid sequence. In addition to the different wavelengths of light emitted as a signal, different types of signals can include different intensities of light and different intensities emitted at a particular wavelength. In other words, a spectrum consisting of different intensities emitted at different wavelengths. Vector: A nucleic acid molecule as introduced into a host cell, thereby producing a transformed host cell. A vector may include nucleic acid sequences that permit it to replicate in a host cell, such as an origin of replication. A vector may also include one or more selectable marker genes and other genetic elements known in the art.
DETAILED EMBODIMENT Disclosed herein is a new method for sequencing nucleic acids, and one disclosed embodiment is called Two Dye Sequencing (TDS), because it depends on at least two classes of fluorophores, a donor and an acceptor. The donor fluorophore is on a polymerase, and the acceptor fluorophore is on the nucleotides which are incoφorated into the nucleic acid as a complementary strand is generated (FIGS 1-3). In one embodiment, as shown in FIG. 1A, a polymerase 10, is attached to a substrate 12, such as a microscope slide, by a linker 14. The nucleic acid 16 to be sequenced has an annealed oligonucleotide primer 18, and is bound by the anchored polymerase 10. To start the sequencing reaction, a mixture of nucleotides 20 is added. The polymerase 10 then sequentially adds the appropriate nucleotide 20 to the complementary strand. As shown in FIG. 3, the substrate 12, can be mounted onto a microscope stage 34. The sequencing reaction may take place in an aqueous environment 36, which may be sealed to prevent desiccation, for example by covering with a glass cover slip 38.
FIGS. IB- ID show alternative embodiments in which a nucleic acid, for example an oligonucleotide primer 18 (FIG. IB) or a nucleic acid to be sequenced 16 (FIGS. IC and ID) is attached to a substrate 12, such as a microscope slide, by a linker 14. The nucleic acid to be sequenced can be attached by its 5' (FIG. ID) or 3' end (FIG. IC). In other embodiments, the nucleic acid to be sequenced can be attached to the substrate by any nucleotide within the nucleic acid. To start the sequencing reaction, a mixture of nucleotides 20 and polymerase 10 is added as described above.
FIG. 2 illustrates the fluorophores on both the polymerase 10 and the nucleotides 20. The polymerase 10 is labeled with a donor fluorophore 22, such as green fluorescent protein (GFP). The nucleotide 20 (A, T/U, C, or G) is labeled with at least one acceptor fluorophore 24. After attaching the fluorescent polymerase 10 to a substrate 12 in a microscope field of view, the fluorescent nucleotides 20 are added to the reaction chamber. While each nucleotide 20 is added to the complementary strand, the fluorophore 22 on the polymerase 10, but not the fluorophore(s) 24 on the nucleotides 20, is continually excited using electromagnetic radiation, for example a coherent beam of light provided by a laser 26 which emits electromagnetic radiation 28 of a particular wavelength, or light within a narrow range of wavelengths. Alternatively, the donor fluorophore 22 can be a luminescent molecule, or a luminescent molecule can be used to excite the donor fluorophore 22. In these embodiments, a source of electromagnetic radiation, such as a laser 26, is not required. An example of a luminescent molecule is aequorin.
The laser 26 provides an excitation signal 28 that excites the donor fluorophore 22 on the polymerase 10, but not the acceptor fluorophore 24 on the incoφorated or free nucleotides 20. Upon addition of a fluorescent nucleotide 20 to the complementary strand, the emission signal 30 from the donor fluorophore 22 will excite the acceptor fluorophore 24 associated with the particular nucleotide being added to the sequence. The acceptor fluorophore 24 then emits its own unique emission signal 32, which acts as an indicator of the corresponding type of nucleotide (uniquely associated with that fluorophore) that has been added to the sequence. This transfer of energy from the donor fluorophore to the acceptor fluorophore is fluorescence resonance energy transfer (FRET). Alternatively, if a luminescent molecule such as aequorin, (instead of a laser 26) is used to excite the donor fluorophore (or is the donor fluorophore), the resulting emission signal 30 from the donor fluorophore 22 (or luminescent molecule) will excite the acceptor fluorophore 24 associated with the particular nucleotide being added to the sequence, without the need for a source of electromagnetic radiation 26. The acceptor fluorophore 24 then emits its own unique emission signal 32, which acts as an indicator of the corresponding type of nucleotide (uniquely associated with that fluorophore) that has been added to the sequence. This transfer of energy is luminescent resonance energy transfer (LRET).
The unique emission signal 32 for each type of nucleotide 20 (A, T/U, C or G) is converted into a nucleic acid sequence as shown in FIG. 3. The series of emission signals 32, emitted in the microscope field as each nucleotide is added to the sequence, is collected with a microscope objective lens 40, and a complete emission spectrum 42 for each nucleotide emission 32 is generated by a spectrophotometer 44. The complete emission spectrum 42 is captured by a detection device, such as CCD-camera 46 for each nucleotide 20 as it is added to the nucleic acid strand 16 in the microscope field of view. The CCD camera 46 collects the emission spectrum 42 for each added nucleotide, and converts the spectrum 42 into a charge 48. The charge 48 for each nucleotide addition may be recorded by a computer 50, for converting the sequence of emission spectrums into a nucleic acid sequence 52 for each nucleic acid in the microscope field of view using an algorithm 54, such as a least-squares fit between the signal spectrum 42 and the dye spectra for the fluors 24 on each class of nucleotides 20.
Although many different algorithms could be used to convert the emission spectrums into a nucleic acid sequence, this specific example illustrates one approach. Four fluorescent spectra (Anm, Cnm, Gnm and T/Unm) are generated from macroscopic measurements. From the sample, an unknown noisy spectrum (Snm) is generated. The unknown spectrum is assumed to be the sum of the four known spectra with only four weights, a, c, g and t u, representing the relative proportions of the bases. So at 520 nm through 523 nm, this results in five equations: A520*a + C520*c + G520*g + T520*t =S520 A521*a + C521*c + G521*g + T521*t =S521 A522*a + C522*c + G522*g + T522*t =S522 A523*a + C523*c + G523*g + T523*t =S523
A524*a + C524*c + G524*g + T524*t =S524
Filling in the known values, a, c, g, and t/u are solved using a least squares linear regression.
In this particular example, the donor fluorophore 22 carried by the polymerase 10 is GFP H9-40, and the nucleotides are labeled with acceptor fluorophores as follows: A is labeled with
BODIPY; T is labeled with fluorescein; C is labeled with rhodamine; G is labeled with Oregon green. In another example, the donor fluorophore 22 carried by the polymerase 10 is H9-40, and the nucleotides are labeled with acceptor fluorophores as follows: A is labeled with tetramethylrhodamine; T U is labeled with napthofluorescein; C is labeled with lissamine; G is labeled with Texas Red. The emission spectrum of each of the acceptor fluorophores is monitored, and the spectrum of each of the fluorophores can be distinguished from each other, so that the addition of each different type of nucleotide can be detected.
Therefore, the method allows for the sequencing of nucleic acids by monitoring the incoφoration of individual nucleotides into individual DNA or RNA molecules on the molecular level, instead of sequencing by monitoring macromolecular events, such as a pattern on an electrophoresis gel, whose signal is representative of a large population of nucleic acid molecules. Using this method in combination with a large field of view, it is possible that 1000 or more DNA molecules could be sequenced simultaneously, at sequencing speeds of 360 bases or more per hour. Each DNA molecule to be copied/sequenced, and its associated polymerase/donor dye, may correspond to a particular field of view, or a particular sensor for a position in which the polymerase mediated reaction is occurring. Therefore, using multiple such devices, molecular sequencing with the method can permit sequencing entire chromosomes or genomes within a day.
More details about particular aspects of this method are given in the following examples.
EXAMPLE 1
Preparation of Fluorescent or Luminescent Polymerases
This example describes how to prepare polymerases containing at least one fluorophore or luminescent molecule. The fluorophore or luminescent molecule may be a donor fluorophore.
Recombinant GFP-polymerase
Green fluorescent protein (GFP) includes a chromophore formed by amino acids in the center of the GFP. GFP is photostable, making it a desirable fluorophore to use on the polymerase, because it is resistant to photobleaching during excitation. Wild-type GFP is excited at 393 nm or 476 nm to produce an emission at 508 nm. GFP mutants have alternative excitation and emission spectra. One GFP mutant, H9-40
(Tsien, 1998, Ann. Rev. Biochem. 67:509; U.S. Patent Nos. 5,625,048 and 5,777,079 to Tsien and Heim, herein incoφorated by reference), has only a single absoφtion at 398 nm and emits at 511 nm. A red-shifted GFP mutant RSGFP4 (Delagrave et al, Biotechnology 13: 151-4, 1995) has an excitation at 490 nm and emission at 505 nm. The blue-shifted GFP mutant BFP5 absorbs at 385 nm and emits at 450 nm (Mitra et al, Gene, 173:13-7, 1996).
The polymerase used for elongation of the primer strand can be attached to GFP to generate a fusion protein, GFP-polymerase, by recombinant techniques known to those skilled in the art. Methods for making fusion proteins are described in Sambrook et al (Molecular Cloning, A
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, Chapter 17, 1989), herein incoφorated by reference. Plasmids containing the wild-type or mutant GFP gene sequences and a multiple cloning site (MCS) into which the polymerase sequence can be inserted (i.e. pGFP), are available from Clontech (Palo Alto, CA). Briefly, both the polymerase DNA and the GFP plasmid are digested with the appropriate restriction enzyme(s) which allow for the insertion of the polymerase into the MCS of the GFP plasmid in the sense orientation. The resulting fragments are ligated and expressed in bacteria, such as E. coli. The expressed recombinant GFP-polymerase is then purified using methods known by those skilled in the art. The GFP molecule may be placed at the N- or C-terminus of the polymerase, or anywhere in between. The resulting GFP-polymerases are tested to determine which has the optimal properties for sequencing. Such properties can include: ease of protein purification, amount of protein produced, amount of fluorescence signal emitted after excitation, minimal alteration of the fluorescent properties of the GFP.
The purification of recombinant fusion proteins has been made significantly easier by the use of affinity tags that can be genetically engineered at either the N- or C-terminus of recombinant proteins. Such tags can be attached to the GFP-polymerase protein, to aid in its purification and subsequent attachment to a substrate (see Example 2). Examples of affinity tags include histidine (His), streptavidin, S-tags, and glutathione-S-transferase (GST). Other tags known to those skilled in the art can also be used. In general, the affinity tags are placed at the N- or C-terminus of a protein. Commercially available vectors contain one or multiple affinity tags. These vectors can be used directly, or if desired, the sequences encoding the tag can be amplified from the vectors using PCR, then ligated into a different vector such as the GFP-containing vectors described above. To prepare a Tag-GFP- polymerase recombinant fusion protein, vectors are constructed which contain sequences encoding the tag, GFP (wild-type or mutant), and the polymerase. The sequences are ordered to generate the desired Tag-GFP-polymerase recombinant fusion protein. Such methods are well known to those skilled in the art (Sambrook et al, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, Chapter 17, 1989). This vector is expressed in bacteria such as E. coli, and the protein purified. The method of purification will depend on the affinity tag attached. Typically, the bacterial lysate is applied to a column containing a resin having high affinity for the tag on the fusion protein. After applying the lysate and allowing the tagged-fusion protein to bind, unbound proteins are washed away, and the fusion protein is subsequently eluted.
One of the most widely used tags is six or ten consecutive histidine (His) residues, which has high affinity for metal ions. A His-6 or His- 10 moiety can be attached to GFP-polymerase by using pET vectors (Novagen, Madison, WI). The generation of GFP-His (Park and Raines, Protein Sci. 6:2344-9, 1997) and protein-GFP-His recombinant proteins have described previously (Prescott et al, FEBS Lett. 411 :97-101 , 1997, herein incoφorated by reference). The His-containing fusion proteins can be purified as described in Paborsky et al. (Anal. Biochem., 234:60-5, 1996), herein incoφorated by reference. Briefly, the cell lysate is immobilized using affinity chromatography on Ni2+-NTA-Agarose (QIAGEN, Valencia, CA). After washing away unbound proteins, for example using a buffer containing 8 mM imidazole, 50 mM Tris HC1, pH 7.5, 150 mM NaCl, the bound recombinant protein is eluted using the same buffer containing a higher concentration of imidazole, for example 100-500 mM. The S-tag system is based on the interaction of the 15 amino acid S-tag peptide with the S- protein derived from pancreatic ribonuclease A. Several vectors for generating S-tag fusion proteins, as well as kits for the purification of S-tagged proteins, are available from Novagen (Madison, WI). For example vectors pET29a-c and ρET30a-c can be used. The S-tag fusion protein is purified by incubating the cell lysate with S-protein agarose, which retains S-tag fusion proteins. After washing away unbound proteins, the fusion protein is released by incubation of the agarose beads with site- specific protease, which leaves behind the S-tag peptide.
The affinity tag streptavidin binds with very high affinity to D-biotin. Vectors for generating streptavidin-fusion proteins, and methods for purifying these proteins, are described in Santo and Cantor (Biochem. Biophys. Res. Commun. 176:571-7, 1991, herein incoφorated by reference). To purify the fusion protein, the cell lysate is applied to a 2-iminobiotin agarose column, (other biotin-containing columns may be used), and after washing away unbound proteins, the fusion protein is eluted, for example with 6 M urea, 50 mM ammonium acetate (pH 4.0).
The enzyme glutathione-S-transferase (GST) has high affinity for gluathione. Plasmid expression vectors containing GST (pGEX) are disclosed in U.S. Patent No. 5,654,176 to Smith, herein incoφorated by reference and in Sharrocks (Gene, 138:105-8, 1994, herein incoφorated by reference). pGEX vectors are available from Amersham Pharmacia Biotech (Piscataway, NJ). The cell lysate is incubated with glutathione-agarose beads and after washing, the fusion protein is eluted, for example, with 50 mM Tris-HCl (pH 8.0) containing 5 mM reduced glutathione. After purification of the GST-GFP-polymerase fusion protein, the GST moiety can be released by specific proteolytic cleavage. If the GST-fusion protein is insoluble, it can be purified by affinity chromatography if the protein is solubilized in a solubilizing agent which does not disrupt binding to glutathione-agarose, such as 1% Triton X-100, 1% Tween 20, 10 mM dithiothreitol or 0.03% NaDodSOj. Other methods used to solubilize GST-fusion proteins are described by Frangioni and Neel (Anal. Biochem. 210:179-87, 1993, herein incoφorated by reference)
Recombinant GFP-aequorin-polymerase
Recombinant GFP-aequorin-polymerase can be generated using methods known to those skilled in the art, for example the method disclosed by Baubet et al. (Proc. Natl. Acad. Sci. USA 97:7260-5, 2000, herein incoφorated by reference). Briefly, aequorin cDNA (for example Genbank Accession No. L29571), polymerase DNA, and a GFP plasmid are digested with the appropriate restriction enzyme(s) which allow for the insertion of the aequorin and polymerase into the MCS of a GFP plasmid in the sense orientation. The resulting fragments are ligated and expressed in bacteria, such as E. coli. The expressed recombinant GFP-aequorin-polymerase is then purified as described above. Affinity tags can also be added.
The ordering of the GFP, aequorin, and polymerase sequences can be optimized. The resulting GFP-aequorin-polymerases are tested to determine which has the optimal properties for sequencing. Such properties can include: ease of protein purification, amount of protein produced, amount of chemiluminescent signal emitted, amount of fluorescent signal emitted after excitation, minimal alteration of the fluorescent properties of the GFP and aequorin, and amount of polymerase activity
Attachment of fluorophores to a polymerase As an alternative to generating a GFP-polymerase fusion protein, other donor fluorophores can be used by directly or indirectly attaching them to the polymerase.
Amine-reactive fluorophores are frequently used to create fluorescently-labeled proteins.
Examples of amine-reactive probes that can be used include, but are not limited to: fluorescein,
BODIPY, rhodamine, Texas Red and their derivatives. Such dyes will attach to lysine residues within the polymerase, as well as to the free amine at the N-terminus. Reaction of amine-reactive fluorophores usually proceeds at pH values in the range of pH 7-10.
Alternatively, thiol-reactive probes can be used to generate a fluorescently-labeled polymerase. In proteins, thiol groups are present in cysteine residues. Reaction of fluors with thiols usually proceeds rapidly at or below room temperature (RT) in the physiological pH range (pH 6.5- 8.0) to yield chemically stable thioesters. Examples of thiol-reactive probes that can be used include, but are not limited to: fluorescein, BODIPY, cumarin, rhodamine, Texas Red and their derivatives. Other functional groups on the protein including alcohols (serine, threonine, and tyrosine residues), carboxylic acids and glutamine, can be used to conjugate other fluorescent probes to the polymerase. Another fluorophore which can be attached to the polymerase is 4-[N-[(iodoacetoxy)ethyl]-
N-methylamino]-7-nitrobenz-2-oxa-l,3-diazole (IANBD), as described by Allen and Benkovic
(Biochemistry, 1989, 28:9586).
Methods for labeling proteins with reactive dyes are well known to those well skilled in the art. In addition, the manufacturers of such fluorescent dyes, such as Molecular Probes (Eugene, OR), provide instructions for carrying out such reactions.
In particular embodiments, fluorescently-labeled polymerases have a high fluorescence yield, and retain the critical features of the polymerase, primarily the ability to synthesize a complementary strand of a nucleic acid molecule. The polymerase may therefore have a less-than- maximal fluorescence yield to preserve the function of the polymerase. Following conjugation of the fluorophore to the polymerase, unconjugated dye is removed, for example by gel filtration, dialysis or a combination of these methods.
EXAMPLE 2 Attachment of the Polymerase or Nucleic Acid to a Substrate
This example describes methods that can be used to attach the fluorescent polymerase generated in Example 1 , or a nucleic acid, to a substrate, such as a microscope slide or gel matrix. During the sequencing reaction, the sample nucleic acid to be sequenced, the oligonucleotide primer, or the polymerase, is attached to a substrate in the microscope field of view.
Attachment of Nucleic Acids
Several methods for attaching nucleic acids (for example the sample nucleic acid to be sequenced or an oligonucleotide primer) to a substrate are available. In particular embodiments, nucleic acids can be attached by their 5' or 3' end, or anywhere in between. For example, a 5' biotinylated primer can be synthesized (Beaucage, Tetrahedron Letters 22:1859-62, 1981; Caruthers, Meth. Enzym. 154:287-313, 1987), and affixed to a streptavidin coated substrate surface (Hultman, Nuc Acids Res. 17:4937-46, 1989). In another embodiment, the nucleic acid can be dried on amino- propyl-silanized (APS) glass, as described by Ha et al (Proc. Natl. Acad. Sci. USA. 93:6264-68, 1996), herein incoφorated by reference. In yet other embodiments, a silyl moiety can be attached to a nucleic acid, which can be used to attach the nucleic acid directly to a glass substrate, for example using the methods disclosed by Kumar et al. (Nucleic Acids Res. 28 :e71, 2000, herein incoφorated by reference). Briefly, silane is conjugated to a nucleic acid using the following method.
Mercaptosilane [(3-Mercaptopropyl)-trimethoxysilane] is diluted to 5 mM stock solution with a reaction buffer such as sodium acetate (30 mM, pH 4.3) or sodium citrate (30 mM, pH 4). For conjugation of 5'-thiol-labeled nucleotides with mercaptosilane, 1 nmol nucleotides are reacted with 5 nmol mercaptosilane in 20 μl of the same buffer for 10-120 min at RT. The reaction mixture is used directly or diluted with the reaction buffer to a desired concentration for immobilization on a substrate, such as a glass microscope slide. 5'-acrylic-labeled oligonucleotides are conjugated to mercaptosilane using an identical procedure.
The 5'-thiol-labeled nucleotides are conjugated with aminosilane [(3-aminopropyl)- trimethoxysilane] in dimethylsulfoxide (DMSO) in the presence of heterobifunctional linkers N- succinimidyl-3-(2-pyridyldithiol)-propionate (SPDP) or succinimidyl-6-(iodoacctyl-amino)- hexanoate (SIAX). Nucleotides (final concentration 5-50 μM) are combined with 2.5 nmol aminosilane (added from 5 mM solution in ethanol) and 2.5 nmol bifunctional reagents (added from 5 mM stock solution in DMSO) in 10 μl DMSO, and the reaction allowed to proceed for 1-2 hours at RT.
Acrylic-labeled oligonucleotides (50-500 pmol) are combined with 25 nmol acrylicsilane (γ- methacryloxy-propyl-trimethoxysilanc) in 10 μl of 30 mM NaOAc, pH 4.3. Ammonium persulfate (10% in H20) and N,N,N',N'-tetramethylethylenediamine (TEMED) are added to final concentration of 0.5 and 2%, respectively, and the mixture allowed to react for 30 minutes at RT.
After the conjugation reactions, the reaction mixture is referred to as silanized nucleic acid, and can be directly used for spotting onto a substrate. Silanized nucleic acids can be spotted on the glass slides manually (120 nl/spot) or with an automated arrayer (Genetic Microsystem, Woburn. USA) (1 nl spot). Nucleic acids in aqueous solutions can be kept in a humidified chamber for 15 minutes at RT after spotting onto the glass slide, dried at 50°C for five minutes, dipped into boiling water for 30 seconds to remove non-covalently bound nucleic acids, and dried with nitrogen before hybridization. Nucleotides in DMSO are left at RT for 15 minutes after spotting onto glass slides and dried at 50°C for 10 minutes. These slides are sequentially washed with DMSO (3 x 2 min), ethanol (3 x 2 min) and boiling water (2 min) and dried with nitrogen for later use.
To hybridize a complementary nucleotide to the nucleotide attached to the substrate, such as an oligonucleotide primer, the nucleotide to be hybridized is diluted to between 20 nM and 1 μM in 5x SSC (750 mM NaCl, 125 mM sodium citrate, pH 7) with 0.1% Tween-20. Hybridization is done under coverslips in a humidifier at 37°C for 30 minutes to overnight. Non-hybridized and nonspecific nucleotides are removed by washing with 5x SSC containing 0.1% Tween-20 (3 x 1 min) followed by lx SSC containing 0.1% Tween-20 (2 x 15 min).
If a longer nucleic acid molecule is to be hybridized, such as a sample nucleic acid, hybridization is carried out at 65°C for four hours in 3 x SSC with 0.1% SDS and 1 μg/μl yeast tRNA. The slides are then washed with lx SSC containing 0.1% SDS (3 x 2 min) and 0. lx SSC containing 0.1% SDS (3 x 5 min) at RT.
After washing, the slides can be dried with nitrogen gas. If repeated hybridization on the same substrate is desired, the substrate is boiled in water for one minute then dried with nitrogen gas before proceeding to the next hybridization reaction. To attach a nucleic acid by the 3' end, a terminal transferase can be used to "tail" the molecule.
Attachment of Polymerase
In other embodiments the polymerase can be attached to the substrate. The polymerase can be linked to a substrate by first generating a streptavidin-polymerase fusion protein using the methods described above in Example 1. The polymerase-streptavidin protein is then affixed to a biotinylated substrate, for example as described by Mazzola and Fodor (Biophys. J. 68:1653-60, 1995) or Itakura et al. (Biochem. Biophys. Res. Commun. 196:1504-10, 1993).
Other methods of attaching the polymerase to a substrate are well known to those skilled in the art. For example, the microscopic tip of an atomic force microscope may be used to chemically alter the surface of a substrate (Travis, Science 268:30-1, 1995). Alternatively, if the protein contains 6-10 consecutive histidine residues, it will bind to a nickel-coated substrate. For example, Paborsky et al. (Anal. Biochem. 234:60-5, 1996, herein incoφorated by reference) describe a method for attaching nickel to a plastic substrate. To charge microtiter polystyrene plates, 100 μl of N,N- bis[carobxymethyl]lysine (BCML) is added (10 mM BCML in 0,1 M NaP04, pH 8) to each well and incubated overnight at RT. The plate is subsequently washed with 200 μl of 0.05% Tween, blocked (3% BSA in 50 mM Tris HC1, pH 7.5, 150 mM NaCl, 0.05% Tween) and washed with a series of buffers. First 50 mM Tris HC1, pH 7.5, 500 mM imidazole, 0.05% Tween; second, 0.05% Tween; third, 100 mM EDTA, pH 8.0 and last 0.05% Tween. The plate is next incubated with 10 mM NiS0 for 20 minutes at RT. The plate is finally washed with 0.05% Tween and then 50 mM Tris HC1, 500 mM NaCl, pH 7.5.
Random attachment of the fluorescent polymerase to a substrate should be sufficient at low polymerase concentrations. To allow for the tightest packing of sequencing signals in the field of view, the polymerases may be arranged on a two-dimensional substrate surface in an organized array. Polymerases may be spaced by micrometer distances as described by Mϋller et al. (Science 268:272- 3, 1995, herein incoφorated by reference). In addition, patterns of channels that are approximately 50 μm in width and approximately 10-20 μm in depth, can be formed in the substrate using standard photolithographic procedures followed by chemical wet etching as described in U.S. Patent No. 5,661,028 to Foote (herein incoφorated by reference). Much smaller channels can be generated using nanolithography techniques. Dense periodic arrays of holes or chambers 20 nm across are fabricated into a silicon nitride coated substrate by the method of Park et al. (Science, 276:1401-4, 1997, herein incoφorated by reference). In each chamber, a single sequencing reaction would take place. The polymerase may also be attached to the substrate in an orderly array by micropipetting droplets containing the polymerase onto the surface of the substrate. The droplets are then covered, for example with a glass coverslip, to prevent evaporation.
Embedding Polymerase and Nucleic acid in a Gel Matrix
As an alternative to attaching the polymerase or nucleic acid to a two-dimensional surface, the polymerase or nucleic acid may be embedded into a three-dimensional gel matrix. The polymerase or nucleic acid is added to the liquid matrix, which is allowed to solidify, trapping the polymerases or nucleic acids within it. Examples of this type of matrix include agarose and acrylamide, for example Ni2+-NTA-Agarose (QIAGEN, Valencia, CA).
EXAMPLE 3
Preparation of Fluorescent Nucleotides
This example describes how to prepare nucleotides containing at least one fluorophore, for example an acceptor fluorophore. In addition, this example lists sources of commercially available fluorescent nucleotides that can be used in the present disclosure. When choosing acceptor fluorophores, it is important that the frequency used to excite the donor fluorophore on the polymerase (Example 1) not overlap the excitation spectra of the acceptor fluorophores on the nucleotides. Each nucleotide should possess at least one acceptor fluorophore having an excitation spectrum which overlaps the emission spectrum of the donor fluorophore attached to the polymerase (Example 1), such that the emission from the donor fluorophore excites the acceptor fluorophore. NEN Life Science Products (Boston, MA) offers all four deoxynucleotides and ribonucleotide analogs with fluorophores attached. There are several different fluorophores available including fluorescein, Texas Red®, tetramethylrhodamine, coumarin, napthofluorescein, cyanine-3, cyanine-5, and Lissamine™. In addition, Molecular Probes (Eugene, OR) sells deoxyuridinetriphosphate (dUTP) labeled with various fluorophores replacing the methyl group of thymine, synthesized by the method of U.S. Patent No. 5,047,519. Because these nucleotides have 3' hydroxyls, they can be used directly for sequencing.
Alternatively, nucleotides containing other acceptor fluorophores can be prepared. The fluorophores are capable of being attached to the nucleotide, are stable against photobleaching, and have high quantum efficiency. In addition, each type of nucleotide (e.g. A, T U, C and G) will have a unique fluorophore (or a unique combination of fluorophores) attached, such that each type of nucleotide will have a distinct emission signal (such as an emission spectrum) from the other types of nucleotides. Hence a deoxynucleotide A will give a different emission signal from a nucleotide T, G or C; a nucleotide T will give a different emission signal from a nucleotide A, G or C; a nucleotide C will give a different emission signal from a nucleotide A, T or G; and a nucleotide G will give a different emission signal from a nucleotide A, T, or C. In the case of RNA, U will be substituted for T in this example.
The fluorophore ideally does not interfere excessively with the degree or fidelity of nucleotide incoφoration. After attaching the fluorophores, the nucleotide is still able to undergo polymerization, complementary base pairing, and retain a free 3' hydroxyl end.
The fluorophore can either be directly or indirectly attached to the nucleotide. When attaching the fluorophore directly to the nucleotide, the method described above for attaching fluorophores to polymerases can be used (Example 1). Alternatively, the fluorophore may be attached indirectly to the nucleotide by a linker molecule. For example, a streptavidin linkage may be used. The linker does not significantly interfere with binding to or incoφoration by the polymerase. The use of a linker would make the nucleotide bulky, allowing less FRET to previous bases. This may make it easier to distinguish nucleotides as they are added to the complementary nucleic acid strand. Alternatively, the nucleotides can be cleaved from the DNA molecules after their incoφoration, for example by attaching DNase to the end of the polymerase, producing free mono nucleotides that could not be reused. In addition, fluorescent molecules containing two attachment points can be used to orient the fluorophore on the polymerase (Corrie et al, 1999, Nature 400:425, herein incoφorated by reference).
The linkage can be peptidase sensitive, allowing the fluorophore to be released after the emission signal is detected as a result of the acceptor fluorophore on the nucleotide being added to the complementary nucleic acid strand. The use of a linker may allow the fluorophore orientation to be controlled, so that the optimal orientation for FRET can be determined. An optimal orientation is one that generates the brightest emission signal from the acceptor fluorophore, without the nucleotide losing its ability to incoφorate into the complementary nucleic acid strand. U.S. Patent Nos. 5,047,519 and 5,151,507 to Hobbs et al (herein incoφorated by reference) teach the use of linkers to separate a nucleotide from a fluorophore. Examples of linkers may include a straight-chained alkylene, CrC2o, optionally containing within the chain double bonds, triple bonds, aryl groups or heteroatoms such as N, O or S. Substituents on the diradical moiety can include Cι-C6 alkyl, aryl, ester, ether, amine, amide or chloro groups.
Unlike the 3' blocked methods of nucleic acid sequencing, the sequencing method described herein is asynchronous. Therefore, it can be difficult to distinguish multiple bases of the same type (i.e. poly T). To solve this problem, "dummy" nucleotides can be supplied, such as four dNMP or dNDPs that have a fifth fluorophore distinct from the four used to identify the nucleotides. Because these molecules do not contain three phosphate groups, they can enter the polymerase, but they cannot bond covalently. If included in a higher concentration relative to the nucleotide fluorophores, they can provide a specific signal indicating the transition between attachment of one base and the next. Hence the signal from the "counter" nucleotide would usually be received between each actual signal, and serve to indicate that a new actual nucleotide has been added (for example syncopating the addition of three Ts as: T-counter-T-counter-T-counter). Repeat sequencing can be performed to confirm the result, and address the possibility that the counter may not be added or detected in some instances of each sequencing reaction. The counter nucleotides can also provide a means of determining the number of bases in runs of bases. However, important information about sequences can be obtained even without the use of the counter nucleotides, such as a rough approximation of the sequence, or quick confirmation of a sequence obtained by other methods.
Another approach to distinguishing multiple bases of the same type is to incubate the reaction at a low temperature, such as 0-30°C, for example 4°C or at RT. At these lower temperatures, the polymerases are selected that are able to function properly at lower temperatures. This temperature range allows for a more narrow spectral line and hence higher coding complexity. If more than one fluorescent acceptor is present on each nucleotide, then the individual classes of nucleotides are coded. The lower temperature shaφens the spectrum, allowing more distinct spectra to be read. It is important to avoid freezing, which would interfere with the polymerization reaction. Other approaches to distinguishing multiple bases of the same type include making the aqueous environment 36 more viscous, reducmg the concentration of nucleotides, and using a polymerase containing one or more mutations which slow the polymerase.
To help assure that the selected fluorescent nucleotides can be incoφorated into a nucleic acid by a polymerase, the fluorescent nucleotides are first tested using a fluorescence spectrophotometer. For example, 5' biotin-labeled single-stranded nucleic acid is attached to magnetic streptavidin particles. A primer is annealed and the polymerase and one or more fluorescent nucleotides are added. After washing the beads, the nucleic acid is cleaved at a restriction site close to the bead. The fluorescence spectrophotometer is used to detect addition of the fluorescent nucleotides. The test can also be performed by separation of the labeled nucleic acids on an agarose gel and detection under UV lamp or using an ABI sequencing machine. Therefore, the contribution of previously incoφorated bases to the current spectrum can be determined by accounting for known spectrum of nucleotides at various previous positions. Since the previous sequence is known, the predicted effect of the previous nucleotides can be removed from the current spectrum.
Another method that can be used to distinguish the latest nucleotide added onto the growing nucleotide chain is to use polarized light to measure the rotation of single molecules. In this embodiment, the newly added base is fixed in orientation. The location of the donor dipole is adjusted to match the most recently added acceptor fluorophore so that the most recent fluorophore generates the strongest FRET signal. Harms et al. teaches the use of polarized light to measure rotation of single molecules (Biophys. J. 77:2864-70, 1999, herein incoφorated by reference). Yet another method to distinguish the individual nucleotides is to label the nucleotides with more than one fluorophore. For example, two or more different fluorophores can be added to each nucleotide. The combination of fluorophores generates an emission spectrum which is easier to distinguish than the emission spectrum from only one fluorophore on each nucleotide. Multiple tags thereby allow each nucleotide to be coded by more than one spectrum, helping to reduce the ambiguity of strings of the same nucleotide.
Distinguishing multiple nucleotides of the same type may be difficult because the signal is not synchronized. The resulting sequence, if recorded, would be "compressed". For example, the first 30 bases of an £. coli sequence:
(1) agcttttcattctgactgcaacgggcaata would be compressed by removing strings of similar bases:
(2) agct cat ctgactgca eg ca ta resulting in:
(3) agctcatctgactgcacgcata
Such a compressed sequence is still usable because it is unique. For example, if RNA from E. coli is sequenced and results in sequence 3 above, the location of the RNA can be determined. When the entire human genome is sequenced, this method can be used to count individual mRNA molecules directly. The first step is to compress the entire human genomic sequence. Then, the NCBI Basic Local Alignment Search Tool (BLAST), or other program is used to search this compressed human genomic sequence using the results obtained from the sequencing methods of the present disclosure. This method does not require macroscopic handling for high-throughput analysis, and is highly useful for studying gene expression.
BLAST (Altschul et al, J. Mol. Biol. 215:403-10, 1990) is available from several sources, including the National Center for Biological Information (NCBI, National Library of Medicine, Building 38A, Room 8N805, Bethesda, MD 20894) and on the Internet, for use in connection with the sequence analysis programs blastp, blastn, blastx, tblastn and tblastx. Additional information can be found at the NCBI web site. EXAMPLE 4 Microscope Set-Up This example describes microscope systems that can be used to sequence nucleic acids using the method disclosed herein.
Microscopes
Total internal reflectance (TIR) fluorescence microscopy can be used, for example using the methods and device described by Pierce et al. (Nature, 388:338, 1997; Methods Cell Biol. 58:49, 1999); Funatsu et al. (Nature, 374:555, 1995); Weiss (Science, 283:1676, 1999) and Schutt et al. (U.S. Patent No. 5,017,009). TIR is an optical phenomenon that occurs when light is directed at less than a critical angle, through a high refractive index material, toward an interface of that material with a second material having a lower refractive index. In this situation, all light is reflected back from that interface, except for a microscopic evanescent wave which propagates into the second material for only a short distance.
In TIR fluorescence microscopy, the first material is a glass substrate and the second material is water or another aqueous medium in which an assay is being conducted. When fluorescently labeled materials approach the interface, within the field of the evanescent wave, the fluorescent molecules can be energized, and fluorescence detected which then emanates into the overlying solution. The advantage of TIR is that it produces a superior signal-to-noise ratio, and reduces the photobleaching of the fluorescent molecules since only a thin layer of the sample is exposed.
To reduce photobleaching of the fluorophores, a confocal microscopy system can be used. An example of such a confocal laser is the Leica Confocal Spectrophotometer TCS-SP (Leica,
Germany). The confocal laser would only illuminate sequencing polymerases, leaving the remainder of the reservoir dark. To accomplish this, one can first scan the entire volume available for polymerases, then program the microscope to only expose those small regions containing functioning polymerases. Another advantage of confocal microscopy is that sequencing reactions could occur in three dimensions. Confocal microscopy excludes planes that are not of interest, allowing one to increase the total number of sequences taken. This would allow more sequencing reactions to be performed and detected per field of view.
Another means that can be used to reduce photobleaching is to incubate the sample in a solution containing an oxygen scavenger system, for example as described by Kitamura et al. (Nature, 397:129, 1999); Okada and Hirokawa (Science, 283: 1152, 1999); Harada et al. (J. Mol. Biol. 216:49, 1990). Examples of solutions include: 1% glucose, 0.05 mg/ml glucose oxidase and 0.1 mg/ml catalase; and 0.5% 2-mercaptoethanol, 4.5 mg/ml glucose, 216 μg/ml glucose oxidase, 36 μg/ml catalase, 2 mM ATP in buffer. Near-field scanning optical microscopy (NSOM) may also be used for the sequencing method disclosed herein. Several methods and devices for NSOM have been described in the prior art (U.S. Patent No. 5,105,305 and PCT Publication WO 97/30366). In NSOM, an aperture having a diameter that is smaller than an optical wavelength is positioned in close proximity (i.e., within less than one wavelength) to the surface of a specimen and scanned over the surface. Light may be either emitted or collected by such an aperture in the end of a probe. Mechanical or piezoelectric means are provided for moving the probe relative to the sample. Light that has interacted with the sample is collected and detected by, for example, a spectrophotometer, and then a CCD camera. The strength of the detected light signal is typically stored, in the form of digital data, as a function of the probe position relative to the sample. The stored data can be converted into a nucleic acid sequence.
NSOM allows optical measurements with sub-wavelength resolution, can measure FRET, and works well in solution (Ha et al, Proc. Natl. Acad. Sci. USA 93:6264-8, 1996). Standard microscopes can be converted to a near-field optical microscope using a device sold by Nanonics Ltd. (Malha, Jerusalem, Israel). The advantage of NSOM is that high resolution of the sample can be obtained. However, since the probe scans the surface of the substrate, the number of sequencing reactions that can be monitored at any one time decreases. To help compensate for this decrease, the rate of nucleotide addition can be decreased by increasing the viscosity of the solution or decreasing the temperature. Kairos Scientific provides a Fluorescence Imaging MicroSpectrophotometer (FIMS). This microscope generates a fluorescence emission spectrum for every pixel in the field of view. Therefore, a unique emission spectrum is generated for each nucleotide as it is added to the complementary nucleic acid strand.
In other embodiments, the method allows for single molecule detection (SMD), for example using the system disclosed by Fang and Tan (Anal. Chem. 1999, 71 :3101-5, herein incoφorated by reference). Briefly, in this system an optical fiber is used to probe into a fluorophore solution (i.e. the aqueous environment 36 of FIG, 3), or at a solid surface (i.e. the substrate 12 shown in FIG. 3). The optical fiber has total internal reflection, allowing fluorescent molecules close to the surface to be excited by the evanescent wave. The fluorescent signals generated by the fluorophores are detected by an intensified charge-coupled device (ICCD)-based microscope system. Optical fibers can be purchased from Newport Coφ. (Irvine, CA).
In yet other embodiments, SMD can be performed using the method disclosed by Unger et al. (BioTechniques, 1999, 27: 1008-14, herein incoφorated by reference). Briefly, using a standard fluorescent microscope with mercury lamp excitation and a CCD camera, single fluorescent molecules can be observed in air and in aqueous solution, if the molecules are sufficiently separated by dilution.
Sources of Electromagnetic Radiation
In particular embodiments, electromagnetic radiation can be emitted by a laser. The choice of laser used will depend on the specific donor fluorophore used. The wavelength of the laser light is selected to excite the donor fluorophore. For example, wild-type GFP and FITC can be excited by an argon laser at 488 nm. To excite the H9-40 GFP mutant, blue laser diodes which emit at 400 nm (Nichia Chemical Industries Ltd.) or 404 nm (Power Technology Inc., Little Rock, AK) can be used. Other sources of electromagnetic radiation known by those skilled in the art can also be used, for example HeNe lasers and mercury lamps.
Fluidics
The use of a fluid handling system is optional. For simplicity, one may prefer to add all of the necessary reagents, then seal the chamber with a glass coverslip or a drop of oil to prevent desiccation. Alternatively, a slow flow of nucleotide containing solution can be provided to replenish the nucleotides and to remove the products (diphosphate). Such a system would increase nucleotide use, but would maintain steady state conditions, which may increase the length of sequencing runs.
A computer chip that performs the liquid handling can be built that sits on the stage of a fluorescent microscope. Micromachine and microfluidic devices and methods for the dispensing of nanoliter size liquid samples has been previously described (Service, Science 282:399-401, 1998; Burns et al. Science 282:484-7 1998).
Detectors
A detector acts as the primary tool to capture the emission spectrums generated by the spectrophotometer.
A CCD camera can be used as the detector to capture the image. The emission spectrums generated by the spectrophotometer are collected by the CCD camera, which converts this input into a charge. The charge is converted into a signal by the CCD output. The resulting signal is digitized, as a characteristic signal associated with each type of nucleotide (e.g. A, T/U, C or G), and the digital data is captured into memory, such as the hard-drive of a computer. The sum of the captured data is then processed into a nucleotide sequence. CCD cameras are commercially available from many sources including Kodak (Rochester, NY).
With color CCD cameras containing more than 1000 by 1000 pixel fields (for example the Kodak Professional DCS 520 Digital Camera), or even 4096 by 4096 pixel fields (for example the Kodak 16.8i, KAF 16800), it is possible to sequence as many as 1000 nucleic acids in parallel, at a rate of 360 bases per hour. Therefore, molecular sequencing with the TDS method has the potential to sequence entire chromosomes or genomes within a day. If the polymerases are placed in a regular hexagonal regular array, about 17 pixels would be available for each polymerase.
Alternatively, a monochrome CCD containing filters or other means of obtaining a spectrum may be used. This would require that the spectrum be swept. To reduce background noise, any of the CCD cameras may be cooled.
The rate at which sequencing of the nucleic acids occurs can be controlled by many factors. Faster rates can be obtained by increasing the temperature (using a heat stable polymerase) or by running the reactions under high pressure, as in HPLC. The reaction rate can be slowed by making the solution more viscous, by lowering the reaction temperature, or by having fewer reactive nucleotides available. The rate of polymerization may be controlled in this manner not to exceed the rate of the CCD integration and computer recording time. Therefore, the rate of polymerization is controlled in this manner such that the fluorescent signal can be more reliably read by the CCD and inteφreted by the computer.
In a disclosed embodiment, the method is performed in a closed chamber device that produces sequencing signals, which enter the computer directly. The method sequences nucleic acids by monitoring the incoφoration of individual nucleotides into individual nucleic acid molecules on the molecular level, instead of sequencing nucleic acids by monitoring macromolecular events, such as a pattern on an electrophoresis gel, that is representative of a large population of nucleic acids molecules. Once the reaction has started, no further liquid handling is necessary (but can be added if desired). Therefore, the machine has no macroscopic moving parts during operation, which can facilitate rapid sequencing.
As an alternative to a CCD camera, photomultiplier tubes or an intensified charge-coupled device (ICCD) can be used.
EXAMPLE 5
Computer System
The methods disclosed herein can be performed in the general context of computer- executable instructions of a computer program that runs on a personal computer. Generally, program modules include routines, programs, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Moreover, those skilled in the art will appreciate that the method may be practiced with other computer system configurations, including hand-held devices, multi-processor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, and the like. The methods disclosed herein may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.
The present implementation platform of the methods disclosed herein is a system implemented on a Sun computer having at least one megabyte of main memory and a one gigabyte hard disk drive, with Unix as the user interface. The application software is written in Pascal or other computer language.
EXAMPLE 6 Sequencing of Nucleic Acids
This example describes methods for sequencing nucleic acids from different sources.
Many different sequences can be determined in parallel. One application of the disclosed method is the sequencing of a plasmid. After introducing random nicks into the plasmid, the DNA is added onto a substrate containing fixed fluorescent polymerases (Examples 1 and 2). The entire plasmid is then sequenced from many points. The computer keeps track of all the sequences and automatically assembles them into a complete plasmid sequence.
Another use is for sequencing a randomized region of a nucleic acid. The primer used is specific for a position just outside the randomized region. The randomized nucleic acid is placed onto the field of fixed polymerases. This method allows one to obtain the entire results of a randomization experiment in parallel, thereby saving time and money.
Nick translation can be used to randomly sequence an unknown nucleic acid.
EXAMPLE 7 Clinical Applications
This example describes how the methods disclosed herein can be used for the analysis of pathology specimens. The source of specimen obtained from a subject may include peripheral blood, urine, saliva, tissue biopsy, fine needle aspirates, surgical specimen, amniocentesis samples and autopsy material. The sample is attached to a substrate, such as a glass slide. Care must be taken to preserve the nucleic acids present in the sample. Alternatively, the nucleic acids could be isolated from the sample, and then subjected to TDS. For example, one can use the present method to sequence whole bacterial chromosomes and human genes containing mutations. Using techniques described previously, the presence of viral and/or bacterial pathogens can be detected by the presence of the viral and/or bacterial nucleic acid sequences. In addition, the methods disclosed herein allows for nucleic acid sequencing in situ, by adding a primer, the GFP-polymerase (or GFP-aequorin- polymerase) and the four nucleotides, to a thin tissue slice.
In view of the many possible embodiments to which the principles of the present disclosure may be applied, it should be recognized that the illustrated embodiments are only particular examples and should not be taken as a limitation on the scope of the disclosure. Rather, the scope of the disclosure is defined by the following claims. We therefore claim as our invention all that comes within the scope and spirit of these claims.

Claims

We claim:
1. A method of sequencing a sample nucleic acid molecule, comprising: exposing the sample nucleic acid molecule to an oligonucleotide primer and a polymerase in the presence of a mixture of nucleotides, wherein the polymerase and the nucleotides each comprise a fluorophore which emits a signal corresponding to addition of a particular nucleotide as each nucleotide is incoφorated into a synthesized nucleic acid molecule which is complementary to the sample nucleic acid molecule; and detecting the signal as each nucleotide is incoφorated into the synthesized nucleic acid molecule.
2. The method of claim 1, wherein the nucleic acid is DNA and the polymerase is a DNA or
RNA polymerase.
3. The method of claim 1, wherein the nucleic acid is RNA and the polymerase is reverse transcriptase.
4. The method of claim 1, wherein the polymerase is a Klenow fragment of DNA polymerase I.
5. The method of claim 1, wherein an emission signal from the fluorophore of the polymerase excites the fluorophore of one of the nucleotides, generating a unique emission signal for each nucleotide as the nucleotide is added to the synthesized nucleic acid molecule and wherein a sequence of the emission signals is detected and converted into a nucleic acid sequence.
6. The method of claim 5, wherein the unique emission signal is converted into a signal for a specific nucleotide in a nucleic acid sequence.
7. The method of claim 5, wherein the unique emission signal is generated by the group consisting of luminescence resonance energy transfer (LRET) and fluorescent resonance energy transfer (FRET).
8. The method of claim 1 , wherein the fluorophore of the polymerase is a donor fluorophore and the fluorophore of each nucleotide is an acceptor fluorophore.
9. The method of claim 8, wherein each of the acceptor fluorophores is stimulated by an emission from the donor fluorophore, but each of the acceptor fluorophores emits a unique emission signal.
10. The method of claim 9 further comprising exciting the donor fluorophore to emit an excitation signal which stimulates the acceptor fluorophore to emit the unique signal corresponding to addition of a particular nucleotide.
11. The method of claim 10, wherein the donor fluorophore is green fluorescent protein (GFP).
12. The method of claim 10, wherein the acceptor fluorophores are BODIPY, fluorescein, rhodamine green, and Oregon green or derivatives thereof.
13. The method of claim 9, wherein the donor fluorophore is excited by a luminescent molecule.
14. The method of claim 13, wherein the donor fluorophore is GFP and the luminescent molecule is aequorin.
15. The method of claim 9, wherein the wherein the donor fluorophore is a luminescent molecule.
16. The method of claim 15, wherein the wherein the luminescent molecule is aequorin.
17. The method of claim 1, wherein the polymerase is a GFP-polymerase.
18. The method of claim 8, wherein the donor fluorophore and one of the acceptor fluorophores comprise a FRET pair selected from the group consisting of GFP mutant H9 and its derivatives, H9-40, tetramethylrhodamine, Lissamine™, Texas Red and naphthofluorescein.
19. The method of claim 1 , further comprising fixing the polymerase to a substrate.
20. The method of claim 19, wherein the polymerase is fixed to the substrate by a linker molecule comprising a polymerase component and a substrate component.
21. The method of claim 20, wherein the linker is selected from the group consisting of streptavidin-biotin, histidine-Ni, S-tag-S-protein, and glutathione-glutathione-S-transferase (GST).
22. The method of claim 1, further comprising fixing the sample nucleic acid molecule or the oligonucleotide primer to a substrate.
23. The method of claim 1, further comprising performing a plurality of sequencing reactions substantially simultaneously, and detecting the signals from the plurality of sequencing reactions.
24. The method of claim 23, wherein a plurality of polymerases, sample nucleic acid molecules, or oligonucleotide primers are fixed directly or indirectly to the substrate in a predetermined pattern, and detecting the signal further comprises correlating the signal with a nucleic acid molecule corresponding to a predetermined position within that pattern.
25. The method of claim 24, wherein the polymerases, sample nucleic acid molecules, or oligonucleotide primers are fixed to the substrate in the predetermined pattern in channels which have been etched in an orderly array.
26. The method of claim 24, wherein the polymerases, sample nucleic acid molecules, or oligonucleotide primers are fixed to the substrate in the predetermined pattern by micropipetting droplets onto a substrate.
27. The method of claim 24, wherein the micropipetting droplets onto a substrate is performed manually or with an automated arrayer.
28. The method of claim 5, wherein the unique emission signals are detected with a charged-coupled device (CCD) camera and converted into the nucleic acid sequence.
29. The method of claim 5, wherein the unique emission signals are stored in a computer readable medium.
30. A substrate to which is attached a GFP-polymerase.
31. The substrate of claim 30, wherein the GFP-polymerase contains an affinity tag that attaches the GFP-polymerase to the substrate.
32. The substrate of claim 31, wherein the GFP-polymerase is attached to the substrate by a linker.
33. The substrate of claim 30, wherein the GFP-polymerase contains aequorin.
34. A method of sequencing a sample nucleic acid, comprising: attaching a polymerase to a substrate; allowing a sample nucleic acid and an annealed oligonucleotide to bind to the polymerase in the presence of nucleotides for incoφoration into a complementary nucleic acid, wherein the polymerase and nucleotides are cooperatively labeled with donor and acceptor fluorophores that emit a unique signal when a particular nucleotide is incoφorated into the complementary nucleic acid; detecting a sequential series of the unique signals as the nucleotides are sequentially added to the complementary nucleic acid; and converting the series of the unique signals into a nucleic acid sequence.
35. A method of sequencing a sample nucleic acid, comprising: attaching a sample nucleic acid to a substrate; adding an oligonucleotide primer; allowing the oligonucleotide primer to anneal to the attached sample nucleic acid; adding a polymerase in the presence of nucleotides for incoφoration into a complementary nucleic acid wherein the polymerase and nucleotides are cooperatively labeled with donor and acceptor fluorophores that emit a unique signal when a particular nucleotide is incoφorated into the complementary nucleic acid; allowing the polymerase to bind to the nucleic acid; detecting a sequential series of the unique signals as the nucleotides are sequentially added to the complementary nucleic acid; and converting the series of the unique signals into a nucleic acid sequence.
36. A method of sequencing a sample nucleic acid, comprising: attaching an oligonucleotide primer to a substrate; adding a sample nucleic acid to be sequenced; allowing the oligonucleotide primer to anneal to the sample nucleic acid; adding a polymerase in the presence of nucleotides for incoφoration into a complementary nucleic acid wherein the polymerase and nucleotides are cooperatively labeled with donor and acceptor fluorophores that emit a unique signal when a particular nucleotide is incoφorated into the complementary nucleic acid; allowing the polymerase to bind to the nucleic acid; detecting a sequential series of the unique signals as the nucleotides are sequentially added to the complementary nucleic acid; and converting the series of the unique signals into a nucleic acid sequence.
37. A device for sequencing a nucleic acid molecule comprising: a substrate to which a polymerase, oligonucleotide primer, or sample nucleic acid is attached wherein the polymerase includes a donor fluorophore; a viewing means for viewing the polymerase; a detection means for detecting a characteristic signal from an acceptor fluorophore carried by a corresponding nucleotide, as the nucleotide is added to the nucleic acid molecule by the polymerase; an electromagnetic radiation source that excites the donor fluorophore but not the acceptor fluorophore; and a decoding means for converting a series of characteristic signals into a nucleic acid sequence.
38. The device of claim 37, wherein the substrate comprises a glass microscope slide.
39. The device of claim 37, wherein the electromagnetic radiation source comprises a laser.
40. The device of claim 37, wherein the viewing means comprises a microscope objective.
41. The device of claim 37, wherein the detection means comprises a CCD camera.
42. The device of claim 37, wherein the decoding means for converting the unique signal into a nucleic acid sequence comprises a digital computer.
43. The device of claim 37, wherein the substrate comprises a three-dimensional matrix.
44. A device for sequencing a nucleic acid molecule comprising: a glass microscope slide to which an oligonucleotide primer, sample nucleic acid, or polymerase is attached, wherein the polymerase includes a donor fluorophore; a laser positioned to stimulate the donor fluorophore with laser light at a first wavelength range which induces the donor fluorophore to emit a signal at a second wavelength range that stimulates an acceptor flurorophore but not the donor fluorophore, and the signal emitted by the acceptor fluorophore is unique to each type of nucleotide, further wherein the first wavelength does not stimulate the acceptor fluorophore to emit the signal characteristic of the nucleotide; a microscope objective positioned for viewing a sequence of signals emitted by the acceptor fluorophores as nucleotides are added to a sequence by the polymerase, wherein the sequence of signals corresponds to a nucleic acid sequence; a spectrophotometer that converts the sequence of signals into a series of spectrographic signals of the acceptor fluorophore; a CCD camera for detecting the sequence of signals; and a digital computer which converts the sequence of signals into the nucleic acid sequence.
45. A device for sequencing a nucleic acid molecule comprising: a glass microscope slide to which a polymerase is attached, wherein the polymerase includes a donor fluorophore; a laser positioned to stimulate the donor fluorophore with laser light at a first wavelength range which induces the donor fluorophore to emit a signal at a second wavelength range that stimulates an acceptor fluorophore but not the donor fluorophore, and the signal emitted by the acceptor fluorophore is unique to each type of nucleotide, further wherein the first wavelength does not stimulate the acceptor fluorophore to emit the signal characteristic of the nucleotide; a microscope objective positioned for viewing a sequence of signals emitted by the acceptor fluorophores as nucleotides are added to a sequence by the polymerase, wherein the sequence of signals corresponds to a nucleic acid sequence; a spectrophotometer that converts the sequence of signals into a series of spectrographic signals of the acceptor fluorophore; a CCD camera for detecting the sequence of signals; and a digital computer which converts the sequence of signals into the nucleic acid sequence.
PCT/US2000/023736 1999-08-30 2000-08-29 High speed parallel molecular nucleic acid sequencing WO2001016375A2 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
US10/070,053 US6982146B1 (en) 1999-08-30 2000-08-29 High speed parallel molecular nucleic acid sequencing
AU70868/00A AU7086800A (en) 1999-08-30 2000-08-29 High speed parallel molecular nucleic acid sequencing
US11/204,367 US20060292583A1 (en) 1999-08-30 2005-08-12 High speed parallel molecular nucleic acid sequencing
US12/196,139 US20090061447A1 (en) 1999-08-30 2008-08-21 High speed parallel molecular nucleic acid sequencing
US12/886,686 US8535881B2 (en) 1999-08-30 2010-09-21 High speed parallel molecular nucleic acid sequencing

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US15158099P 1999-08-30 1999-08-30
US60/151,580 1999-08-30

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US10070053 A-371-Of-International 2000-08-29
US11/204,367 Division US20060292583A1 (en) 1999-08-30 2005-08-12 High speed parallel molecular nucleic acid sequencing

Publications (2)

Publication Number Publication Date
WO2001016375A2 true WO2001016375A2 (en) 2001-03-08
WO2001016375A3 WO2001016375A3 (en) 2001-10-04

Family

ID=22539397

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/023736 WO2001016375A2 (en) 1999-08-30 2000-08-29 High speed parallel molecular nucleic acid sequencing

Country Status (2)

Country Link
AU (1) AU7086800A (en)
WO (1) WO2001016375A2 (en)

Cited By (78)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002088381A2 (en) * 2001-04-27 2002-11-07 Genovoxx Gmbh Method for determining gene expression
WO2002095070A2 (en) * 2001-05-18 2002-11-28 Medical Biosystems Litd. Polynucleotide sequencing method
EP1430156A2 (en) * 2001-09-24 2004-06-23 Intel Corporation Nucleic acid sequencing by raman monitoring of uptake of precursors during molecular replication
US6927779B2 (en) * 2002-05-13 2005-08-09 Large Scale Biology Corporation Web-based well plate information retrieval and display system
WO2006013110A1 (en) * 2004-08-06 2006-02-09 Rudolf Rigler Parallel high throughput single molecule sequencing process
US7033764B2 (en) 1999-05-19 2006-04-25 Cornell Research Foundation, Inc. Method for sequencing nucleic acid molecules
WO2006044078A2 (en) 2004-09-17 2006-04-27 Pacific Biosciences Of California, Inc. Apparatus and method for analysis of molecules
US7170050B2 (en) 2004-09-17 2007-01-30 Pacific Biosciences Of California, Inc. Apparatus and methods for optical analysis of molecules
WO2007070572A2 (en) * 2005-12-12 2007-06-21 The Government Of The United States Of America, As Represented By The Secretary Of The Department Of Health And Human Services Probe for nucleic acid sequencing and methods of use
US7405281B2 (en) 2005-09-29 2008-07-29 Pacific Biosciences Of California, Inc. Fluorescent nucleotide analogs and uses therefor
US7427673B2 (en) 2001-12-04 2008-09-23 Illumina Cambridge Limited Labelled nucleotides
US7541444B2 (en) 2002-08-23 2009-06-02 Illumina Cambridge Limited Modified nucleotides
US7563574B2 (en) 2006-03-31 2009-07-21 Pacific Biosciences Of California, Inc. Methods, systems and compositions for monitoring enzyme activity and applications thereof
EP2100971A2 (en) 2000-07-07 2009-09-16 Visigen Biotechnologies, Inc. Real-time sequence determination
US7592435B2 (en) 2005-08-19 2009-09-22 Illumina Cambridge Limited Modified nucleosides and nucleotides and uses thereof
US7626704B2 (en) 2006-02-13 2009-12-01 Pacific Biosciences Of California, Inc. Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
EP2126072A1 (en) * 2007-02-21 2009-12-02 Life Technologies Corporation Materials and methods for single molecule nucleic acid sequencing
US7630073B2 (en) 2006-02-13 2009-12-08 Pacific Biosciences Of California Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US7645596B2 (en) 1998-05-01 2010-01-12 Arizona Board Of Regents Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US7650351B2 (en) 1999-11-05 2010-01-19 Herzenberg Leonard A Internet-linked system for directory protocol based data storage, retrieval and analysis
US7666593B2 (en) 2005-08-26 2010-02-23 Helicos Biosciences Corporation Single molecule sequencing of captured nucleic acids
US7692783B2 (en) 2006-02-13 2010-04-06 Pacific Biosciences Of California Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US7763423B2 (en) 2005-09-30 2010-07-27 Pacific Biosciences Of California, Inc. Substrates having low density reactive groups for monitoring enzyme activity
US7772384B2 (en) 2001-12-04 2010-08-10 Illumina Cambridge Limited Labelled nucleotides
US7805081B2 (en) 2005-08-11 2010-09-28 Pacific Biosciences Of California, Inc. Methods and systems for monitoring multiple optical signals from a single source
WO2010108638A1 (en) 2009-03-23 2010-09-30 Erasmus University Medical Center Rotterdam Tumour gene profile
US7807349B2 (en) 2002-11-29 2010-10-05 Anima Cell Metrology Protein synthesis monitoring (PSM)
US7820983B2 (en) 2006-09-01 2010-10-26 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US7888073B2 (en) 2003-07-24 2011-02-15 Gen-Probe Incorporated Method for sequencing nucleic acid molecules
EP2270205A3 (en) * 2003-10-20 2011-03-02 Isis Innovation Ltd Nucleic acid sequencing methods
US7981604B2 (en) 2004-02-19 2011-07-19 California Institute Of Technology Methods and kits for analyzing polynucleotide sequences
US7993895B2 (en) 2005-12-02 2011-08-09 Pacific Biosciences Of California, Inc. Mitigation of photodamage in analytical reactions
US8193123B2 (en) 2006-03-30 2012-06-05 Pacific Biosciences Of California, Inc. Articles having localized molecules disposed thereon and methods of producing same
US8207509B2 (en) 2006-09-01 2012-06-26 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US8274040B2 (en) 2008-09-16 2012-09-25 Pacific Biosciences Of California, Inc. Substrates and optical system having at least one optical waveguide, at least one nanometer-scale aperture and at least one lens array and methods of use thereof
US8314216B2 (en) 2000-12-01 2012-11-20 Life Technologies Corporation Enzymatic nucleic acid synthesis: compositions and methods for inhibiting pyrophosphorolysis
US8442773B2 (en) 2002-11-29 2013-05-14 Anima Cell Metrology Protein synthesis monitoring (PSM)
WO2013079215A1 (en) 2011-12-01 2013-06-06 Erasmus University Medical Center Rotterdam Method for classifying tumour cells
US8465699B2 (en) 2010-02-19 2013-06-18 Pacific Biosciences Of California, Inc. Illumination of integrated analytical systems
US8501406B1 (en) 2009-07-14 2013-08-06 Pacific Biosciences Of California, Inc. Selectively functionalized arrays
US8535881B2 (en) 1999-08-30 2013-09-17 The United States Of America As Represented By The Secretary Of The Department Of Health And Human Services High speed parallel molecular nucleic acid sequencing
US8658434B2 (en) 2009-10-28 2014-02-25 Biotium, Inc. Fluorescent pyrene compounds
US8703734B2 (en) 2005-12-12 2014-04-22 The United States Of America, As Represented By The Secretary, Department Of Health And Human Services Nanoprobes for detection or modification of molecules
US8834847B2 (en) 2010-08-12 2014-09-16 Pacific Biosciences Of California, Inc. Photodamage mitigation compounds and systems
US8994946B2 (en) 2010-02-19 2015-03-31 Pacific Biosciences Of California, Inc. Integrated analytical system and method
US9012171B2 (en) 2007-10-09 2015-04-21 Anima Cell Metrology, Inc. Systems and methods for measuring translation activity in viable cells
US9012150B2 (en) 2004-05-26 2015-04-21 Anima Cell Metrology Methods for evaluating ribonucleotide sequences
US9034576B2 (en) 2009-09-24 2015-05-19 Anima Cell Metrology Inc. Systems and methods for measuring translation of target proteins in cells
US9051612B2 (en) 2006-09-28 2015-06-09 Illumina, Inc. Compositions and methods for nucleotide sequencing
US9097667B2 (en) 2007-12-14 2015-08-04 Biotium, Inc. Fluorescent compounds
US9096898B2 (en) 1998-05-01 2015-08-04 Life Technologies Corporation Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US9223084B2 (en) 2012-12-18 2015-12-29 Pacific Biosciences Of California, Inc. Illumination of optical analytical devices
US9372308B1 (en) 2012-06-17 2016-06-21 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices and methods for production
US9606068B2 (en) 2014-08-27 2017-03-28 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices
US9624539B2 (en) 2011-05-23 2017-04-18 The Trustees Of Columbia University In The City Of New York DNA sequencing by synthesis using Raman and infrared spectroscopy detection
US9624540B2 (en) 2013-02-22 2017-04-18 Pacific Biosciences Of California, Inc. Integrated illumination of optical analytical devices
US9657344B2 (en) 2003-11-12 2017-05-23 Fluidigm Corporation Short cycle methods for sequencing polynucleotides
US9670539B2 (en) 2007-10-19 2017-06-06 The Trustees Of Columbia University In The City Of New York Synthesis of cleavable fluorescent nucleotides as reversible terminators for DNA sequencing by synthesis
US20170166961A1 (en) 2013-03-15 2017-06-15 Illumina Cambridge Limited Modified nucleosides or nucleotides
US9708358B2 (en) 2000-10-06 2017-07-18 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US9718852B2 (en) 2000-10-06 2017-08-01 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US9951385B1 (en) 2017-04-25 2018-04-24 Omniome, Inc. Methods and apparatus that increase sequencing-by-binding efficiency
WO2018125759A1 (en) * 2016-12-30 2018-07-05 Omniome, Inc. Method and system employing distinguishable polymerases for detecting ternary complexes and identifying cognate nucleotides
US10077470B2 (en) 2015-07-21 2018-09-18 Omniome, Inc. Nucleic acid sequencing methods and systems
US10161003B2 (en) 2017-04-25 2018-12-25 Omniome, Inc. Methods and apparatus that increase sequencing-by-binding efficiency
US10246744B2 (en) 2016-08-15 2019-04-02 Omniome, Inc. Method and system for sequencing nucleic acids
US10260094B2 (en) 2007-10-19 2019-04-16 The Trustees Of Columbia University In The City Of New York DNA sequencing with non-fluorescent nucleotide reversible terminators and cleavable label modified nucleotide terminators
US10294514B2 (en) 2016-04-29 2019-05-21 Omniome, Inc. Sequencing method employing ternary complex destabilization to identify cognate nucleotides
US10365434B2 (en) 2015-06-12 2019-07-30 Pacific Biosciences Of California, Inc. Integrated target waveguide devices and systems for optical coupling
US10428378B2 (en) 2016-08-15 2019-10-01 Omniome, Inc. Sequencing method for rapid identification and processing of cognate nucleotide pairs
US10487102B2 (en) 2002-08-23 2019-11-26 Illumina Cambridge Limited Labelled nucleotides
US10487356B2 (en) 2015-03-16 2019-11-26 Pacific Biosciences Of California, Inc. Integrated devices and systems for free-space optical coupling
US10648026B2 (en) 2013-03-15 2020-05-12 The Trustees Of Columbia University In The City Of New York Raman cluster tagged molecules for biological imaging
US10975427B2 (en) 2017-01-20 2021-04-13 Omniome, Inc. Process for cognate nucleotide detection in a nucleic acid sequencing workflow
US10995111B2 (en) 2003-08-22 2021-05-04 Illumina Cambridge Limited Labelled nucleotides
US11098353B2 (en) 2006-12-01 2021-08-24 The Trustees Of Columbia University In The City Of New York Four-color DNA sequencing by synthesis using cleavable fluorescent nucleotide reversible terminators
US11193166B2 (en) 2017-10-19 2021-12-07 Omniome, Inc. Simultaneous background reduction and complex stabilization in binding assay workflows
US11705217B2 (en) 2008-03-28 2023-07-18 Pacific Biosciences Of California, Inc. Sequencing using concatemers of copies of sense and antisense strands

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5614386A (en) * 1995-06-23 1997-03-25 Baylor College Of Medicine Alternative dye-labeled primers for automated DNA sequencing
US5674743A (en) * 1993-02-01 1997-10-07 Seq, Ltd. Methods and apparatus for DNA sequencing
US5707804A (en) * 1994-02-01 1998-01-13 The Regents Of The University Of California Primers labeled with energy transfer coupled dyes for DNA sequencing
WO1999005315A2 (en) * 1997-07-28 1999-02-04 Medical Biosystems Ltd. Nucleic acid sequence analysis
WO2000053805A1 (en) * 1999-03-10 2000-09-14 Asm Scientific, Inc. A method for direct nucleic acid sequencing
WO2000070073A1 (en) * 1999-05-19 2000-11-23 Cornell Research Foundation, Inc. Method for sequencing nucleic acid molecules

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5674743A (en) * 1993-02-01 1997-10-07 Seq, Ltd. Methods and apparatus for DNA sequencing
US5707804A (en) * 1994-02-01 1998-01-13 The Regents Of The University Of California Primers labeled with energy transfer coupled dyes for DNA sequencing
US5614386A (en) * 1995-06-23 1997-03-25 Baylor College Of Medicine Alternative dye-labeled primers for automated DNA sequencing
WO1999005315A2 (en) * 1997-07-28 1999-02-04 Medical Biosystems Ltd. Nucleic acid sequence analysis
WO2000053805A1 (en) * 1999-03-10 2000-09-14 Asm Scientific, Inc. A method for direct nucleic acid sequencing
WO2000070073A1 (en) * 1999-05-19 2000-11-23 Cornell Research Foundation, Inc. Method for sequencing nucleic acid molecules

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
GORDON GW ET AL: "Quantitative fluorescence resonance energy transfer measurements using fluorescence microscopy" BIOPHYSICAL JOURNAL, vol. 74, May 1998 (1998-05), pages 2702-2713, XP000990953 *

Cited By (225)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9725764B2 (en) 1998-05-01 2017-08-08 Life Technologies Corporation Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US7645596B2 (en) 1998-05-01 2010-01-12 Arizona Board Of Regents Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US9096898B2 (en) 1998-05-01 2015-08-04 Life Technologies Corporation Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US9212393B2 (en) 1998-05-01 2015-12-15 Life Technologies Corporation Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US9458500B2 (en) 1998-05-01 2016-10-04 Life Technologies Corporation Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US10214774B2 (en) 1998-05-01 2019-02-26 Life Technologies Corporation Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US10208341B2 (en) 1998-05-01 2019-02-19 Life Technologies Corporation Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US9957561B2 (en) 1998-05-01 2018-05-01 Life Technologies Corporation Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US7361466B2 (en) 1999-05-19 2008-04-22 Cornell Research Foundation, Inc. Nucleic acid analysis using terminal-phosphate-labeled nucleotides
US7416844B2 (en) 1999-05-19 2008-08-26 Cornell Research Foundation, Inc. Composition for nucleic acid sequencing
US7052847B2 (en) 1999-05-19 2006-05-30 Cornell Research Foundation, Inc. Method for sequencing nucleic acid molecules
US7056661B2 (en) 1999-05-19 2006-06-06 Cornell Research Foundation, Inc. Method for sequencing nucleic acid molecules
US7056676B2 (en) 1999-05-19 2006-06-06 Cornell Research Foundation, Inc. Method for sequencing nucleic acid molecules
US7943305B2 (en) 1999-05-19 2011-05-17 Cornell Research Foundation High speed nucleic acid sequencing
US7943307B2 (en) 1999-05-19 2011-05-17 Cornell Research Foundation Methods for analyzing nucleic acid sequences
US7485424B2 (en) 1999-05-19 2009-02-03 Cornell Research Foundation, Inc. Labeled nucleotide phosphate (NP) probes
US7033764B2 (en) 1999-05-19 2006-04-25 Cornell Research Foundation, Inc. Method for sequencing nucleic acid molecules
US8535881B2 (en) 1999-08-30 2013-09-17 The United States Of America As Represented By The Secretary Of The Department Of Health And Human Services High speed parallel molecular nucleic acid sequencing
US7650351B2 (en) 1999-11-05 2010-01-19 Herzenberg Leonard A Internet-linked system for directory protocol based data storage, retrieval and analysis
EP2100971A2 (en) 2000-07-07 2009-09-16 Visigen Biotechnologies, Inc. Real-time sequence determination
US10407459B2 (en) 2000-10-06 2019-09-10 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10662472B2 (en) 2000-10-06 2020-05-26 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US9708358B2 (en) 2000-10-06 2017-07-18 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US9718852B2 (en) 2000-10-06 2017-08-01 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10457984B2 (en) 2000-10-06 2019-10-29 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10407458B2 (en) 2000-10-06 2019-09-10 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10669582B2 (en) 2000-10-06 2020-06-02 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10428380B2 (en) 2000-10-06 2019-10-01 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US9719139B2 (en) 2000-10-06 2017-08-01 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10669577B2 (en) 2000-10-06 2020-06-02 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10648028B2 (en) 2000-10-06 2020-05-12 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10633700B2 (en) 2000-10-06 2020-04-28 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10577652B2 (en) 2000-10-06 2020-03-03 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US9868985B2 (en) 2000-10-06 2018-01-16 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US9725480B2 (en) 2000-10-06 2017-08-08 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10570446B2 (en) 2000-10-06 2020-02-25 The Trustee Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US10435742B2 (en) 2000-10-06 2019-10-08 The Trustees Of Columbia University In The City Of New York Massive parallel method for decoding DNA and RNA
US9845500B2 (en) 2000-12-01 2017-12-19 Life Technologies Corporation Enzymatic nucleic acid synthesis: compositions and methods for inhibiting pyrophosphorolysis
US8648179B2 (en) 2000-12-01 2014-02-11 Life Technologies Corporation Enzymatic nucleic acid synthesis: compositions and methods for inhibiting pyrophosphorolysis
US9243284B2 (en) 2000-12-01 2016-01-26 Life Technologies Corporation Enzymatic nucleic acid synthesis: compositions and methods for inhibiting pyrophosphorolysis
US8314216B2 (en) 2000-12-01 2012-11-20 Life Technologies Corporation Enzymatic nucleic acid synthesis: compositions and methods for inhibiting pyrophosphorolysis
WO2002088381A2 (en) * 2001-04-27 2002-11-07 Genovoxx Gmbh Method for determining gene expression
WO2002088381A3 (en) * 2001-04-27 2003-11-27 Genovoxx Gmbh Method for determining gene expression
WO2002095070A3 (en) * 2001-05-18 2003-09-18 Medical Biosystems Litd Polynucleotide sequencing method
US9476094B2 (en) 2001-05-18 2016-10-25 Gen-Probe Incorporated Polynucleotide sequencing method
WO2002095070A2 (en) * 2001-05-18 2002-11-28 Medical Biosystems Litd. Polynucleotide sequencing method
EP1430156A2 (en) * 2001-09-24 2004-06-23 Intel Corporation Nucleic acid sequencing by raman monitoring of uptake of precursors during molecular replication
EP1837407A2 (en) * 2001-09-24 2007-09-26 Intel Corporation Nucleic acid sequencing by monitoring of uptake of precursors during molecular replication, with molecule dispenser
EP1837407A3 (en) * 2001-09-24 2007-11-21 Intel Corporation Nucleic acid sequencing by monitoring of uptake of precursors during molecular replication, with molecule dispenser
EP1430156A4 (en) * 2001-09-24 2005-04-27 Intel Corp Nucleic acid sequencing by raman monitoring of uptake of precursors during molecular replication
US7465578B2 (en) 2001-09-24 2008-12-16 Intel Corporation Nucleic acid sequencing by Raman monitoring of uptake of precursors during molecular replication
US7364851B2 (en) 2001-09-24 2008-04-29 Intel Corporation Nucleic acid sequencing by Raman monitoring of uptake of precursors during molecular replication
US7785796B2 (en) 2001-12-04 2010-08-31 Illumina Cambridge Limited Labelled nucleotides
US8148064B2 (en) 2001-12-04 2012-04-03 Illumina Cambridge Limited Labelled nucleotides
US8158346B2 (en) 2001-12-04 2012-04-17 Illumina Cambridge Limited Labelled nucleotides
US7427673B2 (en) 2001-12-04 2008-09-23 Illumina Cambridge Limited Labelled nucleotides
US7566537B2 (en) 2001-12-04 2009-07-28 Illumina Cambridge Limited Labelled nucleotides
US10519496B2 (en) 2001-12-04 2019-12-31 Illumina Cambridge Limited Labelled nucleotides
US7772384B2 (en) 2001-12-04 2010-08-10 Illumina Cambridge Limited Labelled nucleotides
US9605310B2 (en) 2001-12-04 2017-03-28 Illumina Cambridge Limited Labelled nucleotides
US10480025B2 (en) 2001-12-04 2019-11-19 Illumina Cambridge Limited Labelled nucleotides
US6927779B2 (en) * 2002-05-13 2005-08-09 Large Scale Biology Corporation Web-based well plate information retrieval and display system
US11008359B2 (en) 2002-08-23 2021-05-18 Illumina Cambridge Limited Labelled nucleotides
US10513731B2 (en) 2002-08-23 2019-12-24 Illumina Cambridge Limited Modified nucleotides
US8071739B2 (en) 2002-08-23 2011-12-06 Illumina Cambridge Limited Modified nucleotides
US10487102B2 (en) 2002-08-23 2019-11-26 Illumina Cambridge Limited Labelled nucleotides
US7541444B2 (en) 2002-08-23 2009-06-02 Illumina Cambridge Limited Modified nucleotides
US7807349B2 (en) 2002-11-29 2010-10-05 Anima Cell Metrology Protein synthesis monitoring (PSM)
US8442773B2 (en) 2002-11-29 2013-05-14 Anima Cell Metrology Protein synthesis monitoring (PSM)
US7771973B2 (en) 2002-12-23 2010-08-10 Illumina Cambridge Limited Modified nucleotides
US7888073B2 (en) 2003-07-24 2011-02-15 Gen-Probe Incorporated Method for sequencing nucleic acid molecules
US8592183B2 (en) 2003-07-24 2013-11-26 Gen-Probe Incorporated Apparatus and method for sequencing nucleic acid molecules
US11028115B2 (en) 2003-08-22 2021-06-08 Illumina Cambridge Limited Labelled nucleotides
US10995111B2 (en) 2003-08-22 2021-05-04 Illumina Cambridge Limited Labelled nucleotides
US11028116B2 (en) 2003-08-22 2021-06-08 Illumina Cambridge Limited Labelled nucleotides
EP2270205A3 (en) * 2003-10-20 2011-03-02 Isis Innovation Ltd Nucleic acid sequencing methods
US9657344B2 (en) 2003-11-12 2017-05-23 Fluidigm Corporation Short cycle methods for sequencing polynucleotides
US7981604B2 (en) 2004-02-19 2011-07-19 California Institute Of Technology Methods and kits for analyzing polynucleotide sequences
US9012150B2 (en) 2004-05-26 2015-04-21 Anima Cell Metrology Methods for evaluating ribonucleotide sequences
WO2006013110A1 (en) * 2004-08-06 2006-02-09 Rudolf Rigler Parallel high throughput single molecule sequencing process
US7754427B2 (en) 2004-08-06 2010-07-13 Rudolf Rigler Parallel high throughput single molecule sequencing process
US7906284B2 (en) 2004-09-17 2011-03-15 Pacific Biosciences Of California, Inc. Arrays of optical confinements and uses thereof
US9588051B2 (en) 2004-09-17 2017-03-07 Pacific Biosciences Of California, Inc. Apparatus and method for performing nucleic acid analysis
US7170050B2 (en) 2004-09-17 2007-01-30 Pacific Biosciences Of California, Inc. Apparatus and methods for optical analysis of molecules
US9709503B2 (en) 2004-09-17 2017-07-18 Pacific Biosciences Of California, Inc. Apparatus and method for performing nucleic acid analysis
US8709725B2 (en) 2004-09-17 2014-04-29 Pacific Biosciences Of California, Inc. Arrays of optical confinements and uses thereof
WO2006044078A2 (en) 2004-09-17 2006-04-27 Pacific Biosciences Of California, Inc. Apparatus and method for analysis of molecules
EP3415641A1 (en) 2004-09-17 2018-12-19 Pacific Biosciences Of California, Inc. Method for analysis of molecules
US7805081B2 (en) 2005-08-11 2010-09-28 Pacific Biosciences Of California, Inc. Methods and systems for monitoring multiple optical signals from a single source
US7816503B2 (en) 2005-08-19 2010-10-19 Illumina Cambridge Limited Modified nucleosides and nucleotides and uses thereof
US7592435B2 (en) 2005-08-19 2009-09-22 Illumina Cambridge Limited Modified nucleosides and nucleotides and uses thereof
US7666593B2 (en) 2005-08-26 2010-02-23 Helicos Biosciences Corporation Single molecule sequencing of captured nucleic acids
US9868978B2 (en) 2005-08-26 2018-01-16 Fluidigm Corporation Single molecule sequencing of captured nucleic acids
US7405281B2 (en) 2005-09-29 2008-07-29 Pacific Biosciences Of California, Inc. Fluorescent nucleotide analogs and uses therefor
US8058031B2 (en) 2005-09-29 2011-11-15 Pacific Biosciences Of California, Inc. Labeled nucleotide analogs and uses therefor
US7777013B2 (en) 2005-09-29 2010-08-17 Pacific Biosciences Of California, Inc. Labeled nucleotide analogs and uses therefor
US7993891B2 (en) 2005-09-30 2011-08-09 Pacific Biosciences Of California, Inc. Method for binding reactive groups in observation area of zero mode waveguide
US8137942B2 (en) 2005-09-30 2012-03-20 Pacific Biosciences Of California, Inc. Method of preparing a modified surface
US7763423B2 (en) 2005-09-30 2010-07-27 Pacific Biosciences Of California, Inc. Substrates having low density reactive groups for monitoring enzyme activity
US7998717B2 (en) 2005-12-02 2011-08-16 Pacific Biosciences Of California, Inc. Mitigation of photodamage in analytical reactions
US8071346B2 (en) 2005-12-02 2011-12-06 Pacific Bioscience Of California, Inc. System for the mitigation of photodamage in analytical reactions
US8415128B2 (en) 2005-12-02 2013-04-09 Pacific Biosciences Of California, Inc. Mitigation of photodamage in analytical reactions
US7993895B2 (en) 2005-12-02 2011-08-09 Pacific Biosciences Of California, Inc. Mitigation of photodamage in analytical reactions
US8703734B2 (en) 2005-12-12 2014-04-22 The United States Of America, As Represented By The Secretary, Department Of Health And Human Services Nanoprobes for detection or modification of molecules
US8344121B2 (en) 2005-12-12 2013-01-01 The United States Of America As Represented By The Secretary Of The Department Of Health And Human Services Nanoprobes for detection or modification of molecules
WO2007070572A3 (en) * 2005-12-12 2008-04-17 Us Gov Health & Human Serv Probe for nucleic acid sequencing and methods of use
WO2007070572A2 (en) * 2005-12-12 2007-06-21 The Government Of The United States Of America, As Represented By The Secretary Of The Department Of Health And Human Services Probe for nucleic acid sequencing and methods of use
EP2311982A1 (en) * 2005-12-12 2011-04-20 The Government of the United States of America as represented by the Secretary of the Department of Health and Human Services Method of sequencing nucleic acids
US7871777B2 (en) 2005-12-12 2011-01-18 The United States Of America As Represented By The Department Of Health And Human Services Probe for nucleic acid sequencing and methods of use
US7715001B2 (en) 2006-02-13 2010-05-11 Pacific Biosciences Of California, Inc. Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US7626704B2 (en) 2006-02-13 2009-12-01 Pacific Biosciences Of California, Inc. Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US8264687B2 (en) 2006-02-13 2012-09-11 Pacific Biosciences Of California, Inc. Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US7692783B2 (en) 2006-02-13 2010-04-06 Pacific Biosciences Of California Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US7961314B2 (en) 2006-02-13 2011-06-14 Pacific Biosciences Of California, Inc. Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US8149399B2 (en) 2006-02-13 2012-04-03 Pacific Biosciences Of California, Inc. Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US7995202B2 (en) 2006-02-13 2011-08-09 Pacific Biosciences Of California, Inc. Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US7630073B2 (en) 2006-02-13 2009-12-08 Pacific Biosciences Of California Methods and systems for simultaneous real-time monitoring of optical signals from multiple sources
US8802600B2 (en) 2006-03-30 2014-08-12 Pacific Biosciences Of California, Inc. Articles having localized molecules disposed thereon and methods of producing same
US11186871B2 (en) 2006-03-30 2021-11-30 Pacific Biosciences Of California, Inc. Articles having localized molecules disposed thereon and methods of producing same
US9944980B2 (en) 2006-03-30 2018-04-17 Pacific Biosciences Of California, Inc. Articles having localized molecules disposed thereon and methods of producing same
US8772202B2 (en) 2006-03-30 2014-07-08 Pacific Biosciences Of California, Inc. Articles having localized molecules disposed thereon and methods of producing same
US8193123B2 (en) 2006-03-30 2012-06-05 Pacific Biosciences Of California, Inc. Articles having localized molecules disposed thereon and methods of producing same
US10655172B2 (en) 2006-03-30 2020-05-19 Pacific Biosciences Of California, Inc. Articles having localized molecules disposed thereon and methods of producing same
US8975216B2 (en) 2006-03-30 2015-03-10 Pacific Biosciences Of California Articles having localized molecules disposed thereon and methods of producing same
US7563574B2 (en) 2006-03-31 2009-07-21 Pacific Biosciences Of California, Inc. Methods, systems and compositions for monitoring enzyme activity and applications thereof
US7820983B2 (en) 2006-09-01 2010-10-26 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US9222133B2 (en) 2006-09-01 2015-12-29 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US8618507B1 (en) 2006-09-01 2013-12-31 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US8053742B2 (en) 2006-09-01 2011-11-08 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US9587276B2 (en) 2006-09-01 2017-03-07 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US8207509B2 (en) 2006-09-01 2012-06-26 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US8471219B2 (en) 2006-09-01 2013-06-25 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US8471230B2 (en) 2006-09-01 2013-06-25 Pacific Biosciences Of California, Inc. Waveguide substrates and optical systems and methods of use thereof
US7838847B2 (en) 2006-09-01 2010-11-23 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US7834329B2 (en) 2006-09-01 2010-11-16 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US9029802B2 (en) 2006-09-01 2015-05-12 Pacific Biosciences Of California, Inc. Substrates, systems and methods for analyzing materials
US9051612B2 (en) 2006-09-28 2015-06-09 Illumina, Inc. Compositions and methods for nucleotide sequencing
US11098353B2 (en) 2006-12-01 2021-08-24 The Trustees Of Columbia University In The City Of New York Four-color DNA sequencing by synthesis using cleavable fluorescent nucleotide reversible terminators
EP2126072A1 (en) * 2007-02-21 2009-12-02 Life Technologies Corporation Materials and methods for single molecule nucleic acid sequencing
EP2126072A4 (en) * 2007-02-21 2011-07-13 Life Technologies Corp Materials and methods for single molecule nucleic acid sequencing
JP2010518862A (en) * 2007-02-21 2010-06-03 ライフ テクノロジーズ コーポレーション Materials and methods for single molecule nucleic acid sequencing
US9012171B2 (en) 2007-10-09 2015-04-21 Anima Cell Metrology, Inc. Systems and methods for measuring translation activity in viable cells
US10260094B2 (en) 2007-10-19 2019-04-16 The Trustees Of Columbia University In The City Of New York DNA sequencing with non-fluorescent nucleotide reversible terminators and cleavable label modified nucleotide terminators
US9670539B2 (en) 2007-10-19 2017-06-06 The Trustees Of Columbia University In The City Of New York Synthesis of cleavable fluorescent nucleotides as reversible terminators for DNA sequencing by synthesis
US11242561B2 (en) 2007-10-19 2022-02-08 The Trustees Of Columbia University In The City Of New York DNA sequencing with non-fluorescent nucleotide reversible terminators and cleavable label modified nucleotide terminators
US10144961B2 (en) 2007-10-19 2018-12-04 The Trustees Of Columbia University In The City Of New York Synthesis of cleavable fluorescent nucleotides as reversible terminators for DNA sequencing by synthesis
US11208691B2 (en) 2007-10-19 2021-12-28 The Trustees Of Columbia University In The City Of New York Synthesis of cleavable fluorescent nucleotides as reversible terminators for DNA sequencing by synthesis
US9791450B2 (en) 2007-12-14 2017-10-17 Biotium, Inc. Fluorescent compounds
US9097667B2 (en) 2007-12-14 2015-08-04 Biotium, Inc. Fluorescent compounds
US11705217B2 (en) 2008-03-28 2023-07-18 Pacific Biosciences Of California, Inc. Sequencing using concatemers of copies of sense and antisense strands
US10697012B2 (en) 2008-09-16 2020-06-30 Pacific Biosciences Of California, Inc. Analytic device comprising a nanohole extending through an opaque mask layer and into a waveguide cladding
US9222123B2 (en) 2008-09-16 2015-12-29 Pacific Biosciences Of California, Inc. Analytic devices comprising optical waveguides and nanometer-scale apertures and methods of uses thereof
US10968482B2 (en) 2008-09-16 2021-04-06 Pacific Biosciences Of California, Inc. Substrates and optical systems and methods of use thereof for performing sequencing by synthesis
US8274040B2 (en) 2008-09-16 2012-09-25 Pacific Biosciences Of California, Inc. Substrates and optical system having at least one optical waveguide, at least one nanometer-scale aperture and at least one lens array and methods of use thereof
US11560591B2 (en) 2008-09-16 2023-01-24 Pacific Biosciences Of California, Inc. Analytic device comprising a substrate, nanometer-scale wells, and shallow waveguide optically coupled to a deep waveguide
US9719138B2 (en) 2008-09-16 2017-08-01 Pacific Biosciences Of California, Inc. Substrates and optical systems and methods of use thereof having a single optically resolvable immobilized reaction component disposed within a nanometer-scale aperture
US10280457B2 (en) 2008-09-16 2019-05-07 Pacific Biosciences Of California, Inc. Substrates and optical systems having a waveguide, nanometer-scale apertures, a lens array, and sensing regions and methods of use thereof
WO2010108638A1 (en) 2009-03-23 2010-09-30 Erasmus University Medical Center Rotterdam Tumour gene profile
US8501406B1 (en) 2009-07-14 2013-08-06 Pacific Biosciences Of California, Inc. Selectively functionalized arrays
US9034576B2 (en) 2009-09-24 2015-05-19 Anima Cell Metrology Inc. Systems and methods for measuring translation of target proteins in cells
US8658434B2 (en) 2009-10-28 2014-02-25 Biotium, Inc. Fluorescent pyrene compounds
US10640825B2 (en) 2010-02-19 2020-05-05 Pacific Biosciences Of California, Inc. Integrated analytical system and method
US10724090B2 (en) 2010-02-19 2020-07-28 Pacific Biosciences Of California, Inc. Integrated analytical system and method
US8465699B2 (en) 2010-02-19 2013-06-18 Pacific Biosciences Of California, Inc. Illumination of integrated analytical systems
US8467061B2 (en) 2010-02-19 2013-06-18 Pacific Biosciences Of California, Inc. Integrated analytical system and method
US9822410B2 (en) 2010-02-19 2017-11-21 Pacific Biosciences Of California, Inc. Integrated analytical system and method
US8649011B2 (en) 2010-02-19 2014-02-11 Pacific Biosciences Of California, Inc. Integrated analytical system and method
US9488584B2 (en) 2010-02-19 2016-11-08 Pacific Bioscience Of California, Inc. Integrated analytical system and method
US8867038B2 (en) 2010-02-19 2014-10-21 Pacific Biosciences Of California, Inc. Integrated analytical system and method
US8994946B2 (en) 2010-02-19 2015-03-31 Pacific Biosciences Of California, Inc. Integrated analytical system and method
US9157864B2 (en) 2010-02-19 2015-10-13 Pacific Biosciences Of California, Inc. Illumination of integrated analytical systems
US10138515B2 (en) 2010-02-19 2018-11-27 Pacific Biosciences Of California, Inc. Illumination of integrated analytical systems
US9291568B2 (en) 2010-02-19 2016-03-22 Pacific Biosciences Of California, Inc. Integrated analytical system and method
US11001889B2 (en) 2010-02-19 2021-05-11 Pacific Biosciences Of California, Inc. Illumination of integrated analytical systems
US9291569B2 (en) 2010-02-19 2016-03-22 Pacific Biosciences Of California, Inc. Optics collection and detection system and method
US9410891B2 (en) 2010-02-19 2016-08-09 Pacific Biosciences Of California, Inc. Optics collection and detection system and method
US8834847B2 (en) 2010-08-12 2014-09-16 Pacific Biosciences Of California, Inc. Photodamage mitigation compounds and systems
US9732382B2 (en) 2010-08-12 2017-08-15 Pacific Biosciences Of California, Inc. Photodamage mitigation compounds and systems
US9624539B2 (en) 2011-05-23 2017-04-18 The Trustees Of Columbia University In The City Of New York DNA sequencing by synthesis using Raman and infrared spectroscopy detection
WO2013079215A1 (en) 2011-12-01 2013-06-06 Erasmus University Medical Center Rotterdam Method for classifying tumour cells
US10768362B2 (en) 2012-06-17 2020-09-08 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices and methods for production
US9946017B2 (en) 2012-06-17 2018-04-17 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices and methods for production
US9372308B1 (en) 2012-06-17 2016-06-21 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices and methods for production
US10310178B2 (en) 2012-06-17 2019-06-04 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices and methods for production
US9658161B2 (en) 2012-06-17 2017-05-23 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices and methods for production
US11137532B2 (en) 2012-12-18 2021-10-05 Pacific Biosciences Of California, Inc. Illumination of optical analytical devices
US10578788B2 (en) 2012-12-18 2020-03-03 Pacific Biosciences Of California, Inc. Illumination of optical analytical devices
US10018764B2 (en) 2012-12-18 2018-07-10 Pacific Biosciences Of California Illumination of optical analytical devices
US9223084B2 (en) 2012-12-18 2015-12-29 Pacific Biosciences Of California, Inc. Illumination of optical analytical devices
US11640022B2 (en) 2012-12-18 2023-05-02 Pacific Biosciences Of California, Inc. Illumination of optical analytical devices
US11384393B2 (en) 2013-02-22 2022-07-12 Pacific Biosciences Of California, Inc. Integrated illumination of optical analytical devices
US9624540B2 (en) 2013-02-22 2017-04-18 Pacific Biosciences Of California, Inc. Integrated illumination of optical analytical devices
US10570450B2 (en) 2013-02-22 2020-02-25 Pacific Biosciences Of California, Inc. Integrated illumination of optical analytical devices
US10144963B2 (en) 2013-02-22 2018-12-04 Pacific Biosciences Of California, Inc. Integrated illumination of optical analytical devices
US10648026B2 (en) 2013-03-15 2020-05-12 The Trustees Of Columbia University In The City Of New York Raman cluster tagged molecules for biological imaging
US10407721B2 (en) 2013-03-15 2019-09-10 Illumina Cambridge Limited Modified nucleosides or nucleotides
US10982277B2 (en) 2013-03-15 2021-04-20 Illumina Cambridge Limited Modified nucleosides or nucleotides
US20170166961A1 (en) 2013-03-15 2017-06-15 Illumina Cambridge Limited Modified nucleosides or nucleotides
US9606068B2 (en) 2014-08-27 2017-03-28 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices
US9915612B2 (en) 2014-08-27 2018-03-13 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices
US10234393B2 (en) 2014-08-27 2019-03-19 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices
US11467089B2 (en) 2014-08-27 2022-10-11 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices
US10859497B2 (en) 2014-08-27 2020-12-08 Pacific Biosciences Of California, Inc. Arrays of integrated analytical devices
US10487356B2 (en) 2015-03-16 2019-11-26 Pacific Biosciences Of California, Inc. Integrated devices and systems for free-space optical coupling
US11693182B2 (en) 2015-06-12 2023-07-04 Pacific Biosciences Of California, Inc. Integrated target waveguide devices and systems for optical coupling
US10365434B2 (en) 2015-06-12 2019-07-30 Pacific Biosciences Of California, Inc. Integrated target waveguide devices and systems for optical coupling
US11054576B2 (en) 2015-06-12 2021-07-06 Pacific Biosciences Of California, Inc. Integrated target waveguide devices and systems for optical coupling
US10077470B2 (en) 2015-07-21 2018-09-18 Omniome, Inc. Nucleic acid sequencing methods and systems
US11499185B2 (en) 2015-07-21 2022-11-15 Pacific Biosciences Of California, Inc. Nucleic acid sequencing methods and systems
US11203778B2 (en) 2016-04-29 2021-12-21 Omniome, Inc. Sequencing method employing ternary complex destabilization to identify cognate nucleotides
US10633692B2 (en) 2016-04-29 2020-04-28 Omniome, Inc. Sequencing method employing ternary complex destabilization to identify cognate nucleotides
US10294514B2 (en) 2016-04-29 2019-05-21 Omniome, Inc. Sequencing method employing ternary complex destabilization to identify cognate nucleotides
US11168364B2 (en) 2016-08-15 2021-11-09 Omniome, Inc. Method and system for sequencing nucleic acids
US11203779B2 (en) 2016-08-15 2021-12-21 Omnionie, Inc. Sequencing method for rapid identification and processing of cognate nucleotide pairs
US10246744B2 (en) 2016-08-15 2019-04-02 Omniome, Inc. Method and system for sequencing nucleic acids
US10443098B2 (en) 2016-08-15 2019-10-15 Omniome, Inc. Method and system for sequencing nucleic acids
US10428378B2 (en) 2016-08-15 2019-10-01 Omniome, Inc. Sequencing method for rapid identification and processing of cognate nucleotide pairs
US11248254B2 (en) 2016-12-30 2022-02-15 Omniome, Inc. Method and system employing distinguishable polymerases for detecting ternary complexes and identifying cognate nucleotides
WO2018125759A1 (en) * 2016-12-30 2018-07-05 Omniome, Inc. Method and system employing distinguishable polymerases for detecting ternary complexes and identifying cognate nucleotides
US10975427B2 (en) 2017-01-20 2021-04-13 Omniome, Inc. Process for cognate nucleotide detection in a nucleic acid sequencing workflow
US9951385B1 (en) 2017-04-25 2018-04-24 Omniome, Inc. Methods and apparatus that increase sequencing-by-binding efficiency
US11447823B2 (en) 2017-04-25 2022-09-20 Pacific Biosciences Of California, Inc. Methods and apparatus that increase sequencing-by-binding efficiency
US10161003B2 (en) 2017-04-25 2018-12-25 Omniome, Inc. Methods and apparatus that increase sequencing-by-binding efficiency
US10655176B2 (en) 2017-04-25 2020-05-19 Omniome, Inc. Methods and apparatus that increase sequencing-by-binding efficiency
US11193166B2 (en) 2017-10-19 2021-12-07 Omniome, Inc. Simultaneous background reduction and complex stabilization in binding assay workflows

Also Published As

Publication number Publication date
WO2001016375A3 (en) 2001-10-04
AU7086800A (en) 2001-03-26

Similar Documents

Publication Publication Date Title
US6982146B1 (en) High speed parallel molecular nucleic acid sequencing
WO2001016375A2 (en) High speed parallel molecular nucleic acid sequencing
EP1960550B1 (en) Probe for nucleic acid sequencing and methods of use
EP1594987B1 (en) Nucleic acid sequencing methods, kits and reagents
US20080032307A1 (en) Use of Single-Stranded Nucleic Acid Binding Proteins In Sequencing
US6524829B1 (en) Method for DNA- or RNA-sequencing
CN100462433C (en) Real-time sequence determination
US7297518B2 (en) Methods and apparatus for analyzing polynucleotide sequences by asynchronous base extension
US20110165652A1 (en) Compositions, methods and systems for single molecule sequencing
US20060024711A1 (en) Methods for nucleic acid amplification and sequence determination
US20070031875A1 (en) Signal pattern compositions and methods
US20050239085A1 (en) Methods for nucleic acid sequence determination
US20070196832A1 (en) Methods for mutation detection
Rubens et al. Schneider et al.
Soper et al. DNA sequencing using fluorescence detection
JP2023523236A (en) Methods and compositions for sequencing fluorescent polynucleotides
Vo-Dinh DNA Sequencing Using Fluorescence Detection
Owens et al. Steven A. Soper

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

WWE Wipo information: entry into national phase

Ref document number: 10070053

Country of ref document: US

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP