This disclosure provides systems and methods for sample processing and data analysis. Sample processing may include nucleic acid sample processing and subsequent sequencing. Some or all of a nucleic acid sample may be sequenced to provide sequence information, which may be stored or otherwise maintained in an electronic storage location. The sequence information may be analyzed with the aid of a computer processor, and the analyzed sequence information may be stored in an electronic storage location that may include a pool or collection of sequence information and analyzed sequence information generated from the nucleic acid sample. Methods and systems of the present disclosure can be used, for example, for the analysis of a nucleic acid sample, for producing one or more libraries, and for producing biomedical reports. Methods and systems of the disclosure can aid in the diagnosis, monitoring, treatment, and prevention of one or more diseases and conditions.
Legal claims defining the scope of protection, as filed with the USPTO.
. A method for personalized genetic testing, comprising:
. The method of, wherein the personalized primer set further comprises at least one fixed portion independent of the results of the first assay.
. The method of, wherein the reference genome is from the subject.
. The method of, wherein:
. The method of, wherein the second assay of step (f) comprises: (i) sequencing by synthesis, (ii) qPCR, or (iii) digital PCR.
. The method of, wherein the average sequence depth of the second assay is at least 1,000-fold.
. The method of, wherein the personalized primer set comprises 10 or more primers.
. The method of, wherein the personalized primer set comprises 50 or more primers.
. The method of, wherein the personalized primer set comprises 100 or more primers.
. The method of, wherein the nucleic acid molecules extracted from the one or more plasma samples are selectively amplified in a pool.
. The method of, wherein the nucleic acid molecules extracted from the tumor sample comprise: (i) DNA, (ii) RNA, or (iii) cDNA derived from RNA.
. The method of, wherein the nucleic acid molecules extracted from the one or more plasma samples comprise: (i) DNA, (ii) RNA, (iii) cDNA derived from RNA, (iv) cfDNA, or (v) cfRNA.
. The method of, wherein the set of one or more genetic characteristics comprises: (i) Single Nucleotide Polymorphisms (SNPs), (ii) Multiple Nucleotide Polymorphisms (MNPs), (iii) insertions and/or deletions (inDels), (iv) copy number variations, (v) structural variations, (vi) HLA typing, or any combination thereof.
. The method of, wherein the at least one biological sample comprises a plurality of biological samples, wherein the plurality of biological samples comprises the tumor sample of step (a)(i) and a sample reflective of the germline genome of the subject.
. The method of, wherein the sample reflective of the germline genome of the subject comprises white blood cells.
. The method of, wherein the nucleic acid molecules extracted from the plurality of biological samples comprise: (i) DNA, (ii) RNA, or (iii) cDNA derived from RNA.
. The method of, wherein the set of one or more genetic characteristics comprises: (i) Single Nucleotide Polymorphisms (SNPs), (ii) Multiple Nucleotide Polymorphisms (MNPs), (iii) insertions and/or deletions (inDels), (iv) copy number variations, (v) structural variations, (vi) mosaic variants, (vii) somatic variants, (viii) HLA typing, (ix) V(D)J recombination, or any combination thereof.
. The method of, wherein the at least one biological sample is obtained at a first time point and the one or more plasma samples are obtained at one or more time points subsequent to the first time point.
. The method of, wherein the biomedical information comprises a biomedical report.
. The method of, wherein the biomedical report comprises reporting: (i) information of the subject that is predictive, prognostic, or diagnostic of one or more biomedical features, (ii) information of the subject that is predictive, prognostic, or diagnostic of a status or outcome of a disease or condition in the subject, or (iii) one or more biomedical outputs.
. The method of, wherein the one or more biomedical features comprise: (i) status of a disease or condition, (ii) genetic risk of a disease or condition, (iii) risk of an adverse drug reaction, (iv) efficacy of a drug therapy, (v) prediction of optimal drug dosage, (vi) transplant tolerance, or a combination thereof.
. The method of, wherein the disease or condition comprises cancer.
. The method of, wherein the cancer comprises recurrent cancer and/or refractory cancer.
. The method of, wherein the predictive, prognostic, or diagnostic information of the biomedical report comprises a member selected from the group consisting of: (i) diagnosing a disease or condition, (ii) identifying a disease or condition, (iii) determining stage of a disease or condition, (iv) assessing risk of a disease or condition, (v) assessing risk of disease recurrence, (vi) assessing efficacy of a drug, (vii) assessing risk of an adverse drug reaction, (viii) predicting optimal drug dosage, and (ix) predicting drug resistance.
. The method of, wherein the disease or condition comprises cancer.
. The method of, wherein the cancer comprises recurrent cancer and/or refractory cancer.
. The method of, wherein the one or more biomedical outputs suggest, select, designate, recommend, or otherwise determine a course of treatment and/or prevention of a disease or condition.
. The method of, wherein the one or more biomedical outputs recommend modifying or continuing one or more therapies.
. The method of, wherein modifying one or more therapies comprises administering, initiating, reducing, increasing, and/or terminating one or more therapies.
. The method of, wherein the one or more therapies comprise an anticancer therapy, wherein the anticancer therapy comprises surgery, chemotherapy, radiation therapy, immunotherapy/biological therapy, photodynamic therapy, monoclonal antibodies, cancer vaccines, gene therapy, or a combination thereof.
Complete technical specification and implementation details from the patent document.
This application is a continuation application of U.S. patent application Ser. No. 18/593,406, filed Mar. 1, 2024, which is a continuation application of U.S. patent application Ser. No. 18/179,582, filed Mar. 7, 2023, now U.S. Pat. No. 11,952,625, which is a continuation application of U.S. patent application Ser. No. 17/747,436, filed May 18, 2022, now U.S. Pat. No. 11,643,685, which is a continuation application of U.S. patent application Ser. No. 17/688,072, filed Mar. 7, 2022, which is a continuation application of U.S. patent application Ser. No. 17/235,776, filed Apr. 20, 2021, now U.S. Pat. No. 11,299,783, which is a continuation in part of U.S. patent application Ser. No. 17/065,411, filed Oct. 7, 2020, which is a continuation application of U.S. patent application Ser. No. 16/843,115, filed Apr. 8, 2020, which is a continuation application of U.S. patent application Ser. No. 16/547,451, filed Aug. 21, 2019, which is a continuation application of U.S. patent application Ser. No. 15/793,845, filed Oct. 25, 2017, now U.S. Pat. No. 10,450,611, which is a continuation application of International Application Patent No. PCT/US2017/034823, filed May 26, 2017, which claims priority to U.S. Provisional Application No. 62/342,674, filed May 27, 2016, each of which is incorporated herein by reference in its entirety.
The history of deoxynucleic acid (DNA) sequencing and DNA synthesis has been intertwined, with advances in one often leading to advances in or applications of the other.
The double helix structure of DNA was discovered by Watson and Crick in 1953.
In the decades following that, chemists worked to develop methods to synthesize DNA strands (oligonucleotides) of predefined sequence. Caruthers, et al (U.S. Pat. No. 4,458,066 “Process for preparing oligonucleotides”, filed Mar. 24, 1981) introduced the phosphoramidite chemistry now widely used. It was implemented on substrates similar to chromatography columns, yielding one oligonucleotide per synthesis. At the end of this process, the synthesized molecules are cleaved from the substrates on which they have been synthesized, so they can be used in further reactions in solution.
Instrument manufacturers subsequently introduced equipment implementing this process on multiple columns in parallel. On Apr. 24, 2000 for example, PE Applied Biosystems issued a press release introducing its “ABI 3900 High Throughput DNA Synthesizer” with 48 columns operating concurrently. In a system of this type, each oligo was synthesized on a separate substrate and delivered in a separate tube (or other container). Relatively large amounts of each DNA sequence can be synthesized on these machines (the ABI 3900 specification was 40 nanomoles up to 1 micro-mole per sequence).
Methods for the synthesis of DNA sequences led to Polymerase Chain Reaction (PCR), which uses synthesized DNA priming sequences. Kary Mullis, who invented PCR and was later awarded the Nobel Prize for it, was working in a DNA synthesis lab at Cetus at the time. It was originally devised as a method to enable sequencing of the sickle cell anemia locus via Sanger sequencing. U.S. Pat. No. 4,683,202 “Process for amplifying nucleic acid sequences”, the original PCR patent, was filed in 1985.
This was further refined in methods which integrated DNA amplification and the Sanger chain terminating reaction, e.g., Murray, V., “Improved double-stranded DNA sequencing using the linear polymerase chain reaction” Nucleic Acids Research, Vol 17, No 21 Pg 8889, Nov. 11, 1989. Still further refinement along these lines was termed “Cycle Sequencing” (e.g., U.S. Pat. No. 5,432,065 filed Mar. 30, 1993). All of these combined the use of individually synthesized DNA sequences, as primers for further DNA synthesis with polymerase enzymes.
During this time, other groups developed methods for synthesis of DNA on a highly parallel microscopic scale, on a single substrate. This increased the parallelism of DNA synthesis by over a thousand-fold. Compared to the ABI 3900 instrument mentioned above for example, which can synthesize up to 48 sequences in parallel, some array-based methods can synthesize over 50,000 sequences in parallel without large manufacturing set-up costs.
One method of array-based synthesis was described in Pirrung, et al (U.S. Pat. No. 5,143,854 “Large scale photolithographic solid phase synthesis of polypeptides and receptor binding screening thereof”, priority date Jun. 7, 1989). It was developed by scientists at Affymax Corporation, later spun out as Affymetrix, Inc. This early work used fixed photolithographic masks, similar to those of the semiconductor industry. This enabled production of many “DNA arrays” with the same set of DNA sequences on them.
A group at the University of Wisconsin at Madison later devised a more flexible version of this using micro-mirror arrays (rather the fixed photolithographic masks) to dynamically define the spatial pattern of light in the system. This was spun out into the company Nimblegen in 1999, which was acquired by Roche in 2007.
Another method for synthesis of DNA on a highly parallel microscopic scale, on a single substrate, was developed using technology from ink-jet printing. Brennan (U.S. Pat. No. 5,472,672 “Apparatus and method for polymer synthesis using arrays” filed Oct. 22, 1993) described such a system including the dispensing of microscopic droplets of synthesis reagents through an array of nozzles on a moveable print head. This technology was commercialized by Agilent, Inc.
Early applications of these DNA arrays involved use of the oligonucleotides on the array substrates where they were synthesized. This typically involved hybridization of DNA (or complementary deoxyribonucleic acid (cDNA)) from a test sample to the oligonucleotides on the array. If the DNA (or cDNA) of the test sample was fluorescently labeled in advance, then imaging the array after hybridization and washing can quantify the amount of each sequence in the test sample. This was initially used to measure mRNA expression of genes and it was later used for genotyping.
Application of DNA array technology to DNA sequencing largely waited until DNA sequencing itself advanced. The original methods of DNA sequencing (Sanger, Maxim & Gilbert shared a 1975 Nobel prize) used electrophoresis for separation and subsequent readout. Each such electrophoretic separation and detection was spatially separate, though companies developed instruments with several in parallel (e.g., Applied Biosystems Model 370, introduced about 1987, supported up to 24 in parallel; Applied Biosystems Model 3700, introduced in 1999 supported up to 96 in parallel, and Amersham's Molecular Dynamics unit introduced a version of its MegaBace system about 2002 with 384 in parallel.)
Several groups did attempt to leverage DNA arrays for DNA sequencing (e.g., Lysov, et al, 1996, “Efficiency of sequencing by hybridization on oligonucleotide matrix supplemented by measurement of the distance between DNA segments.”). Affymetrix commercialized this approach for small applications (variants in CYP drug metabolizing genes, genotyping of HIV). These methods conduct the DNA sequencing reactions and fluorescent readout on the array and thus have been limited to one base per array spot and fairly small non-repetitive portions of genomes. Heidi Rehm, et al at the Harvard Medical School published a set of protocols for this in April 2011 “Targeted Sequencing Using Affymetrix CustomSeq Arrays” in Current Protocols in Human Genetics. In it the technology was described as suitable for re-sequencing portions of the human genome up to 300,000 bases in total length.
The field moved forward with the commercialization of “Next Generation DNA Sequencing” methods, which enabled measurement of hundreds of thousands of sequences at a time. One of the first such systems was commercialized by 454, Inc (previously a division of Curagen, Inc and later acquired by Roche) in 2005 (Margulies, M. et al. “Genome sequencing in microfabricated high-density picolitre reactors” Nature 437, 376-380 (2005). This initial system can measure up to 200,000 sequences in parallel, each on average 100 bases long.
Two years later, in 2007, a group at the Baylor College of Medicine used a 454 DNA sequencing instrument to sequence an exome (Albert, et al “Direct selection of human genomic loci by microarray hybridization” Nature Methods, November 2007, 4(11):903-5). The key to this work was that a DNA array was used not as a substrate for sequencing itself, but to enrich a genomic DNA sample for just the parts of the genome intended for sequencing. The original DNA sample, fragmented, was hybridized to the array. Portions of the genome which did not hybridize were washed off. Then the portions of the genome which did hybridize to the array were eluted off the array and sequenced separate from the array, using the 454 system. The DNA arrays used were from Nimblegen. Although that DNA synthesis technology had been available since 1999, it was its 2007 combination with huge parallelism of next generation DNA sequencing that made this application practical.
In the work described above, DNA sequences synthesized on an array were used in-place on the array substrate. During the early 2000's though, groups began to explore technologies by which DNA molecules can be synthesized on an array but attached to the substrate of the array by a cleavable linker. This meant that after array synthesis, the linkers can be cleaved (e.g., chemically) releasing the oligonucleotides into solution, where they can be used as a pool. One example of this work is U.S. Pat. No. 7,211,654 (Xiaolian, et al, “Linkers and co-coupling agents for optimization of oligonucleotide synthesis and purification on solid supports” May 1, 2007).
In 2007, a group at the Broad Institute, began to explore use of this approach to create pools of oligonucleotides in solution to capture select portions of the genome of a test sample. (See U.S. provisional application 61/063,489, Gnirke, et al, filed Feb. 4, 2008: “Selection of nucleic acids by solution hybridization to oligonucleotide baits”.) Dr. Carsten Russ of the Broad Institute described this approach at the February 2008 AGBT conference (reported by GenomeWeb). During 2008, Agilent licensed this technology. It was published on line Feb. 1, 2009 “Solution hybrid selection with ultra-long oligonucleotides for massively parallel sequencing” Nature Biotechnology 27, 182-189 (2009). In February 2009 Agilent launched this as a product line (trade name “SureSelect”) with its first human exome kit (“SureSelect All Exon”).
Dr. Gnirke, et al at the Broad Institute continued to innovate and applied targeted capture, using array synthesis of DNA, to RNA transcriptomes: “Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcripts” Joshua Levin, et al (including Andreas Gnirke). Genome Biology 2009, 10:R115.
In parallel with this, Next Generation DNA Sequencing technologies continued to advance. In June 2006, Solexa, Inc first shipped its Genome Analyzer system. This system measured 40 million DNA sequences in parallel, each initially 25 bases long. In 2008 Illumina, Inc acquired Solexa. Subsequent versions of this technology have continued to advance. The most current instrument (Illumina HiSeq-4000) can produce about 6 billion sequences in parallel, each 2×125 bases, for a total of 1.5 trillion bases, in a single run.
Exome sequencing has been broadly adopted as a research tool. As an example, the Exome Aggregation Consortium based at the Broad Institute has released a dataset based on human exome sequences from over 60,000 individuals (release v0.3 Jan. 2015).
Exome sequencing has also been adopted clinically. The first commercial clinical exome tests were announced by GeneDx and Ambry Genetics at the ASHG conference in October 2011. Others including the Baylor College of Medicine have also offered commercial clinical human exome-based tests, and over 8,000 have been performed.
DNA synthesis technologies have continued to advance, particularly focused on gene synthesis applications requiring very long DNA sequences. Many of these advances involve the construction of long DNA molecules by strategies which combine shorter synthetic DNA molecules. This was reviewed in: “Large-scale de novo DNA synthesis: technologies and applications” Sriram Kosuri and George Church, Nature Methods, Volume 11, No 5, May 2014; 499.
In spite of the advances described above, the clinical adoption of exome-scale sequencing has been limited by the costs involved. Health insurers, who are asked to pay for these tests, often refuse, given the scale of the expense. This problem is even worse in cancer, where the depth of deoxynucleic acid (DNA) sequencing required can be much higher (e.g., >500×) than that for inherited diseases (e.g., 30-100×).
While array-based DNA synthesis is now widely used to capture whole exomes, transcriptomes, or application-specific subsets of exomes (e.g., the genes involved with a specific Mendelian disease), a limitation of the field, as recognized herein, is the potential to leverage array synthesis of DNA in a personalized manner. The field has largely used array-based synthesis to develop standard products which are broadly applicable across a whole set of human patients and/or research subjects. Even where custom array synthesis is proposed, it is to sequence regions of the genome defined independent of a specific sample.
In one aspect, the disclosure provides a method for personalized genetic testing, comprising: (a) using a plurality of genetic characteristics to determine a nucleic acid configuration of an assay, which nucleic acid configuration includes nucleic acid sequences of a plurality of nucleic acid probe molecules, wherein the nucleic acid sequences are selective for genetic variants, wherein the plurality of genetic characteristics is determined by analyzing nucleic acid sequence data generated from at least one biological sample of a subject, and wherein the plurality of genetic characteristics include the genetic variants in the nucleic acid molecules from the at least one biological sample; (b) providing the plurality of nucleic acid probe molecules by (i) synthesizing the plurality of nucleic acid probe molecules using at least one array, or (ii) selecting the plurality of nucleic acid probe molecules from a collection of nucleic acid probe molecules; and (c) using the plurality of nucleic acid probe molecules provided in (b) to perform at least the assay on one or more biological samples from the subject or at least one biological relative of the subject, to generate data indicative of a presence or absence of at least a subset of the genetic variants in the subject or the at least one biological relative.
Some embodiments may further comprise generating the nucleic acid sequence data using a sequencing assay to sequence or quantify nucleic acid molecules from the at least one biological sample. In some embodiments providing the plurality of nucleic acid probe molecules comprises synthesizing the plurality of nucleic acid probe molecules using at least one array.
In some embodiments, in the sequencing assay, at least one biological sample is obtained from the subject at a first time point, and wherein in (c), the one or more biological samples are obtained from the subject or the at least one biological relative of the subject at a second time point subsequent to the first time point. In some embodiments, providing the plurality of nucleic acid probe molecules comprises selecting the plurality of nucleic acid probe molecules from a collection of nucleic acid probe molecules.
Some embodiments comprise outputting a report that is indicative of a presence or absence of the at least the subset of the genetic variants in the subject or the at least one biological relative. In some embodiments, the nucleic acid probe molecules comprise primers for amplifying the nucleic acid sequences.
Some embodiments further comprise outputting a report that is generated at least based on comparison of results from the sequencing assay with results from the second assay of (c).
In some embodiments, the one or more biological samples in (c) comprise a plurality of biological samples, and wherein (c) further comprises outputting a report that is generated at least based on comparison of results from the at least the assay from the plurality of biological samples assayed in (c) with each other.
In some embodiments, at least the assay comprises a plurality of the assay. In some embodiments, the plurality of the assay is performed on (i) a plurality of biological samples of the subject or (ii) a plurality of biological samples of the at least one biological relative of the subject.
Some embodiments further comprise providing a therapeutic intervention at least based on the presence or absence of the at least the subset of the genetic variants identified in (c).
In some embodiments, the sequencing assay comprises (i) exome sequencing, (ii) sequencing a panel of genes, (iii) whole genome sequencing, and/or (iv) sequencing a population of complementary deoxyribonucleic acid molecules derived from ribonucleic acid molecules. In some embodiments, the sequencing assay comprises sequencing the nucleic acid molecules generated in quantity or sequence by interaction with the at least one biological sample from the subject. In some embodiments, the sequencing assay comprises sequencing the nucleic acid molecules derived from antibody-oligonucleotide conjugates of the subject.
In some embodiments, the nucleic acid molecules from the at least one biological sample comprise nucleic acid molecules from cells of the subject and are representative of a germline genome of the subject. In some embodiments, the nucleic acid molecules from the at least one biological sample comprise nucleic acids from (i) white blood cells or (ii) non-cancerous cells adjacent to or embedded in a tumor or metastasis of the subject. In some embodiments, the nucleic acid molecules from the at least one biological sample are cell-free nucleic acid molecules. In some embodiments, at least one biological sample includes a blood sample and the nucleic acids molecules are from blood cells in the blood sample, and wherein the subject has been diagnosed with a blood-related cancer such that the nucleic acid molecules in (a) are representative of a cancer genome of the subject. In some embodiments, the nucleic acids molecules are derived from a buccal swab, and wherein the nucleic acid molecules are representative of an ectodermal genome of the subject. In some embodiments, at least one biological sample includes a tumor sample and the nucleic acids molecules are from cells in the tumor sample, and wherein the nucleic acid molecules are representative of a cancer genome of the subject. In some embodiments, the nucleic acid molecules are derived from T-cells and/or B-cells of an adaptive immune system of the subject, representing post-zygotic V(D)J recombination. In some embodiments, the nucleic acid molecules comprise non-human nucleic acid molecules derived from the at least one biological sample, representing a genome(s) of one or more microbial organisms.
In some embodiments, the sequencing assay comprises analysis of a single biological sample from the subject.
In some embodiments, at least one biological sample includes a plurality of biological samples, and wherein the first assay comprises analysis of the plurality of biological samples and at least one of the plurality of genetic characteristics determined in (b) is based on comparison of the analysis. In some embodiments, at least one biological sample includes a tumor of the subject, and wherein the first assay of (a) comprises analysis of the at least one biological sample and analysis of an additional biological sample which represents a germline genome of the subject. In some embodiments, at least one biological sample includes a tumor of the subject and the nucleic acid molecules include deoxyribonucleic acid (DNA) molecules and ribonucleic acid (RNA) molecules from the tumor, and wherein the first assay comprises analysis of the DNA and RNA.
In some embodiments, the plurality of genetic characteristics comprises one or more (i) single nucleotide polymorphisms, (ii) insertions and/or deletions, (iii) copy number variations, and (iv) structural variations. In some embodiments, the plurality of genetic characteristics include signatures combining multiple genetic variants. In some embodiments, the plurality of genetic characteristics comprise genetic variants in a germline sequence of the subject. In some embodiments, the plurality of genetic characteristics comprise post-zygotic variants from a germline sequence of the subject. In some embodiments, the plurality of genetic characteristics comprise post-zygotic recombination of elements from a germline sequence of the subject. In some embodiments, the plurality of genetic characteristics comprise levels of gene expression and/or sequencing read counts or read-depth in data derived from ribonucleic acid molecules or complementary deoxyribonucleic acid molecules derived from the at least one biological sample. In some embodiments, the plurality of genetic characteristics comprise levels of messenger ribonucleic acid expression of alleles from deoxyribonucleic acid molecules derived from the at least one biological sample. In some embodiments, the plurality of genetic characteristics comprise levels of methylation at specific locations or in specific regions of a genome.
In some embodiments, the plurality of genetic characteristics comprise locations in or regions of a genome, and wherein the plurality of nucleic acid probe molecules of the assay enrich or deplete a nucleic acid mixture of nucleic acid molecules which include the locations or regions of the genome or portions thereof.
In some embodiments, the plurality of genetic characteristics comprise numbers of sequences derived from oligo-antibody conjugates contacted with the at least one biological sample.
In some embodiments, the plurality of nucleic acid probe molecules of the assay enrich or deplete a nucleic acid mixture of nucleic acid molecules for target regions, by hybridization or amplification.
In some embodiments, each of the nucleic acid probe molecules of the assay includes a region targeted for a genomic locus or region. In some embodiments, each of the nucleic acid probe molecules of the second assay includes a barcode sequence. In some embodiments, each of the nucleic acid probe molecules of the assay includes a region for demultiplexing or selective amplification of at least a subset of nucleic acid molecules from the one or more biological samples, pooled across multiple genomic loci and/or across multiple subjects.
In some embodiments, the plurality of nucleic acid probe molecules includes sequences selected from a library of sequences. In some embodiments, the sequences capture coding exons of a genome of the subject or the at least one biological relative. In some embodiments, each of the plurality of nucleic acid probe molecules includes a variation from a reference sequence in the first assay of the subject.
Some embodiments further comprise synthesizing the plurality of nucleic acid probe molecules on a single solid substrate. Some embodiments further comprise synthesizing at least 100 nucleic acid sequences in parallel. Some embodiments further comprise synthesizing at least 1,000 nucleic acid sequences in parallel. Some embodiments further comprise synthesizing at least 10,000 nucleic acid sequences in parallel. Some embodiments further comprise synthesizing at least 50,000 nucleic acid sequences in parallel. Some embodiments further comprise synthesizing a plurality of nucleic acid sequences in spatially separate regions of the single solid substrate.
In some embodiments, the assay comprises generating nucleic acid sequence data from the one or more biological samples.
Some embodiments further comprise mapping the nucleic acid sequence data to a reference.
In some embodiments, each of the plurality of nucleic acid probe molecules is at least 50 bases in length.
In some embodiments, the assay comprises nucleic acid sequencing or gene expression analysis.
In some embodiments, each of the plurality of nucleic acid probe molecules of the assay includes oligonucleotide-directed genomic content comprising (i) at least one variable portion from a result of the sequencing assay and (ii) at least one fixed portion independent of the result of the sequencing assay. In some embodiments, the oligonucleotides of the at least one fixed portion are synthesized on the same array(s) as the at least one variable portion. In some embodiments oligonucleotides of the at least one fixed portion are synthesized on separate array(s) as the at least one variable portion. In some embodiments, at least one variable portion corresponds to genes which are more highly expressed than genes that correspond to the at least one fixed portion. In some embodiments, at least one variable portion corresponds to genes with a first expression profile and the at least one fixed portion corresponds to genes with a second expression profile, wherein the first expression profile has greater sample-to-sample variability than the second expression profile.
In some embodiments, the genomic content includes coding regions of genes. In some embodiments, the genomic content includes regions corresponding to non-coding ribonucleic acid (RNA), micro-RNA and/or intronic RNA.
Unknown
March 10, 2026
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.