Patentable/Patents/US-11646102
US-11646102

System and method for secondary analysis of nucleotide sequencing data

PublishedMay 9, 2023
Assigneenot available in USPTO data we have
Inventorsnot available in USPTO data we have
Technical Abstract

Disclosed herein are systems and methods for performing secondary analyses of nucleotide sequencing data in a time-efficient manner. Some embodiments include performing a secondary analysis iteratively while sequence reads are generated by a sequencing system. Secondary analyses can encompass both alignment of sequence reads to a reference sequence (e.g., the human reference genome sequence) and utilization of this alignment to detect differences between a sample and the reference. Secondary analysis can enable detection of genetic differences, variant detection and genotyping, identification of single nucleotide polymorphisms (SNPs), small insertions and deletion (indels) and structural changes in the DNA, such as copy number variants (CNVs) and chromosomal rearrangements.

Patent Claims
31 claims

Legal claims defining the scope of protection, as filed with the USPTO.

2

2. The system of claim 1, wherein the first confidence depends on a number of mismatches a probability of a correct match, a number of candidate positions at which the first nucleotide subsequence aligns to the reference sequence, or a diversity of bases in the first nucleotide subsequence.

3

3. The system of claim 1, wherein comparing the one or more additional nucleotides in the second nucleotide subsequence to the reference sequence comprises a simple alignment process, the simple alignment process being more computationally efficient, in memory usage or the number of computation operations, than a process used to align the first nucleotide subsequence with the reference sequence.

4

4. The system of claim 3, wherein the processor is further configured to determine a simple alignment score based on the simple alignment process.

5

5. The system of claim 3, wherein the simple alignment process aligns only the one or more additional nucleotides in the second nucleotide subsequence to the reference sequence based in part on a first plurality of candidate locations resulting from comparing the first nucleotide subsequence to the reference sequence.

6

6. The system of claim 1, wherein the processor is further configured to store data corresponding to at least one of a first plurality of candidate locations resulting from comparing the first nucleotide subsequence to the reference sequence.

7

7. The system of claim 6, wherein comparing the one or more additional nucleotides in the second nucleotide subsequence to the reference sequence comprises comparing the second nucleotide subsequence with corresponding sequences of the second nucleotide subsequence on the reference sequence based on the first plurality of candidate locations.

8

8. The system of claim 1, wherein the processor is further configured to store data corresponding to at least one of a second plurality of candidate locations resulting from comparing the second nucleotide subsequence to the reference sequence.

9

9. The system of claim 8, wherein the processor is further configured to determine a mapping quality (MapQ) score for each of the second plurality of candidate locations.

10

10. The system of claim 1, wherein determining whether the first nucleotide subsequence aligns to the reference sequence is initiated before the sequencing reactions are completed.

11

11. The system of claim 1, wherein the processor is further configured to perform variant calling for the first nucleotide subsequence or the second nucleotide subsequence.

13

13. The system of claim 11, wherein the variant calling is performed using the output of: a process used to align the first nucleotide subsequence with the reference sequence, or a process used to compare the one or more additional nucleotides in the second nucleotide subsequence to the reference sequence, based on a variant calling metric.

14

14. The system of claim 13, wherein the variant calling metric is determined based on a number of different base types called at a position of the reference sequence.

15

15. The system of claim 1, wherein processing the second nucleotide subsequence is initiated before the sequencing reactions are completed.

16

16. The system of claim 1, wherein the sequencing apparatus implements sequencing-by-synthesis.

17

17. The system of claim 1, further comprising aligning the entire second nucleotide subsequence to the reference sequence if the first nucleotide subsequence is not aligned to the reference sequence at the first confidence.

18

18. The system of claim 1, wherein the metric indicates whether variant calling has achieved a predetermined confidence.

19

19. The system of claim 1, wherein the metric indicates whether a particular target variant has been identified.

20

20. The system of claim 1, wherein the processor is further configured to terminate receipt of nucleotide subsequences from the sequencing apparatus based on the metric.

21

21. The method of claim 1, wherein the processor is further configured to terminate the sequencing run of the sequencing apparatus based on the metric.

24

24. The method of claim 23, wherein processing the first nucleotide subsequence using the second process comprises performing a simple alignment to determine a simple alignment score.

25

25. The method of claim 23, wherein results of the secondary analysis comprises output of the first process, output of the second process, or any combination thereof.

27

27. The method of claim 26, wherein results of the secondary analysis comprises output of the first variant calling process, output of the second variant calling process, or any combination thereof.

28

28. The method of claim 22, further comprising providing a user with results of the secondary analysis during the sequencing run.

29

29. The method of claim 28, wherein the results of the secondary analysis are provided to the user at fixed intervals.

30

30. The method of claim 28, wherein the results of the secondary analysis are provided to the user at request of the user.

31

31. The method of claim 22, wherein performing the secondary analysis comprises performing the secondary analysis of the first nucleotide subsequence of the read based on results from a prior sequencing interval of the sequencing run.

32

32. The method of claim 22, wherein the first nucleotide subsequence comprises one or more additional nucleotides compared to a previous iteration, wherein the first process aligns the entire first nucleotide subsequence to the reference sequence, and wherein the second process aligns the one or more additional nucleotides to the reference sequence based in part on results from the previous iteration.

33

33. The method of claim 22, wherein the metric indicates whether variant calling has achieved a predetermined confidence.

34

34. The method of claim 22, wherein the metric indicates whether a particular target variant has been identified.

35

35. The method of claim 22, further comprising terminating receipt of nucleotide subsequences from the sequencing apparatus based on the metric.

36

36. The method of claim 22, further comprising terminating the sequencing run of the sequencing apparatus based on the metric.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

Patent Metadata

Filing Date

October 6, 2017

Publication Date

May 9, 2023

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “System and method for secondary analysis of nucleotide sequencing data” (US-11646102). https://patentable.app/patents/US-11646102

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.