9606986

Integrated Word N-Gram and Class M-Gram Language Models

PublishedMarch 28, 2017
Assigneenot available in USPTO data we have
Technical Abstract

Patent Claims
24 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1

1. A method for processing discourse input comprising: at an electronic device with a processor and memory storing one or more programs for execution by the processor: receiving a discourse input from a user; determining a text string corresponding to the discourse input, wherein determining the text string comprises: determining, using a first language model, a conditional probability of a candidate word in the discourse input given one or more words in the discourse input; determining, using a second language model, a probability of the candidate word within a corpus; determining, using a third language model, a conditional probability of the candidate word given one or more classes associated with the one or more words; and determining an integrated probability of the candidate word and one or more subclasses associated with the candidate word based on: the conditional probability of the candidate word given the one or more words; the probability of the candidate word within the corpus; and the conditional probability of the candidate word given the one or more classes, wherein the text string is based on the integrated probability; and generating an output based on the text string.

2

2. The method of claim 1 , wherein the integrated probability is a joint probability of the candidate word and the one or more subclasses associated with the candidate word given the one or more words and the one or more classes.

3

3. The method of claim 1 , wherein the integrated probability is determined based on assuming that a conditional probability of a subclass associated with the candidate word given the candidate word, the one or more words, and the one or more classes is equal to a conditional probability of the subclass associated with the candidate word given the candidate word and the one or more classes.

4

4. The method of claim 1 , wherein the integrated probability is determined based on assuming that a conditional probability of the candidate word given a subclass associated with the candidate word and the one or more classes is equal to the conditional probability of the candidate word given the subclass associated with the candidate word.

5

5. The method of claim 1 , wherein the integrated probability is determined based on assuming that a conditional probability of the one or more classes given the candidate word and the one or more words is equal to a conditional probability of the one or more classes given the one or more words.

6

6. The method of claim 1 , wherein a class of the one or more classes includes one or more subclasses.

7

7. The method of claim 1 , wherein the integrated probability is determined at least in part by dividing the conditional probability of the candidate word given the one or more classes by the probability of the candidate word within the corpus.

8

8. The method of claim 1 , wherein the integrated probability is less dependent on the conditional probability of the candidate word given the one or more classes when the probability of the candidate word within the corpus is higher than when the probability of the candidate word within the corpus is lower.

9

9. The method of claim 1 , wherein the first language model and the second language model are a same language model.

10

10. A method for processing discourse input comprising: at an electronic device with a processor and memory storing one or more programs for execution by the processor: receiving a discourse input from a user; determining a text string corresponding to the discourse input, wherein determining the text string comprises: determining, using a first language model, a conditional probability of a candidate word in the discourse input given one or more words in the discourse input; and applying a weight to the conditional probability of the candidate word given the one or more words to obtain a weighted conditional probability of the candidate word given the one or more words, wherein the weight is based on a conditional probability of the candidate word given one or more classes associated with the one or more words, and wherein the text string is based on the weighted conditional probability of the candidate word; and generating an output based on the text string.

11

11. The method of claim 10 , wherein the weight is further based on a probability of the candidate word within a corpus.

12

12. The method of claim 11 , further comprising: determining, using a second language model, the probability of the candidate word within the corpus.

13

13. The method of claim 12 , wherein the first language model and the second language model are a same language model.

14

14. The method of claim 11 , wherein the weight is inversely proportional to the probability of the candidate word within the corpus.

15

15. The method of claim 11 , wherein the weight comprises the conditional probability of the candidate word given the one or more classes divided by the probability of the candidate word within the corpus.

16

16. The method of claim 15 , wherein the weighted conditional probability of the candidate word comprises a dot product of the weight and the conditional probability of the candidate word given the one or more words.

17

17. The method of claim 10 , further comprising: determining, using a third language model, the conditional probability of the candidate word given the one or more classes.

18

18. The method of claim 10 , wherein the weight is proportional to the conditional probability of the candidate word given the one or more classes.

19

19. The method of claim 10 , wherein the weighted conditional probability of the candidate word is proportional to the conditional probability of the candidate word given the one or more words.

20

20. The method of claim 10 , wherein the weighted conditional probability of the candidate word is based on an assumption that a conditional probability of a subclass associated with the candidate word given the candidate word, the one or more words, and the one or more classes is equal to a conditional probability of the subclass associated with the candidate word given the candidate word and the one or more classes.

21

21. The method of claim 10 , wherein the weighted conditional probability of the candidate word is based on an assumption that a conditional probability of the candidate word given a subclass associated with the candidate word and the one or more classes is equal to a conditional probability of the candidate word given the subclass associated with the candidate word.

22

22. The method of claim 10 , wherein the weighted conditional probability of the candidate word is based on an assumption that a conditional probability of the one or more classes given the candidate word and the one or more words is equal to a conditional probability of the one or more classes given the one or more words.

23

23. A non-transitory computer-readable storage medium comprising instructions for causing one or more processor to: receiving a discourse input from a user; determining a text string corresponding to the discourse input, wherein determining the text string comprises: determining, using a first language model, a conditional probability of a candidate word in the discourse input given one or more words in the discourse input; and applying a weight to the conditional probability of the candidate word given the one or more words to obtain a weighted conditional probability of the candidate word given the one or more words, wherein the weight is based on a conditional probability of the candidate word given one or more classes associated with the one or more words, and wherein the text string is based on the weighted conditional probability of the candidate word; and generating an output based on the text string.

24

24. An electronic device comprising: one or more processors; memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for: receiving a discourse input from a user; determining a text string corresponding to the discourse input, wherein determining the text string comprises: determining, using a first language model, a conditional probability of a candidate word in the discourse input given one or more words in the discourse input; and applying a weight to the conditional probability of the candidate word given the one or more words to obtain a weighted conditional probability of the candidate word given the one or more words, wherein the weight is based on a conditional probability of the candidate word given one or more classes associated with the one or more words, and wherein the text string is based on the weighted conditional probability of the candidate word; and generating an output based on the text string.

Patent Metadata

Filing Date

Unknown

Publication Date

March 28, 2017

Inventors

Jerome R. BELLEGARDA

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “INTEGRATED WORD N-GRAM AND CLASS M-GRAM LANGUAGE MODELS” (9606986). https://patentable.app/patents/9606986

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.