9940384

Statistical Clustering Inferred From Natural Language to Drive Relevant Analysis and Conversation With Users

PublishedApril 10, 2018
Assigneenot available in USPTO data we have
Technical Abstract

Patent Claims
20 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1

1. A method, in a data processing system, for statistical clustering inferred from natural language to drive relevant analysis, the method comprising: receiving a natural language text from a user; processing the natural language text to identify an entity of interest and a focus of statistical analysis; performing a follow-up question and answer conversation with the user to receive from the user one or more driving factor values for one or more driving factors for the focus of the statistical analysis; determining at least one cluster of entities matching the one or more driving factor values; and generating at least one data visualization of the data in a corpus for the focus of statistical analysis having a scope that is narrowed based on the at least one cluster of entities matching the one or more driving factor values.

2

2. The method of claim 1 , wherein performing a follow-up question and answer conversation comprises performing a clustering operation on data in the corpus for the focus of statistical analysis and determining one or more driving factors for the focus of the statistical analysis based on results of the clustering operation.

3

3. The method of claim 2 , wherein performing a follow-up question and answer conversation comprises detecting the most important driving factors for the focus of analysis based on the results of the clustering operation.

4

4. The method of claim 1 , wherein performing a follow-up question and answer conversation comprises generating one or more follow-up questions to be presented to the user to gather information required about the entity of interest and receiving responses to the one or more questions from the user.

5

5. The method of claim 4 , wherein performing a follow-up question and answer conversation further comprises parsing the responses to determine values for attributes that form the driving factors.

6

6. The method of claim 4 , wherein generating one or more follow-up questions comprises using slot tiller templates to generate the follow-up questions.

7

7. The method of claim 1 , wherein determining at least one cluster of entities matching the one or more driving factor values comprises creating clusters based on the driving factors and matching the entity of interest to at least one of the clusters.

8

8. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to: receive a natural language text from a user; process the natural language text to identify an entity of interest and a focus of statistical analysis; perform a follow-up question and answer conversation with the user to receive from the user one or more driving factor values for one or more driving factors for the focus of the statistical analysis; determine at least one cluster of entities matching the one or more driving factor values; and generate at least one data visualization of the data in a corpus for the focus of statistical analysis having a scope that is narrowed based on the at least one cluster of entities matching the one or more driving factor values.

9

9. The computer program product of claim 8 , wherein performing a follow-up question and answer conversation comprises performing a clustering operation on data in the corpus for the focus of statistical analysis and determining one or more driving factors for the focus of the statistical analysis based on results of the clustering operation.

10

10. The computer program product of claim 9 , wherein performing a follow-up question and answer conversation comprises detecting the most important driving factors for the focus of analysis based on the results of the clustering operation.

11

11. The computer program product of claim 8 , wherein performing a follow-up question and answer conversation comprises generating one or more follow-up questions to be presented to the user to gather information required about the entity of interest and receiving responses to the one or more questions from the user.

12

12. The computer program product of claim 11 , wherein performing a follow-up question and answer conversation further comprises parsing the responses to determine values for attributes that form the driving factors.

13

13. The computer program product of claim 11 , wherein generating one or more follow-up questions comprises using slot filler templates to generate the follow-up questions.

14

14. The computer program product of claim 8 , wherein determining at least one cluster of entities matching the one or more driving factor values comprises creating clusters based on the driving factors and matching the entity of interest to at least one of the clusters.

15

15. An apparatus comprising: a processor; and a memory coupled to the processor, wherein the memory comprises instructions which, When executed by the processor, cause the processor to: receive a natural language text from a user; process the natural language text to identify an entity of interest and a focus of statistical analysis; perform a follow-up question and answer conversation with the user to receive from the user one or more driving factor values for one or more driving factors for the focus of the statistical analysis; determine at least one cluster of entities matching the one or more driving factor values; and generate at least one data visualization of the data in a corpus for the focus of statistical analysis having a scope that is narrowed based on the at least one cluster of entities matching the one or more driving factor values.

16

16. The apparatus of claim 15 , wherein performing a follow-up question and answer conversation comprises performing a clustering operation on data in the corpus for the focus of statistical analysis and determining one or more driving factors for the focus of the statistical analysis based on results of the clustering operation.

17

17. The apparatus of claim 16 , wherein performing a follow-up question and answer conversation comprises detecting the most important driving factors for the focus of analysis based on the results of the clustering operation.

18

18. The apparatus of claim 15 , wherein performing a follow-up question and answer conversation comprises generating one or more follow-up questions to be presented to the user to gather information required about the entity of interest and receiving responses to the one or more questions from the user.

19

19. The apparatus of claim 18 , wherein performing a follow-up question and answer conversation further comprises parsing the responses to determine values for attributes that form the driving factors.

20

20. The apparatus of claim 15 , wherein determining at least one cluster of entities matching the one or more driving factor values comprises creating clusters based on the driving factors and matching the entity of interest to at least one of the clusters.

Patent Metadata

Filing Date

Unknown

Publication Date

April 10, 2018

Inventors

Stephen D. Gibson
Alireza Pourshahid
Vinay N. Wadhwa
Graham A. Watts

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “Statistical Clustering Inferred From Natural Language to Drive Relevant Analysis and Conversation With Users” (9940384). https://patentable.app/patents/9940384

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.