Legal claims defining the scope of protection, as filed with the USPTO.
3. The method of claim 2, further comprising associating the data entity name with user-provided synonyms and/or inherited synonyms.
6. The method of claim 5, further comprising removing synonyms that are lexically similar to the respective data entity name from the matching similar words.
10. The method of claim 1, wherein the set of words of the natural language is generated using an n-gram language model for the natural language.
11. The method of claim 1, further comprising generating the plurality of word embeddings using one or more trained neural network models, wherein the one or more trained neural network models are trained on a large corpus of text of the natural language.
12. The method of claim 11, wherein the one or more neural network models includes a Word2vec model, and the plurality of word embeddings are word vectors output by the Word2vec model.
14. The method of claim 1, further comprising selecting the word similarity model from a plurality of word models.
15. The method of claim 1, further comprising, storing the semantic annotations to the published data source.
16. The method of claim 1, further comprising displaying a data visualization based on the retrieved dataset.
18. The method of claim 1, wherein generating the semantic annotations for the published data source using the trained word similarity model is performed concurrently for a plurality of data entity names of the published data source, using a distributed, multitenant-capable text search engine.
Unknown
November 1, 2022
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.