{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-8533148","patent":{"patent_number":"US-8533148","title":"Document relevancy analysis within machine learning systems including determining closest cosine distances of training examples","assignee":null,"inventors":[],"filing_date":"2012-10-01T00:00:00.000Z","publication_date":"2013-09-10T00:00:00.000Z","cpc_codes":["G06F","G06F","G06F","G06N","G06N"],"num_claims":17,"abstract":"Systems and methods that quantify document relevance for a document relative to a training corpus and select a best match or best matches are provided herein. Methods may include generating an example-based explanation for relevancy of a document to a training corpus by executing a support vector machine classifier, the support vector machine classifier performing a centroid classification of a relevant document in a term frequency-inverse document frequency features space relative to training examples in a training corpus, and generating an example-based explanation by selecting a best match for the relevant document from the training examples based upon the centroid classification. Determining the training example having the closest cosine distance to the relevant document includes ranking the training examples by stretching the internal best match scores for the training examples linearly to cover a complete unit interval."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Document relevancy analysis within machine learning systems including determining closest cosine distances of training examples","description":"Systems and methods that quantify document relevance for a document relative to a training corpus and select a best match or best matches are provided herein. Methods may include generating an example","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-8533148","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-8533148","citation_suggestion":"Patentable. \"Document relevancy analysis within machine learning systems including determining closest cosine distances of training examples\" (US-8533148). https://patentable.app/patents/US-8533148","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-8533148","json":"https://patentable.app/api/llm-context/US-8533148","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-30T10:30:07.501Z"}