{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-11276394","patent":{"patent_number":"US-11276394","title":"Method for re-aligning corpus and improving the consistency","assignee":null,"inventors":[],"filing_date":"2020-01-30T00:00:00.000Z","publication_date":"2022-03-15T00:00:00.000Z","cpc_codes":["G10L","G06F","G06F","G10L"],"num_claims":20,"abstract":"Vocabulary consistency for a language model may be improved by splitting a target token in an initial vocabulary into a plurality of split tokens, calculating an entropy of the target token and an entropy of the plurality of split tokens in a bootstrap language model, and determining whether to delete the target token from the initial vocabulary based on at least the entropy of the target token and the entropy of the plurality of split tokens."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Method for re-aligning corpus and improving the consistency","description":"Vocabulary consistency for a language model may be improved by splitting a target token in an initial vocabulary into a plurality of split tokens, calculating an entropy of the target token and an ent","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-11276394","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-11276394","citation_suggestion":"Patentable. \"Method for re-aligning corpus and improving the consistency\" (US-11276394). https://patentable.app/patents/US-11276394","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-11276394","json":"https://patentable.app/api/llm-context/US-11276394","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-30T08:26:34.612Z"}