{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-12008829","patent":{"patent_number":"US-12008829","title":"System and method for improved OCR efficacy through image segmentation","assignee":null,"inventors":[],"filing_date":"2022-02-16T00:00:00.000Z","publication_date":"2024-06-11T00:00:00.000Z","cpc_codes":["G06V","G06V","G06V","G06V"],"num_claims":12,"abstract":"A method to improve the efficacy of optical character recognition (OCR) includes scanning an electronically stored representation of a whole or partial document, identifying an image having text in the electronically stored representation of a whole or partial document, identifying the text within the image, and generating a plurality of bounding boxes around the identified text using blob detection. The method also includes grouping together certain text bounding boxes of the plurality of text bounding boxes that are vertically aligned with each other to generate a plurality of aligned text bounding boxes and performing OCR on the aligned text bounding boxes to generate a plurality of OCR groups of text. In addition, the method includes generating a resultant representation of a whole or partial document electronically using the plurality of OCR groups of text and saving the resultant representation of a whole or partial document electronically."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"System and method for improved OCR efficacy through image segmentation","description":"A method to improve the efficacy of optical character recognition (OCR) includes scanning an electronically stored representation of a whole or partial document, identifying an image having text in th","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-12008829","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-12008829","citation_suggestion":"Patentable. \"System and method for improved OCR efficacy through image segmentation\" (US-12008829). https://patentable.app/patents/US-12008829","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-12008829","json":"https://patentable.app/api/llm-context/US-12008829","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-31T12:20:42.335Z"}