{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-10534846","patent":{"patent_number":"US-10534846","title":"Page stream segmentation","assignee":null,"inventors":[],"filing_date":"2019-06-07T00:00:00.000Z","publication_date":"2020-01-14T00:00:00.000Z","cpc_codes":["G06F","G06F","G06V","G06V"],"num_claims":28,"abstract":"Described are system, method, and computer-program product embodiments for performing language-agnostic page stream segmentation. In some embodiments, a method includes receiving a multi-page file associated with a plurality of documents. A plurality of characters present on each page of the set of consecutive pages, including a first page and a second page, of the multi-page file can be detected. A plurality of structural data for each page can be computed based on a position and a font format for one or more of the detected characters. The plurality of structural data between the first page and the second page can be compared to determine whether the second page corresponds to a boundary between two documents of the plurality of documents. The multi-page file can be segmented at the second page in response to determining that the second page corresponds to the boundary."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Page stream segmentation","description":"Described are system, method, and computer-program product embodiments for performing language-agnostic page stream segmentation. In some embodiments, a method includes receiving a multi-page file ass","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-10534846","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-10534846","citation_suggestion":"Patentable. \"Page stream segmentation\" (US-10534846). https://patentable.app/patents/US-10534846","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-10534846","json":"https://patentable.app/api/llm-context/US-10534846","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-30T09:36:35.843Z"}