US-11509795

On-device artificial intelligence systems and methods for document auto-rotation

PublishedNovember 22, 2022

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

An auto-rotation module having a single-layer neural network on a user device can convert a document image to a monochrome image having black and white pixels and segment the monochrome image into bounding boxes, each bounding box defining a connected segment of black pixels in the monochrome image. The auto-rotation module can determine textual snippets from the bounding boxes and prepare them into input images for the single-layer neural network. The single-layer neural network is trained to process each input image, recognize a correct orientation, and output a set of results for each input image. Each result indicates a probability associated with a particular orientation. The auto-rotation module can examine the results, determine what degree of rotation is needed to achieve a correct orientation of the document image, and automatically rotate the document image by the degree of rotation needed to achieve the correct orientation of the document image.

Patent Claims

8 claims

Legal claims defining the scope of protection, as filed with the USPTO.

2. The method according to claim 1, wherein the converting comprises performing an adaptive binarization on the document image in which, for every pixel in the document image, a neighborhood of pixels is examined so as to separate characters in the document image from background of the document image or from one another.

3. The method according to claim 1, wherein the bounding boxes overlap one another and wherein the connected segments of pixels do not overlap with one another.

7. The method according to claim 1, wherein the single-layer neural network is configured for outputting a probability value indicating that a zero degree of rotation or zero number of turns is needed to correct the orientation of the input textual snippet, a probability value indicating that a 90 degree of rotation or a single right or left turn is needed to correct the orientation of the input textual snippet, a probability value indicating that a 180 degree of rotation is or two right or left turns are needed to correct the orientation of the input textual snippet, and a probability value indicating that a 270 degree of rotation is or three right or left turns are needed to correct the orientation of the input textual snippet.

9. The system according to claim 8, wherein the converting comprises performing an adaptive binarization on the document image in which, for every pixel in the document image, a neighborhood of pixels is examined so as to separate characters in the document image from background of the document image or from one another.

10. The system according to claim 8, wherein the bounding boxes overlap one another and wherein the connected segments of pixels do not overlap with one another.

14. The system according to claim 8, wherein the single-layer neural network is configured for outputting a probability value indicating that a zero degree of rotation or zero number of turns is needed to correct the orientation of the input textual snippet, a probability value indicating that a 90 degree of rotation or a single right or left turn is needed to correct the orientation of the input textual snippet, a probability value indicating that a 180 degree of rotation is or two right or left turns are needed to correct the orientation of the input textual snippet, and a probability value indicating that a 270 degree of rotation is or three right or left turns are needed to correct the orientation of the input textual snippet.

16. The computer program product according to claim 15, wherein the converting comprises performing an adaptive binarization on the document image in which, for every pixel in the document image, a neighborhood of pixels is examined so as to separate characters in the document image from background of the document image or from one another.

17. The computer program product according to claim 15, wherein the bounding boxes overlap one another and wherein the connected segments of pixels do not overlap with one another.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

G06N G06V H04N

Patent Metadata

Filing Date

June 14, 2021

Publication Date

November 22, 2022

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search