An image is passed through an image identifier to identify a coarse category for the image and a bounding box for a categorized object. A mask is used to identify the portion of the image that represents the object. Given the foreground mask, the convex hull of the mask is located and an aligned rectangle of minimum area that encloses the hull is fitted. The aligned bounding box is rotated and scaled, so that the foreground object is roughly moved to a standard orientation and size (referred to as calibrated). The calibrated image is used as an input to a fine-grained categorization module, which determines the fine category within the coarse category for the input image.
Legal claims defining the scope of protection, as filed with the USPTO.
1. A method comprising: accessing an image; identifying, using one or more hardware processors, an object in the image, the identified object being associated with a coarse category; based on the coarse category, analyzing the image to identify a fine category; and causing presentation of information related to the identified fine category.
2. The method of claim 1 , further comprising: determining a bounding box that encompasses the object in the image.
3. The method of claim 1 , further comprising: performing a search for the information related to the identified fine category.
4. The method of claim 1 , wherein the fine category is a sub-category of the coarse category.
5. The method of claim 1 , wherein the fine category is identifiable by a name.
6. The method of claim 1 , further comprising: rotating at least a portion of the image based on orientation.
7. The method of claim 1 , wherein the fine category is identified from a set of available fine categories for the coarse category.
8. A system comprising: one or more hardware processors; and a storage medium storing instructions that; when executed by the one or more hardware processors, causes the one or more hardware processors to perform operations comprising: accessing an image; identifying an object in the image, the identified object being associated with a coarse category; based on the coarse category, analyzing the image to identify a fine category; and causing presentation of information related to the identified fine category.
9. The system of claim 8 , wherein the operations further comprise: determining a bounding box that encompasses the object in the image.
10. The system of claim 8 , wherein the operations further comprise: performing a search for the information related to the identified fine category.
11. The system of claim 8 , wherein the fine category is a sub-category of the coarse category.
12. The system of claim 8 , wherein the fine category is identifiable by a name.
13. The system of claim 8 , wherein the operations further comprise: rotating at least a portion of the image based on orientation.
14. The system of claim 8 , wherein the fine category is identified from a set of available fine categories for the coarse category.
15. A machine-readable storage device storing instructions that, when executed by one or more processors of a machine, cause the machine to perform operations comprising: accessing an image; identifying an object in the image, the identified object being associated with a coarse category; based on the coarse category, analyzing the image to identify a fine category; and causing presentation of information related to the identified fine category.
16. The machine-readable storage device of claim 15 , wherein the operations further comprise: determining a bounding box that encompasses the object in the image.
17. The machine-readable storage device of claim 15 , wherein the operations further comprise: performing a search for the information related to the identified fine category.
18. The machine-readable storage device of claim 15 , wherein the fine category is a sub-category of the coarse category.
19. The machine-readable storage device of claim 15 , wherein the fine category is identifiable by a name.
20. The machine-readable storage device of claim 15 , wherein the operations further comprise: rotating at least a portion of the image based on orientation.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
March 28, 2019
January 5, 2021
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.