US-10803350

Object detection and image cropping using a multi-detector approach

PublishedOctober 13, 2020

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

Computer-implemented methods for detecting objects within digital image data based on color transitions include: receiving or capturing a digital image depicting an object; sampling color information from a first plurality of pixels of the digital image; optionally sampling color information from a second plurality of pixels of the digital image; generating or receiving a representative background color profile based on the color information sampled from the first plurality of pixels; generating or receiving a representative foreground color profile based on the color information sampled from the second plurality of pixels and/or the first plurality of pixels; assigning each pixel a label; binarizing the digital image based on the labels; detecting contour(s) within the binarized digital image; and defining edge(s) of the object based on the detected contour(s). Corresponding systems and computer program products configured to perform the inventive methods are also described.

Patent Claims

18 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A computer-implemented method of detecting objects within digital image data based at least in part on color transitions within the digital image data, the method comprising: receiving or capturing a digital image depicting an object; analyzing the digital image data using one or more color transition detectors, each color transition detector being independently configured to detect one or more objects within digital images according to a unique set of analysis parameters; determining a confidence score for each of a plurality of analysis results produced by the one or more color transition detectors; selecting the analysis result having a highest confidence score among the plurality of analysis results as an optimum object location result; and either or both of: outputting, based on the optimum object location result, a projected location of one or more edges of the object to a memory; and rendering, based on the optimum object location result, a projected location of the one or more edges of the object on a display.

2. A computer-implemented method of detecting objects within digital image data based at least in part on color transitions within the digital image data, the method comprising: receiving or capturing a digital image depicting an object; sampling color information from a first plurality of pixels of the digital image, wherein each of the first plurality of pixels is located in a background region of the digital image; optionally sampling color information from a second plurality of pixels of the digital image, wherein each of the second plurality of pixels is located in a foreground region of the digital image; generating or receiving a representative background color profile, the representative background color profile being based on the color information sampled from the first plurality of pixels; generating or receiving a representative foreground color profile based on the color information sampled from the second plurality of pixels and/or the color information sampled from the first plurality of pixels; assigning each pixel within the digital image a label of either foreground or background using an adaptive label learning process; binarizing the digital image based on the labels assigned to each pixel; detecting one or more contours within the binarized digital image; and defining one or more edges of the object based on the detected contour(s).

3. The method as recited in claim 2 , wherein the adaptive label learning process comprises: selecting or estimating at least one initial Gaussian model of the representative foreground color profile and/or the representative background color profile; and performing a maximum likelihood analysis of un-labeled pixels of the digital image using the at least one initial Gaussian model.

4. The method as recited in claim 3 , wherein the adaptive learning process comprises a plurality of iterations; and wherein for each iteration of the adaptive learning process, one or more Gaussian models of the representative foreground color profile and/or the representative background color profile is/are updated based on labels assigned to pixels in an immediately previous iteration of the adaptive learning process.

5. The method as recited in claim 4 , wherein the adaptive learning process comprises performing the plurality of iterations until parameters of the one or more Gaussian models achieve convergence.

6. The method as recited in claim 5 , wherein convergence is achieved within about 4 to about 8 iterations of the adaptive learning process.

7. The method as recited in claim 3 , wherein the maximum likelihood analysis comprises minimizing a total potential energy across all pixels within the digital image based on the representative foreground color profile and the representative background color profile, wherein a potential energy of each pixel comprises: a negative log likelihood of a Gaussian model; and an interaction energy β describing a probability of adjacent pixels exhibiting a transition from one color to another.

10. The method as recited in claim 2 , further comprising performing a color space transformation on the digital image.

11. The method as recited in claim 10 , wherein the color space transformation comprises a RGB to CIELUV transformation.

12. The method as recited in claim 10 , wherein the color space transformation produces a plurality of Luv vectors; and wherein each Luv vector is modeled as a random vector in a 3D CIELUV color space.

13. The method as recited in claim 2 , wherein generating the representative foreground color profile based on the color information sampled from the first plurality of pixels comprises inverting color values of the first plurality of pixels.

14. The method as recited in claim 2 , wherein the foreground region comprises a central region of the digital image.

15. The method as recited in claim 14 , wherein the central region comprises approximately 20% of a total area of the digital image.

16. The method as recited in claim 2 , further comprising downscaling the received or captured digital image, wherein the downscaling preserves an aspect ratio of the received or captured digital image.

17. The method as recited in claim 2 , wherein the object is surrounded by either at least 2 rows of background pixels or at least 2 columns of background pixels on each side.

18. The method as recited in claim 2 , further comprising computing a segmentation confidence score for the defined edge(s) of the object using one or more measures selected from the group consisting of: edge strength, angle between adjacent edges of the object, angle between opposite edges of the object, color contrast between foreground and background of the image, a least mean squares fitness, and combinations thereof.

19. The method as recited in claim 2 , wherein a first edge of the object is defined based on a largest of the detected contours; and wherein additional edges of the object are derived by a least mean squares fitting process.

20. A computer program product for detecting objects within digital image data based at least in part on color transitions within the digital image data, the computer program product comprising a computer readable storage medium having embodied therewith computer readable program instructions configured to cause a processor, upon execution of the computer readable program instructions, to perform a method comprising: receiving or capturing a digital image depicting an object; sampling, using the processor, color information from a first plurality of pixels of the digital image, wherein each of the first plurality of pixels is located in a background region of the digital image; optionally sampling, using the processor, color information from a second plurality of pixels of the digital image, wherein each of the second plurality of pixels is located in a foreground region of the digital image; generating, using the processor, or receiving, by the processor, a representative background color profile, the representative background color profile being based on the color information sampled from the first plurality of pixels; generating, using the processor, or receiving, by the processor, a representative foreground color profile based on the color information sampled from the second plurality of pixels and/or the color information sampled from the first plurality of pixels; assigning, using the processor, each pixel within the digital image a label of either foreground or background using an adaptive label learning process; binarizing, using the processor, the digital image based on the labels assigned to each pixel; detecting, using the processor, one or more contours within the binarized digital image; and defining, using the processor, one or more edges of the object based on the detected contour(s).

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

G06V G06F G06N G06T

Patent Metadata

Filing Date

November 30, 2018

Publication Date

October 13, 2020

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search