Legal claims defining the scope of protection, as filed with the USPTO.
1. An information processing system for detecting objects in a digital image, the information processing system comprising: a memory; a processor communicatively coupled to the memory; and an object detection system communicatively coupled to the memory and the processor, the object detection system configured to perform a method comprising: receiving at least one image representing at least one frame of a video sequence of an external environment; extracting a given color channel of the at least one image; identifying at least one blob that stands out from a background of the given color channel that has been extracted; extracting one or more features from the at least one blob that has been identified; providing the one or more features that have been extracted to a plurality of pre-learned object models each comprising a set of pre-defined features associated with a pre-defined blob type; comparing, in response to the providing, the one or more features to the set of pre-defined features of each of the plurality of pre-learned object models; determining, based on the comparing, that the at least one blob is of a type that substantially matches a pre-defined blob type associated with one of the pre-learned object models; and visually indicating, in response to the determining, at least a location of an object within the at least one image that corresponds to the at least one blob.
2. The information processing system of claim 1 , wherein the method performed by the information processing system further comprises: applying a bounding box around the at least one blob; analyzing two or more corner regions of the bounding box; and determining a rough shape of the at least one blob based on the two or more corner regions comprising one of a portion of background and a potential portion of the at least one blob.
3. The information processing system of claim 1 , wherein the method performed by the information processing system further comprises: determining that a classification probability of the at least one blob with respect to the plurality of pre-learned object models is below a given threshold; comparing the at least one blob with one or more blobs from at least one preceding image that has been assigned a blob type from the plurality of pre-learned object models; determining that the at least one blob substantially matches with at least one of the one or more blobs; and assigning the at least one blob to the blob type that has been assigned to the at least one of the one or more blobs in the at least one preceding image.
4. The information processing system of claim 1 , wherein the method performed by the information processing system further comprises: tracking the object for at least one or more subsequent images, wherein the tracking comprises: identifying a search area S in the one or more subsequent images; and localizing the object within S.
5. The information processing system of claim 4 , wherein identifying the search area S comprises: determining a size of S based on a size of the object within the at least one image, and a moving direction of the object, wherein the moving direction is based on a displacement of the object in an x-direction and a y-direction in a preceding image of the at least one image, wherein the size of the search area S is further determined based on a size change of the object from a preceding image to the at least one image and a size of the object in the at least one image.
6. A computer program product for detecting objects in a digital image, the computer program product comprising: a non-transitory storage medium readable by a processing circuit and storing instructions for execution by the processing circuit for performing a method comprising: receiving at least one image representing at least one frame of a video sequence of an external environment; extracting a given color channel of the at least one image; identifying at least one blob that stands out from a background of the given color channel that has been extracted; extracting one or more features from the at least one blob that has been identified; providing the one or more features that have been extracted to a plurality of pre-learned object models each comprising a set of pre-defined features associated with a pre-defined blob type; comparing, in response to the providing, the one or more features to the set of pre-defined features of each pre-learned object model; determining, based on the comparing, that the at least one blob is of a type that substantially matches a pre-defined blob type associated with one of the pre-learned object models; and visually indicating, in response to the determining, at least a location of an object within the at least one image that corresponds to the at least one blob.
7. The computer program product of claim 6 , wherein the instructions are further for: applying a bounding box around the at least one blob; analyzing two or more corner regions of the bounding box; and determining a rough shape of the at least one blob based on the two or more corner regions comprising one of a portion of background and a potential portion of the at least one blob.
8. The computer program product of claim 6 , wherein the instructions are further for: determining that a classification probability of the at least one blob with respect to the plurality of pre-learned object models is below a given threshold; comparing the at least one blob with one or more blobs from at least one preceding image that has been assigned a blob type from the plurality of pre-learned object models; determining that the at least one blob substantially matches with at least one of the one or more blobs; and assigning the at least one blob to the blob type that has been assigned to the at least one of the one or more blobs in the at least one preceding image.
9. The computer program product of claim 6 , wherein the instructions are further for: tracking the object for at least one or more subsequent images, wherein the tracking comprises: identifying a search area S in the one or more subsequent images; and localizing the object within S.
10. The computer program product of claim 8 , wherein identifying the search area S comprises: determining a size of S based on a size of the object within the at least one image, and a moving direction of the object, wherein the moving direction is based on a displacement of the object in an x-direction and a y-direction in a preceding image of the at least one image, wherein the size of the search area S is further determined based on a size change of the object from a preceding image to the at least one image and a size of the object in the at least one image.
Unknown
August 13, 2013
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.