Low complexity event detection for video programs

PublishedJune 5, 2012

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

A video processing method for detecting significant events from a video program includes computing short-time sub-band energies in the audio for plural audio sub-bands, detecting scene boundaries where a weighted sum of these short-time sub-band energies are less energy threshold for longer than an time interval, segmenting the video program into a plurality of scenes by the boundaries, removing scenes shorter than a segment time interval and classifying and ranking the remaining scenes by audio. A second segmenting and removal is based upon a second energy threshold and a second time interval or when energy in a lowest frequency sub-band is greater than a predetermined bass energy threshold. The first segment time interval may be recomputed based upon the distribution of length of the remaining scenes.

Patent Claims

2 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A video processing method for detecting significant events from a video program comprising the steps of: computing short-time sub-band energies in an audio of the video program for plural audio sub-bands; detecting scene boundaries from the audio of the video program by detecting whether the short-time sub-band energies are less than a predetermined first energy threshold for longer than a predetermined first energy time interval; segmenting the video program into a plurality of scenes using the detected scene boundaries as boundaries; removing scenes from the video program having a length shorter than a first segment time interval; following said step of removing scenes detecting second scene boundaries in the audio of the video program by detecting whether the short-time sub-band energies are less than a predetermined second energy threshold for longer than a predetermined second energy time interval, and whether the short-time sub-band energy in a lowest frequency sub-band is greater than a predetermined bass energy threshold; further segmenting the video program into a plurality of scenes using the detected second scene boundaries as boundaries; further removing scenes from the video program having a length shorter than a predetermined second segment time interval; and classifying and ranking remaining scenes based upon short-time sub-band energy in the audio from largest to smallest.

2. The method of claim 1 , further comprising: following said step of removing scenes: recomputing the first segment time interval based upon a distribution of length of the remaining scenes; and further removing scenes from the video program having a length shorter than the recomputed first segment time interval.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

H04N

Patent Metadata

Filing Date

April 13, 2009

Publication Date

June 5, 2012

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search