Detecting Key Roles and Their Relationships from Video

PublishedFebruary 23, 2016

Assigneenot available in USPTO data we have

InventorsTao Mei Xian-Sheng Hua Shipeng Li Yan Wang

Technical Abstract

Patent Claims

20 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A method comprising: receiving a video from which to identify key roles; performing video structuring on the video to identify key frames; processing faces from the key frames to generate processed faces; discovering a community from the processed faces, wherein the discovering the community comprises: correlating roles that co-occur in a scene, wherein the roles are associated with the processed faces; determining a strength of a relationship between a first role of the roles and a second role of the roles that co-occur in the scene based at least in part on a lapse of time between a first time that the first role occurs and a second time that the second role occurs in the scene; and identifying the key roles and relationships between the key roles based at least in part on the strength of the relationship; and generating a user-interface presentation that visually summarizes content of the video by depicting the key roles that have been identified.

2. A method as recited in claim 1 , wherein the video includes internet protocol television (IPTV) content or video on demand (VOD) content.

3. A method as recited in claim 1 , wherein performing the video structuring on the video comprises: identifying a hierarchical structure of the video, the hierarchical structure of the video including scenes, shots, and the key frames; extracting a shot from the video, wherein the shot represents a continuous section of video shot by a camera; identifying a key frame in the shot, wherein the key frame includes a plurality of images from the shot; and grouping a plurality of shots to form a scene, the user-interface presentation at least partly depicting the scene.

4. A method as recited in claim 1 , wherein: the processing the faces from the key frames includes determining an importance of a role associated with at least one processed face of the processed faces; and generating the user-interface presentation is based at least in part on the importance of the role associated with the at least one processed face.

5. A method as recited in claim 1 , wherein the discovering the community from the processed faces includes constructing a community graph representing interrelationships between the roles.

6. A method as recited in claim 5 , wherein the community graph further represents strengths of the interrelationships between the roles.

7. A method as recited in claim 1 , wherein the user-interface presentation includes a key frame style presentation based at least on a key frame representing the video in terms of one or more of color, theme, or visual quality.

8. A method as recited in claim 1 , wherein the user-interface presentation includes multiple pictures arranged in a collage.

9. A method as recited in claim 1 , wherein the user-interface presentation includes images of the key roles embedded on a background representative of the video in terms of one or more of color, theme, or visual quality.

10. A method as recited in claim 1 , wherein the key frames include at least one face and represent a shot of the video at least in terms of color, background image, or action.

11. A method as recited in claim 1 , wherein the discovering the community further comprises: determining that the first role and the second role each appear a number of times above a predetermined threshold; determining that the first role and the second role are key roles; and determining that a strength of the relationship between the first role and the second role meets or exceeds a threshold value based at least in part on the lapse of time being within a predetermined threshold of time.

12. A computer storage device having encoded thereon computer-executable instructions to configure a computer to perform operations comprising: receiving a video from which to ascertain a key role; processing faces from the video to obtain processed faces, wherein an individual processed face of the processed faces is associated with an individual role of a plurality of roles; discovering a community from the processed faces, wherein the community represents interrelationships between characters in the video, the discovering the community comprising: identifying two or more roles of the plurality of roles that co-occur in a scene; and determining a relationship between the two or more roles that co-occur in the scene within a predetermined threshold of time, wherein a strength of the relationship meets or exceeds a threshold value; ascertaining the key role from the video based at least on the two or more roles; and generating a user-interface presentation that visually summarizes content of the video, the user-interface presentation including the key role.

13. A computer storage device as recited in claim 12 , wherein: processing the faces from the video includes determining an importance of the individual role; and generating the user-interface presentation is based at least in part on the importance of the individual role.

14. A computer storage device as recited in claim 12 , wherein ascertaining the key role from the video is performed independent of metadata associated with the video.

15. A computer storage device as recited in claim 12 , wherein discovering the community from the processed faces includes: identifying individual processed faces most frequently processed from the video and having a threshold level of relationships to other individual processed faces; and employing the individual processed faces being identified as vertices to construct a community graph including correlations between the individual processed faces.

16. A computer storage device as recited in claim 12 , wherein: generating the user-interface presentation is based at least in part on at least one key frame and at least the key role; and the user-interface presentation comprises an image of at least the key role embedded on a representative background obtained from the at least one key frame.

17. A computer storage device as recited in claim 12 , further comprising instructions to configure the computer to perform operations comprising: extracting a shot from the video; and identifying a key frame in the shot.

18. An apparatus comprising: a processor; and a video tool comprising: a video structuring component configured to: receive a video; analyze the video; and segment the video into hierarchical levels of scenes, shots, and key frames; a face grouping component configured to generate face clusters for faces identified in the key frames; a community discovery component configured to identify one or more key roles and relationships between the one or more key roles by: determining, from a face cluster of the face clusters, that at least one role occurs at a frequency above a predetermined threshold in a scene of the scenes; and determining a relationship between the at least one role and a second role based at least in part on a determination that the at least one role and the second role co-occur in the scene within a predetermined threshold of time, wherein a strength of the relationship meets or exceeds a threshold value; and a generation tool configured to generate a user-interface presentation that visually summarizes content of the video, the user-interface presentation based at least on the one or more key roles and the relationships.

19. An apparatus as recited in claim 18 , wherein the generation tool is further configured to: receive a community graph representing a community, the community representing the one or more key roles and the relationships between the one or more key roles; and generate the user-interface presentation based at least in part on the community graph.

20. An apparatus as recited in claim 18 , wherein the generation tool is further configured to: determine an importance of the one or more key roles; and generate the user-interface presentation based at least in part on the importance of the one or more key roles.

Patent Metadata

Filing Date

Unknown

Publication Date

February 23, 2016

Inventors

Tao Mei

Xian-Sheng Hua

Shipeng Li

Yan Wang

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search