Legal claims defining the scope of protection, as filed with the USPTO.
1. A method comprising: receiving a video from which to identify key roles; performing video structuring on the video to identify key frames; processing faces from the key frames to generate processed faces; discovering a community from the processed faces, wherein the discovering the community comprises: correlating roles that co-occur in a scene, wherein the roles are associated with the processed faces; determining a strength of a relationship between a first role of the roles and a second role of the roles that co-occur in the scene based at least in part on a lapse of time between a first time that the first role occurs and a second time that the second role occurs in the scene; and identifying the key roles and relationships between the key roles based at least in part on the strength of the relationship; and generating a user-interface presentation that visually summarizes content of the video by depicting the key roles that have been identified.
2. A method as recited in claim 1 , wherein the video includes internet protocol television (IPTV) content or video on demand (VOD) content.
3. A method as recited in claim 1 , wherein performing the video structuring on the video comprises: identifying a hierarchical structure of the video, the hierarchical structure of the video including scenes, shots, and the key frames; extracting a shot from the video, wherein the shot represents a continuous section of video shot by a camera; identifying a key frame in the shot, wherein the key frame includes a plurality of images from the shot; and grouping a plurality of shots to form a scene, the user-interface presentation at least partly depicting the scene.
4. A method as recited in claim 1 , wherein: the processing the faces from the key frames includes determining an importance of a role associated with at least one processed face of the processed faces; and generating the user-interface presentation is based at least in part on the importance of the role associated with the at least one processed face.
5. A method as recited in claim 1 , wherein the discovering the community from the processed faces includes constructing a community graph representing interrelationships between the roles.
6. A method as recited in claim 5 , wherein the community graph further represents strengths of the interrelationships between the roles.
7. A method as recited in claim 1 , wherein the user-interface presentation includes a key frame style presentation based at least on a key frame representing the video in terms of one or more of color, theme, or visual quality.
8. A method as recited in claim 1 , wherein the user-interface presentation includes multiple pictures arranged in a collage.
9. A method as recited in claim 1 , wherein the user-interface presentation includes images of the key roles embedded on a background representative of the video in terms of one or more of color, theme, or visual quality.
10. A method as recited in claim 1 , wherein the key frames include at least one face and represent a shot of the video at least in terms of color, background image, or action.
11. A method as recited in claim 1 , wherein the discovering the community further comprises: determining that the first role and the second role each appear a number of times above a predetermined threshold; determining that the first role and the second role are key roles; and determining that a strength of the relationship between the first role and the second role meets or exceeds a threshold value based at least in part on the lapse of time being within a predetermined threshold of time.
12. A computer storage device having encoded thereon computer-executable instructions to configure a computer to perform operations comprising: receiving a video from which to ascertain a key role; processing faces from the video to obtain processed faces, wherein an individual processed face of the processed faces is associated with an individual role of a plurality of roles; discovering a community from the processed faces, wherein the community represents interrelationships between characters in the video, the discovering the community comprising: identifying two or more roles of the plurality of roles that co-occur in a scene; and determining a relationship between the two or more roles that co-occur in the scene within a predetermined threshold of time, wherein a strength of the relationship meets or exceeds a threshold value; ascertaining the key role from the video based at least on the two or more roles; and generating a user-interface presentation that visually summarizes content of the video, the user-interface presentation including the key role.
13. A computer storage device as recited in claim 12 , wherein: processing the faces from the video includes determining an importance of the individual role; and generating the user-interface presentation is based at least in part on the importance of the individual role.
14. A computer storage device as recited in claim 12 , wherein ascertaining the key role from the video is performed independent of metadata associated with the video.
15. A computer storage device as recited in claim 12 , wherein discovering the community from the processed faces includes: identifying individual processed faces most frequently processed from the video and having a threshold level of relationships to other individual processed faces; and employing the individual processed faces being identified as vertices to construct a community graph including correlations between the individual processed faces.
16. A computer storage device as recited in claim 12 , wherein: generating the user-interface presentation is based at least in part on at least one key frame and at least the key role; and the user-interface presentation comprises an image of at least the key role embedded on a representative background obtained from the at least one key frame.
17. A computer storage device as recited in claim 12 , further comprising instructions to configure the computer to perform operations comprising: extracting a shot from the video; and identifying a key frame in the shot.
18. An apparatus comprising: a processor; and a video tool comprising: a video structuring component configured to: receive a video; analyze the video; and segment the video into hierarchical levels of scenes, shots, and key frames; a face grouping component configured to generate face clusters for faces identified in the key frames; a community discovery component configured to identify one or more key roles and relationships between the one or more key roles by: determining, from a face cluster of the face clusters, that at least one role occurs at a frequency above a predetermined threshold in a scene of the scenes; and determining a relationship between the at least one role and a second role based at least in part on a determination that the at least one role and the second role co-occur in the scene within a predetermined threshold of time, wherein a strength of the relationship meets or exceeds a threshold value; and a generation tool configured to generate a user-interface presentation that visually summarizes content of the video, the user-interface presentation based at least on the one or more key roles and the relationships.
19. An apparatus as recited in claim 18 , wherein the generation tool is further configured to: receive a community graph representing a community, the community representing the one or more key roles and the relationships between the one or more key roles; and generate the user-interface presentation based at least in part on the community graph.
20. An apparatus as recited in claim 18 , wherein the generation tool is further configured to: determine an importance of the one or more key roles; and generate the user-interface presentation based at least in part on the importance of the one or more key roles.
Unknown
February 23, 2016
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.