Legal claims defining the scope of protection, as filed with the USPTO.
1. A method at a first client conferencing system associated with a first participant of a videoconference and operably in communication with at least a second client conferencing system and a third client conferencing system associated with a second participant and a third participant, respectively, of the videoconference, the method comprising: generating first metadata that includes that information that associates the first participant, the second participant, and the third participant with an adjusted eye gaze for the first participant based on a location of the second participant and the third participant on a first display on the first client conferencing system; and modifying an eye region of the first participant according to the generated first metadata on a second display on the second client conferencing system and on a third display on the third client conferencing system; wherein the first client conferencing system comprises at least a first display, and the second client conferencing system comprises a first video camera and a second display, and wherein the third client conferencing system comprises a second video camera, and wherein the method further comprising receiving, from the third client conferencing system, a second video signal of the third participant, a second video signal being acquired by the second video camera; and displaying the received second video signal on a second area of the first display; wherein the first client conferencing system further comprises a third video camera, and wherein the method further comprises acquiring, by the third video camera, a third video signal, the third video signal comprising at least one second video frame including an image of the first participant looking at a fourth area of the first display configured to display a video signal of a fourth participant of the videoconference; determining, from the image, a gaze direction of the first participant toward a position within the fourth area of the first display; generating second metadata associated with said at least one second video frame and including, based on the determined gaze direction, an identity of the fourth participant of the videoconference; and sending the second metadata with the associated second video frame to at least one client conferencing system associated with a participant of the videoconference; wherein: either the fourth participant corresponds to said second participant, and the fourth area of the first display corresponds to a first area configured to display the first video signal; or the fourth participant corresponds to said third participant and the fourth area of the first display corresponds to said second area configured to display the second video signal; and wherein sending the second metadata with the associated second video frame to at least one client conferencing system associated with a participant of the videoconference comprises sending the second metadata to at least the third client conferencing system, if the second metadata includes an identity of the second participant, or sending the second metadata to at least the second client conferencing system, if the second metadata includes an identity of the third participant; and wherein determining the gaze direction of the first participant toward a position within the fourth area of the first display comprises: using stored gaze directions of the first participant toward the corners of the first display, the stored gaze directions being determined from images of the first participant acquired by the third video camera during a calibration phase.
2. The method of claim 1, further comprising: detecting and cropping at least one eye region of the second participant in the image; determining a target gaze direction based on the position of the second area in the first display; and modifying the cropped eye region of the second participant according to the determined target gaze direction.
3. The method of claim 1, further comprising: detecting and cropping a head region of the second participant in the image; determining a target gaze direction based on the position of the second area in the first display; and reorienting the head within the head region of the second participant according to the determined target gaze direction.
4. A client conferencing system including at least a video camera and a display, associated with a first participant of a videoconference and operably in communication with at least a second client conferencing system and a third client conferencing system associated with a second participant and a third participant, respectively, of the video conference configured to perform the steps of the method comprising: generating first metadata that includes information that associates the first participant, the second participant, and the third participant with an adjusted eye gaze for the first participant based on a location of the second participant and the third participant on a first display on the first client conferencing system; and modifying an eye region of the first participant according to the generated first metadata on a second display on the second client conferencing system and on a third display on the third client conferencing system; wherein the first client conferencing system comprises at least a first display, and the second client conferencing system comprises a first video camera and a second display, and wherein the third client conferencing system comprises a second video camera, and wherein the method further comprising receiving, from the third client conferencing system, a second video signal of the third participant, the second video signal being acquired by a second video camera; and displaying the received second video signal on the second area of the first display; wherein the first client conferencing system further comprises a third video camera, and wherein the method further comprises acquiring, by the third video camera, a third video signal, the third video signal comprising at least one second video frame including an image of the first participant looking at a fourth area of the first display configured to display a video signal of a fourth participant of the video conference; determining, from the image, a gaze direction of the first participant toward a position within the fourth area of the first display; generating second metadata associated with said at least one second video frame and including, based on the determined gaze direction, an identity of the fourth participant of the videoconference; and sending the second metadata with the associated second video frame to at least one client conferencing system associated with a participant of the videoconference; wherein: either the fourth participant corresponds to said second participant and the fourth area of the first display corresponds to said first area configured to display the first video signal; or the fourth participant corresponds to said third participant and the fourth area of the first display corresponds to said second area configured to display the second video signal; and wherein sending the second metadata with the associated second video frame to at least one client conferencing system associated with a participant of the videoconference comprises:, sending the second metadata to at least the third client conferencing system, if the second metadata includes an identity of the second participant, or sending the second metadata to at least the second client conferencing system, if the second metadata includes an identity of the third participant, wherein determining the gaze direction of the first participant toward a position within the fourth area of the first display comprises: using stored gaze directions of the first participant toward the corners of the first display, the stored gaze directions being determined from images of the first participant acquired by the third video camera during a calibration phase.
5. The client conferencing system of claim 4, wherein the method further comprising: detecting and cropping at least one eye region of the second participant in the image; determining a target gaze direction based on the position of the second area in the first display; and modifying the cropped eye region of the second participant according to the determined target gaze direction.
6. The client conferencing system of claim 4, wherein the method further comprising: detecting and cropping a head region of the second participant in the image; determining a target gaze direction based on the position of the second area in the first display; and reorienting the head within the head region of the second participant according to the determined target gaze direction.
7. A videoconferencing setup system comprising at least: a first client conferencing system associated with a first participant of a videoconference and including a first video camera and a first display; a second client conferencing system associated with a second participant of the videoconference; and a third client conferencing system associated with a third participant of the videoconference and including at least a second display, wherein the first, second and third client conferencing systems are operably in communication one to each other, and wherein the first, second and third client conferencing systems are configured to perform the steps of the method comprising: generating a first metadata that includes information that associates the first participant, the second participant, and the third participant with an adjusted eye gaze for the first participant based on a location of the second participant and the third participant on a first display on the first client conferencing system; and modifying an eye region of the first participant according to the generated first metadata on a second display on the second client conferencing system and on a third display on the third client conferencing system; wherein the second client conferencing system comprises a first video camera and a second display, and wherein the third client conferencing system comprises a second video camera, and wherein the method further comprising: receiving, from the third client conferencing system, a second video signal of the third participant, the second video signal being acquired by the second video camera; and displaying the received second video signal on the second area of the first display; wherein the first client conferencing system further comprises a third video camera, and wherein the method further comprises: acquiring, by the third video camera, a third video signal, the third video signal comprising at least one second video frame including an image of the first participant looking at a fourth area of the first display configured to display a video signal of a fourth participant of the videoconference; determining, from the image, a gaze direction of the first participant toward a position within the fourth area of the first display; generating second metadata associated with said at least one second video frame and including, based on the determined gaze direction, an identity of the fourth participant of the videoconference; and sending the second metadata with the associated second video frame to at least one client conferencing system associated with a participant of the videoconference; wherein: either the fourth participant corresponds to said second participant and the fourth area of the first display corresponds to said first area configured to display the first video signal, or the fourth participant corresponds to said third participant and the fourth area of the first display corresponds to said second area configured to display the second video signal; and wherein sending the second metadata with the associated second video frame to at least one client conferencing system associated with a participant of the videoconference comprises:, sending the second metadata to at least the third client conferencing system, if the second metadata includes an identity of the second participant, or sending the second metadata to at least the second client conferencing system, if the second metadata includes an identity of the third participant.
8. The videoconferencing setup system of claim 7, wherein determining the gaze direction of the first participant toward a position within the fourth area of the first display comprises: using stored gaze directions of the first participant toward the corners of the first display, the stored gaze directions being determined from images of the first participant acquired by the third video camera during a calibration phase.
9. The videoconferencing setup system of claim 7, wherein the method further comprising: detecting and cropping at least one eye region of the second participant in the image; determining a target gaze direction based on the position of the second area in the first display; and modifying the cropped eye region of the second participant according to the determined target gaze direction.
Unknown
June 3, 2025
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.