Legal claims defining the scope of protection, as filed with the USPTO.
1. A system for providing real-time feedback of remote collaborative communication between a first user and a second user, the system comprising: a non-transitory computer-readable medium storing instructions; and a processing system, including at least one specialized processor, that is configured to execute the stored instructions to: extract a plurality of first speech-related features from first audio signals generated in response to speech supplied from the first user; extract a plurality of first physiological features from the first audio signals, first physiological data generated in response to physiological activity of the first user, or both, wherein the first physiological data is obtained from one or more first physiological sensors; extract a plurality of second speech-related features from second audio signals generated response to speech supplied from the second user; extract a plurality of second physiological features from the second audio signals, second physiological data generated in response to physiological activity of the second user, or both, wherein the second physiological data is obtained from one or more second physiological sensors; process the first and second speech-related features to determine if the speech from the first and second users exhibits positive or negative vocal entrainment; process the first and second physiological features to determine if physiological states of the first and second users are aligned or misaligned; and generate commands that cause at least one device to supply real-time, adaptive multimodal feedback that indicates: (i) whether the speech from the first and second users exhibits positive or negative vocal entrainment, and (ii) whether the physiological activity of the first and second users is aligned or misaligned.
2. The system of claim 1, wherein the processing system is further configured to: process the first and second physiological features to determine if the physiological states of the first and second users are increasing or decreasing; and generate commands that cause the at least one device to supply feedback that indicates if the physiological states of the first and second users are increasing or decreasing.
3. The system of claim 1, further comprising: a first microphone configured to receive the speech supplied from the first user and, in response thereto, supply the first audio signals to the processing system; and a second microphone configured to receive the speech supplied from the second user and, in response thereto, supply the second audio signals to the processing system.
4. The system of claim 1, further comprising: the plurality of first physiological sensors disposed on the first user and configured to supply the first physiological data to the processing system; and the plurality of second physiological sensors disposed on the second user and configured to supply the second physiological data to the processing system.
5. The system of claim 1, further comprising: a first human-machine interface coupled to receive the commands from the processing system; and a second user human-machine interface coupled to receive the commands from the processing system.
6. The system of claim 5, further comprising: a third human-machine interface disposed remote from the first and second human-machine interfaces, the third human-machine interface coupled to receive the commands from the processing system.
7. The system of claim 1, wherein the first and second speech-related features that the processing system is configured to extract comprise: pitch, speech rate, intensity, jitter, and shimmer.
8. The system of claim 1, wherein the first and second physiological features that the processing system is configured to extract comprise one or more of: heart rate, breath rate, oxygen saturation level, pupil diameter, and galvanic skin response.
9. The system of claim 1, wherein the processing system is configured to correlate the physiological state of the first and second users to a stress state of the first and second user, respectively, and determine if the stress state of the first and second users is aligned or misaligned.
10. The system of claim 1, wherein the processing system comprises: a first local processor coupled to receive and process the first audio signals and the first physiological data and to transmit the first speech-related features and the first physiological features; a second local processor coupled to receive and process the second audio signals and the second physiological data and to transmit the second speech-related features and the second physiological features; and a cloud processor coupled to receive and process (i) the first and second speech-related features to determine if the speech from the first and second users exhibits positive or negative vocal entrainment and (ii) the first and second physiological features to determine if the physiological states of the first and second users are aligned or misaligned.
Unknown
July 22, 2025
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.