A device initiates voice control through gaze detection. The device detects that a user is gazing at a gaze target. Responsive to that detection, the device captures audio and performs automatic speech recognition of the captured audio to turn the audio into text. The device performs natural language understanding on the text to determine an application-specific command. The device performs application-specific processing for the application-specific command.
Legal claims defining the scope of protection, as filed with the USPTO.
2. The method of claim 1, wherein prior to detecting that the user is gazing at the gaze target, detecting that there is motion by the user.
3. The method of claim 1, wherein the captured audio is only stored locally on the device.
4. The method of claim 1, wherein the gaze target is on a different device.
5. The method of claim 1, wherein the gaze target is a display, and the device is a media computing device or gaming device.
6. The method of claim 1, wherein the application-specific processing is specific for a television or a streaming device.
8. The device of claim 7, wherein the processing circuitry is further configured to perform the following step prior to detection that the user is gazing at the gaze target, detect that there is motion by the user.
9. The device of claim 7, wherein the captured audio is to be only stored locally on the device.
10. The device of claim 7, wherein the gaze target is on a different device.
11. The device of claim 7, wherein the gaze target is a display, and the device is a media computing device or gaming device.
12. The device of claim 7, wherein the application-specific processing is specific for a television or a streaming device.
14. The non-transitory machine-readable medium of claim 13, wherein prior to detecting that the user is gazing at the gaze target, detecting that there is motion by the user.
15. The non-transitory machine-readable medium of claim 13, wherein the captured audio is only stored locally on the device.
16. The non-transitory machine-readable medium of claim 13, wherein the gaze target is on a different device.
17. The non-transitory machine-readable medium of claim 13, wherein the gaze target is a display, and the device is a media computing device or gaming device.
18. The non-transitory machine-readable medium of claim 13, wherein the application-specific processing is specific for a television or a streaming device.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
December 21, 2018
August 23, 2022
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.