Legal claims defining the scope of protection, as filed with the USPTO.
1. A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by one or more processors of a first electronic device having a display and a microphone, cause the first electronic device to: display, on the display, an extended reality (XR) environment corresponding to a copresence session that includes a first user associated with the first electronic device and a second user associated with a second electronic device, wherein the first user is different from the second user and the first electronic device is different from the second electronic device, and wherein the copresence session is configured such that the first user and the second user simultaneously experience a shared environment; and while displaying the XR environment corresponding to the copresence session: sample, with the microphone, a first audio input; determine whether the first audio input is intended for a first digital assistant operating on an external electronic device; and in accordance with a determination that the first audio input is intended for the first digital assistant: cause the first digital assistant to provide an audible response to the first audio input, wherein the audible response is not transmitted to the second electronic device over a shared communication channel for the copresence session.
2. The non-transitory computer-readable storage medium of claim 1, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: in accordance with a determination that the first audio input is not intended for the first digital assistant operating on the external electronic device: forgo causing the first digital assistant to provide the audible response.
3. The non-transitory computer-readable storage medium of claim 1, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: transmit, over the shared communication channel, the first audio input to the second electronic device.
4. The non-transitory computer-readable storage medium of claim 3, wherein transmitting the first audio input is performed in accordance with a determination that the first audio input is intended for the first digital assistant.
5. The non-transitory computer-readable storage medium of claim 1, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: determine whether the first audio input corresponds to a personal domain, wherein causing the first digital assistant to provide the audible response that is not transmitted over the shared communication channel is performed in accordance with a determination that the first audio input corresponds to the personal domain.
6. The non-transitory computer-readable storage medium of claim 1, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: display, in the XR environment, a physical representation of the external electronic device.
7. The non-transitory computer-readable storage medium of claim 6, wherein displaying the XR environment includes: displaying a first view of the XR environment, the first view corresponding to a first pose associated with the first electronic device, wherein: the first view includes an avatar associated with the second electronic device; and the first view does not include the physical representation of the external electronic device.
8. The non-transitory computer-readable storage medium of claim 7, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: detect a change from the first pose associated with the first electronic device to a second pose associated with the first electronic device, wherein the external electronic device is visible from the second pose but not from the first pose; and in accordance with detecting the change: replace the display of the first view with a display of a second view of the XR environment, the second view corresponding to the second pose, wherein: the second view includes the physical representation of the external electronic device displayed at a physical location of the external electronic device.
9. The non-transitory computer-readable storage medium of claim 6, wherein the physical representation is displayed at a physical location of the external electronic device, and wherein determining whether the first audio input is intended for the first digital assistant operating on the external electronic device includes: determining that a pose associated with the first electronic device corresponds to the physical location while sampling the first audio input.
10. The non-transitory computer-readable storage medium of claim 9, wherein determining that the pose corresponds to the physical location while sampling the first audio input includes: determining that a gaze is directed at the physical representation while sampling the first audio input.
11. The non-transitory computer-readable storage medium of claim 9, wherein determining that the pose corresponds to the physical location while sampling the first audio input includes: detecting, using one or more sensors of the first electronic device, the external electronic device.
12. The non-transitory computer-readable storage medium of claim 11, wherein the one or more sensors include a camera and detecting the external electronic device using the one or more sensors includes: recognizing the external electronic device from image data captured by the camera.
13. The non-transitory computer-readable storage medium of claim 6, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: concurrently display, in the XR environment, a digital assistant indicator with the physical representation, wherein determining that the first audio input is intended for the first digital assistant includes determining that the first audio input is sampled while the digital assistant indicator is concurrently displayed with the physical representation.
14. The non-transitory computer-readable storage medium of claim 13, wherein concurrently displaying the digital assistant indicator with the physical representation includes displaying the digital assistant indicator above the physical representation at a location defined by the vertical axis of the physical representation.
15. The non-transitory computer-readable storage medium of claim 13, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: before sampling the first audio input, sample a second audio input, and wherein concurrently displaying the digital assistant indicator with the physical representation includes: concurrently displaying the digital assistant indicator with the physical representation in accordance with a determination that the second audio input includes a spoken trigger for the first digital assistant.
16. The non-transitory computer-readable storage medium of claim 13, wherein the physical representation is displayed at a physical location of the external electronic device, and wherein concurrently displaying the digital assistant indicator with the physical representation includes: concurrently displaying the digital assistant indicator with the physical representation in accordance with a determination that a pose associated with the first electronic device corresponds to the physical location.
17. The non-transitory computer-readable storage medium of claim 13, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: while concurrently displaying the digital assistant indicator with the physical representation, disable the shared communication channel, including: forgoing transmitting, over the shared communication channel, audio to the second electronic device, wherein the first digital assistant provides the audible response while the shared communication channel is disabled.
18. The non-transitory computer-readable storage medium of claim 17, wherein disabling the shared communication channel further includes: forgoing outputting, at the first electronic device, audio received at the second electronic device.
19. The non-transitory computer-readable storage medium of claim 17, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: after disabling the shared communication channel, activate the shared communication channel, including: transmitting, over the shared communication channel, audio sampled by the microphone to the second electronic device.
20. The non-transitory computer-readable storage medium of claim 19, wherein activating the shared communication channel is performed in accordance with determining that a second audio input is not sampled within a predetermined duration after providing the audible response.
21. The non-transitory computer-readable storage medium of claim 19, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: cease to display the digital assistant indicator, wherein activating the shared communication channel is performed in accordance with ceasing to display the digital assistant indicator.
22. The non-transitory computer-readable storage medium of claim 1, wherein causing the first digital assistant to provide the audible response includes: causing the external electronic device to provide the audible response with one or more speakers of the external electronic device.
23. The non-transitory computer-readable storage medium of claim 1, wherein causing the first digital assistant to provide the audible response includes: receiving, from the external electronic device, the audible response; and outputting, using one or more speakers of the first electronic device, the audible response.
24. The non-transitory computer-readable storage medium of claim 1, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: while displaying the XR environment, receive a notification; in accordance with receiving the notification, determine whether the first electronic device is engaged in the copresence session; and in accordance with a determination that the first electronic device is engaged in the copresence session, cause the external electronic device to provide an audio output representing the notification.
25. The non-transitory computer-readable storage medium of claim 1, wherein the one or more programs further comprise instructions, which when executed by the one or more processors, cause the first electronic device to: in accordance with a determination that the first audio input is intended for the first digital assistant operating on the external electronic device: receive, from the external electronic device, an indication that the first digital assistant is incapable of responding to a request included in the first audio input; and in accordance with receiving the indication: determine whether a second digital assistant operating on the first electronic device is capable of responding to the request; and in accordance with a determination that the second digital assistant is capable of responding to the request, output, using the second digital assistant, a second audible response to the request, wherein causing the first digital assistant to provide the audible response is performed in accordance with not receiving the indication.
26. The non-transitory computer-readable storage medium of claim 1, wherein determining whether the first audio input is intended for the first digital assistant is performed without identifying a second spoken trigger in the first audio input.
27. The non-transitory computer-readable storage medium of claim 1, wherein determining whether the first audio input is intended for the first digital assistant includes determining whether a direction of the first audio input corresponds to a physical location of the external electronic device.
28. The non-transitory computer-readable storage medium of claim 1, wherein determining whether the first audio input is intended for the first digital assistant includes analyzing a content of the first audio input.
29. A first electronic device, comprising: a display; a microphone; one or more processors; a memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for: displaying, on the display, an extended reality (XR) environment corresponding to a copresence session that includes a first user associated with the first electronic device and a second user associated with a second electronic device, wherein the first user is different from the second user and the first electronic device is different from the second electronic device, and wherein the copresence session is configured such that the first user and the second user simultaneously experience a shared environment; and while displaying the XR environment corresponding to the copresence session: sampling, with the microphone, a first audio input; determining whether the first audio input is intended for a first digital assistant operating on an external electronic device; and in accordance with a determination that the first audio input is intended for the first digital assistant: causing the first digital assistant to provide an audible response to the first audio input, wherein the audible response is not transmitted to the second electronic device over a shared communication channel for the copresence session.
30. The first electronic device of claim 29, wherein the one or more programs further include instructions for: in accordance with a determination that the first audio input is not intended for the first digital assistant operating on the external electronic device: forgoing causing the first digital assistant to provide the audible response.
31. The first electronic device of claim 29, wherein the one or more programs further include instructions for: transmitting, over the shared communication channel, the first audio input to the second electronic device.
32. The first electronic device of claim 31, wherein transmitting the first audio input is performed in accordance with a determination that the first audio input is intended for the first digital assistant.
33. The first electronic device of claim 29, wherein the one or more programs further include instructions for: determining whether the first audio input corresponds to a personal domain, wherein causing the first digital assistant to provide the audible response that is not transmitted over the shared communication channel is performed in accordance with a determination that the first audio input corresponds to the personal domain.
34. The first electronic device of claim 29, wherein the one or more programs further include instructions for: displaying, in the XR environment, a physical representation of the external electronic device.
35. The first electronic device of claim 34, wherein displaying the XR environment includes: displaying a first view of the XR environment, the first view corresponding to a first pose associated with the first electronic device, wherein: the first view includes an avatar associated with the second electronic device; and the first view does not include the physical representation of the external electronic device.
36. The first electronic device of claim 35, wherein the one or more programs further include instructions for: detecting a change from the first pose associated with the first electronic device to a second pose associated with the first electronic device, wherein the external electronic device is visible from the second pose but not from the first pose; and in accordance with detecting the change: replacing the display of the first view with a display of a second view of the XR environment, the second view corresponding to the second pose, wherein: the second view includes the physical representation of the external electronic device displayed at a physical location of the external electronic device.
37. The first electronic device of claim 34, wherein the physical representation is displayed at a physical location of the external electronic device, and wherein determining whether the first audio input is intended for the first digital assistant operating on the external electronic device includes: determining that a pose associated with the first electronic device corresponds to the physical location while sampling the first audio input.
38. The first electronic device of claim 37, wherein determining that the pose corresponds to the physical location while sampling the first audio input includes: determining that a gaze is directed at the physical representation while sampling the first audio input.
39. The first electronic device of claim 37, wherein determining that the pose corresponds to the physical location while sampling the first audio input includes: detecting, using one or more sensors of the first electronic device, the external electronic device.
40. The first electronic device of claim 34, wherein the one or more programs further include instructions for: concurrently displaying, in the XR environment, a digital assistant indicator with the physical representation, wherein determining that the first audio input is intended for the first digital assistant includes determining that the first audio input is sampled while the digital assistant indicator is concurrently displayed with the physical representation.
41. The first electronic device of claim 40, wherein the one or more programs further include instructions for: before sampling the first audio input, sampling a second audio input, and wherein concurrently displaying the digital assistant indicator with the physical representation includes: concurrently displaying the digital assistant indicator with the physical representation in accordance with a determination that the second audio input includes a spoken trigger for the first digital assistant.
42. The first electronic device of claim 40, wherein the physical representation is displayed at a physical location of the external electronic device, and wherein concurrently displaying the digital assistant indicator with the physical representation includes: concurrently displaying the digital assistant indicator with the physical representation in accordance with a determination that a pose associated with the first electronic device corresponds to the physical location.
43. The first electronic device of claim 40, wherein the one or more programs further include instructions for: while concurrently displaying the digital assistant indicator with the physical representation, disabling the shared communication channel, including: forgoing transmitting, over the shared communication channel, audio to the second electronic device, wherein the first digital assistant provides the audible response while the shared communication channel is disabled.
44. The first electronic device of claim 43, wherein disabling the shared communication channel further includes: forgoing outputting, at the first electronic device, audio received at the second electronic device.
45. The first electronic device of claim 43, wherein the one or more programs further include instructions for: after disabling the shared communication channel, activating the shared communication channel, including: transmitting, over the shared communication channel, audio sampled by the microphone to the second electronic device.
46. The first electronic device of claim 45, wherein activating the shared communication channel is performed in accordance with determining that a second audio input is not sampled within a predetermined duration after providing the audible response.
47. The first electronic device of claim 45, wherein the one or more programs further include instructions for: ceasing to display the digital assistant indicator, wherein activating the shared communication channel is performed in accordance with ceasing to display the digital assistant indicator.
48. The first electronic device of claim 29, wherein causing the first digital assistant to provide the audible response includes: causing the external electronic device to provide the audible response with one or more speakers of the external electronic device.
49. The first electronic device of claim 29, wherein causing the first digital assistant to provide the audible response includes: receiving, from the external electronic device, the audible response; and outputting, using one or more speakers of the first electronic device, the audible response.
50. A method, comprising: at a first electronic device having one or more processors, memory, a microphone, and a display: displaying, on the display, an extended reality (XR) environment corresponding to a copresence session that includes a first user associated with the first electronic device and a second user associated with a second electronic device, wherein the first user is different from the second user and the first electronic device is different from the second electronic device, and wherein the copresence session is configured such that the first user and the second user simultaneously experience a shared environment; and while displaying the XR environment corresponding to the copresence session: sampling, with the microphone, a first audio input; determining whether the first audio input is intended for a first digital assistant operating on an external electronic device; and in accordance with a determination that the first audio input is intended for the first digital assistant: causing the first digital assistant to provide an audible response to the first audio input, wherein the audible response is not transmitted to the second electronic device over a shared communication channel for the copresence session.
51. The method of claim 50, further comprising: in accordance with a determination that the first audio input is not intended for the first digital assistant operating on the external electronic device: forgoing causing the first digital assistant to provide the audible response.
52. The method of claim 50, further comprising: transmitting, over the shared communication channel, the first audio input to the second electronic device.
53. The method of claim 52, wherein transmitting the first audio input is performed in accordance with a determination that the first audio input is intended for the first digital assistant.
54. The method of claim 50, further comprising: determining whether the first audio input corresponds to a personal domain, wherein causing the first digital assistant to provide the audible response that is not transmitted over the shared communication channel is performed in accordance with a determination that the first audio input corresponds to the personal domain.
55. The method of claim 50, further comprising: displaying, in the XR environment, a physical representation of the external electronic device.
56. The method of claim 55, wherein displaying the XR environment includes: displaying a first view of the XR environment, the first view corresponding to a first pose associated with the first electronic device, wherein: the first view includes an avatar associated with the second electronic device; and the first view does not include the physical representation of the external electronic device.
57. The method of claim 56, further comprising: detecting a change from the first pose associated with the first electronic device to a second pose associated with the first electronic device, wherein the external electronic device is visible from the second pose but not from the first pose; and in accordance with detecting the change: replacing the display of the first view with a display of a second view of the XR environment, the second view corresponding to the second pose, wherein: the second view includes the physical representation of the external electronic device displayed at a physical location of the external electronic device.
58. The method of claim 55, wherein the physical representation is displayed at a physical location of the external electronic device, and wherein determining whether the first audio input is intended for the first digital assistant operating on the external electronic device includes: determining that a pose associated with the first electronic device corresponds to the physical location while sampling the first audio input.
59. The method of claim 58, wherein determining that the pose corresponds to the physical location while sampling the first audio input includes: determining that a gaze is directed at the physical representation while sampling the first audio input.
60. The method of claim 58, wherein determining that the pose corresponds to the physical location while sampling the first audio input includes: detecting, using one or more sensors of the first electronic device, the external electronic device.
61. The method of claim 55, further comprising: concurrently displaying, in the XR environment, a digital assistant indicator with the physical representation, wherein determining that the first audio input is intended for the first digital assistant includes determining that the first audio input is sampled while the digital assistant indicator is concurrently displayed with the physical representation.
62. The method of claim 61, further comprising: before sampling the first audio input, sampling a second audio input, and wherein concurrently displaying the digital assistant indicator with the physical representation includes: concurrently displaying the digital assistant indicator with the physical representation in accordance with a determination that the second audio input includes a spoken trigger for the first digital assistant.
63. The method of claim 61, wherein the physical representation is displayed at a physical location of the external electronic device, and wherein concurrently displaying the digital assistant indicator with the physical representation includes: concurrently displaying the digital assistant indicator with the physical representation in accordance with a determination that a pose associated with the first electronic device corresponds to the physical location.
64. The method of claim 61, further comprising: while concurrently displaying the digital assistant indicator with the physical representation, disabling the shared communication channel, including: forgoing transmitting, over the shared communication channel, audio to the second electronic device, wherein the first digital assistant provides the audible response while the shared communication channel is disabled.
65. The method of claim 64, wherein disabling the shared communication channel further includes: forgoing outputting, at the first electronic device, audio received at the second electronic device.
66. The method of claim 64, further comprising: after disabling the shared communication channel, activating the shared communication channel, including: transmitting, over the shared communication channel, audio sampled by the microphone to the second electronic device.
67. The method of claim 66, wherein activating the shared communication channel is performed in accordance with determining that a second audio input is not sampled within a predetermined duration after providing the audible response.
68. The method of claim 66, further comprising: ceasing to display the digital assistant indicator, wherein activating the shared communication channel is performed in accordance with ceasing to display the digital assistant indicator.
69. The method of claim 50, wherein causing the first digital assistant to provide the audible response includes: causing the external electronic device to provide the audible response with one or more speakers of the external electronic device.
70. The method of claim 50, wherein causing the first digital assistant to provide the audible response includes: receiving, from the external electronic device, the audible response; and outputting, using one or more speakers of the first electronic device, the audible response.
Unknown
April 8, 2025
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.