A multi-channel and multi-zone audio environment is provided. Various inventions are disclosed that allow playback devices on one or more networks to provide an effective multi-channel and a multi-zone audio environment using timing information. According to one example, timing information is used to coordinate playback devices connected over a low-latency network to provide audio along with a video display. In another example, timing information is used to coordinate playback devices connected over a mesh network to provide audio in one or more zones or zone groups.
Legal claims defining the scope of protection, as filed with the USPTO.
an out-loud playback device comprising one or more loudspeakers; one or more network interfaces; one or more processors; and tangible, non-transitory computer-readable media comprising instruction code that, when executed by the one or more processors, configures the media playback system to: while a wearable device is configured to communicate data according to a first networking mode with the media playback system, receive an instruction for the wearable device to join a bonded zone that comprises the out-loud playback device; after receiving the instruction for the wearable device to join the bonded zone that comprises the out-loud playback device, configure the wearable device to communicate data according to a second networking mode that is different from the first networking mode; while the wearable device is configured to communicate data according to the second networking mode, (i) provide audio data to the wearable device and (ii) coordinate performance of lip synchronization between video content that is output via a display device and audio content that is played back via one or more playback devices of the bonded zone; assign a playback role within the bonded zone to the wearable device, wherein assigning the playback role within the bonded zone to the wearable device comprises determining first audio data to be played back via the wearable device; determine, based on a user location, a first playback delay associated with the wearable device; and cause the wearable device to play back the first audio data based on the first playback delay. . A media playback system, comprising:
claim 1 . The media playback system of, wherein when the wearable device is configured to communicate data according to the first networking mode, the wearable device is configured to communicate audio data via a first spectrum interface, and wherein when the wearable device is configured to communicate data according to the second networking mode, the wearable device is configured to communicate audio data via a second spectrum interface.
claim 2 . The media playback system of, wherein the first spectrum interface comprises a 2.4 GHz wireless interface, and wherein the second spectrum interface comprises a 5 GHz wireless interface.
claim 1 . The media playback system of, wherein the first networking mode comprises an 802.11 compatible communication protocol.
claim 1 . The media playback system of, wherein the first networking mode comprises an 802.15 compatible communication protocol.
claim 1 . The media playback system of, wherein the wearable device comprises a wireless headphone.
claim 1 while the wearable device is configured to communicate data according to the second networking mode, receive an instruction for the wearable device to leave the bonded zone that comprises the out-loud playback device; and after receiving the instruction to leave the bonded zone that comprises the out-loud playback device, configure the wearable device to communicate data according to the first networking mode. . The media playback system of, wherein the instruction code, when executed by the one or more processors, configures the media playback system to:
claim 1 while the wearable device is configured to communicate data according to the second networking mode, receive an instruction for the wearable device to leave the bonded zone that comprises the out-loud playback device; and after receiving the instruction to leave the bonded zone that comprises the out-loud playback device, configure the wearable device to communicate data according to a third networking mode, wherein the third networking mode is different from both of (i) the first networking mode and (ii) the second networking mode. . The media playback system of, wherein the instruction code, when executed by the one or more processors, configures the media playback system to:
claim 1 . The media playback system of, wherein the media playback system comprises a display device, and wherein the out-loud playback device is physically connected to an interface of the display device.
claim 1 . The media playback system of, wherein while the wearable device comprises a display.
while a wearable device is configured to communicate data according to a first networking mode with the media playback system, receiving an instruction for the wearable device to join a bonded zone that comprises an out-loud playback device, wherein the out-loud playback device comprises one or more loudspeakers; after receiving the instruction for the wearable device to join the bonded zone that comprises the out-loud playback device, configuring the wearable device to communicate data according to a second networking mode that is different from the first networking mode; while the wearable device is configured to communicate data according to the second networking mode, (i) providing audio data to the wearable device and (ii) coordinating performance of lip synchronization between video content that is output via a display device and audio content that is played back via one or more playback devices of the bonded zone; assigning a playback role within the bonded zone to the wearable device, wherein assigning the playback role within the bonded zone to the wearable device comprises determining first audio data to be played back via the wearable device; determining, based on a user location, a first playback delay associated with the wearable device; and causing the wearable device to play back the first audio data based on the first playback delay. . A tangible, non-transitory computer-readable media comprising program instructions, wherein the program instructions, when executed by one or more processors, cause a media playback system to perform functions comprising:
claim 11 . The tangible, non-transitory computer-readable media of, wherein when the wearable device is configured to communicate data according to the first networking mode, the wearable device is configured to communicate audio data via a first spectrum interface, and wherein when the wearable device is configured to communicate data according to the second networking mode, the wearable device is configured to communicate audio data via a second spectrum interface.
claim 12 . The tangible, non-transitory computer-readable media of, wherein the first spectrum interface comprises a 2.4 GHz wireless interface, and wherein the second spectrum interface comprises a 5 GHz wireless interface.
claim 11 . The tangible, non-transitory computer-readable media of, wherein the first networking mode comprises one of (i) an 802.11 compatible communication protocol or (ii) an 802.15 compatible communication protocol.
claim 11 . The tangible, non-transitory computer-readable media of, wherein the wearable device comprises a wireless headphone.
claim 11 while the wearable device is configured to communicate data according to the second networking mode, receiving an instruction for the wearable device to leave the bonded zone that comprises the out-loud playback device; and after receiving the instruction to leave the bonded zone that comprises the out-loud playback device, configuring the wearable device to communicate data according to the first networking mode. . The tangible, non-transitory computer-readable media of, wherein the functions further comprise:
claim 11 while the wearable device is configured to communicate data according to the second networking mode, receiving an instruction for the wearable device to leave the bonded zone that comprises the out-loud playback device; and after receiving the instruction to leave the bonded zone that comprises the out-loud playback device, configuring the wearable device to communicate data according to a third networking mode, wherein the third networking mode is different from both of (i) the first networking mode and (ii) the second networking mode. . The tangible, non-transitory computer-readable media of, wherein the functions further comprise:
claim 11 . The tangible, non-transitory computer-readable media of, wherein the media playback system comprises a display device, and wherein the out-loud playback device is physically connected to an interface of the display device.
claim 11 . The tangible, non-transitory computer-readable media of, wherein while the wearable device is configured to communicate data according to the second networking mode, the wearable device is configured to automatically adjust audio playback to enhance speech content associated with the video content.
while a wearable device is configured to communicate data according to a first networking mode with the media playback system, receiving an instruction for the wearable device to join a bonded zone that comprises an out-loud playback device, wherein the out-loud playback device comprises one or more loudspeakers; after receiving the instruction for the wearable device to join the bonded zone that comprises the out-loud playback device, configuring the wearable device to communicate data according to a second networking mode that is different from the first networking mode; while the wearable device is configured to communicate data according to the second networking mode, (i) providing audio data to the wearable device and (ii) coordinating performance of lip synchronization between video content that is output via a display device and audio content that is played back via one or more playback devices of the bonded zone; assigning a playback role within the bonded zone to the wearable device, wherein assigning the playback role within the bonded zone to the wearable device comprises determining first audio data to be played back via the wearable device; determining, based on a user location, a first playback delay associated with the wearable device; and causing the wearable device to play back the first audio data based on the first playback delay. . A method performed by a media playback system, the method comprising:
Complete technical specification and implementation details from the patent document.
This application is a continuation of U.S. patent application Ser. No. 18/058,864 titled “Providing a Multi-Channel and a Multi-Zone Audio Environment,” filed Nov. 26, 2022, and issued as U.S. Pat. No. 12,432,512 on Sep. 30, 2025; U.S. patent application Ser. No. 18/058,864 is a continuation of U.S. patent application Ser. No. 16/933,861 titled “Providing a Multi-Channel and a Multi-Zone Audio Environment,” filed Jul. 20, 2020, and issued as U.S. Pat. No. 11,516,611 on Nov. 29, 2022; U.S. patent application Ser. No. 16/933,861 is a continuation of U.S. patent application Ser. No. 16/102,695 titled “Providing a Multi-Channel and a Multi-Zone Audio Environment,” filed Aug. 13, 2018, and issued as U.S. Pat. No. 10,721,575 on Jul. 21, 2020; U.S. patent application Ser. No. 16/102,695 is a continuation of U.S. patent application Ser. No. 15/296,304 titled “Providing Playback Timing in a Multi-Zone Audio Environment,” filed Oct. 18, 2016, and issued as U.S. Pat. No. 10,051,398 on Aug. 14, 2018; U.S. patent application Ser. No. 15/296,304 is a continuation of U.S. patent application Ser. No. 13/632,731 titled “Providing A Multi-Channel And A Multi-Zone Audio Environment,” filed Oct. 1, 2012, and issued as U.S. Pat. No. 9,516,440 on Dec. 6, 2016. The entire contents of U.S. Pats. Ser. No. 18/058,864; Ser. No. 16/933,861; Ser. No. 16/102,695; Ser. No. 15/296,304; and Ser. No. 13/632,731 are incorporated herein by reference.
The disclosure is related to consumer goods and, more particularly, to systems, products, features, services, and other items directed to media playback or some aspect thereof.
Technological advancements have increased the accessibility of music content, as well as other types of media, such as television content, movies, and interactive content. For example, a user can access audio, video, or both audio and video content over the Internet through an online store, an Internet radio station, a music service, a movie service, and so on, in addition to the more traditional avenues of accessing audio and video content. Demand for audio, video, and both audio and video content inside and outside of the home continues to increase.
In addition, the drawings are for the purpose of illustrating example embodiments, but it is understood that the inventions are not limited to the arrangements and instrumentality shown in the drawings.
Embodiments disclosed herein enable coordinated playback amongst players for use in a home theater environment, a zone group environment, or both. Particularly, the embodiments provide for the distribution of audio information that corresponds to video.
Certain embodiments provide one or more methods, systems, apparatus, and/or machine-readable mediums to distribute audio information, such as audio information that corresponds to video.
Certain embodiments provide a method to distribute audio information. The example method includes receiving an audio signal at a first device from an audio source. The example method includes identifying home theater audio information contained in the audio signal for playback by a plurality of home theater players. The example method includes identifying zone group audio information contained in the audio signal for playback by one or more players in a zone group. The example method includes assigning one or more timestamps to home theater audio information that indicates a time at which audio is to be played by the plurality of home theater players. The example method includes assigning a zone group timestamp to the zone group audio information that indicates a time at which audio is to be played by the one or more players in the zone group. The example method includes sending, from the first device, the home theater audio information with the one or more timestamps to the plurality of home theater players and sending the zone group audio information with the zone group timestamp to the one or more players in the zone group.
Certain embodiments provide a tangible machine-readable medium having instructions stored thereon that, when executed, cause a machine to at least receive an audio signal at a first device from an audio source. The example instructions cause the machine to at least identify home theater audio information contained in the audio signal for playback by a plurality of home theater players. The example instructions cause the machine to at least identify zone group audio information contained in the audio signal for playback by one or more players in a zone group. The example instructions cause the machine to at least assign one or more timestamps to home theater audio information that indicates a time al which audio is to be played by the plurality of home theater players. The example instructions cause the machine to at least assign a zone group timestamp to the zone group audio information that indicates a time at which audio is to be played by the one or more players in the zone group. The example instructions cause the machine to at least send, from the first device, the home theater audio information with the one or more timestamps to the plurality of home theater players and sending the zone group audio information with the zone group timestamp to the one or more players in the zone group.
1 FIG. 100 Referring now to the drawings, in which like numerals can refer to like parts throughout the figures,shows an example system configurationin which one or more embodiments disclosed herein can be practiced or implemented.
100 102 124 102 124 130 100 130 100 130 100 100 1 FIG. Byway of illustration, the system configurationrepresents a home with multiple zones, though the home could have been configured with only one zone. Each zone, for example, may represent a different room or space, such as an office, bathroom, bedroom, kitchen, dining room, family room, home theater room, utility or laundry room, and patio. A zone may also include an area inside a home, building, or vehicle or outside and as such may include any spatial area. A single zone might also include multiple rooms or spaces if so configured. One or more of zone players-are shown in each respective zone. A zone player-, also referred to herein as a playback device, multimedia unit, speaker, player, and so on, provides audio, video, and/or audiovisual output. A controller(e.g., shown in the kitchen for purposes of illustration) provides control to the system configuration. Controllermay be fixed to a zone, or alternatively, mobile such that it can be moved about the zones. The system configurationmay also include more than one controller. The system configurationillustrates an example whole house audio system, though it is understood that the technology described herein is not limited to its particular place of application or to an expansive system like a whole house audio systemof.
2 2 2 FIGS.A,B, andC 2 28 2 FIGS.A,, andC 1 FIG. 200 202 204 102 124 200 204 show example types of zone players. Zone players,, andof, respectively, can correspond to any of the zone players-of, for example. In some embodiments, audio is reproduced using only a single zone player, such as by a full-range player. In some embodiments, audio is reproduced using two or more zone players, such as by using a combination of full-range players or a combination of full-range and specialized players. In some embodiments, zone players-may also be referred to as a “smart speaker,” because they contain processing capabilities beyond the reproduction of audio, more of which is described below.
2 FIG.A 4 FIG. 200 208 200 208 200 200 200 200 illustrates zone playerthat includes sound producing equipmentcapable of reproducing full-range sound. The sound may come from an audio signal that is received and processed by zone playerover a wired or wireless data network. Sound producing equipmentincludes one or more built-in amplifiers and one or more acoustic transducers (e.g., speakers). A built-in amplifier is described more below with respect to. A speaker or acoustic transducer can include, for example, any of a tweeter, a mid-range driver, a low-range driver, and a subwoofer. In some embodiments, zone playercan be statically or dynamically configured to play stereophonic audio, monaural audio, or both. In some embodiments, zone playeris configured to reproduce a subset of full-range sound, such as when zone playeris grouped with other zone players to play stereophonic audio, monaural audio, and/or surround audio or when the audio content received by zone playeris less than full-range.
2 FIG.B 202 210 202 202 210 illustrates zone playerthat includes a built-in amplifier to power a set of detached speakers. A detached speaker can include, for example, any type of loudspeaker. Zone playermay be configured to power one, two, or more separate loudspeakers. Zone playermay be configured to communicate an audio signal (e.g., right and left channel audio or more channels depending on its configuration) to the detached speakersvia a wired path.
2 FIG.C 204 214 illustrates zone playerthat does not include a built-in amplifier, but is configured to communicate an audio signal, received over a data network, to an audio (or “audio/video”) receiverwith built-in amplification.
1 FIG. 102 124 128 130 Referring back to, in some embodiments, one, some, or all of the zone playerstocan retrieve audio directly from a source. For example, a zone player may contain a playlist or queue of audio items to be played (also referred to herein as a “playback queue”). Each item in the queue may comprise a uniform resource identifier (URI) or some other identifier. The URI or identifier can point the zone player to the audio source. The audio source might be found on the Internet (e.g., the cloud), locally from another device over data network(described further below), from the controller, stored on the zone player itself, or from an audio source communicating directly to the zone player. In some embodiments, the zone player can reproduce the audio itself, send it to another zone player for reproduction, or both where the audio is played by the zone player and one or more additional zone players in synchrony. In some embodiments, the zone player can play a first audio content (or not play at all), while sending a second, different audio content to another zone player(s) for reproduction.
2 2 2 FIGS.A,B, andC By way of illustration, SONOS, Inc. of Santa Barbara, Calif. presently offers for sale zone players currently referred to as a “PLAY:5,” “PLAY:3,” “CONNECT:AMP.” “CONNECT,” and “SUB.” Any other past, present, and/or future zone players can additionally or alternatively be used to implement the zone players of example embodiments disclosed herein. Additionally, it is understood that a zone player is not limited to the particular examples illustrated inor to the SONOS product offerings. For example, a zone player may include a wired or wireless headphone. In yet another example, a zone player might include a sound bar for television. In yet another example, a zone player can include or interact with a docking station for an Apple IPOD™ or similar device.
3 FIG. 1 FIG. 300 302 300 130 302 300 300 304 300 100 100 100 300 128 illustrates an example wireless controllerin docking station. By way of illustration, controllercan correspond to controlling deviceof. Docking station, if provided, may be used to charge a battery of controller. In some embodiments, controlleris provided with a touch screenthat allows a user to interact through touch with the controller, for example, to retrieve and navigate a playlist of audio items, control operations of one or more zone players, and provide overall control of the system configuration. In certain embodiments, any number of controllers can be used to control the system configuration. In some embodiments, there can be a limit set on the number of controllers that can control the system configuration. The controllers might be wireless like wireless controlleror wired to data network.
100 100 128 In some embodiments, if more than one controller is used in system, then each controller may be coordinated to display common content, and may all be dynamically updated to indicate changes made from a single controller. Coordination can occur, for instance, by a controller periodically requesting a state variable directly or indirectly from one or more zone players; the state variable may provide information about system, such as current zone group configuration, what is playing in one or more zones, volume levels, and other items of interest. The state variable may be passed around on data networkbetween zone players (and controllers, if so desired) as needed or as often as programmed.
130 130 100 128 200 In addition, an application running on any network-enabled portable device, such as an IPHONE®, IPAD®, ANDROID™ powered phone, or any other smart phone or network-enabled device can be used as controller. An application running on a laptop or desktop personal computer (PC) or MAC® can also be used as controller. Such controllers may connect to systemthrough an interface with data network, a zone player, a wireless router, or using some other configured connection path. Example controllers offered by Sonos, Inc. of Santa Barbara, Calif. include a “Controller,” “SONOS® CONTROL,” “SONOS® Controller for IPHONE®.” “SONOS® Controller for IPAD®,” “SONOS® Controller for ANDROID™,” “SONOS® Controller for MAC® or PC.”
102 124 128 130 128 128 128 100 128 102 124 128 102 124 128 102 124 128 102 124 128 128 1 FIG. Zone playerstoofare coupled directly or indirectly to a data network, such as data network. Controllermay also be coupled directly or indirectly to data networkor individual zone players. Data networkis represented by an octagon in the figure to stand out from other representative components. While data networkis shown in a single location, it is understood that such a network is distributed in and around system. Particularly, data networkcan be a wired network, a wireless network, or a combination of both wired and wireless networks. In some embodiments, one or more of the zone players-are wirelessly coupled to data networkbased on a proprietary mesh network. In some embodiments, one or more of the zone players-are wirelessly coupled to data networkusing a non-mesh topology. In some embodiments, one or more of the zone players-arc coupled via a wire to data networkusing Ethernet or similar technology. In addition to the one or more zone players-connecting to data network, data networkcan further allow access to a wide area network, such as the Internet.
102 124 128 102 124 128 102 124 In some embodiments, connecting any of the zone players-, or some other connecting device, to a broadband router, can create data network. Other zone players-can then be added wired or wirelessly to the data network. For example, a zone player (e.g., any of zone players-) can be added to the system
100 128 100 128 128 configurationby simply pressing a button on the zone player itself (or perform some other action), which enables a connection to be made to data network. The broadband router can be connected to an Internet Service Provider (ISP), for example. The broadband router can be used to form another data network within the system configuration, which can be used in other applications (e.g., web surfing). Data networkcan also be used in other applications, if so programmed. An example, second network may implement SONOSNET™ protocol, developed by SONOS, Inc. of Santa Barbara. SONOSNET™ represents a secure, AES-encrypted, peer-to-peer wireless mesh network. Alternatively, in certain embodiments, the data networkis the same network, such as a traditional wired or wireless network, used for other applications in the household.
1 FIG. 106 108 102 130 130 130 A particular zone can contain one or more zone players. For example, the family room ofcontains two zone playersand, while the kitchen is shown with one zone player. In another example, the home theater room contains additional zone players to play audio from a 5.1 channel or greater audio source (e.g., a movie encoded with 5.1 or greater audio channels). In some embodiments, one can position a zone player in a room or space and assign the zone player to a new or existing zone via controller. As such, zones may be created, combined with another zone, removed, and given a specific name (e.g. “Kitchen”), if so desired and programmed to do so with controller. Moreover, in some embodiments, zone configurations may be dynamically changed even after being configured using controlleror some other mechanism.
106 108 106 108 106 108 106 108 In some embodiments, if a zone contains two or more zone players, such as the two zone playersandin the family room, then the two zone playersandcan be configured to play the same audio source in synchrony, or the two zone playersandcan be paired to play two separate sounds in left and right channels, for example. In other words, the stereo effects of a sound can be reproduced or enhanced through the two zone playersand, one for the left sound and the other for the right sound. In certain embodiments, paired zone players (also referred to as “bonded zone players”) can play audio in synchrony with other zone players in the same or different zones.
In some embodiments, two or more zone players can be sonically consolidated to form a single, consolidated zone player. A consolidated zone player (though made up of multiple, separate devices) can be configured to process and reproduce sound differently than an unconsolidated zone player or zone players that are paired, because a consolidated zone player will have additional speaker drivers from which sound can be passed. The consolidated zone player can further be paired with a single zone player or yet another consolidated zone player. Each playback device of a consolidated playback device can be set in a consolidated mode, for example.
130 According to some embodiments, one can continue to do any of: group, consolidate, and pair zone players, for example, until a desired configuration is complete. The actions of grouping, consolidation, and pairing are preferably performed through a control interface, such as using controller, and not by physically connecting and re-connecting speaker wire, for example, to individual, discrete speakers to create different configurations. As such, certain embodiments described herein provide a more flexible and dynamic platform through which sound reproduction can be offered to the end-user.
124 102 110 124 110 124 In some embodiments, each zone can play from the same audio source as another zone or each zone can play from a different audio source. For example, someone can be grilling on the patio and listening to jazz music via zone player, while someone is preparing food in the kitchen and listening to classical music via zone player. Further, someone can be in the office listening to the same jazz music via zone playerthat is playing on the patio via zone player. In some embodiments, the jazz music played via zone playersandis played in synchrony. Synchronizing playback amongst zones allows for someone to pass through zones while seamlessly (or substantially seamlessly) listening to the audio. Further, zones can be put into a “party mode” such that all associated zones will play audio in synchrony.
102 124 128 128 128 102 124 128 130 Sources of audio content to be played by zone players-are numerous. In some embodiments, music on a zone player itself may be accessed and a played. In some embodiments, music from a personal library stored on a computer or networked-attached storage (NAS) may be accessed via the data networkand played. In some embodiments, Internet radio stations, shows, and podcasts can be accessed via the data network. Music or cloud services that let a user stream and/or download music and audio content can be accessed via the data network. Further, music can be obtained from traditional sources, such as a turntable or CD player, via a line-in connection to a zone player, for example. Audio content can also be accessed using a different protocol, such as AIRPLAY™, which is a wireless technology by Apple, Inc., for example. Audio content received from one or more sources can be shared amongst the zone playerstovia data networkand/or controller. The above-disclosed sources of audio content are referred to herein as network-based audio information sources. However, network-based audio information sources are not limited thereto.
116 118 120 132 132 116 118 120 132 102 124 100 In some embodiments, the example home theater zone players,,are coupled to an audio information source such as a television. In some examples, the televisionis used as a source of audio for the home theater zone players,,, while in other examples audio information from the televisioncan be shared with any of the zone players-in the audio system.
4 FIG. 2 FIG.A 2 FIG.B 2 FIG.C 400 400 402 408 410 412 414 416 418 416 418 416 400 400 Referring now to, there is shown an example block diagram of a zone playerin accordance with an embodiment. Zone playerincludes a network interface, a processor, a memory, an audio processing component, one or more modules, an audio amplifier, and a speaker unitcoupled to the audio amplifier.shows an example illustration of such a zone player. Other types of zone players may not include the speaker unit(e.g., such as shown in) or the audio amplifier(e.g., such as shown in). Further, it is contemplated that the zone playercan be integrated into another component. For example, the zone playercould be constructed as part of a television, lighting, or some other device for indoor or outdoor use.
402 400 128 128 400 402 400 In some embodiments, network interfacefacilitates a data flow between zone playerand other devices on a data network. In some embodiments, in addition to getting audio from another zone player or device on data network, zone playermay access audio directly from the audio source, such as over a wide area network or on the local network. In some embodiments, the network interfacecan further handle the address part of each packet so that it gets to the right destination or intercepts packets destined for the zone player. Accordingly, in certain embodiments, each of the packets includes an Internet Protocol (IP)-based source address as well as an IP-based destination address.
402 404 406 404 400 128 404 404 400 420 406 400 404 406 404 406 400 404 406 In some embodiments, network interfacecan include one or both of a wireless interfaceand a wired interface. The wireless interface, also referred to as a radio frequency (RF) interface, provides network interface functions for the zone playerto wirelessly communicate with other devices (e.g., other zone player(s), speakers), receiver(s), component(s) associated with the data network, and so on) in accordance with a communication protocol (e.g., any wireless standard including IEEE 802.11a, 802.11b, 802.11g, 802.11n, or 802.15). Wireless interfacemay Include one or more radios. To receive wireless signals and to provide the wireless signals to the wireless interfaceand to transmit wireless signals, the zone playerincludes one or more antennas. The wired interfaceprovides network interface functions for the zone playerto communicate over a wire with other devices in accordance with a communication protocol (e.g., IEEE 802.3). In some embodiments, a zone player includes multiple wirelessinterfaces. In some embodiments, a zone player includes multiple wiredinterfaces. In some embodiments, a zone player includes both of the interfacesand. In some embodiments, a zone playerincludes only the wireless interfaceor the wired interface.
408 410 In some embodiments, the processoris a clock-driven electronic device that is configured to process input data according to instructions stored in memory.
410 414 408 410 408 400 400 400 400 The memoryis data storage that can be loaded with one or more software module(s), which can be executed by the processorto achieve certain tasks. In the illustrated embodiment, the memoryis a tangible machine-readable medium storing instructions that can be executed by the processor. In some embodiments, a task might be for the zone playerto retrieve audio data from another zone player or a device on a network (e.g., using a uniform resource locator (URL) or some other identifier). In some embodiments, a task may be for the zone playerto send audio data to another zone player or device on a network. In some embodiments, a task may be for the zone playerto synchronize playback of audio with one or more additional zone players. In some embodiments, a task may be to pair the zone playerwith one or more zone players to create a multi-channel audio environment.
414 408 Additional or alternative tasks can be achieved via the one or more software module(s)and the processor.
412 412 408 402 412 412 416 418 412 400 The audio processing componentcan include one or more digital-to-analog converters (DAC), an audio preprocessing component, an audio enhancement component or a digital signal processor, and so on. In some embodiments, the audio processing componentmay be part of processor. In some embodiments, the audio that is retrieved via the network interfaceis processed and/or intentionally altered by the audio processing component. Further, the audio processing componentcan produce analog audio signals. The processed analog audio signals are then provided to the audio amplifierfor play back through speakers. In addition, the audio processing componentcan include circuitry to process analog or digital signals as inputs to play from zone player, send to another zone player on a network, or both play and send to another zone player on the network. An example input includes a line-in connection (e. g, an auto-detecting 3.5 mm audio line-in connection).
416 418 418 The audio amplifieris a device(s) that amplifies audio signals to a level for driving one or more speakers. The one or more speakerscan Include an individual transducer (e.g., a “driver”) or a complete speaker system that includes an enclosure including one or more drivers. A particular driver can be a subwoofer (e.g., for low frequencies), a mid-range driver (e.g., for middle frequencies), and a tweeter (e.g., for high frequencies), for example. An enclosure can be scaled or ported, for example. Each transducer may be driven by its own individual amplifier.
A commercial example, presently known as the PLAY:5™, is a zone player with a built-in amplifier and speakers that is capable of retrieving audio directly from the source, such as on the Internet or on the local network, for example. In particular, the PLAY:5™ is a five-amp, five-driver speaker system that includes two tweeters, two mid-range drivers, and one woofer. When playing audio content via the PLAY:5, the left audio data of a track is sent out of the left tweeter and left mid-range driver, the right audio data of a track is sent out of the right tweeter and the right mid-range driver, and mono bass is sent out of the subwoofer. Further, both mid-range drivers and both tweeters have the same equalization (or substantially the same equalization). That is, they are both sent the same frequencies but from different channels of audio, Audio from Internet radio stations, online music and video services, downloaded music, analog audio inputs, television, DVD, and so on, can be played from the PLAY:5™.
5 FIG. Referring now to, there is shown an example block diagram
500 130 500 500 102 124 508 500 500 1 FIG. 1 FIG. for controller, which can correspond to thecontrolling devicein. Controllercan be used to facilitate the control of multimedia applications, automation and others in a system. In particular, the controllermay be configured to facilitate a selection of a plurality of audio sources available on the network and enable control of one or more zone players (e.g., the zone players-in) through a wireless or wired network interface. According to one embodiment, the wireless communications is based on an industry standard (e.g., infrared, radio, wireless standards including IEEE 802.11a, 802.11b, 802.11g, 802.1 in, 802.15, and so on). Further, when a particular audio is being accessed via the controlleror being played via a zone player, a picture (e.g., album art) or any other data, associated with the audio and/or audio source can be transmitted from a zone player or other electronic device to controllerfor display.
500 502 514 500 502 500 500 504 506 510 512 506 514 512 512 506 512 504 502 Controlleris provided with a screenand an input interfacethat allows a user to interact with the controller, for example, to navigate a playlist of many multimedia items and to control operations of one or more zone players. The screenon the controllercan be an LCD screen, for example. The screencommunicates with and is commanded by a screen driverthat is controlled by a microcontroller (e.g., a processor). The memorycan be loaded with one or more application modulesthat can be executed by the microcontrollerwith or without a user input via the user interfaceto achieve certain tasks. In some embodiments, an application moduleis configured to facilitate grouping a number of selected zone players into a zone group and synchronizing the zone players for audio play back, in some embodiments, an application moduleis configured to control the audio sounds (e.g., volume) of the zone players in a zone group. In operation, when the microcontrollerexecutes one or more of the application modules, the screen drivergenerates control signals to drive the screento display an application specific user interface accordingly.
500 508 508 508 500 102 124 1 FIG. The controllerincludes a network interfacethat facilitates wired or wireless communication with a zone player. In some embodiments, the commands such as volume control and audio playback synchronization are sent via the network interface. In some embodiments, a saved zone group configuration is transmitted between a zone player and a controller via the network interface. The controllercan control one or more zone players, such as-of. There can be more than one controller for a particular system, and each controller may share common information with another controller, or retrieve the common information from a zone player, if such a zone player stores configuration data (e.g., such as a state variable). Further, a controller can be integrated into a zone player.
It should be noted that other network-enabled devices such as an IPHONE®, IPAD® or any other smart phone or network-enabled device (e.g., a networked computer such as a PC or MAC®) can also be used as a controller to interact or control zone players in a particular environment. In some embodiments, a software application or upgrade can be downloaded onto a network-enabled device to perform the functions described herein.
500 500 In certain embodiments, a user can create a zone group (also referred to as a bonded zone) including at least two zone players from the controller. The zone players in the zone group can play audio in a synchronized fashion, such that all of the zone players in the zone group play back an identical audio source or a list of identical audio sources in a synchronized manner such that no (or substantially no) audible delays or hiccups are to be heard. Similarly, in some embodiments, when a user increases the audio volume of the group from the controller, the signals or data of increasing the audio volume for the group are sent to one of the zone players and causes other zone players in the group to be increased together in volume.
500 A user via the controllercan group zone players into a zone group by activating a “Link Zones” or “Add Zone” soft button, or de-grouping a zone group by activating an “Unlink Zones” or “Drop Zone” button. For example, one mechanism for ‘joining’ zone players together for audio play back is to link a number of zone players together to form a group. To link a number of zone players together, a user can manually link each zone player or room one after the other. For example, assume that there is a multi-zone system that includes the following zones: Bathroom, Bedroom, Den, Dining Room, Family Room, and Foyer.
In certain embodiments, a user can link any number of the six zone players, for example, by starting with a single zone and then manually linking each zone to that zone.
In certain embodiments, a set of zones can be dynamically linked together using a command to create a zone scene or theme (subsequent to first creating the zone scene). For instance, a “Morning” zone scene command can link the Bedroom, Office, and Kitchen zones together in one action. Without this single command, the user would manually and individually link each zone. The single command may include a mouse click, a double mouse click, a button press, a gesture, or some other programmed action. Other kinds of zone scenes can be programmed.
In certain embodiments, a zone scene can be triggered based on time (e.g., an alarm clock function). For instance, a zone scene can be set to apply at 8:00 am. The system can link appropriate zones automatically, set specific music to play, and then stop the music after a defined duration. Although any particular zone can be triggered to an “On” or “Off” state based on time, for example, a zone scene enables any zone(s) linked to the scene to play a predefined audio (e.g., a favorable song, a predefined playlist) at a specific time and/or for a specific duration. If, for any reason, the scheduled music failed to be played (e.g., an empty playlist, no connection to a share, failed Universal Plug and Play (UPnP), no Internet connection for an Internet Radio station, and so on), a backup buzzer can be programmed to sound. The buzzer can include a sound file that is stored in a zone player, for example.
6 FIG. 6 FIG. Certain particular examples are now provided in connection withto describe, for purposes of illustration, certain systems and methods to provide and facilitate connection to a playback network.shows that there are three zone
602 604 606 608 610 610 610 602 604 606 608 610 610 610 6 FIG. players,andand a controllerthat form a network branch that is also referred to as an Ad-Hoc network. The networkmay be wireless, wired, or a combination of wired and wireless. In general, an Ad-Hoc (or “spontaneous”) network is a local area network or other small network in which there is generally no one access point for all traffic. With an established Ad-Hoc network, the devices,,andcan all communicate with each other in a “peer-to-peer” style of communication, for example. Furthermore, devices may join and/or leave from the network, and the networkmay automatically reconfigure itself without needing the user to reconfigure the network. While an Ad-Hoc network is referenced in, it is understood that a playback network may be based on a type of network that is completely or partially different from an Ad-Hoc network.
610 602 604 606 608 602 604 606 602 604 606 608 610 6 FIG. Using the Ad-Hoc network, the devices,,, andcan share or exchange one or more audio sources and be dynamically grouped to play the same or different audio sources. For example, the devicesandare grouped to playback one piece of music, and at the same time, the deviceplays back another piece of music. In other words, the devices,,and, as shown in, form a HOUSEHOLD that distributes audio and/or reproduces sound. As used herein, the term HOUSEHOLD (provided in uppercase letters to disambiguate from the user's domicile) is used to represent a collection of networked devices that are cooperating to provide an application or service. An instance of a HOUSEHOLD is identified with a household(or household identifier), though a HOUSEHOLD may be identified with a different area or place.
610 In certain embodiments, a household identifier (HHID) is a short string or an identifier that is computer-generated to help ensure that it is unique. Accordingly, the networkcan be characterized by a unique HHID and a unique set of configuration variables or parameters, such as channels (e.g., respective frequency bands), service set identifier (SSID) (a sequence of alphanumeric characters as a name of a wireless network), and WEP keys (wired equivalent privacy or other security keys). In certain embodiments, SSID is set to be the same as HHID.
308 308 In certain embodiments, each HOUSEHOLD includes two types of network nodes: a control point (CP) and a zone player (ZP). The control point controls an overall network setup process and sequencing, including an automatic generation of required network parameters (e.g., WEP keys). In an embodiment, the CP also provides the user with a HOUSEHOLD configuration user interface. The CP function can be provided by a computer running a CP application module, or by a handheld controller (e.g., the controller) also running a CP application module, for example. The zone player is any other device on the network that is placed to participate in the automatic configuration process. The ZP, as a notation used herein, includes the controlleror a computing device, for example. In some embodiments, the functionality, or certain parts of the functionality, in both the CP and the ZP are combined at a single node (e.g., a ZP contains a CP or vice-versa).
In certain embodiments, configuration of a HOUSEHOLD involves multiple CPs and ZPs that rendezvous and establish a known configuration such that they can use a standard networking protocol (e.g., IP over Wired or Wireless Ethernet) for communication. In an embodiment, two types of networks/protocols are employed: Ethernet 802.3 and Wireless 802.11g. Interconnections between a CP and a ZP can use either of the networks/protocols. A device in the system as a member of a HOUSEHOLD can connect to both networks simultaneously.
606 612 602 604 608 6 FIG. In an environment that has both networks in use, it is assumed that at least one device in a system is connected to both as a bridging device, thus providing bridging services between wired/wireless networks for others. The zone playerinis shown to be connected to both networks, for example. The connectivity to the networkis based on Ethernet and/or Wireless, while the connectivity to other devices,andis based on Wireless and Ethernet if so desired.
606 604 602 602 602 It is understood, however, that in some embodiments each zone player,,may access the Internet when retrieving media from the cloud (e.g., the Internet) via the bridging device. For example, zone playermay contain a uniform resource locator (URL) that specifies an address to a particular audio track in the cloud. Using the URL, the zone playermay retrieve the audio track from the cloud, and ultimately play the audio out of one or more zone players.
7 FIG. 1 FIG. 700 700 702 704 706 702 704 706 708 700 708 100 130 illustrates an example network audio systemconstructed in accordance with an embodiment. The network audio systemincludes a playback device, a first group of one or more playback devices generally identified by reference numeral, and a second group of one or more playback devices generally identified by reference numeral. The playback devices,, andoperate under the control of one or more user interface modules generally identified by reference numeral. In an embodiment, the network audio system, including the user interface (UI), are shown inas the systemand the controller, respectively.
7 FIG. 702 710 710 702 722 As shown in, the playback deviceis connected directly or in-directly to a video devicethat contains, or is connected to, a display for the purpose of viewing video. The video devicemay include any of a digital video receiver like an Apple TV, cable or satellite box. Blu-ray player, television set, monitor, projector, or any other device or devices that is an audio source to the playback devicevia a wired or wireless connection, more of which is described below.
100 702 706 712 714 716 716 712 702 704 706 714 1 FIG. 7 FIG. Similar to the zone players described with respect to the systemof, the playback devicesandare connected to one or more additional audio sources, examples of which are identified ingenerally as local audioand cloud contentvia a network interface. The network interfacemay include any of a network router, modem, or other network device that allows access to devices on the local area network and/or a wide area network such as the Internet. Local audiomay include any of local audio content, such as audio stored and/or accessible from a local device like a personal computer, laptop, mobile device, network accessible storage (NAS) device, an audio reproduction device connected to one of the playback devices,, and, or similar devices capable of storing digital information in volatile or non-volatile form or streaming the digital audio information. Audio source at the cloud contentmay include Internet audio content or any other source of audio information via a wide area network.
704 712 714 702 718 704 712 714 720 7 FIG. In one embodiment, the playback devicesare connected to the local audioand the cloud contentvia the playback deviceover the HT network. In another embodiment (not shown in) , the playback devicesare connected to the local audioand the cloud contentvia the ZG network.
7 FIG. 702 704 718 706 720 With reference to the example illustrated in, the playback devicemay be connected to the playback devicesvia a star network, generally referred to in the figure as home theater (HT) network, and to the playback devicesvia a mesh network, generally referred to as zone group (ZG) network.
720 128 720 706 702 1 FIG. In an embodiment, the ZG networkis like the data networkof. The ZG networksupports the playback of audio in synchrony amongst playback devicesandthat may be in ear-shot of each other.
718 720 718 710 In an embodiment, the HI networkis a low-latency network to support a higher coordination between audio and the corresponding video than needed by the ZG network. That is, the HT networkis configured as a low-latency network to prevent or reduce lip synchronization issues between the audio and corresponding video being played via the video device.
702 704 706 702 704 718 720 According to such embodiments, the playback devicesandmay be configured in a home theater arrangement in a single location, such as the same mom (e.g., a home theater room, family room, or some other area where surround sound is desired), and playback devicesmay be located in the same or different areas from the playback devicesand. While the present disclosure is not limited to the particular network types described herein, aspects of the HT networkand the ZG networkare described more below.
702 710 710 702 0 1 702 708 702 704 702 710 In an embodiment, the playback devicemay be used with the video deviceto provide the sole reproduction of audio (or substantially the sole reproduction) for the video device. In this embodiment, the playback devicemay be configured, for example, to support the playback of audio either in a single channel or multi-channel mode. By way of illustration, traditional multi-channel modes include, among others, 2.0 (e.g., stereophonic sound), 2.1, 3.1, virtual 5.1, or virtual 7.1 configurations, where the “.” typically represents the low-frequency effects channel and virtual 5.1 and 7.1 configurations represent a simulated surround sound effect. The playback devicemay playback the audio according to a certain configuration based on its particular speaker design, how the incoming audio is encoded, based on a user configuration via the UI, or any combination of the three. According to this embodiment, the playback deviceneed not be coupled to the playback devices, because the playback device, by itself or without the use of another playback device, is used to reproduce the audio for the video device.
702 704 704 704 704 702 704 In another embodiment, the playback devicemay operate with the playback devicesto reproduce a home theater-style listening environment. A home theater-style listening environment may often include a multi-channel surround field, such as 2.1, 3.1, 5.1, or 7.1 channel audio reproductions to recreate the ambiance of a movie theater, but may include other types of multi-channel or single-channel audio playback modes. For instance, the playback devicesmight include any of a subwoofer or group of subwoofers to reproduce the low-frequency effects channel (SUB), a playback device for each of the right rear and left rear channels (SAT), and other playback devices depending on the setup. As such, the playback devicesmay only include one or more playback devices depending on the desired setup. In certain embodiments, playback devices may be dynamically added or removed from the playback devicesand the system (e.g., the playback devicesand) reconfigures itself accordingly.
702 706 720 710 706 702 100 702 706 720 724 716 702 712 714 724 1 FIG. In yet another embodiment, a coupling of the playback deviceto playback devicesvia ZO networkenables audio from the video deviceto be played in substantial synchrony by other playback devices, and furthermore allows the playback deviceto be included in an overall audio system (e.g., systemshown in) , In which additional benefits am provided. If the playback deviceis not coupled to the playback devicesvia the ZG network, then a connectionmay be made directly (or indirectly) to network device, such that the playback devicecan be controlled by a wireless enabled controller, has access to local audio content, has access to cloud content, and so on. In one embodiment, the connectionmay include wired Ethernet.
702 704 718 706 720 710 704 706 704 718 706 720 In yet another embodiment, a coupling of the playback deviceto the playback devicesvia the HT networkand the playback devicesvia the ZG networkenables audio from the video deviceto be played back concurrently by different sets of playback devicesand, where the playback devicesoperate using the low-latency HT networkand the playback devicesoperate using the ZG network.
702 704 706 702 710 712 714 Regardless of whether the playback deviceis connected to playback devicesand/or playback devices, the playback devicecan advantageously provide sound reproduction for the video device(e.g., a television set) and wired or wireless access to local audioand/or cloud contentfor audio playback.
130 1 FIG. In an embodiment, similar to the controllerof,
708 702 704 706 708 702 704 706 702 702 708 702 702 708 702 708 700 7 FIG. the UIincan be used to, among other things, configure any of playback devices,, and, such as to modify the audio equalization (EQ) settings, and enable zone group joining and disengaging. For instance, the UIcan enable dynamic joining and disengaging of any playback devices of the playback devices,, andto one or more zone groups (also referred to as synchrony groups). In other words, for instance, if the playback deviceis currently not a member of a zone group, for example, the playback devicemay be joined to a zone group via UI, after which it will play the audio of that zone group. If the playback deviceis currently a member of a zone group, the playback devicemay be disengaged from that zone group via the UI. Alternatively, a new zone group may be created with the playback deviceand another playback device. Additionally, the UIcan control other aspects of the network audio system, including but not limited to the selection of the audio information source that a particular playback device is to utilize, the volume of the audio playback, to turn audio information source(s) on and off, and so on.
708 702 704 706 708 With respect to volume adjustment, in an embodiment, the UImay be used to adjust playback volume by any of the individual playback devices,, and/orin addition to adjusting a group volume. In operation, the UIcan provide information identifying the particular device whose volume is to be adjusted, and the level at which the volume is to be set to a master device or to the playback device, itself.
702 704 708 702 704 702 704 In an embodiment, the volume for the playback devicesandmay act in a coordinated fashion such that the UIdisplays a single volume control. As such, the manipulation of the single volume control may cause the playback devicesandto increase or decrease in volume. Each playback device ofandmay increase or decrease at the same rate or different rates depending on the configuration.
702 712 714 402 1202 410 1210 In an embodiment, the playback devicedetermines the type of incoming audio to be played and automatically adjusts the audio playback to enhance the output. For example, stereo (or 2-channel) audio may be processed and played differently than audio consisting of more (or less) than 2 channels. In another example, audio that is associated with video is processed and played differently than audio that is not associated with video. In some embodiments, the adjustments are made based on the source of the audio content (e.g., local audioand/or cloud content). In other embodiments, the adjustments are made based on interface over which the audio content arrives to the playback device (e.g., audio that arrives over the network interfaceor, or audio that arrives over the audio interfaceor). In yet other embodiments, the adjustments are made based on the application that was used to initiate the audio playback (e.g., audio that is played using a YOUTUBE™ application vs. audio played that is played using a SONOS® application).
702 702 702 708 702 702 710 722 In one embodiment, the playback devicemay be configured to implement a mode of operation referred to herein as “dialog enhancement” that provides an enhancement to audio that is associated with video so that a user can more clearly understand speech. Particularly, in one embodiment, the playback deviceboosts the center channel, rolls off the bass frequencies, lowers the volume of the satellites, and emphasizes the speech spectrum (e.g., 300 to 3400 Hz). In one embodiment, the playback deviceis configured via the UIto implement “dialog enhancement” mode of operation. In one embodiment, “dialog enhancement,” once configured with the playback device, is implemented based on characteristics of the audio content (e.g., is the audio associated with video, what interface does the audio content arrive to the playback device, what application was used to initiate the audio playback, and so on). For example, if “dialog enhancement” is configured on the playback deviceand the audio content arrives from the video devicevia connection, then the audio is adjusted according to the “dialog enhancement”mode.
702 702 708 702 702 702 In another embodiment, the playback devicemay be configured to implement a mode of operation referred to herein as “night mode” that applies dynamic range compression in addition to implementing “dialog enhancement.” In one embodiment, the playback device, is configured via the UIto implement “night mode” mode of operation. In one embodiment, “night mode,” once configured with the playback device, is implemented based on characteristics of the audio content (e.g., is the audio associated with video, what interface does the audio content arrive to the playback device, what application was used to initiate the audio playback, and so on) as well as a temporal or environmental characteristics (e.g., what time of day is it, is it dark outside, and so on). In one embodiment, the playback deviceuses a light sensor to determine the relative brightness in the room to determine if the environmental characteristic is met for “night mode.” For example, if the light sensor detects, for example, a lux value of less than (<) 1, the environmental characteristic may be met for “night mode” use. In another embodiment, the playback deviceuses a real-time clock to determine if the temporal characteristic is met for “night mode” use. For example, if the real-time clock indicates the time is between, for example, 8 pm and 6 am, the temporal characteristic may be met for “night mode” use.
702 710 712 714 704 718 702 706 720 702 706 702 702 704 706 In certain embodiments, the playback deviceobtains audio information from an audio information source, such as the video device(and/or any other audio source such as local audioand/or cloud content), adds playback timing information, and transmits the combined audio and playback timing information to the playback devicesover the HT networkfor coordinated playback. The playback devicemay also transmit the combined audio and playback timing information to playback devicesover the ZG networkwhen the playback deviceis part of a zone group including any of playback devicesand the playback deviceis the master device for the zone group. The playback timing information that is provided with the audio information, together with clock timing information provided by the playback device, enables the playback devicesand, if so grouped, to coordinate playback of the audio information. As will be described more below, the playback timing information is determined for each particular playback device (or a particular group of playback devices) and its role in audio playback.
702 706 706 704 702 702 704 718 702 704 702 706 In certain embodiments, the playback deviceis part of a zone group including any of playback devices, where one of the playback devicesis the master device for the zone group. If any of the playback devicesare grouped with the playback device, then the playback devicemodifies the playback timing information it receives from the master device to match its local clock, and transmits the combined audio and modified timing information to the playback devicesover the HT networkfor coordinated playback. The playback timing information that is provided with the audio information, together with the clock timing information provided by the playback device, enables the playback devicescoordinate playback of the audio information withwithout having to explicitly coordinate with the one of the playback deviceswhich is the master device for the zone group.
702 718 724 720 702 704 706 In an embodiment, the playback devicetransmits the audio and playback liming information in messages over the HT network, the network connection, and/or ZG networkusing a multi-cast message transmission methodology. In some embodiments, each of the messages includes a multi-cast address (or some other ID or identifying address) that is used to identify the multi-cast group or members of the multi-cast group for which the message is intended. Each playback device monitors the messages on the network, and when a playback device detects a message with its address or a multi-cast group address to which the playback device belongs, they will receive and process the contents of the message. It is understood, however, that the playback devicemay make use of any convenient multi-cast or uni-cast (or other) message transmission methodology in transmitting the audio and playback timing information to the playback devicesand/or.
702 702 704 706 Irrespective of the transmission methodology, in an embodiment, the audio and playback timing information is in the form of a series of frames, with each frame having a timestamp. The timestamp indicates a time, relative to the time indicated by a clock maintained by the playback deviceor some other designated reference device, at which the frame is to be played. Depending on the size or sizes of the messages used in the selected multi-cast message transmission methodology and the size or sizes of the frames, a message may contain one frame, or multiple frames, or, alternatively, a frame may extend across several messages. It is understood that the information included in the timestamp(s) may alternatively be provided by the playback deviceto the playback devicesand/orin periodic or non-periodic intervals instead of, or in addition to, in a series of frames.
702 704 706 718 720 724 700 702 702 704 706 704 706 702 In an embodiment, the playback devicemay provide clock time information to the playback devicesand/orindividually over networksand/orand/or, respectively, using a highly accurate clock time information transmission methodology. In one embodiment, the systemutilizes an out-of-band protocol, such as the SNTP (Simple Network Time Protocol), to obtain current clock time information from the playback device. The SNTP makes use of a unicast message transfer methodology, in which one device, such as the playback device, provides clock time information to a specific other device, such as any of the playback devicesand. According to this embodiment, each of the playback devicesand/orwill periodically initiate SNTP transactions to obtain the clock time information from the playback device.
704 706 702 702 704 706 702 704 706 The playback devicesand/orcan make use of the clock time information to determine the time differential between the time indicated by the playback device's () clock and the time indicated by its respective clock, and use that time differential value, along with the playback time information associated with the audio information and the respective device's local time as indicated by its clock to determine when the various frames are to be played. This enables the playback devices,and/orto coordinate playback. Coordinated playback may not always result in perfect, or substantially perfect, synchrony as a result of using a wireless technology between playback devices,, and/or. In some instances, coordinated playback includes a user imposed delay, such that the audio is intentionally not played in synchrony, but rather exhibits a more theater-like listening experience.
702 704 706 702 718 720 700 702 Additionally, it is understood that the playback devicemay not transmit audio and playback timing information over the networks it playback devicesandare not used. The playback devicecan maintain a count of the number of playback devices as they join and disengage, and, if the number drops to zero, it can stop transmitting the audio and playback timing information over any of the networksand/or. Furthermore, it is understood that there may be multiple synchrony groups in the network audio system, and further that, for example, playback devicemay operate both as a master or primary device (e.g., a device that may be configured to provide audio and timing information) and a slave device (e.g., a device that may be configured to receive the audio and timing information).
702 704 706 702 In an embodiment, when the playback deviceprovides the audio information to the playback devicesand/or, the playback devicemay divide the audio stream or file into a series of frames, with each frame containing digital audio information for a predetermined period of time. A particular frame in a digital audio stream may contain a series of audio samples. Associated with each frame is a header that includes a number of fields for storing other information that is useful in controlling playback of the audio samples in the respective frame. For instance, the header associated with a frame may include a frame sequence number field, an encoding type field, a sampling rate information field, a timestamp field, an end of track flag, and a length flag field. The header may also include fields for storing other information that is useful in controlling playback. Generally, the frame sequence number field receives a sequence number that identifies the relative position of the frame in the sequence of frames containing the digital audio stream. The encoding type field receives a value that identifies the type of encoding and/or compression that has been used in generating the digital audio stream. Depending on the audio source, conventional encoding and/or compression schemes include, for example, MP3, WMA, AAC, and WAV encoding and/or compression schemes, although it will be appreciated that other schemes may be provided for as well. The sampling rate information field receives sampling rate information that indicates the sampling rate for the audio samples. The condition of the end of work flag indicates whether the frame contains the last digital audio samples for the audio track associated with the framed digital audio work. If the frame does not contain the audio samples that are associated with the end of the digital audio stream for a respective audio work, the end of work flag will be clear. On the other hand, if the frame does contain the audio samples that are associated with the end of the digital audio stream for a respective audio work, the end of work flag will be set. In addition, since the number of valid audio samples in the frame, that is, the samples that are not padding, may be less than “S,” the default number of audio samples in a frame, the length flag field will contain a value that identifies the number of audio samples in the last frame of the audio work.
702 702 704 706 According to an embodiment, the timestamp field stores a timestamp that identifies the time at which a particular playback device is to play the respective frame. More specifically, for each frame of a framed digital audio stream that is buffered in the audio information buffer, the playback device, using timing information from its digital to analog converter clock, can determine a time at which a particular playback device is to play the respective frame, and stores a timestamp identifying the playback time in the timestamp field. The timestamp associated with each frame can be used by a playback scheduler (e.g., contained within each of the playback devices,, and) to determine when the portion of the digital audio stream stored in the frame is to be coupled to the digital to analog converter to initiate playback. It will be appreciated that the timestamps that are associated with frames in sequential frames may be such that they can be played back in order, and without an interruption between the sequential frames comprising the digital audio stream. The timestamps may also be such that frames will be played back after some slight time delay after they have been buffered in the audio information buffer.
8 FIG. 7 FIG. 8 FIG. 700 702 704 718 706 720 704 718 720 702 704 706 702 718 720 702 illustrates the example network audio systemof, where the playback deviceis coupled to three different playback devices at, via the HT network, and is grouped in a zone with at least one other playback device, via the ZG network. Particularly, the three playback devicesinclude a subwoofer (SUB) and two satellite speakers (SAT). In the illustrative embodiment of, it is assumed that the HT networkand the ZG networkare using wireless networks, such as described above, in which audio information is being sent wirelessly from the playback deviceto the playback devicesand. As such, certain timing delays are introduced by the playback deviceto enable the receiving devices to receive and process the audio information. In other embodiments, the HT networkand/or the ZG networkare using wired networks (e.g., 100 Mb or 1 Gb Ethernet networks). As such, certain timing delays may be different than the delays introduced for wireless networks. In yet other embodiments, the timing delays may be different depending on the number of zone players that are grouped together and/or the total number of zone players in the system. In addition, certain timing delays may be introduced to provide a home-theater listening environment. In an embodiment, these timing delays are reflected in the timestamps assigned to various audio channels, groups of audio channels, or audio to be sent to a zone group by the playback device.
702 704 702 702 706 702 710 722 702 704 706 706 704 In an embodiment, the playback devicedetermines that the playback devices, and in this example a SUB and two SATs, are part of its configuration. To do so, for example, the playback devicecan look to a locally stored state variable that lists members of a group, if any. Furthermore, the playback devicedetermines that it is a part of a zone group with one or more playback devices. Again, according to an example, the playback devicecan look to the locally stored state variable to make this determination. Upon receiving audio information from the video devicevia connection, the playback deviceidentifies the audio that is to be transmitted to the playback devicesand the audio that is to be transmitted to the playback devices. In some instances, the audio sent to various playback devices or groups of playback devices may be different. In other words, the audio sent to the playback devicesmight include all of the available frequencies and/or channels, for example, whereas the audio sent to the playback devicesmight include different sets of frequencies and/or channels depending on the playback device and its purpose.
702 704 702 702 702 702 For audio to be played by the playback deviceand playback devices, the playback devicemay separate the audio into different channels, such as front-right, front-left, center, left-rear, right-rear, surround-right, surround-left, and subwoofer. If two or more channels are to be played by the same device, then the playback devicemay group them together. For instance, the playback devicemay be configured to play the front-right, center, and front-left channels. Then, for each audio channel or group of channels, the playback deviceassigns a timestamp to the audio frame specifying the time at which the audio should be played by its respective device.
8 FIG. 702 710 702 708 702 704 720 With reference to, a timestamp may be provided that indicates a time to play the three-channel audio (e.g., front-right, center, and front-left channels) from the playback device, such that the time is delayed by, for example, 5 milliseconds (ms) from when the corresponding video is to be displayed at time (t)=0.0 via the video device. With respect to the SUB channel, the playback devicemay assign a timestamp of 10 ms past t=0.0. With respect to the SAT channels, the playback device may assign each a timestamp of 15 ms past t=0.0. Note that the SAT channels (and SUB channels, if multiple SUBs are grouped together) do not need to be assigned the same timestamp as each other, and can be assigned different timestamps. For instance, if a user is sitting closer to the rear-left SAT, then its timing value may be different from the rear-right SAT. The timing value may be adjusted via the UI. The playback devicemay then send the audio and timing information (audio and timing information may also be collectively referred to as “audio information”) to the respective playback devicesand assign them to a high priority queue, so as to be processed more quickly than traditional audio (e.g., audio from over the ZG network) where video synchronization isn't available and/or necessary.
702 In another embodiment, a designated device other than the playback deviceassigns the timestamp.
704 706 704 706 702 In yet another embodiment, the playback devicesand/orthemselves apply the delay at their respective devices to the received audio information. In such an embodiment, the delay value(s) may be assigned to the playback devicesand/orvia the playback deviceor some other designated device.
706 702 710 706 706 704 718 For the playback devices, the playback devicemay assign a timestamp that indicates a time to play the zone group audio information at 20 ms past when the corresponding video is displayed at t=0 via the video device. A copy of the audio frames may be sent to each playback device within the playback devices, as each will most likely receive and play the same audio information. In an embodiment, the priority to send these audio frames to the playback devicesvia the ZG network may be lower than frames sent to the playback devicesvia the HT network.
706 702 10 710 702 704 It is understood that in one embodiment, the lower priority given to the playback devicesis due to the desire to use the processing capabilities at the playback devicemaintain a high coordination between the video being displayed via the video deviceand the audio being played via the playback deviceand/or the playback devices. If, however, the processing capabilities are not limited or are fast enough to process the audio information without human detection, then the use of priority becomes less relevant or unnecessary.
702 704 702 In an embodiment, the timestamp indicates a lime to play the audio by a particular player. As such, the timestamp includes a delay, if any, for a particular player or group of players. For instance, a delay of 5 ms may be applied to all playback devicesandas a base-line delay. The base-line delay may be for time called for to process and/or distribute the audio information to one or more players. Additional delays that are specific to the players may also be applied. Accordingly, the playback devicemight be configured to play at t+5 ms, the SUB might be configured to play at t+5 ms+5 ms, and the SATs might each be configured to play at t+5 ms+10 ms, for example.
9 FIG. 900 704 718 900 702 900 702 shows a flowchartrepresentative of an example method to provide audio with timestamps to the playback devicesvia HT network. In one embodiment, the methodis implemented in software and/or hardware of the playback device. Alternatively, the methodmay be implemented by a designated device that is different from the playback device.
902 702 710 722 722 702 At block, the playback devicemay receive an audio signal from an audio source such as the video devicevia the connection. The connectionmay utilize any of the Ethernet standards, an optical fiber connection system like TOSLINK, an HDMI connection, an IEEE 802.11 compliant wireless connection, or some other connection that allows transmission of the audio signal to the playback device.
904 702 702 702 At block, the playback devicemay process the incoming audio signal by separating audio information contained in the audio signal into different audio channels. The audio channels may include, for example, any of a front-left, front-right, center, low-frequency, surround-left, surround-right, surround-back-left, and surround-back-right. The playback devicemay likewise group certain channels together to be played by a single playback device, for example. An example of channel grouping includes grouping the front right, center and front left channels together to be played by the playback device.
906 702 702 704 704 At block, the playback deviceassigns a timestamp to the audio information based on the channel, where the timestamp indicates a time at which the audio is to be played by a particular playback device. In one example, the front-left, front-right, and center are assigned a timestamp at or below 5 ms for playback device; the surround-left and surround-right are assigned a timestamp at or below 15 ms for SAT playback devices; the subwoofer is assigned a timestamp at or below 10 ms for the SUB playback device. The timestamp for the rear surrounds may be based on adding an intentional delay to provide a surround effect. The timestamp for the subwoofer may be based on providing enough time to receive audio packets wirelessly via the HT network and based on the assumption that bass frequencies do not require the level of synchronization as higher frequencies. Additionally, the time delays may also be based on content, or the source of content. For example, stereo audio may be played at different delays.
908 702 704 At block, the playback devicemay send the different channels of audio information with their respective timestamps to the playback devices.
702 704 718 Alternatively, it is understood that the playback devicemay forward the audio signal from the audio source onto the playback devicesvia the HT networkwithout separating the audio into separate channels or groups of channels. As such, the respective, receiving playback device can parse the data to find its channel information to playback.
702 906 1000 704 706 1000 702 1000 702 10 FIG. Additionally, the respective, receiving playback devices may store the delay information and use that information to determine a playback time instead of the playback devicepropagating the timestamp information with the audio information, as described above in block.shows a flowchartrepresentative of an example method to provide audio with timestamps to the playback devicesand playback devices. In one embodiment, the methodis implemented in software and/or hardware of the playback device. Alternatively, the methodmay be implemented by a designated device that is different from the playback device.
1002 702 710 722 At block, the playback devicemay receive an audio signal from an audio source, such as the video devicevia the connection.
1004 702 704 At block, the playback devicemay identify home theater audio information contained in the audio signal for playback by the playback devices(e.g., home theater “HT” players).
1006 702 706 At block, the playback devicemay identify zone group audio information contained in the audio signal for playback by the playback devices(e.g., zone group “ZG” players). It is understood that zone group audio information is likely to overlap with home theater audio information. That is, the zone group audio information may contain all, or substantially all, of the frequencies found in the home theater audio information.
1008 702 At block, the playback devicemay assign one or more timestamps to home theater audio information that indicates a time at which audio is to be played by the plurality of home theater players.
1010 702 At block, the playback devicemay assign a zone group timestamp to the zone group audio information that indicates a time at which audio is to be played by the one or more players in the zone group.
1012 702 At block, the playback devicemay send the home theater audio information with the one or more timestamps to the plurality of home theater players and send the zone group audio information with the zone group timestamp to the one or more players in the zone group.
11 FIG. 1100 704 706 1100 702 1100 702 shows a flowchartrepresentative of an example method to adjust time delay for the playback devicesand/or. According to the figure, the system is intelligently adjusting time delays for home theater and non-home theater groups of players based on the user adjustment. In one embodiment, the methodis implemented in software and/or hardware of the playback device. Alternatively, the methodmay be implemented by a designated device that is different from the playback device.
1102 702 710 722 722 702 At block, the playback devicereceives an audio signal from an audio source, such as the video devicevia the connection. The audio signal comprises digital audio information, although it is understood that the audio information may be analog and then converted to a digital format. An example connectionis a TOSLINK, which is a standardized optical fiber connection. The playback devicereceives and decodes the audio stream.
1104 702 702 706 7 FIG. At block, the playback deviceassigns a first timestamp to home theater audio information contained in the audio signal, wherein the first timestamp indicates a first time at which audio is to be played by a first player of a plurality of home theater (HT) components or players. According to, HT components are represented by any of playback devicesand. The first timestamp includes a particular delay component. For example, a first timestamp might be to play at 10 ms, where 10 ms=t+5 ms+5 ms, where “t” represents an approximate time that the video is displayed, 5 ms represents a time value that all home theater players are delayed (e.g., a base delay), and 5 ms represents a specific time delay tailored for an individual player or groups of players.
1106 702 706 7 FIG. At block, the playback deviceassigns a second timestamp to the zone group audio information contained in the audio signal, wherein the second timestamp indicates a second time at which audio is to be played by a second player of a plurality of non-HT components or players, wherein the first and second players are different and the first and second times are different. According to, non-HT components are represented by any of the playback devices. The second timestamp includes a second delay component. For example, a second timestamp might be to play at 20 ms, where 20 ms=t+20 ms.
1104 702 At block, the playback devicereceives a user command to adjust a delay that is to be played by the plurality of players. For example, the user may adjust the delay between the video and the audio to improve synchronization between the video and audio. In another example, the user may adjust the delay to improve the multichannel audio affects depending on, for example, the location of the user within the room. In one example, different delays are used for speakers depending on, for example, if the speaker is less than two feet from the user, the speaker is between two feet and ten feet from the user, and the speaker is more than ten feet from the user.
1108 702 At block, the playback deviceadjusts the first time differently than the second time responsive to the user command. For example, the first time corresponding to the HT components is adjusted, whereas the second time corresponding to the non-HT components are not adjusted. An example includes a first time to be played at t+5 ms+5 ms and a second time to be played at t+20 ms, where again “t” represents an approximate time that the video is displayed. When an adjustment is made, such as +5 ms, then the first time might look like play at t+5 ms+10 ms and the second time might stay the same. In other words, the system is trying to achieve a balance between a delay for HT components and a delay for non-HT components.
1110 702 At block, the playback devicesends the audio to the plurality of players with the at least adjusted first time. In some instances, the first time is the only time adjusted. In some instances, the first time and the second time are both adjusted, and in some instances, only the second time is adjusted. The first time and/or second time is adjusted via the timestamps.
7 FIG. 718 702 704 718 718 According todescribed above, the HT networkis used by the playback devicesandto exchange (e.g., transmit and receive) information. In an embodiment, the HT networkrepresents a low-latency network to enable multi-channel audio, and in particular, to enable multi-channel audio in a home-theater like environment. The following definitions may be used throughout this disclosure, and in particular, used to describe an example operation of a low latency network like HT network.
The terms “spectrum” or “wireless spectrum” refer to a range of wireless communications frequencies, where different “spectra” (multiple spectrum) refer to different ranges of wireless frequencies. Different spectra may or may not overlap. Different spectra may or may not be contiguous (i.e., may or may not have spectra between them). In some examples disclosed herein, the term spectrum refers to a regulatory spectrum as defined by a regulatory agency such as the Federal Communications Commission (FCC) in the United States. For example, the FCC has allocated the “2.4 GHz spectrum” (or spectral band) to include the frequency range of 2400 MHz to 2500 MHz for Industrial, Scientific, and Medical applications. Additionally, the FCC has allocated the “5 GHz spectrum” (or spectral band) to include the frequency range of about 5.17 GHz to about 5.835 GHz, with some excepted bands within that range.
The terms “channel,” “audio channel.” “control channel.” and/or, more generally, “wireless channel,” all refer to a distinct frequency or distinct sub-range(s) of frequencies within one or more spectra that may be used to transmit particular information. A channel may be a band of frequencies, a non-contiguous set of frequencies and bands, a frequency hopping configuration, time division multiplexing, code division multiplexing, and/or any other type of communication frequency arrangement.
The terms “primary” or “primary zone player” refer to a zone player configured to control a low-latency audio network having zero or more “satellite” zone players. A “satellite” or “satellite zone player” refers to a zone player configured to provide low-latency audio in combination with a primary zone player. Both primary zone players and satellite zone players may be configurable to operate in other audio playback arrangements, such as in a zone group.
7 FIG. 702 704 Referring back to, the playback devicemay be considered a primary zone player and the playback devicesmay be considered the satellite zone players, it is understood, however, that a standalone device may perform all, or most of, the functions associated with a primary zone player, and as such, the disclosure is not limited to a zone player operating as the primary device.
The term “low latency audio” generally refers to audio played within a threshold time of another event (e.g., video played on a monitor substantially synchronized with audio played via a zone player). For instance, in some embodiments audio is considered low-latency when the audio is played within a threshold time (e.g., at or around 30 milliseconds) of the corresponding video being shown. In some embodiments, audio is considered low-latency when the audio is played (e.g., from a speaker) within a threshold time (e.g., at or around 30 milliseconds) of being provided from a source of the audio (e.g., from a media source such as a television, a cable set top box, a digital media player, a DVD player, a Blu-ray disc player, etc.). Other measures of latency and low-latency may be used in different embodiments and even have different meanings depending on the application.
12 FIG. 12 FIG. 1 FIG. 7 FIG. 1200 102 124 1200 116 118 120 1200 702 1200 shows an internal functional block diagram of an example primary zone player to provide low-latency audio. The example primary zone playerofmay be used to implement any of the example zone players-of. In some embodiments, the example primary zone playermay be used to implement one of the home theater zone players,,. In some embodiments, the primary zone playermay be used to implement the playback devicein. In some embodiments, the primary zone playeris a sound bar. As used herein, a “sound bar” refers to a single playback device including an array of speakers configured to replicate audio for video and to replicate audio in general. In some instances, a sound bar may play three channels of audio including a front right, center, and front left. In some instances, a sound bar may simulate or partially simulate a surround sound experience.
400 1200 408 410 412 414 416 418 420 1200 1202 1204 1206 406 406 1200 406 1204 1206 12 FIG. 12 FIG. Like the example zone playerof FIG, 4, the example primary zone playerofincludes a processor, memory, an audio processing component, a module, an audio amplifier, speakers, and one or more antenna(s). These components are discussed in more detail above. More or less components may be included depending on the desired configuration. The example primary zone playerofincludes a network interfacehaving a first interface(Spectrum 1 interface) to communicate via a first wireless spectrum (e.g., the 2.4 GHz spectrum), a second interface(Spectrum 2 interface) to communicate via a second wireless spectrum different from the first wireless spectrum (e.g., the 5 GHz spectrum), and a wired interface. The wired interfaceis discussed above. The example primary zone playermay simultaneously or substantially simultaneously communicate via any or all of the interfaces,, and.
406 1204 1206 406 1204 1206 1200 406 1204 1206 12 FIG. Each of the example interfaces,,ofmay have a unique identifier such as a unique Media Access Control (MAC) address. Thus, each of the example interfaces,,may be addressed separately, and the example primary zone playermay communicate using any or all of the interfaces,,simultaneously if so desired.
1200 1208 1210 1208 1204 1206 1208 1204 1208 1204 1208 1206 406 1204 1206 12 FIG. The example primary zone playeroffurther includes a control interfaceand an audio interface. The control interfacetransmits and/or receives control information (e.g., configuration information) wirelessly, for example, via the first and second spectrum interfaces,. For example, the control interfacemay communicate configuration information to one or more satellite zone players and/or communicate configuration information to one or more other zone players via the first spectrum interface. In some examples, the control interfacereceives configuration information via the first spectrum interfacefrom other zone players. The example control interfaceadditionally or alternatively communicates control information (e.g., channel probes, keep-alive probes, etc.) to satellite zone players via the second spectrum interface. It is also understood that the wired interfacecould instead (or in addition to the wireless interfacesand/or) to transmit and receive control information.
1210 406 412 1204 1206 1210 1210 1204 1204 1206 1210 1208 12 FIG. The example audio interfaceoftransmits audio information and/or receives audio information via the interfaces,,,. For example, the audio interfacemay receive digital audio information from an Internet source, from a local networked source (e.g., a computer via a LAN), and/or from another home theater component such as a video device (e.g., a television set, a cable box, an optical media player (DVD, Blu-ray disc, etc.), a digital media player, a video game console) and/or any other type of audio source. The example audio interfacefurther transmits received audio information to one or more zone players, including standard zone players (e.g., via line-out connection such as RCA or optical output, or via a mesh network via the first spectrum interface, such as a 2.4 GHZ interface) and/or satellite zone players (e.g., via a star network via the first spectrum interfaceand/or the second spectrum interface). In some examples, the audio interfacetransmits the audio information based on control information provided by the control interface.
1202 1212 1212 1204 1212 1204 1212 1206 To control which channels are used in the first and second spectra, the example network interfacefurther includes a channel selector. The example channel selectorselects channels in the first spectrum. The example first spectrum interfacetransmits and/or receives information via the selected channel. In some examples, the channel is selected by a different device (e.g., an external device such as another zone player), and the channel selectoris provided with the channel information via the first spectrum interface. The example channel selectoralso selects channels for use in the second spectrum. The second spectrum interfacetransmits and receives data via the selected channel(s) in the second spectrum.
1212 1208 1208 1212 1206 1210 In some examples, the currently selected channel may become unsuitable for low latency audio, and another, more suitable channel is available. The example channel selectormay select a new channel within the same spectrum (e.g., the 5 GHz spectrum) and provide the channel information to the control interface. The example control interfacegenerates and sends a probe or other configuration information, including a command to switch channels and the new channel information, to any connected satellite zone players. The example channel selectorthen causes the second spectrum interfaceto change to the new selected channel. The audio interfacemay then continue to transmit audio information on the new selected channel.
1212 1208 1208 1210 In some examples, the currently selected channel may become unsuitable for low latency audio, and no other suitable channels are available within the same spectrum. The example channel selectormay select a channel within a different spectrum (e.g., the 2.4 GHz spectrum) and provide the channel information to the control interface. The example control interfacegenerates and sends a probe or other configuration information, including a command to switch channels and the new channel and spectrum information, to any connected satellite zone players. The audio interfacemay then continue to transmit audio information on the new channel of the different spectrum
1200 1214 1200 410 1200 1214 1214 1200 12 FIG. 12 FIG. The example primary zone playeroffurther includes a spanning tree protocol controllerto control a spanning tree protocol configuration. Spanning tree protocol refers to a network protocol that structures a network to avoid bridge loops by, in general, 1) designating a root node, 2) calculating the least cost path from other nodes to the root node, and 3) disabling other paths. The example primary zone playerofadvantageously uses spanning tree protocol to communicate with satellite zone players and/or other zone players in a mesh network. The use of spanning tree protocol enables the delivery of low-latency audio by determining shortest paths between points and by reducing (e.g., avoiding) unnecessary hops of the low-latency audio data between zone players. An example spanning tree protocol configuration may be a spanning tree protocol table (e.g., stored in the memory) that includes the ports and/or devices to which the example primary zone playeris connected. The example spanning tree protocol controllerreconfigures the spanning tree protocol table when additional zone players are added and/or when configurations of zone players change. For example, the spanning tree protocol controllerchanges the spanning tree protocol table when the primary zone playerdisconnects from a satellite zone player (e.g., connected via a mesh networking mode) and reconnects to the same satellite zone player in a different networking mode (e.g., a star networking mode).
1208 1204 1208 1200 1200 1210 1210 In an example mode of operation, the control interfaceinitially (e.g., on startup, on adding a satellite zone player to a zone player network) communicates with one or more satellite zone players via the first spectrum interface. The control interfacetransmits control information to the satellite zone player(s) via a selected channel in the first spectrum. The example control information includes at least a selected channel in the second spectrum and an identifier of the primary zone player(e.g., to differentiate the primary zone playerfrom any other primary zone players that may be on the same network). After transmitting the control information (and, in some embodiments, receiving acknowledgement from the satellite zone player(s)), the example audio interfacemay begin transmitting audio information to the satellite zone player(s). In some embodiments, the audio interfacetransmits zone player specific audio information to each of multiple zone players (e.g., left surround channel audio to a zone player configured as a left surround speaker, right surround channel audio to a zone player configured as a right surround speaker).
1208 1200 1204 1212 12 FIG. Continuing with the example, if the selected channel in the second spectrum is inadequate (e.g., too much interference, too much latency, etc.), the example control interfaceoftransmits control information to the satellite zone players to cause the satellite zone players to revert to communicating with the primary zone playervia a channel in the first spectrum and the first spectrum interface. The example channel selectorselects a different channel in the second spectrum and transmits control information to the satellite zone players identifying the newly selected channel.
13 FIG. 12 FIG. 13 FIG. 1 FIG. 7 FIG. 1300 1200 1300 102 124 704 1200 116 118 120 shows an internal functional block diagram of an example satellite zone playerto provide low latency audio in combination with the example primary zone playerof. The example satellite zone playerofmay be used to implement any of the example zone players-ofand/or the playback devicesin. In some embodiments, the example primary zone playermay be used to implement any of the home theater zone players,,and/or may be satellite speakers (e.g., left/right surround speakers, subwoofers, etc.) to complement a sound bar-type surround sound configuration.
400 1200 1300 408 410 412 414 416 418 420 1300 1302 1304 406 406 4 FIG. 12 FIG. 13 FIG. 13 FIG. Like the example zone playerofand the example primary zone playerof, the example satellite zone playerofincludes a processor, memory, an audio processing component, a module, an audio amplifier, speakers, and one or more antenna(s). These components are discussed in detail above. More or less components may be included depending on the desired configuration. The example satellite zone playerofincludes a network interfacehaving a dual spectrum interface(Spectrum 1/2 interface) to communicate via a first wireless spectrum (e.g., the 2.4 GHz spectrum) and a second wireless spectrum different from the first wireless spectrum (e.g., the 5 GHz spectrum), and a wired interface. The wired interfaceis discussed above.
12 13 FIGS.and In the examples of, the dual spectrum
1304 1204 1206 1304 1204 1206 1304 1204 1206 12 FIG. interfacecommunicates in the same two spectra as the first and second wireless interfaces,. The example dual spectrum interfacemay communicate in either the first spectrum (e.g., on a wireless channel in the first spectrum) or the second spectrum (e.g., on a wireless channel in the second spectrum) at a given time. In some other examples, the dual spectrum wireless interfaces,may communicate in both spectra simultaneously or substantially simultaneously. In some examples, the dual spectrum interfaceis replaced with separate first and second wireless interfaces, which may be similar or identical to the first and second wireless interfaces,of. In some embodiments, each wireless interface is assigned a unique address (e.g., a MAC address).
1300 1306 1308 1306 1308 1304 406 1306 1200 1300 1200 1306 1300 1200 13 FIG. The example satellite zone playeroffurther includes a control interfaceand an audio interface. The control interfaceand the audio interfacetransmit and/or receive information (e.g., control information, audio information) via the dual spectrum interfaceand/or the wired interface. The example control interfacereceives control information via a channel (e.g., from the primary zone playerof FIG. 12) in a first spectrum (e.g., Spectrum 1). The control information via the first spectrum indicates an audio channel in a second spectrum (e.g., Spectrum 2) via which audio information in addition to control information is to be transmitted to the satellite zone playerfrom the primary zone player. The example control interfacealso transmits control information, such as probe acknowledgements, configuration information, device information, and/or other information used for control and/or configuration of the satellite zone playerto a primary zone playerand/or to standard zone players.
1308 418 1304 1308 13 FIG. The example audio interfaceofreceives audio information to be played (e.g., via the speakers). The audio information may be received via an audio channel via which the dual spectrum interfaceis communicating. In some examples, the audio interfacedetermines that the received audio information has a high quality of service characteristic and/or a low latency, indicating that the audio is to be played as soon as possible and/or at a designated time.
1302 1310 1310 1304 13 FIG. The example network interfaceoffurther includes a channel selector. The example channel selectorselects a wireless communications channel in the first spectrum or the second spectrum and causes the dual spectrum interfaceto begin communicating on the selected channel.
1306 1306 1310 1304 1306 1306 1310 1304 In some examples, the control interfacemay receive a notification from a primary zone player that communication is to be changed to a different channel in the second spectrum (e.g., the 5 GHz spectrum). This may occur if, for example, the channel being used has become unsuitable for low-latency audio and another, more suitable channel is available. The example control interfaceprovides the new channel to the channel selector, which causes the dual spectrum interfaceto change channels within the same spectrum. In some examples, the control interfacemay receive a notification from a primary zone player that communication is to be changed to a channel in the first spectrum (e.g., the 2.4 GHz spectrum). This may occur if, for example, the channel being used has become unsuitable for low-latency audio and no other more suitable channel is available in the second spectrum (e.g., 5 GHz spectrum). The example control interfaceprovides the channel to the channel selector, which causes the dual spectrum interfaceto change channels and spectrum.
1300 1312 410 1300 1312 1300 1312 1300 The example satellite zone playerfurther includes a spanning tree protocol controllerto control a spanning tree protocol configuration based on receiving control information (e.g., from a primary zone player). For example, a spanning tree protocol configuration may be a spanning tree protocol table (e.g., stored in the memory) that includes the ports and/or devices to which the example satellite zone playeris connected. The example spanning tree protocol controllerreconfigures the spanning tree protocol table when additional zone players are added and/or when configurations of the satellite zone playerchanges. For example, the spanning tree protocol controllerchanges the spanning tree protocol table when the satellite zone playeris reconfigured to connect to a primary zone player in a different networking mode (e.g., reconfigured from a mesh networking mode to a star networking mode).
14 FIG. 1400 1402 1404 1406 1408 1410 1412 1414 1402 1404 1406 shows a block diagram of an example zone player networkin which a primary zone playeris connected to satellite zone players,via a star networkand to additional zone players,via a mesh network. The example primary zone playerand the example satellite zone players,may be configured in a home theater arrangement in a single location (e.g., in the same room).
1410 1412 1402 1406 1400 700 1402 702 1404 1406 704 1410 1412 706 1408 718 1414 720 7 FIG. 7 FIG. The example zone players,may be located in the same or different rooms from the example primary and/or satellite zone players-. In an embodiment, the zone player networkis like the networkin, where the primary zone playeris like the playback device, the satellite zone players,are like the playback devices, and the additional zone players,are like playback devices. Additionally, in the embodiment, the star networkis like the HT networkinand the mesh networkis like the ZG network.
1402 1410 1412 1402 1410 1412 1414 1414 1402 1410 1412 1414 1402 1410 1412 1402 1410 1412 1404 1406 1408 14 FIG. The example primary zone playerand the zone players,are communicatively coupled via a first spectrum (e.g., a 2.4 GHz spectrum channel). The example zone players,,in the mesh networkmay communicate and/or synchronize audio information and/or control information as described in U.S. Pat. No. 8,234,395, entitled, “System and method for synchronizing operations among a plurality of independently clocked digital data processing devices,” the entirety of which is hereby incorporated by reference. At the direction of a user, additional zone players may be added to and/or removed from the mesh network; any of the zone players,,may be removed from and/or rejoined to the mesh network; and/or any combination of the primary zone player, the zone player, and/or the zone playermay be grouped into zone groups for synchronized playback of audio. In the example of, a user may choose to cause audio provided to the primary zone playerthat is part of a zone group with one or more of the zone player(s),to also be played via the satellite zone players,in the star network.
14 FIG. 1402 1416 1418 1404 1406 1416 1418 1402 1404 1406 1404 1406 1404 1406 In the example of, in which the primary zone playeris coupled via respective star wireless connections,to the satellite zone players,. The establishment of the example connections,is described in more detail below. The example primary zone playerand the satellite zone players,are communicatively coupled via a second spectrum (e.g., a 5 GHz spectrum channel). The satellite zone players,may be, for example, left and right rear surround speakers such as the Sonos PLAY:3™ zone player. In some examples, one or more of the satellite zone players,is a subwoofer zone player such as the Sonos SUB zone player.
1402 1410 1412 1408 1402 1404 1406 1402 1404 1406 1402 1404 1406 The primary zone playerreceives audio information from an audio source (e.g., a television, a networked source, an Internet source, via the zone player(s),, etc.) to be played via the star network. The example primary zone playerdetermines the audio information to be transmitted to respective ones of the example satellite zone players,. For example, the Dolby®). Digital family of surround sound technology provides for audio channels corresponding to surround (e.g., rear) speakers. The primary zone playertransmits, via the second spectrum (e.g., 5 GHz spectrum) audio channel, the respective audio information to the satellite zone players,, which decode and play the audio information. In an embodiment, the primary zone playerincludes timing information along with the audio information to the satellite zone players,.
14 FIG. 1402 1402 1404 1406 1402 1410 1412 1402 1410 1412 1402 0 1404 1406 1410 1412 1408 In the example of, the primary zone playerassigns different quality of service indicators to different types of information. For example, the primary zone playermay assign the highest quality of service (e.g., quality of service level 3) to audio information requiring a low latency (e.g., less than a threshold latency) that is transmitted to the satellite zone players,. The primary zone playermay assign a next highest quality of service (e.g., quality of service level 2) to audio information from one or more predetermined types of sources (e.g., a wired source, a home theater source, etc.) that is not transmitted to a satellite zone player (e.g., is transmitted to the zone players,). The example primary zone playerassigns a next highest quality of service (e.g., quality of service level 1) to audio information from other types of sources (or any source, if quality of service level 2 is not used) that is not transmitted to a satellite zone player (e.g., is transmitted to the zone players,). The example primary zone playeruses the lowest quality of service (e.g., quality of service level) to transmit background data (e.g., non-audio information, control information, configuration information, etc.) lo other zone players (e.g., zone players,,,). The use of quality of service enables the star networkto provide low-latency audio, which improves user perception of a home theater presentation and reduces lag between video and audio to acceptable levels.
The description discloses various example systems, methods, apparatus, and articles of manufacture including, among other components, firmware and/or software executed on hardware. However, such examples are merely illustrative and should not be considered as limiting. For example, it is contemplated that any or all of these firmware, hardware, and/or software components can be embodied exclusively in hardware, exclusively in software, exclusively in firmware, or in any combination of hardware, software, and/or firmware. Accordingly, while the following describes example systems, methods, apparatus, and/or articles of manufacture, the examples provided are not the only way(s) to implement such systems, methods, apparatus, and/or articles of manufacture.
Additionally, reference herein to “embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one example embodiment of the invention. The appearances of this phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. As such, the embodiments described herein, explicitly and implicitly understood by one skilled in the art, can be combined with other embodiments.
The specification is presented largely in terms of illustrative environments, systems, procedures, steps, logic blocks, processing, and other symbolic representations that directly or indirectly resemble the operations of data processing devices coupled to networks. These process descriptions and representations are typically used by those skilled in the art to most effectively convey the substance of their work to others skilled in the art. Numerous specific details are set forth to provide a thorough understanding of the present disclosure. However, it is understood to those skilled in the art that certain embodiments of the present disclosure can be practiced without certain, specific details. In other instances, well known methods, procedures, components, and circuitry have not been described in detail to avoid unnecessarily obscuring aspects of the embodiments. Accordingly, the scope of the present disclosure is defined by the appended claims rather than the forgoing description of embodiments.
When any of the appended claims are read to cover a purely software and/or firmware implementation, at least one of the elements in at least one example is hereby expressly defined to include a tangible medium such as a memory, DVD, CD, Blu-ray, and so on, storing the software and/or firmware.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
September 25, 2025
April 30, 2026
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.