Methods and systems for managing telemetry data for endpoint devices are disclosed. Telemetry data may be generated by sensors distributed among sub-systems of an endpoint device. A management controller of the endpoint device may collect the telemetry data from the sensors without traversing in band components of the sub-systems. Therefore, the telemetry data may be aggregated by the management controller regardless of a power or operational state of hardware resources of the sub-systems. The management controller may provide the aggregated telemetry data to a remote server according to a subscription indicated by the remote server.
Legal claims defining the scope of protection, as filed with the USPTO.
. A method of managing an endpoint device that comprises sub-systems, the method comprising:
. The method of, further comprising:
. The method of, wherein the sub-systems comprise at least one device selected from a list of devices consisting of:
. The method of, wherein the third device is located remote to the management controller and the third portion of the telemetry data is obtained from the third device according to a subscription selected by the management controller.
. The method of, wherein providing the aggregated telemetry data comprises:
. The method of, wherein providing the aggregated telemetry data further comprises:
. The method of, wherein the plurality of sensors is distributed across the sub-systems of the endpoint device.
. The method of, wherein the telemetry data is obtained when hardware resources of the endpoint device are unpowered.
. The method of, wherein the endpoint device comprises a network module adapted to separately advertise network endpoints for the management controller and hardware resources of the endpoint device, the network endpoints being usable by the remote server to address communications to the hardware resources and the management controller.
. The method of, wherein the management controller and the network module are on separate power domains from the hardware resources so that the management controller and the network module are operable while the hardware resources are inoperable.
. The method of, wherein the out of band communication channel runs through the network module, and an in band communication channel that services the hardware resources also runs through the network module.
. The method of, wherein the network module hosts a transmission control protocol/internet protocol (TCP/IP) stack to facilitate network communications via the out of band communication channel.
. The method of, further comprising:
. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform operations for managing an endpoint device that comprises sub-systems, the operations comprising:
. The non-transitory machine-readable medium of, further comprising:
. The non-transitory machine-readable medium of, wherein the sub-systems comprise at least one device selected from a list of devices consisting of:
. The non-transitory machine-readable medium of, wherein the third device is located remote to the management controller and the third portion of the telemetry data is obtained from the third device according to a subscription selected by the management controller.
. A data processing system, comprising:
. The data processing system of, further comprising:
. The data processing system of, wherein the sub-systems comprise at least one device selected from a list of devices consisting of:
Complete technical specification and implementation details from the patent document.
Embodiments disclosed herein relate generally to managing telemetry data. More particularly, embodiments disclosed herein relate to systems and methods to manage telemetry data across sub-systems of an endpoint device using a management controller of the endpoint device.
Computing devices may provide computer-implemented services. The computer-implemented services may be used by users of the computing devices and/or devices operably connected to the computing devices. The computer-implemented services may be performed with hardware components such as processors, memory modules, storage devices, and communication devices. The operation of these components and the components of other devices may impact the performance of the computer-implemented services.
Various embodiments will be described with reference to details discussed below, and the accompanying drawings will illustrate the various embodiments. The following description and drawings are illustrative and are not to be construed as limiting. Numerous specific details are described to provide a thorough understanding of various embodiments. However, in certain instances, well-known or conventional details are not described in order to provide a concise discussion of embodiments disclosed herein.
Reference in the specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in conjunction with the embodiment can be included in at least one embodiment. The appearances of the phrases “in one embodiment” and “an embodiment” in various places in the specification do not necessarily all refer to the same embodiment.
References to an “operable connection” or “operably connected” means that a particular device is able to communicate with one or more other devices. The devices themselves may be directly connected to one another or may be indirectly connected to one another through any number of intermediary devices, such as in a network topology.
In general, embodiments disclosed herein relate to methods and systems for managing endpoint devices. The endpoint devices may include any number of sub-systems (e.g., data processing systems, devices) that may provide computer-implemented services. The computer-implemented services may include any quantity and type of such services. The computer-implemented services may be provided as a result of operation of the sub-systems, which may depend on hardware resources and/or other components operating as intended. Therefore, monitoring operation of the sub-systems may support timely intervention in the event of undesired operation of the sub-systems and, therefore, may support ongoing provision of the computer-implemented services as desired by downstream consumers of the computer-implemented services.
To monitor the operation of the sub-systems, an endpoint device may include any number of sensors distributed across the sub-systems and each sensor may generate operation data indicating operation of a sub-system of the sub-systems. The operation data may include, for example, telemetry data that may be aggregated by the endpoint device. Portions of the telemetry data may be provided to remote entities (e.g., a remote server) for use in monitoring the operation of the endpoint device.
However, in band components of the endpoint device (e.g., hardware resources, in band communication channels) may be subject to failures, which may interfere with obtaining the telemetry data from the sensors. For example, hardware and/or software components of the endpoint device may experience: (i) power failures, (ii) compromise due to malware, (iii) network connectivity issues, and/or (iv) other issues. Therefore, the in band components may not be reliable and/or available to aggregate the telemetry data as desired by the remote server. Unavailability of the telemetry data may result in delayed intervention following undesired operation of at least a portion of the sub-systems. Consequently, computer-implemented services provided by the endpoint device may be interrupted, may be provided at a lower quality, and/or may otherwise be undesirably impacted.
To increase a reliability of telemetry data aggregation and distribution, embodiments disclosed herein relate to methods and systems for managing telemetry data across sub-systems of an endpoint device using a management controller and an out of band communication channel. The management controller and a network module that manages communication channels for the data processing system may be powered separately from hardware resources of the endpoint device. Therefore, the management controller may obtain the telemetry data via data transmission paths that do not include the sub-systems while the hardware resources of the endpoint device are unpowered, compromised, and/or otherwise unavailable. The management controller may aggregate the telemetry data to obtain aggregated telemetry data and may provide at least a portion of the aggregated telemetry data to the remote server via an out of band communication channel that does not traverse in band components of the endpoint device.
By doing so, embodiments disclosed herein may provide a system for managing telemetry data for an endpoint device using out of band methods. A management controller of the endpoint device may respond to requests for telemetry data (e.g., as part of subscriptions indicated by a remote server) regardless of a state (e.g., a power state, a connectivity state, a compromise state) of hardware resources of the endpoint device and without intervention by a user of the endpoint device. Thus, the endpoint device may be more likely to provide computer-implemented services as desired to downstream consumers of the computer-implemented services.
In an embodiment, a method of managing an endpoint device that includes sub-systems is provided. The method may include: obtaining, by a management controller of the endpoint device, telemetry data from a plurality of sensors, the telemetry data indicating operation of the sub-systems and the telemetry data being obtained via a data transmission path that excludes the sub-systems; aggregating, by the management controller, the telemetry data to obtain aggregated telemetry data; and providing, by the management controller and via an out of band communication channel, the aggregated telemetry data to a remote server.
The method may also include: obtaining, by the management controller and via the out of band communication channel, a response to the aggregated telemetry data; and providing, by the management controller, computer-implemented services based at least in part on the response.
The sub-systems may include at least one device selected from a list of devices consisting of: a first device that provides a first portion of the telemetry data to the management controller via a physical interface; a second device that provides a second portion of the telemetry data to the management controller via a side band communication channel of the endpoint device; and a third device that provides a third portion of the telemetry data to the management controller via the out of band communication channel.
The third device may be located remote to the management controller and the third portion of the telemetry data may be obtained from the third device according to a subscription selected by the management controller.
Providing the aggregated telemetry data may include: providing, by the management controller and via the out of band communication channel, a first portion of the telemetry data to the remote server according to a subscription selected by the remote server.
Providing the aggregated telemetry data may also include: identifying, by the management controller, that a second portion of the telemetry data comprises urgent telemetry data; and providing, by the management controller and via the out of band communication channel, the second portion of the telemetry data to the remote server, the second portion of the telemetry data not being provided to the remote server according to the subscription.
The plurality of sensors may be distributed across the sub-systems of the endpoint device.
The telemetry data may be obtained when hardware resources of the endpoint device are unpowered.
The endpoint device may include a network module adapted to separately advertise network endpoints for the management controller and hardware resources of the endpoint device, the network endpoints being usable by the remote server to address communications to the hardware resources and the management controller.
The management controller and the network module may be on separate power domains from the hardware resources so that the management controller and the network module are operable while the hardware resources are inoperable.
The out of band communication channel may run through the network module, and an in band communication channel that services the hardware resources may also run through the network module.
The network module may host a transmission control protocol/internet protocol (TCP/IP) stack to facilitate network communications via the out of band communication channel.
The method may also include: prior to obtaining the telemetry data: advertising, by the management controller and via the out of band channel, access to operation data for the endpoint device; obtaining, by the management controller and via the out of band channel, a subscription request from the remote server that is responsive to the advertising, the subscription request indicating that the remote server desires access to the operation data; and establishing, by the management controller and based on the subscription request, a subscription for the aggregate telemetry data for the remote server.
In an embodiment, a non-transitory media is provided. The non-transitory media may include instructions that when executed by a processor cause the computer-implemented method to be performed.
In an embodiment, a data processing system is provided. The data processing system may include the non-transitory media and a processor, and may perform the method when the computer instructions are executed by the processor.
Turning to, a block diagram illustrating a distributed environment (e.g., system) in accordance with an embodiment is shown. The system shown inmay provide for management of endpoint devices that may provide, at least in part, computer-implemented services. The system may include any number of endpoint devices(e.g., computing devices) that may each include any number of sub-systems (e.g., devices, data processing systems). Each of the sub-systems may include hardware components (e.g., processors, memory modules, storage devices, communications devices). The hardware components may support execution of any number and types of applications (e.g., software components). Refer tofor additional details regarding the sub-systems.
The computer-implemented services may include any type and quantity of computer-implemented services. The computer-implemented services may include, for example, database services, data processing services, electronic communication services, and/or any other services that may be provided using one or more computing devices. The computer-implemented services may be provided by, for example, endpoint devices, remote serverand/or any other type of devices (not shown in). Other types of computer-implemented services may be provided by the system shown inwithout departing from embodiments disclosed herein.
To provide the aforementioned computer-implemented services, any of endpoint devices(e.g.,A,N) may need to operate in a predetermined manner. For example, certain hardware components and/or software components of the sub-systems may need to be operational for endpoint devicesto provide a desired type and/or quantity of computer-implemented services.
An endpoint device (e.g.,A) may include any number of sensors tasked with monitoring operation of the sub-systems. While providing the computer-implemented services, the sensors may collect operation data that may include telemetry data (e.g., metrics, events, logs, traces). The telemetry data may be used (e.g., locally, by remote server) to monitor performance of endpoint deviceA. For example, remote servermay request that certain types and/or quantities of the telemetry data be provided in accordance with a subscription.
By monitoring the performance of endpoint deviceA, undesired operation of any of the sub-systems may be efficiently identified and remedied. In addition, monitoring performance of endpoint deviceA may provide useful insights for potential improvement of the operation of the sub-systems and, therefore, potential improvement in a quality and/or reliability of the computer-implemented services.
However, the in band components of one or more of the sub-systems (e.g., hardware components, the operating system, in band communication channels) may be compromised, inoperable, depowered, and/or otherwise unavailable for use by a user. Therefore, endpoint deviceA may be unable to collect and aggregate telemetry data from each sub-system. Consequently, provision of the telemetry data to remote server(and/or any other entity) may be interrupted thereby causing potential delays and/or a reduction in quality of the computer-implemented services.
In general, embodiments disclosed herein relate to systems, devices, and methods for managing endpoint devices that include sub-systems using out of band methods so that telemetry data may be aggregated and provided to remote systems regardless of a power or operational state of the endpoint devices. Endpoint devicesmay include out of band components (e.g., a management controller) that may communicate with remote systems (e.g., remote server) without traversing in band communication channels and without utilizing the in band components of the sub-systems (e.g., hardware resources of the sub-systems).
To do so, the sub-systems of the endpoint device may host any number of sensors that monitor operation of the endpoint device by collecting operation data. The operation data may include telemetry data such as event logs and/or other metrics usable to investigate performance of the endpoint device. The management controller may communicate with the sensors via transmission paths that exclude the sub-systems. The management controller may request a list of available operation data from each sensor.
Following an identification of available types of operation data, the management controller may advertise, via an out of band communication channel, the available operation data to a remote server (e.g., remote server). Remote servermay provide a response indicating portions of the available operation data (e.g., telemetry data) that are of interest to remote server. The management controller may establish a subscription for remote serverbased on the requested telemetry data.
The management controller may obtain the telemetry data indicated by the subscription from the sensors and may aggregate the telemetry data to obtain aggregated telemetry data. The management controller may provide the aggregated telemetry data to remote servervia the out of band communication channel.
Prior to providing the telemetry data to remote server, the management controller may determine whether the telemetry data includes urgent telemetry data (e.g., indicating an operational failure, suspected compromise of a sub-system). If the telemetry data includes urgent telemetry data, the management controller may provide aggregated telemetry data that includes the urgent telemetry data to remote serveras it is identified rather than following a schedule indicated by the subscription.
If the telemetry data does not include urgent telemetry data, the management controller may provide the aggregated telemetry data to remote serveraccording to a schedule indicated by the subscription.
To perform the above-mentioned functionality, the system ofmay include endpoint devices, remote server, and/or other components. Endpoint devices, remote server, and/or any other type of devices not shown inmay perform all, or a portion of the computer-implemented services independently and/or cooperatively. Each of these components is discussed below.
Endpoint devicesmay include any number and/or type of endpoint devices (e.g.,A-N). Any of endpoint devicesmay include sub-systems that may include in band components (e.g., hardware resources). Any of endpoint devicesmay also include sub-systems that may include in band components (e.g., hardware resources) and functionality that may allow the out of band components to interact with remote systems independently from the in band components. For more information regarding out of band components of endpoint devices, refer to the discussion of.
Endpoint devicesmay also include any number of sensors positioned to monitor operation of the sub-systems. The sensors may collect operation data (e.g., telemetry data) and may provide portions of the operation data to the management controller (e.g., upon request, according to a subscription). Refer tofor additional information regarding sub-systems and sensors.
Remote servermay be implemented using physical devices located remote to endpoint devicesthat monitor operation of endpoint devices. Remote servermay subscribe to telemetry data (e.g., portions of the operation data advertised by the management controller) and may receive aggregated telemetry data via the out of band communication channel. Remote servermay utilize the aggregated telemetry data to make decisions, diagnose performance issues, improve performance of endpoint devices, and/or for other purposes.
Thus, telemetry data indicating operation of endpoint devicesmay be provided to remote serverusing out of band methods. The telemetry data may be obtained and more reliably provided to remote serverwhen implemented using the out of band methods versus in band methods that may rely on availability (e.g., operability, security) of in band components and in band communication channels. The telemetry data may be used to diagnose performance issues of endpoint devicesand/or to improve performance thereby improving a quality and/or reliability of the computer-implemented services provided by endpoint devices.
When providing their functionality, any of endpoint devices, remote server, and/or other devices may perform all, or a portion of the method shown in.
Any of (and/or components thereof) endpoint devices, and/or remote servermay be implemented using a computing device (also referred to as a data processing system) such as a host or a server, a personal computer (e.g., desktops, laptops, and tablets), a “thin” client, a personal digital assistant (PDA), a Web enabled appliance, a mobile phone (e.g., smartphone), an embedded system, local controllers, an edge node, and/or any other type of data processing device or system. For additional details regarding computing devices, refer to the discussion of.
In an embodiment, one or more of endpoint devicesand/or remote serverare implemented using an internet of things (IoT) device, which may include a computing device. The IoT device may operate in accordance with a communication model and/or management model known to endpoint devices, remote server, and/or other devices.
Any of the components illustrated inmay be operably connected to each other (and/or components not illustrated) with communication system. In an embodiment, communication systemincludes one or more networks that facilitate communication between any number of components. The networks may include wired networks and/or wireless networks (e.g., and/or the Internet). The networks may operate in accordance with any number and/or types of communication protocols (e.g., such as the internet protocol).
Communication systemmay be implemented with one or more local communications links (e.g., a bus interconnecting a processor of any of endpoint devicesand remote server).
Communication systemmay include out of band communication channels, in band communication channels, and/or other types of communication channels.
Refer tofor additional details regarding the management controller, network module, out of band communication channel, and/or hardware resources of endpoint devices.
While illustrated inas including a limited number of specific components, a system in accordance with an embodiment may include fewer, additional, and/or different components than those illustrated therein.
Unknown
October 30, 2025
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.