10535343

Implementations for Voice Assistant on Devices

PublishedJanuary 14, 2020
Assigneenot available in USPTO data we have
Technical Abstract

Patent Claims
20 claims

Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.

Claim 1

Original Legal Text

1. A method, comprising: at an electronic device having a first device type and comprising an audio input system, one or more processors, and memory storing one or more programs for execution by the one or more processors: downloading a device-agnostic voice assistant library configured to execute across a plurality of different electronic device types, including the first device type, wherein the voice-assistant library includes a plurality of voice processing modules, each of the voice processing modules providing one or more voice processing operations that are accessible to application programs executing or executable on the different electronic device types; configuring the device-agnostic voice assistant library to execute on the electronic device based on the electronic device having the first device type, including: selecting an implementation for the voice assistant library based on the electronic device having the first device type, wherein the implementation for the voice assistant library is selected from a group consisting of: in an application installed on the electronic device, in an operating system of the electronic device, and in firmware of the electronic device; after the configuring, receiving, via a microphone of the audio input system, a verbal input from a user; extracting request information from the verbal input by processing the verbal input using the device-agnostic voice assistant library executing on the electronic device; transmitting a request to a remote system, the request including the extracted request information; receiving a response to the request, wherein the response is generated by the remote system in accordance with the extracted request information; and performing an operation in accordance with the response by one or more voice processing modules of the configured voice assistant library.

Plain English translation pending...
Claim 2

Original Legal Text

2. The method of claim 1 , wherein at least some voice processing operations associated with the voice processing modules are performed on the remote system, which is interconnected with the electronic device via a wide area network.

Plain English Translation

This invention relates to distributed voice processing systems, specifically methods for offloading voice processing tasks from an electronic device to a remote system via a wide area network. The technology addresses the computational and power limitations of local devices by leveraging remote processing capabilities to enhance voice processing efficiency and performance. The method involves an electronic device equipped with voice processing modules that handle tasks such as speech recognition, voice synthesis, or audio enhancement. At least some of these operations are executed on a remote system connected to the device over a wide area network, such as the internet. This distributed approach reduces the processing burden on the local device, conserving its resources while maintaining or improving the quality and speed of voice processing. The remote system may perform specialized or resource-intensive tasks, while simpler operations can remain on the device. The system ensures seamless integration between local and remote processing, optimizing performance based on network conditions and device capabilities. This approach is particularly useful for mobile or low-power devices that lack the computational power for advanced voice processing tasks.

Claim 3

Original Legal Text

3. The method of claim 1 , wherein processing the verbal input comprises performing speech processing on the verbal input, and the speech processing is performed by a module of the voice processing modules of the voice assistant library.

Plain English Translation

A voice assistant system processes verbal inputs from users to perform tasks such as executing commands, retrieving information, or controlling devices. A challenge in such systems is efficiently handling and interpreting verbal inputs to ensure accurate and timely responses. This invention addresses this by using a modular voice processing architecture within a voice assistant library. The system includes multiple voice processing modules, each specialized for different tasks like speech recognition, natural language understanding, or command execution. When a verbal input is received, the system processes it by routing it through one or more of these modules. Specifically, the speech processing module within the voice assistant library performs speech recognition to convert the verbal input into a machine-readable format. This modular approach allows for flexible and scalable processing, enabling the system to adapt to different types of verbal inputs and improve performance over time. The use of a dedicated module for speech processing ensures that the system can accurately interpret user commands, even in noisy environments or with varying speech patterns. This enhances the overall reliability and usability of the voice assistant system.

Claim 4

Original Legal Text

4. The method of claim 1 , wherein processing the verbal input comprises performing audio input processing on audio data of the verbal input, and the audio input processing is performed by a module of the voice processing modules of the voice assistant library.

Plain English Translation

A voice assistant system processes verbal inputs from users to perform tasks or provide information. A challenge in such systems is efficiently handling and interpreting audio data to accurately execute user commands. This invention addresses this by using a modular voice processing architecture. The system includes a voice assistant library containing multiple voice processing modules, each specialized for different tasks like speech recognition, natural language understanding, or command execution. When a user provides a verbal input, the system processes the audio data through one or more of these modules. Specifically, the audio input processing is performed by a dedicated module within the voice assistant library, ensuring that the audio data is analyzed and interpreted correctly before further actions are taken. This modular approach allows for flexibility, scalability, and improved accuracy in handling verbal commands. The system can dynamically select or combine modules based on the nature of the input, optimizing performance and user experience. By separating audio processing into distinct modules, the system can be updated or customized without overhauling the entire architecture, making it adaptable to new technologies or user needs.

Claim 5

Original Legal Text

5. The method of claim 1 , wherein performing an operation in accordance with the response comprises decoding audio, and the audio decoding is performed by a module of the voice processing modules of the voice assistant library.

Plain English Translation

A voice assistant system processes audio input to execute operations based on user commands. The system includes a voice assistant library with multiple voice processing modules, each handling specific tasks such as speech recognition, natural language processing, or audio decoding. When a user provides an audio command, the system processes the input to determine the requested operation. If the operation involves audio decoding, the system routes the task to a dedicated audio decoding module within the voice assistant library. This module decodes the audio data, enabling further processing or execution of the command. The system ensures efficient handling of audio tasks by leveraging specialized modules, improving performance and accuracy in voice assistant operations. The audio decoding module may use standard or proprietary algorithms to convert encoded audio into a usable format, facilitating seamless interaction between the user and the voice assistant. This approach optimizes resource usage and enhances the system's responsiveness to user inputs.

Claim 6

Original Legal Text

6. The method of claim 1 , further comprising: at a second electronic device having a second device type, distinct from the first device type: downloading the device-agnostic voice assistant library; configuring the device-agnostic voice assistant library to execute on the second electronic device based on the second electronic device having the second device type; after the configuring the device-agnostic voice assistant library, receiving a second verbal input from a second user; and performing a second operation at the second electronic device in response to the second verbal input by one or more voice processing modules of the configured voice assistant library.

Plain English Translation

This invention relates to a voice assistant system designed to operate across multiple electronic devices with different device types. The system addresses the challenge of developing a single voice assistant library that can be adapted for use on various devices, such as smartphones, smart speakers, or other computing platforms, without requiring separate, device-specific implementations. The core method involves deploying a device-agnostic voice assistant library to a first electronic device, configuring it based on the device's type, and then processing verbal inputs to perform operations. The library includes voice processing modules that handle tasks like speech recognition, natural language understanding, and command execution. The system ensures compatibility by dynamically adjusting the library's configuration to match the hardware and software capabilities of the target device. This allows the same voice assistant functionality to be deployed across different devices while maintaining performance and reliability. The invention further extends this approach to a second electronic device with a distinct device type, where the library is similarly downloaded, configured, and used to process verbal inputs from a second user, demonstrating the system's scalability and adaptability across diverse hardware platforms.

Claim 7

Original Legal Text

7. The method of claim 1 , wherein performing the operation comprises outputting an audible response to the user via the audio input system.

Plain English Translation

This invention relates to audio-based user interaction systems, specifically methods for processing and responding to user inputs in real-time. The system captures audio input from a user, processes the input to identify relevant commands or queries, and performs an operation based on the processed input. A key aspect of the invention is the ability to output an audible response to the user via the same audio input system, enabling a seamless, bidirectional communication interface. The system may include components for speech recognition, natural language processing, and audio output, allowing it to interpret user requests and provide feedback in an audible format. This ensures users receive immediate confirmation or information without requiring additional devices or interfaces. The method enhances user experience by integrating input and output functions within a single system, reducing complexity and improving responsiveness. The invention is particularly useful in applications where hands-free or voice-controlled interactions are preferred, such as smart home devices, virtual assistants, or automotive systems. By providing audible feedback, the system ensures users are informed of the system's actions, improving clarity and usability.

Claim 8

Original Legal Text

8. The method of claim 1 , wherein configuring the device-agnostic voice assistant library includes enabling a voice assistant functionality on the electronic device.

Plain English Translation

A voice assistant system for electronic devices enables voice-controlled operations without requiring device-specific integration. The system includes a device-agnostic voice assistant library that standardizes voice processing across different devices, allowing seamless interaction with various hardware and software configurations. The library processes voice commands, converts them into executable actions, and interfaces with device-specific components to perform tasks such as launching applications, adjusting settings, or retrieving information. By enabling voice assistant functionality on an electronic device, the system ensures compatibility and consistent performance across diverse platforms, eliminating the need for custom development for each device. The library handles voice recognition, natural language processing, and command execution, while adapting to the device's capabilities to provide a unified user experience. This approach reduces development complexity and enhances accessibility, allowing users to interact with their devices using voice commands regardless of the underlying hardware or software environment.

Claim 9

Original Legal Text

9. A device-agnostic voice assistant library for electronic devices that include respective audio input systems, comprising: one or more implementation modules configured to implement the voice assistant library across each of a plurality of different electronic devices based on a corresponding device type, wherein the implementation for the voice assistant library is selected from a group consisting of: in an application installed on the electronic device, in an operating system of the electronic device, and in firmware of the electronic device; a plurality of voice processing modules, each of the voice processing modules providing one or more voice processing operations that are accessible to application programs executing or executable on the different electronic device types; and one or more application programming interfaces (APIs) configured to provide interfaces between the plurality of voice processing operations and hardware and/or software of the electronic devices; whereby the one or more voice processing modules and APIs enable portability across the plurality of different electronic device types of voice-enabled applications configured to interact with one or more of the voice processing operations.

Plain English Translation

This invention relates to a device-agnostic voice assistant library designed for electronic devices equipped with audio input systems. The library addresses the challenge of developing voice-enabled applications that can operate consistently across diverse electronic devices with varying hardware and software configurations. The library includes implementation modules that allow it to be integrated into different parts of a device, such as an installed application, the operating system, or firmware, depending on the device type. This flexibility ensures compatibility with a wide range of devices. The library also features multiple voice processing modules, each offering specific voice processing functions that applications can access. These functions may include speech recognition, natural language processing, or voice command execution. Additionally, the library provides application programming interfaces (APIs) that facilitate communication between the voice processing modules and the device's hardware or software components. By standardizing these interfaces, the library enables voice-enabled applications to be portable across different device types, reducing development complexity and ensuring consistent performance. The overall system allows developers to create voice-assisted applications that work seamlessly across various electronic devices without requiring device-specific customization.

Claim 10

Original Legal Text

10. The voice assistant library of claim 9 , wherein at least some voice processing operations associated with the voice processing modules are performed on a backend server interconnected with the electronic devices via a wide area network.

Plain English Translation

This invention relates to a voice assistant library designed to enhance voice processing capabilities in electronic devices. The library includes multiple voice processing modules that handle various tasks such as speech recognition, natural language processing, and voice command execution. These modules can be distributed across different processing environments, including local processing on the electronic device itself and remote processing on a backend server connected via a wide area network. By offloading at least some voice processing operations to the backend server, the system optimizes performance, reduces computational load on the device, and improves responsiveness. The backend server may handle computationally intensive tasks, while simpler operations are processed locally. This distributed approach ensures efficient resource utilization and seamless user interaction. The library is designed to integrate with various electronic devices, enabling them to leverage advanced voice processing capabilities without requiring extensive local computational resources. The system dynamically allocates tasks between the device and the server based on factors such as network conditions, device capabilities, and processing requirements. This ensures reliable and efficient voice assistant functionality across different devices and environments.

Claim 11

Original Legal Text

11. The voice assistant library of claim 10 , wherein the voice processing operations include device-specific operations configured to control devices coupled with the electronic devices.

Plain English Translation

This invention relates to a voice assistant library designed to enhance voice processing capabilities in electronic devices. The library enables devices to perform device-specific operations, such as controlling connected devices, through voice commands. The system processes voice inputs to execute these operations, ensuring compatibility with various electronic devices and their associated peripherals. The library includes a voice processing module that interprets voice commands and translates them into executable actions, allowing seamless integration with different devices. This approach improves user interaction by enabling direct control of connected devices via voice, eliminating the need for manual input. The library is adaptable to different hardware configurations, ensuring broad applicability across multiple electronic devices. The invention addresses the challenge of integrating voice control features into diverse electronic systems, providing a unified solution for device management through voice commands.

Claim 12

Original Legal Text

12. The voice assistant library of claim 10 , wherein the voice processing operations include information and media request operations configured to provide requested information and/or media content to users of the respective electronic devices, or on devices coupled with the respective electronic devices.

Plain English Translation

This invention relates to a voice assistant library designed to enhance the functionality of electronic devices by enabling voice-based interactions. The library processes voice commands to perform various operations, including information and media request operations. These operations allow users to retrieve requested information or media content directly on their electronic devices or on devices connected to them. The library is configured to handle these requests efficiently, ensuring seamless access to data and media through voice commands. The system integrates with multiple electronic devices, enabling consistent voice assistant functionality across different platforms. The library may also include additional voice processing operations, such as natural language processing, to interpret and execute user commands accurately. The overall goal is to provide a unified and intuitive voice interface for accessing information and media, improving user convenience and interaction efficiency.

Claim 13

Original Legal Text

13. The voice assistant library of claim 9 , wherein the plurality of voice processing operations comprises hotword detection.

Plain English Translation

A voice assistant library is designed to enhance the functionality of voice-enabled systems by providing a modular framework for processing voice commands. The library includes a set of voice processing operations that can be dynamically loaded and executed to handle various aspects of voice interaction, such as speech recognition, natural language processing, and command execution. One of the key operations included in this library is hotword detection, which allows the system to continuously monitor audio input for a specific wake word or phrase, triggering the activation of the voice assistant when detected. This modular approach enables developers to customize and extend the capabilities of the voice assistant without modifying the core system, improving flexibility and scalability. The library may also include additional operations such as speech-to-text conversion, intent recognition, and response generation, ensuring comprehensive support for voice-based interactions. By integrating these operations into a unified framework, the library simplifies the development of voice-enabled applications while maintaining high performance and accuracy in processing user commands.

Claim 14

Original Legal Text

14. The voice assistant library of claim 9 , wherein the plurality of voice processing operations comprises speech processing.

Plain English Translation

A voice assistant library is designed to enhance the functionality of voice assistant systems by providing a modular framework for executing various voice processing operations. The library includes a plurality of voice processing operations, which may include speech processing. Speech processing involves analyzing and interpreting spoken language to convert it into a format that can be understood and acted upon by the voice assistant system. This may include tasks such as speech recognition, natural language understanding, and speech synthesis. The library is structured to allow for the integration of different voice processing modules, enabling customization and scalability based on specific application requirements. By modularizing these operations, the library facilitates easier updates, maintenance, and expansion of the voice assistant's capabilities. The system is particularly useful in environments where voice interaction is a primary mode of user engagement, such as smart home devices, virtual assistants, and automotive systems. The modular design ensures that new voice processing techniques can be added without disrupting existing functionality, making the system adaptable to evolving technological advancements.

Claim 15

Original Legal Text

15. The voice assistant library of claim 9 , wherein the plurality of voice processing operations comprises audio input processing.

Plain English Translation

A voice assistant library is designed to enhance the functionality of voice-enabled systems by providing a modular framework for executing voice processing operations. The library addresses the challenge of integrating diverse voice processing tasks into a unified system, improving efficiency and scalability. The library includes a plurality of voice processing operations, one of which is audio input processing. Audio input processing involves capturing, filtering, and preparing raw audio signals for further analysis, such as speech recognition or natural language understanding. This operation ensures that the audio data is clean and structured, enabling accurate and reliable voice assistant performance. The library may also include other voice processing operations, such as speech recognition, natural language processing, and response generation, to support a comprehensive voice interaction pipeline. By modularizing these operations, the library allows developers to customize and extend voice assistant capabilities without rebuilding the entire system from scratch. This approach simplifies integration and reduces development time, making it easier to deploy voice-enabled applications across various platforms and devices.

Claim 16

Original Legal Text

16. An electronic device having a first device type, comprising: an audio input system; one or more processors; and memory storing one or more programs to be executed by the one or more processors, the one or more programs comprising instructions for: downloading a device-agnostic voice assistant library configured to execute across a plurality of different electronic device types, including the first device type, wherein the voice-assistant library includes a plurality of voice processing modules, each of the voice processing modules providing one or more voice processing operations that are accessible to application programs executing or executable on the different electronic device types; configuring the device-agnostic voice assistant library to execute on the electronic device based on the electronic device having the first device type of the plurality of different electronic device types, including: selecting an implementation for the voice assistant library based on the electronic device having the first device type, wherein the implementation for the voice assistant library is selected from a group consisting of: in an application installed on the electronic device, in an operating system of the electronic device, and in firmware of the electronic device; after the configuring, receiving, via a microphone of the audio input system, a verbal input from a user; extracting request information from the verbal input by processing the verbal input using the device-agnostic voice assistant library executing on the electronic device; transmitting a request to a remote system, the request including the extracted request information; receiving a response to the request, wherein the response is generated by the remote system in accordance with the extracted request information; and performing an operation in accordance with the response by one or more voice processing modules of the configured voice assistant library.

Plain English Translation

The invention relates to a voice assistant system for electronic devices, addressing the challenge of providing consistent voice processing capabilities across diverse device types. The system includes an electronic device with an audio input system, processors, and memory storing a device-agnostic voice assistant library. This library is designed to operate across multiple device types, including the device's specific type, and contains modular voice processing functions accessible to applications. The library is configured based on the device type, with implementations possible within an installed application, the operating system, or firmware. Upon receiving a user's verbal input via a microphone, the device processes the input using the library to extract request information. This information is transmitted to a remote system, which generates a response based on the request. The device then performs an operation in accordance with the response using the configured voice assistant library. The modular design allows for flexible deployment and consistent functionality across different device types, ensuring compatibility and scalability.

Claim 17

Original Legal Text

17. The device of claim 16 , comprising instructions for performing hotword detection on the verbal input, wherein the instructions for performing hotword detection on the verbal input are included in a module of the voice processing modules of the voice assistant library.

Plain English Translation

This invention relates to voice processing systems, specifically a device for handling verbal input in a voice assistant application. The problem addressed is the need for efficient and modular processing of voice commands, particularly in detecting hotwords (trigger phrases) that activate voice assistant functions. The device includes a voice assistant library with multiple voice processing modules, each handling different aspects of voice input processing. One such module is dedicated to hotword detection, which analyzes verbal input to identify predefined trigger phrases. This module operates within the broader voice processing framework, ensuring seamless integration with other modules for tasks like speech recognition, natural language processing, and command execution. The hotword detection module is designed to quickly and accurately identify hotwords in real-time, minimizing latency and improving user experience. By isolating this function in a dedicated module, the system allows for easier updates, customization, and optimization of hotword detection without affecting other voice processing components. The modular architecture also supports scalability, enabling the addition of new hotwords or detection algorithms as needed. This approach enhances the flexibility and efficiency of voice assistant applications, making them more responsive and adaptable to user needs. The invention is particularly useful in environments where rapid and reliable hotword detection is critical, such as smart home devices, virtual assistants, and automotive voice control systems.

Claim 18

Original Legal Text

18. The device of claim 16 , comprising instructions for performing speech processing on the verbal input, wherein the instructions for performing speech processing on the verbal input are included in a module of the voice processing modules of the voice assistant library.

Plain English Translation

This invention relates to voice assistant technology, specifically improving the integration and functionality of voice processing modules within a voice assistant library. The problem addressed is the need for efficient and modular speech processing within voice assistant systems, ensuring that verbal inputs are accurately processed and executed. The invention provides a device that includes a voice assistant library with multiple voice processing modules, each designed to handle specific tasks such as speech recognition, natural language understanding, or command execution. The device further includes instructions for performing speech processing on verbal inputs, where these instructions are embedded within one of the voice processing modules in the library. This modular approach allows for flexible and scalable integration of speech processing capabilities, enabling the voice assistant to adapt to different user needs and environments. The system ensures that verbal inputs are processed efficiently, improving the overall responsiveness and accuracy of the voice assistant. By organizing speech processing instructions within dedicated modules, the invention simplifies updates and customization, making it easier to enhance or modify the system's functionality. This modular design also supports interoperability with other voice processing modules, allowing for seamless integration into broader voice assistant frameworks. The invention aims to provide a robust and adaptable solution for handling verbal inputs in voice assistant applications.

Claim 19

Original Legal Text

19. The device of claim 16 , comprising instructions for performing audio input processing on audio data of the verbal input, wherein the instructions for performing audio input processing on audio data of the verbal input are included in a module of the voice processing modules of the voice assistant library.

Plain English Translation

This invention relates to a voice assistant system designed to process verbal inputs from users. The system includes a voice assistant library with multiple voice processing modules, each handling specific tasks such as speech recognition, natural language processing, or audio enhancement. The invention specifically addresses the need for efficient and modular audio input processing within such systems, ensuring that verbal inputs are accurately captured and prepared for further analysis. The device includes instructions for performing audio input processing on the audio data of the verbal input. These instructions are integrated into one of the voice processing modules within the voice assistant library. The audio input processing may involve noise reduction, speech enhancement, or other preprocessing steps to improve the quality of the audio data before it is analyzed by other modules. By modularizing this functionality, the system allows for flexible and scalable processing, where different modules can be updated or replaced independently without disrupting the overall workflow. This approach enhances the system's ability to handle diverse audio environments and user inputs, ensuring reliable performance in real-world applications. The modular design also facilitates easier maintenance and customization, allowing developers to adapt the system to specific use cases or integrate new processing techniques as needed.

Claim 20

Original Legal Text

20. The device of claim 16 , wherein the plurality of voice processing modules includes a ducking module, the ducking module providing a ducking operation that is accessible to application programs executing or executable on the different electronic device types, the ducking operation including: while the electronic device is producing an audible output, receiving an activation input to the electronic device indicating that a user is about to submit verbal input to the electronic device; and in response to the activation input, adjusting by the electronic device the audible output from the first volume level to a second volume level, less than the first volume level.

Plain English Translation

This invention relates to audio processing in electronic devices, specifically improving user interaction by dynamically adjusting audio output levels when verbal input is detected. The problem addressed is the difficulty of users providing verbal input while the device is producing audible output, such as music or notifications, which can interfere with voice recognition accuracy and user experience. The invention describes a device with multiple voice processing modules, including a ducking module. The ducking module provides a ducking operation that reduces the volume of the device's audible output when the user is about to speak. The process begins while the device is playing audio at a first volume level. Upon receiving an activation input—such as a button press or touch gesture—indicating the user is about to speak, the device automatically lowers the audio output to a second, lower volume level. This ensures clearer verbal input capture and improves voice recognition performance. The ducking operation is accessible to application programs running on various electronic device types, allowing consistent behavior across different platforms. The adjustment is temporary, returning to the original volume after the verbal input is complete or when no further input is detected. This solution enhances usability in scenarios where background audio would otherwise disrupt voice interactions.

Patent Metadata

Filing Date

Unknown

Publication Date

January 14, 2020

Inventors

Kenneth Mixter

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, FAQs, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “Implementations for Voice Assistant on Devices” (10535343). https://patentable.app/patents/10535343

© 2026 Nomic Interactive Technology LLC. Machine-readable context available at /api/llm-context/10535343. See llms.txt for full attribution policy.

Implementations for Voice Assistant on Devices