Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.
1. A method for processing one or more multi-modal user interactions in a natural language voice services environment that includes one or more electronic devices, the method being implemented by a computer system that includes one or more physical processors executing one or more computer program instructions which, when executed, perform the method, the method comprising: detecting a multi-modal user interaction received via one or more electronic devices, the multi-modal user interaction comprising at least a non-voice input and a natural language utterance, wherein the non-voice input is received from a non-voice input component of the one or more electronic devices, and wherein the natural language utterance is received from a voice input component of the one or more electronic devices and is related to the non-voice input; obtaining an indication of a first time at which the non-voice input was received by the non-voice input component; obtaining an indication of a second time at which the natural language utterance was received by the voice input component; determining that the non-voice input and the natural language utterance are related and are to be interpreted together based on the first time and the second time; and responsive to determining that the non-voice input and the natural language utterance are related and are to be interpreted together based on the first time and the second time, performing the following steps: determining first context information relating to the non-voice input; determining second context information relating to the natural language utterance; determining an intent of the multi-modal user interaction based on the first context information and the second context information; identifying a transaction lead based on the determined intent; and transmitting the identified transaction lead to a user via the one or more electronic devices.
This invention relates to processing multi-modal user interactions in natural language voice services, particularly where users combine non-voice inputs (e.g., touch, gestures, or text) with spoken language. The system detects simultaneous or sequential inputs from different modalities, such as a user tapping a screen while speaking a related command. It analyzes timestamps to determine if inputs are related, then extracts context from both the non-voice and voice inputs to infer the user's intent. For example, a user might tap a product image while saying "buy this," combining visual and voice cues. The system processes these inputs together to identify transaction opportunities (e.g., purchases or service requests) and transmits relevant leads to the user. The approach improves interaction accuracy by leveraging multi-modal context, reducing ambiguity in voice-only systems. The system operates on electronic devices with voice and non-voice input components, using timestamps to correlate inputs and machine learning or rule-based methods to interpret combined context. The output is actionable transaction leads, such as product recommendations or service bookings, delivered to the user.
2. The method of claim 1 , wherein the one or more processors, the non-voice input component, and the voice input component are housed within a single electronic device.
This invention relates to a system for processing both voice and non-voice inputs within a single electronic device. The device includes one or more processors, a non-voice input component, and a voice input component, all integrated into a unified housing. The non-voice input component captures non-verbal user inputs, such as gestures, touch, or other non-auditory signals, while the voice input component records spoken commands or queries. The processors analyze these inputs to determine user intent, enabling the device to respond appropriately. The integration of both input types into a single device enhances user interaction by allowing seamless switching between voice and non-voice interactions. This design eliminates the need for separate devices or components, improving portability and convenience. The system may also include additional features like data storage, wireless communication, or user feedback mechanisms to further enhance functionality. The invention is particularly useful in applications where users need to interact with a device in environments where voice input alone may be impractical or where non-voice inputs provide more precise control. The unified design ensures consistent performance and reduces complexity for the user.
3. The method of claim 1 , wherein the one or more processors are housed in a first electronic device, the non-voice input component is housed in a second electronic device, and the voice input component is housed in a third electronic device.
This invention relates to a distributed system for processing user inputs, addressing the challenge of integrating voice and non-voice inputs from multiple devices into a cohesive processing framework. The system includes one or more processors configured to receive and process inputs from both voice and non-voice sources, such as touchscreens, keyboards, or sensors. The processors analyze these inputs to determine user intent and execute corresponding actions, such as controlling applications or devices. A key feature is the ability to correlate and synchronize inputs from different sources, ensuring seamless interaction even when inputs are received simultaneously or in rapid succession. The system also prioritizes inputs based on predefined rules, such as giving precedence to voice commands over non-voice inputs in certain contexts. Additionally, the system can adapt its processing logic based on historical input patterns to improve accuracy and responsiveness. The processors are housed in a first electronic device, while the non-voice input components are in a second device, and the voice input components are in a third device. This distributed architecture allows for flexible deployment across different hardware configurations, enabling applications in smart home systems, wearable devices, or multi-device user interfaces. The invention enhances user experience by providing a unified input processing solution that works across disparate devices.
4. The method of claim 1 , wherein the one or more processors are housed in a first electronic device, and wherein the non-voice input component and the voice input component are housed in a second electronic device.
This claim describes a method where the computing is done in one device, while the user's voice and other input (like touch) are captured by a separate device.
5. The method of claim 1 , wherein the one or more processors and the non-voice input component are housed in a first electronic device, and wherein the voice input component is housed in a second electronic device.
This invention relates to a system for processing voice and non-voice inputs across multiple electronic devices. The technology addresses the challenge of integrating voice and non-voice inputs from separate devices into a unified processing system, improving user interaction and functionality. The system includes a voice input component and a non-voice input component, each housed in distinct electronic devices. The voice input component captures spoken commands or queries, while the non-voice input component handles other forms of input, such as touch, gestures, or text. Both components are connected to one or more processors that analyze and process the inputs to generate a response or action. The processors may execute algorithms to interpret the inputs, coordinate between the devices, and execute tasks based on the combined input data. The system ensures seamless interaction between the voice and non-voice inputs, allowing users to control or interact with applications or services using multiple input modalities. The separation of components into different devices enables flexibility in device placement and usage, such as a voice-enabled remote control paired with a touchscreen device. The processors may also synchronize data between the devices to maintain consistency in user interactions. This approach enhances accessibility and convenience by accommodating different input preferences and environments.
6. The method of claim 1 , wherein the one or more processors and the voice input component are housed in a first electronic device, and wherein the non-voice input component is housed in a second electronic device.
This invention relates to a distributed voice interaction system that processes voice and non-voice inputs across multiple electronic devices. The system addresses the challenge of integrating voice and non-voice inputs in a seamless manner, particularly in scenarios where different input modalities are handled by separate devices. The primary device contains one or more processors and a voice input component, such as a microphone, to capture and process spoken commands or queries. A secondary device houses a non-voice input component, such as a keyboard, touchscreen, or other non-auditory input mechanism. The system enables coordination between these devices to process inputs from both modalities, allowing users to interact with the system using voice or non-voice inputs as needed. The processors in the primary device may analyze the voice inputs, while the secondary device may handle non-voice inputs, with data exchanged between the devices to ensure coherent system operation. This setup enhances flexibility and usability, particularly in environments where multiple input methods are required or preferred. The invention may be applied in smart home systems, virtual assistants, or other interactive computing applications where distributed input processing is beneficial.
7. The method of claim 1 , wherein the non-voice input comprises a point of focus input on a display of the non-voice input component.
A method for processing non-voice inputs in a computing system involves detecting and interpreting user interactions with a display to determine a point of focus. The system includes a non-voice input component, such as a touchscreen or pointing device, that captures user gestures or selections on a display. The point of focus input indicates a specific area or element on the display where the user is directing attention, such as by tapping, hovering, or selecting an item. The system analyzes this input to determine the user's intent, which may involve selecting an object, navigating a user interface, or triggering an action. The method may also involve integrating this input with other non-voice inputs, such as text or graphical inputs, to enhance interaction accuracy. The system dynamically adjusts its response based on the detected point of focus, improving user experience by reducing ambiguity in input interpretation. This approach is particularly useful in applications requiring precise control, such as graphical design tools, virtual reality interfaces, or accessibility features for users with limited mobility. The method ensures that non-voice interactions are efficiently processed, enabling seamless and intuitive user-system communication.
8. The method of claim 1 , wherein the non-voice input comprises a highlighting of text on a display of the non-voice input component.
A system and method for processing non-voice inputs in a communication device involves detecting and interpreting user interactions with a display to generate commands or responses. The method includes receiving a non-voice input, such as a user highlighting text on a display of a non-voice input component, and converting this input into a machine-readable format. The system then processes the highlighted text to determine its context or meaning, enabling actions like sending the text as a message, performing a search, or triggering a function based on the highlighted content. The method may also involve analyzing the highlighted text to extract keywords, phrases, or commands, which are then used to generate an appropriate response or action. The system can integrate with voice-based communication systems to provide a hybrid input method, allowing users to interact with the device through both voice and non-voice inputs. This approach enhances user convenience by enabling quick, non-verbal interactions, particularly in environments where voice input is impractical or undesirable. The method ensures accurate interpretation of the highlighted text by applying natural language processing or pattern recognition techniques to derive meaningful commands or data from the input.
9. The method of claim 1 , the method further comprising: obtaining preference information of a user, wherein the transaction lead is identified based further on the preference information.
A system and method for identifying and presenting transaction leads to users based on their preferences. The technology operates in the domain of digital transaction facilitation, addressing the problem of inefficient lead matching by incorporating user preferences to improve relevance and engagement. The method involves collecting and analyzing user preference data, which may include historical interactions, explicit selections, or behavioral patterns. This preference information is used to refine the identification of transaction leads, ensuring they align with the user's interests or needs. The system dynamically adjusts lead recommendations based on the updated preference data, enhancing the likelihood of successful transactions. By integrating user preferences into the lead identification process, the method improves the efficiency and personalization of transaction opportunities, reducing the time and effort required for users to find suitable leads. The approach leverages data-driven insights to tailor recommendations, optimizing the user experience in digital transaction environments.
10. The method of claim 1 , wherein the transaction lead comprises at least one of an advertisement or a recommendation related to the determined intent of the multi-modal user interaction.
This invention relates to systems for processing multi-modal user interactions, such as those involving voice and text inputs, to generate transaction leads. The problem addressed is the need to accurately interpret user intent from diverse input modalities and provide relevant transaction opportunities, such as advertisements or recommendations, based on that intent. The method involves analyzing a multi-modal user interaction to determine the user's intent, then generating a transaction lead that aligns with that intent. The transaction lead may include an advertisement or recommendation tailored to the user's inferred intent. For example, if a user's interaction suggests interest in purchasing a product, the system may generate an ad for that product or a related recommendation. The method ensures that transaction leads are contextually relevant, improving user engagement and conversion rates. The system may also prioritize leads based on relevance or other factors to enhance the user experience. This approach leverages multi-modal input analysis to deliver personalized and timely transaction opportunities, addressing the challenge of matching user intent with appropriate commercial content.
11. The method of claim 1 , the method further comprising: receiving a further input after the transaction lead was transmitted; determining a second intent of the further input; and providing further information relating to the transaction lead based on the second intent.
This invention relates to transaction lead processing systems that dynamically respond to user inputs to provide relevant information. The problem addressed is the static nature of traditional transaction lead systems, which often fail to adapt to evolving user needs during interactions. The invention improves upon prior systems by enabling real-time intent detection and adaptive information delivery. The method involves receiving an initial transaction lead, which may include details such as user identity, transaction type, or context. The system processes this lead to determine a primary intent, then transmits relevant information based on that intent. The key innovation is the ability to receive additional inputs after the initial response, analyze these inputs to determine a secondary intent, and provide further tailored information accordingly. This ensures that the system remains responsive to changing user requirements throughout the interaction. The method may also include steps such as validating the transaction lead, categorizing the intent, or filtering information based on user preferences. The adaptive response mechanism allows for more efficient and personalized interactions, reducing the need for manual follow-ups or redundant queries. This approach is particularly useful in financial, customer service, or sales automation contexts where dynamic responses enhance user satisfaction and operational efficiency.
12. The method of claim 1 , the method further comprising: receiving a further input after the transaction lead was transmitted; determining a second intent of the further input; and completing a purchase transaction in response to receiving the further input based on the determined second intent.
This invention relates to transaction processing systems that handle user inputs to complete purchases. The problem addressed is the need for systems to dynamically respond to additional user inputs after an initial transaction lead is transmitted, ensuring seamless and accurate transaction completion based on evolving user intent. The method involves receiving an initial input from a user, which triggers the transmission of a transaction lead to a processing system. After this transmission, the system receives a further input from the user. The system then analyzes this further input to determine a second intent, which may differ from the initial intent. Based on this second intent, the system completes a purchase transaction, ensuring the transaction aligns with the user's updated preferences or actions. This allows for flexible and adaptive transaction processing, accommodating changes in user behavior or requirements during the transaction flow. The system may also include steps for processing the initial input, such as analyzing the input to determine an initial intent and transmitting a transaction lead based on that intent. The further input can be received through various channels, such as voice, text, or graphical interfaces, and the system may use natural language processing or other techniques to interpret the second intent accurately. The purchase transaction is then executed in accordance with the determined second intent, ensuring the transaction is completed as intended by the user.
13. The method of claim 12 , wherein the further input comprises a second natural language utterance.
This invention relates to natural language processing systems that handle user inputs, particularly in conversational interfaces or virtual assistants. The problem addressed is improving the accuracy and relevance of system responses by dynamically incorporating additional context from subsequent user inputs. The method involves a system that processes an initial natural language utterance to generate a response. If the system detects ambiguity or insufficient context in the initial input, it prompts the user for further clarification. The further input, which may be a second natural language utterance, is then analyzed to refine the system's understanding of the user's intent. This additional context is used to adjust the system's response, ensuring it aligns more closely with the user's needs. The system may also use the further input to update its internal knowledge base or user profile, improving future interactions. The method includes techniques for parsing and interpreting the second utterance, such as identifying key phrases, resolving ambiguities, or extracting new contextual information. The system may employ machine learning models or rule-based logic to determine how the second utterance modifies the initial response. This approach enhances the system's ability to handle complex or multi-turn conversations, where user intent may evolve over time. The invention is particularly useful in applications like customer service chatbots, voice assistants, or automated support systems, where accurate and context-aware responses are critical. By dynamically incorporating follow-up inputs, the system avoids misinterpretations and provides more precise and helpful outputs.
14. The method of claim 12 , wherein the further input comprises a second non-voice input.
A system and method for processing user inputs in a voice-controlled device involves detecting and interpreting non-voice inputs, such as gestures or touch inputs, to enhance interaction. The device includes a microphone for capturing voice commands and a sensor for detecting non-voice inputs. When a voice command is received, the device processes it to perform a corresponding action. If a non-voice input is detected simultaneously or shortly after the voice command, the device interprets this input as a modifier or additional instruction, adjusting the action accordingly. For example, a voice command to "turn on the lights" paired with a swipe gesture could adjust brightness or color settings. The system distinguishes between intentional non-voice inputs and accidental triggers using timing, context, or user preferences. This method improves user interaction by allowing combined voice and non-voice inputs, reducing the need for multiple commands and enhancing precision in device control. The invention is particularly useful in smart home systems, wearable devices, or automotive interfaces where multi-modal inputs improve usability.
15. The method of claim 1 , wherein the non-voice input component comprises a map display, and wherein the transaction lead is presented as a point on the map display.
This invention relates to a system for processing non-voice inputs in a transactional environment, particularly for visualizing transaction leads on a map display. The system captures non-voice inputs, such as text, gestures, or other non-auditory signals, and processes them to generate transaction leads. These leads are then displayed as points on a map, allowing users to interact with and manage transactions spatially. The map display may include additional features like zoom, pan, and filtering to enhance usability. The system may also integrate with other data sources to provide contextual information alongside the transaction leads. This approach improves transaction management by providing a visual, location-based interface for tracking and organizing leads, making it easier for users to identify and prioritize opportunities based on geographic data. The invention is particularly useful in fields like real estate, logistics, or field service management, where spatial awareness is critical. The method ensures that non-voice inputs are efficiently converted into actionable insights displayed in a map-based format, streamlining workflows and improving decision-making.
16. A system of processing one or more multi-modal user interactions in a natural language voice services environment that includes one or more electronic devices, the system comprising: one or more physical processors programmed with one or more computer program instructions which, when executed, cause the one or more physical processors to: detect a multi-modal user interaction received via one or more electronic devices, the multi-modal user interaction comprising at least a non-voice input and a natural language utterance, wherein the non-voice input is received from a non-voice input component of the one or more electronic devices, and wherein the natural language utterance is received from a voice input component of the one or more electronic devices and is related to the non-voice input; obtain an indication of a first time at which the non-voice input was received by the non-voice input component; obtain an indication of a second time at which the natural language utterance was received by the voice input component; determine that the non-voice input and the natural language utterance are related and are to be interpreted together based on the first time and the second time; and responsive to determining that the non-voice input and the natural language utterance are related and are to be interpreted together based on the first time and the second time, perform the following steps: determine first context information relating to the non-voice input; determine second context information relating to the natural language utterance; determine an intent of the multi-modal user interaction based on the first context information and the second context information; identify a transaction lead based on the determined intent; and transmit the identified transaction lead to a user via the one or more electronic devices.
The system processes multi-modal user interactions in a natural language voice services environment, addressing the challenge of integrating and interpreting combined voice and non-voice inputs to derive meaningful actions. The system includes electronic devices equipped with voice and non-voice input components, such as microphones and touchscreens, to capture user interactions. When a user provides a non-voice input (e.g., a touch or gesture) and a related natural language utterance (e.g., a spoken command), the system records the timestamps of both inputs. By analyzing the temporal proximity of these inputs, the system determines if they are related and should be interpreted together. If so, it extracts context from both inputs—such as the type of non-voice interaction and the semantic content of the voice input—to determine the user's intent. Based on this intent, the system identifies a transaction lead (e.g., a product recommendation or service action) and transmits it to the user. This approach enhances user experience by seamlessly combining multiple input modalities to provide context-aware responses.
17. The system of claim 16 , wherein the one or more processors, the non-voice input component, and the voice input component are housed within a single electronic device.
This invention relates to an integrated electronic system for processing both voice and non-voice inputs. The system addresses the challenge of managing multiple input modalities in a compact, unified device, improving user interaction efficiency and reducing hardware complexity. The system includes one or more processors, a non-voice input component (such as a touchscreen, keyboard, or gesture sensor), and a voice input component (such as a microphone and speech recognition module). These components are housed within a single electronic device, eliminating the need for separate hardware or external processing units. The processors handle the coordination between the voice and non-voice inputs, ensuring seamless integration and real-time processing. The system may also include additional features like display interfaces, communication modules, and memory storage to support various applications. By consolidating these components into a single device, the invention simplifies user interaction while maintaining robust functionality for diverse input methods. This design is particularly useful in portable devices, smart home systems, and interactive interfaces where space and efficiency are critical.
18. The system of claim 16 , wherein the one or more processors are housed in a first electronic device, the non-voice input component is housed in a second electronic device, and the voice input component is housed in a third electronic device.
This invention relates to a distributed electronic system for processing both voice and non-voice inputs across multiple devices. The system addresses the challenge of integrating voice and non-voice inputs from separate devices into a cohesive processing framework, ensuring seamless interaction and data exchange. The system includes one or more processors configured to process voice and non-voice inputs, a voice input component for capturing spoken commands or queries, and a non-voice input component for receiving non-verbal inputs such as gestures, touch, or text. The processors analyze the inputs, determine their relevance, and execute corresponding actions. The system may also include a display for presenting output or feedback based on the processed inputs. A key feature is the distributed architecture, where the processors are housed in a first electronic device, the non-voice input component is in a second device, and the voice input component is in a third device. This modular design allows for flexible deployment across different devices, such as smartphones, tablets, or dedicated input peripherals, while maintaining synchronized operation. The system may further include communication interfaces to facilitate data exchange between the devices, ensuring real-time coordination of inputs and outputs. The invention enhances user interaction by enabling multi-modal input processing in a distributed environment.
19. The system of claim 16 , wherein the one or more processors are housed in a first electronic device, and wherein the non-voice input component and the voice input component are housed in a second electronic device.
This invention relates to a distributed electronic system for processing user inputs, addressing the challenge of integrating voice and non-voice inputs across separate devices. The system includes one or more processors configured to receive and process both voice and non-voice inputs, such as text, gestures, or touch inputs, to generate a unified output. The processors analyze the inputs to determine user intent, resolve ambiguities, and execute corresponding actions. The system may also include a natural language processing module to interpret voice commands and a context-aware module to adapt responses based on environmental or user-specific data. The processors can be housed in a first electronic device, such as a server or cloud-based system, while the input components (voice and non-voice) are housed in a second electronic device, such as a smartphone or wearable device. This separation allows for scalable processing and flexible deployment, enabling seamless interaction across multiple devices. The system may further include a communication interface to facilitate data exchange between the devices, ensuring real-time synchronization of inputs and outputs. The invention aims to enhance user experience by providing a cohesive, multi-modal input system that operates efficiently across distributed hardware.
20. The system of claim 16 , wherein the one or more processors and the non-voice input component are housed in a first electronic device, and wherein the voice input component is housed in a second electronic device.
This invention relates to a distributed electronic system for processing voice and non-voice inputs. The system addresses the challenge of integrating voice and non-voice inputs from separate devices while maintaining seamless functionality. The system includes one or more processors, a non-voice input component, and a voice input component. The non-voice input component captures non-voice data, such as text, gestures, or touch inputs, while the voice input component captures spoken commands or audio signals. The processors analyze and process these inputs to generate a unified output or perform a specific task. A key feature is the physical separation of components: the processors and non-voice input component are housed in a first electronic device, while the voice input component is housed in a second electronic device. This distributed architecture allows for flexible deployment, such as in wearable devices, smart home systems, or multi-device setups, where voice and non-voice inputs are processed collaboratively across devices. The system may also include communication interfaces to enable data exchange between the devices, ensuring synchronized operation. The invention improves user interaction by enabling multi-modal input handling across separate hardware components.
21. The system of claim 16 , wherein the one or more processors and the voice input component are housed in a first electronic device, and wherein the non-voice input component is housed in a second electronic device.
This invention relates to a distributed voice and non-voice input system for electronic devices. The system addresses the challenge of integrating voice and non-voice inputs across multiple devices, improving user interaction by allowing seamless input from different sources. The system includes one or more processors, a voice input component, and a non-voice input component. The voice input component captures spoken commands or queries, while the non-voice input component handles other forms of input, such as touch, gestures, or keyboard entries. The processors process these inputs to execute corresponding actions or commands. A key feature is the distribution of components across separate electronic devices. The voice input component and processors are housed in a first device, while the non-voice input component is in a second device. This separation enables flexible deployment, such as using a smartphone for voice input and a tablet for touch input, or a smart speaker for voice and a remote control for non-voice commands. The system ensures synchronized processing of inputs from both devices, allowing users to interact with multiple input methods without switching between devices. The invention enhances usability by leveraging the strengths of different input modalities and devices, providing a more intuitive and efficient user experience.
22. The system of claim 16 , wherein the non-voice input comprises a point of focus input on a display of the non-voice input component.
A system for processing non-voice inputs in a computing environment, particularly for devices with limited or no voice input capabilities, addresses the challenge of enabling efficient user interaction without relying on speech recognition. The system includes a non-voice input component that captures user inputs, such as touch, gesture, or other non-speech interactions, and processes these inputs to generate commands or data for the computing device. The system further includes a processing module that interprets the non-voice inputs and translates them into executable actions or data entries, ensuring seamless integration with the device's operating system or applications. In one embodiment, the non-voice input comprises a point of focus input on a display of the non-voice input component, where the user selects or interacts with specific on-screen elements to convey commands or data. This allows for precise control and interaction, particularly in environments where voice input is impractical or unavailable. The system may also include feedback mechanisms, such as visual or haptic responses, to confirm input recognition and execution. By providing an alternative to voice-based interaction, the system enhances accessibility and usability across various computing platforms, including mobile devices, wearables, and embedded systems.
23. The system of claim 16 , wherein the non-voice input comprises a highlighting of text on a display of the non-voice input component.
A system for processing non-voice inputs in a communication device includes a non-voice input component that captures user interactions, such as text highlighting on a display. The system interprets these inputs to generate commands or data for transmission to a remote device. The non-voice input component may include a touchscreen or other input mechanism that detects user selections, such as highlighting text, gestures, or other visual interactions. The system converts these inputs into a format compatible with the remote device, ensuring seamless integration with voice-based communication. The remote device processes the non-voice input alongside voice data, enabling a unified communication experience. The system may also include a voice input component for capturing spoken commands or messages, which are synchronized with the non-voice inputs. The overall system enhances communication by allowing users to convey information through both voice and non-voice means, improving clarity and efficiency in interactions. The highlighting of text on the display serves as a non-voice input method, allowing users to emphasize or select specific content for transmission or further processing. This feature is particularly useful in scenarios where visual context is critical, such as remote collaboration or real-time annotation.
24. The system of claim 16 , wherein the one or more physical processors are further programmed to: obtain preference information of a user, wherein the transaction lead is identified based further on the preference information.
A system for identifying and presenting transaction leads to users based on their preferences. The system operates in the domain of transaction facilitation, addressing the challenge of efficiently matching users with relevant transaction opportunities. The system includes one or more physical processors configured to collect and analyze user preference information, such as historical transaction data, user-provided inputs, or behavioral patterns. This preference information is used to refine the identification of transaction leads, ensuring they align with the user's interests or needs. The system may also incorporate additional data sources, such as market trends or external databases, to enhance the accuracy of lead identification. By leveraging preference information, the system improves the relevance and effectiveness of transaction leads, increasing the likelihood of successful user engagement. The system may be applied in various industries, including e-commerce, financial services, or real estate, where personalized transaction recommendations are valuable. The core innovation lies in dynamically adjusting lead identification based on user preferences, optimizing the transaction process for both users and providers.
25. The system of claim 16 , wherein the transaction lead comprises at least one of an advertisement or a recommendation related to the determined intent of the multi-modal user interaction.
The system is designed for processing multi-modal user interactions, such as those involving voice, text, or gestures, to determine user intent and generate relevant responses. The system analyzes input from multiple modalities to identify the user's intent, which may involve tasks like information retrieval, transaction processing, or content recommendations. Once the intent is determined, the system generates a transaction lead, which can include an advertisement or a recommendation tailored to the identified intent. This ensures that the system provides contextually relevant outputs, enhancing user engagement and satisfaction. The system may also integrate with external services or databases to fetch additional data needed to fulfill the user's intent, such as product details, pricing, or availability. By dynamically adapting to the user's input across different modalities, the system improves the accuracy and relevance of its responses, making it suitable for applications in customer service, e-commerce, or virtual assistants. The system's ability to process and respond to multi-modal inputs in real-time ensures a seamless and intuitive user experience.
26. The system of claim 16 , wherein the one or more physical processors are further programmed to: receive a further input after the transaction lead was transmitted; determine a second intent of the further input; and provide further information relating to the transaction lead based on the second intent.
This invention relates to a system for processing transaction leads, particularly in a financial or sales context, where the system dynamically responds to user inputs to provide relevant information. The system includes one or more physical processors configured to receive an initial transaction lead, analyze it to determine a first intent, and transmit the lead to a recipient based on that intent. The system further processes subsequent inputs related to the lead, determining a second intent from these inputs and providing additional information tailored to that intent. For example, if the initial lead involves a financial transaction, the system may first route it to an appropriate financial institution based on the transaction type. Later, if a user provides additional input—such as a request for details or a follow-up action—the system assesses this new input to determine the user's intent (e.g., seeking clarification, requesting modifications, or initiating a related process) and delivers further information or actions accordingly. This dynamic response mechanism ensures that the system adapts to evolving user needs, improving efficiency and accuracy in transaction processing. The system may also integrate with databases or external services to retrieve or generate the required information based on the determined intent.
27. The system of claim 16 , wherein the one or more physical processors are further programmed to: receive a further input after the transaction lead was transmitted; determine a second intent of the further input; and complete a purchase transaction in response to receiving the further input based on the determined second intent.
This invention relates to a system for processing transaction leads in a digital environment, particularly for completing purchases based on user intent. The system addresses the challenge of efficiently converting user interactions into completed transactions by dynamically responding to subsequent inputs and determining user intent to streamline the purchase process. The system includes one or more physical processors configured to receive an initial transaction lead, which may be generated from a user's interaction with a digital interface, such as a website or application. The processors analyze the lead to determine a first intent, which could involve identifying a user's interest in a product or service. Based on this intent, the system transmits a transaction lead to a relevant party, such as a merchant or payment processor, to initiate a transaction. The system further enhances this process by receiving additional inputs from the user after the initial lead is transmitted. These inputs are analyzed to determine a second intent, which may reflect a refined or updated desire to complete a purchase. If the second intent aligns with a transactional action, the system automatically completes the purchase without requiring further manual steps, thereby improving user experience and conversion rates. This dynamic intent-based processing ensures that transactions are completed efficiently and accurately, reducing friction in the purchasing process.
28. The system of claim 27 , wherein the further input comprises a second natural language utterance.
The invention relates to a system for processing natural language inputs, particularly in the context of interactive computing environments. The system is designed to handle user interactions where a first natural language input is received and processed to generate a response. The system then receives a second natural language input, which may be a follow-up or related query, and uses this additional input to refine or modify the initial response. This allows for more dynamic and context-aware interactions, improving the accuracy and relevance of the system's outputs. The system may incorporate machine learning models or other computational techniques to analyze the relationship between the first and second inputs, ensuring that the final response is coherent and contextually appropriate. This approach is particularly useful in applications such as virtual assistants, customer service bots, or any system requiring multi-turn dialogue processing. The invention addresses the challenge of maintaining context and coherence in natural language interactions, where follow-up inputs may depend on or modify the meaning of prior inputs. By dynamically adjusting responses based on subsequent inputs, the system provides a more intuitive and effective user experience.
29. The system of claim 27 , wherein the further input comprises a second non-voice input.
A system for processing user inputs in a voice-controlled environment includes a voice interface and a non-voice input interface. The system receives a primary voice command from a user and executes a corresponding action. Additionally, the system accepts a secondary input, which may be a non-voice input such as a gesture, touch, or button press, to modify or supplement the voice command. The secondary input can trigger additional functions, override default settings, or provide context for the primary command. For example, a user might issue a voice command to adjust a device setting, and a simultaneous non-voice input could specify the exact parameter to adjust or the duration of the adjustment. The system integrates these inputs to enhance control flexibility and precision, addressing limitations in voice-only interfaces where ambiguity or environmental noise may hinder accurate command interpretation. The non-voice input ensures more reliable and nuanced interactions, particularly in scenarios where voice commands alone may be insufficient or impractical.
30. The system of claim 16 , wherein the non-voice input component comprises a map display, and wherein the transaction lead is presented as a point on the map display.
This invention relates to a system for processing and displaying transaction leads, particularly in a spatial or geographic context. The system addresses the challenge of efficiently presenting and managing transaction leads, such as sales opportunities or service requests, in a way that enhances user interaction and decision-making. The system includes a non-voice input component that allows users to interact with the system without relying solely on spoken commands. This component includes a map display, which visually represents transaction leads as points on the map. By integrating transaction leads with a map interface, the system enables users to quickly identify and assess leads based on their geographic location. The map display may include additional features, such as zoom functionality, filtering options, or contextual information, to further assist users in analyzing and acting on the leads. The system may also incorporate other input methods, such as touch or gesture-based interactions, to provide a flexible and intuitive user experience. The overall goal is to improve the efficiency and effectiveness of lead management by leveraging spatial data visualization.
Unknown
February 4, 2020
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.