10839800

Information Processing Apparatus

PublishedNovember 17, 2020
Assigneenot available in USPTO data we have
Technical Abstract

Patent Claims
14 claims

Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.

Claim 1

Original Legal Text

1. An information processing apparatus comprising: a choice presentation unit configured to present a plurality of choices to a user; a speech recognition unit configured to recognize utterance contents of the user for selecting one of the plurality of choices; a selection result specification unit configured to specify the choice selected by the user based on whether or not a phrase included in the recognized utterance contents of the user corresponds to a phrase included in a dictionary corresponding to each of the plurality of choices prepared in advance; and an outputting unit configured to calculate a feature amount of a sound signal including the utterance of the user to evaluate an emotion of the user and perform outputting according to a result of the evaluation of the emotion and the choice selected by the user.

Plain English Translation

This invention relates to an information processing apparatus designed to enhance user interaction by analyzing both the content and emotional tone of spoken responses. The apparatus addresses the challenge of accurately interpreting user selections from multiple choices while also assessing the user's emotional state based on their speech. The system presents a set of choices to the user and uses speech recognition to capture their verbal selection. A selection result specification unit then determines the chosen option by comparing phrases in the recognized speech against predefined dictionaries associated with each choice. Additionally, the apparatus evaluates the user's emotion by analyzing acoustic features of the sound signal, such as pitch, volume, or speech patterns. Based on this emotional evaluation and the selected choice, the system generates an appropriate output, which may include tailored responses, feedback, or further actions. This approach improves user experience by combining semantic understanding with emotional context, making interactions more intuitive and responsive. The invention is particularly useful in applications like voice assistants, customer service systems, or interactive interfaces where accurate interpretation of user intent and emotional state is critical.

Claim 2

Original Legal Text

2. The information processing apparatus according to claim 1 , wherein the dictionary corresponding to each of the plurality of choices includes at least one of phrases relating to the phrase of the choice and phrases obtained by translating the phrase of the choice into a different language.

Plain English Translation

This invention relates to an information processing apparatus designed to enhance user interaction by dynamically generating and displaying contextually relevant phrases for multiple choices in a user interface. The apparatus addresses the challenge of providing users with meaningful and varied options when selecting or inputting information, particularly in multilingual or context-sensitive applications. The apparatus includes a display unit that presents a plurality of choices to a user, each associated with a phrase. A dictionary storage unit stores dictionaries corresponding to each choice, where each dictionary contains phrases related to the choice's phrase or translations of that phrase into different languages. A dictionary selection unit dynamically selects a dictionary based on the user's interaction, such as selecting a choice or entering text. A phrase selection unit then retrieves phrases from the selected dictionary to present additional options or suggestions to the user. This system improves user experience by offering contextually appropriate phrases, reducing input effort, and supporting multilingual interactions. The dynamic selection of dictionaries ensures that the displayed phrases remain relevant to the user's current context, whether for language translation, autocomplete, or choice refinement. The apparatus is particularly useful in applications requiring natural language processing, such as chatbots, translation tools, or interactive forms.

Claim 3

Original Legal Text

3. The information processing apparatus according to claim 1 , wherein the choice presentation unit adds, to each of the plurality of choices, a label for identifying the choice and presents the choices to the user, and the dictionary corresponding to each of the plurality of choices includes a phrase indicative of a label added to the choice.

Plain English Translation

This invention relates to an information processing apparatus designed to enhance user interaction with multiple-choice selections. The apparatus addresses the challenge of efficiently presenting and managing choices in a user interface, particularly when dealing with complex or numerous options. The system includes a choice presentation unit that displays a set of choices to a user, each labeled with a unique identifier for easy recognition. Each choice is linked to a corresponding dictionary containing phrases that describe or relate to the label assigned to that choice. This structure allows the apparatus to dynamically associate additional context or metadata with each selection, improving clarity and usability. The apparatus may also include a dictionary management unit that organizes and updates these dictionaries, ensuring that the phrases remain relevant and accurate. By integrating labels and dictionaries, the system streamlines the selection process, making it easier for users to navigate and understand their options. This approach is particularly useful in applications requiring structured data input, such as surveys, forms, or decision-making tools, where clear and organized choices are essential for effective user engagement.

Claim 4

Original Legal Text

4. The information processing apparatus according to claim 1 , wherein the dictionary corresponding to each of the plurality of choices includes a phrase indicative of a displaying mode of the choice.

Plain English Translation

This invention relates to an information processing apparatus designed to enhance user interaction by dynamically displaying choices in a user interface. The apparatus addresses the problem of static or inflexible choice presentation, which can lead to poor user experience or inefficient decision-making. The apparatus includes a display unit that presents multiple choices to a user, where each choice is associated with a dictionary containing metadata. This metadata includes a phrase that specifies how the choice should be displayed, allowing for customization of appearance, formatting, or behavior. The apparatus also includes a control unit that processes user input to select a choice and retrieves the corresponding dictionary to determine the appropriate display mode. This ensures that each choice is presented in a way that aligns with its intended purpose or context, improving usability and clarity. The invention may be applied in systems requiring adaptive user interfaces, such as software applications, interactive kiosks, or virtual assistants, where dynamic presentation of options enhances user engagement and efficiency.

Claim 5

Original Legal Text

5. The information processing apparatus according to claim 4 , wherein the phrase indicative of the displaying mode of the choice includes at least one of a displaying position, a displaying order, and a displaying color of the choice.

Plain English Translation

This invention relates to an information processing apparatus that enhances user interaction by dynamically adjusting the display of choices based on user behavior. The apparatus monitors user actions, such as selection patterns or input timing, to determine an optimal display mode for presenting choices. The display mode includes visual attributes like position, order, and color of the choices to improve usability and efficiency. For example, frequently selected options may be highlighted or positioned prominently, while less common choices may be deprioritized. The apparatus also adapts to user preferences over time, refining the display to align with individual usage habits. This approach reduces cognitive load and speeds up decision-making by presenting choices in a contextually relevant manner. The system may be applied in interfaces like menus, forms, or recommendation systems where adaptive presentation improves user experience. The invention addresses the problem of static, one-size-fits-all interfaces that do not account for user variability, leading to inefficiencies and frustration. By dynamically adjusting display attributes, the apparatus ensures choices are presented in a way that aligns with user expectations and behavior.

Claim 6

Original Legal Text

6. The information processing apparatus according to claim 1 , wherein, where the phrase included in the utterance contents coincides with some of the phrases included in the dictionary, the selection result specification unit decides that both of the phrases correspond to each other.

Plain English Translation

This invention relates to an information processing apparatus designed to improve the accuracy of phrase matching in natural language processing, particularly in voice or text-based systems. The problem addressed is the difficulty in reliably determining whether phrases in user utterances match predefined phrases in a dictionary, which is critical for tasks like voice commands, chatbots, or search queries. The apparatus includes a selection result specification unit that evaluates whether a phrase from an utterance matches any phrase in a dictionary. When a match is found, the unit confirms that both phrases correspond to each other, enabling precise interpretation of user input. This matching process is essential for systems that rely on predefined phrase sets to trigger actions or retrieve information. The apparatus may also include components for preprocessing input data, such as noise reduction or normalization, to enhance matching accuracy. The dictionary may be dynamically updated or customized for specific applications, ensuring adaptability to different domains or user preferences. The invention aims to reduce false positives and negatives in phrase recognition, improving the reliability of automated systems that depend on natural language understanding.

Claim 7

Original Legal Text

7. The information processing apparatus according to claim 1 , wherein, where a displacement between the phrase included in the utterance contents and a phrase included in the dictionary corresponds to a pattern determined in advance, the selection result specification unit decides that both of the phrases correspond to each other.

Plain English Translation

This invention relates to an information processing apparatus designed to improve the accuracy of phrase matching in natural language processing, particularly when comparing phrases from spoken or written input against a reference dictionary. The core problem addressed is the ambiguity that arises when phrases in user utterances differ slightly from dictionary entries due to variations in wording, speech recognition errors, or informal language use. The apparatus includes a selection result specification unit that evaluates whether two phrases correspond to each other by analyzing their displacement patterns. If the displacement between the input phrase and a dictionary phrase matches a predefined pattern, the system determines they are equivalent. This approach allows for flexible matching beyond exact word-for-word comparisons, accommodating common linguistic variations while maintaining precision. The apparatus may also include components for extracting phrases from input data, generating candidate phrases from a dictionary, and calculating similarity scores between them. The predefined displacement patterns could include minor word order changes, synonym substitutions, or grammatical variations. By dynamically adjusting matching criteria based on these patterns, the system enhances the reliability of phrase recognition in applications like voice assistants, translation tools, or search engines.

Claim 8

Original Legal Text

8. The information processing apparatus according to claim 1 , further comprising: an outputting unit configured to calculate a feature amount of a sound signal including the utterance of the user to evaluate an emotion of the user and perform outputting according to a result of the evaluation of the emotion and the choice selected by the user.

Plain English Translation

This invention relates to an information processing apparatus designed to enhance user interaction by evaluating emotional states from speech and adapting responses based on both emotional analysis and user selections. The apparatus processes sound signals containing user utterances to extract feature amounts, which are analyzed to assess the user's emotional state. The system then generates outputs—such as responses, actions, or adjustments—based on this emotional evaluation combined with the user's explicit choices. This approach integrates affective computing with user input to create more personalized and contextually appropriate interactions. The technology addresses the challenge of static, one-size-fits-all responses in human-computer interfaces by dynamically incorporating emotional context alongside direct user preferences. The apparatus may be applied in virtual assistants, customer service systems, or healthcare tools where emotional awareness improves engagement and effectiveness. The core innovation lies in the fusion of emotion detection from speech with user-selected options to tailor system behavior, ensuring responses align with both cognitive and affective user states.

Claim 9

Original Legal Text

9. The information processing apparatus according to claim 1 , wherein the feature amount includes an elapsed time period until the user performs utterance for selecting one of the plurality of choices after the plurality of choices are presented to the user, and, when the emotion of the user is to be evaluated using the elapsed time period, the outputting unit varies an evaluation reference according to a presentation order when the choice specified by the selection result specification unit is presented to the user.

Plain English Translation

This invention relates to an information processing apparatus that evaluates a user's emotional state based on their interaction with a system presenting multiple choices. The apparatus measures the time a user takes to select an option after choices are displayed, using this elapsed time as a feature to assess the user's emotional response. The system adjusts the evaluation criteria for this time measurement based on the order in which the selected choice was presented to the user. For example, if a choice appears earlier in the sequence, the system may apply a stricter time threshold for determining a positive or negative emotional reaction, whereas a later-presented choice might allow a longer response time before triggering a negative evaluation. This dynamic adjustment accounts for cognitive biases, such as the tendency for users to favor earlier options, ensuring more accurate emotional assessments. The apparatus may also incorporate additional user features, like speech patterns or physiological data, to refine the evaluation. The system is designed for applications in human-computer interaction, such as voice assistants or decision-support tools, where understanding user emotions can improve responsiveness and user experience.

Claim 10

Original Legal Text

10. The information processing apparatus according to claim 1 , further comprising: a reproduction unit configured to reproduce speech to be presented to the user by voice, wherein, where the speech recognition unit detects the utterance of the user during the reproduction of the speech, the reproduction unit stops the reproduction of the speech.

Plain English Translation

This invention relates to an information processing apparatus designed to enhance user interaction by dynamically adjusting speech reproduction based on user input. The apparatus includes a speech recognition unit that monitors for user utterances and a reproduction unit that outputs speech to the user. During speech reproduction, if the speech recognition unit detects a user utterance, the reproduction unit immediately stops the speech output. This ensures that the user can interrupt the system at any time, preventing delays or disruptions in communication. The system is particularly useful in applications requiring real-time interaction, such as voice assistants, customer service systems, or interactive voice response (IVR) platforms, where seamless and responsive user engagement is critical. By pausing speech output upon detecting user input, the apparatus avoids overlapping speech and ensures clear, uninterrupted communication. The invention improves user experience by prioritizing user input over automated speech, reducing frustration and enhancing efficiency in voice-based interactions.

Claim 11

Original Legal Text

11. The information processing apparatus according to claim 10 , wherein the speech recognition unit determines whether or not the utterance of the user is to be detected during the reproduction of the speech in response to a length of the speech.

Plain English Translation

This invention relates to an information processing apparatus that enhances speech recognition during audio playback by dynamically adjusting detection based on speech length. The apparatus includes a speech recognition unit that analyzes user utterances during audio playback, such as voice guidance or synthesized speech. The key innovation is the ability to determine whether to detect user speech based on the length of the reproduced speech. For example, if the audio segment is short, the system may prioritize immediate recognition of user input, while longer segments may trigger delayed or conditional detection to avoid interference. The apparatus also includes a speech output unit that generates the audio playback and a control unit that manages the interaction between speech recognition and playback. The system ensures seamless user interaction by adapting recognition sensitivity to the context of the audio content, improving accuracy and responsiveness in applications like voice assistants, navigation systems, or multimedia interfaces. The invention addresses challenges in balancing real-time user input with uninterrupted audio playback, particularly in environments where background noise or overlapping speech could degrade recognition performance.

Claim 12

Original Legal Text

12. The information processing apparatus according to claim 10 , wherein the speech recognition unit varies a detection reference for the utterance of the user in response to an elapsed time period after the reproduction of the speech is started.

Plain English Translation

This invention relates to speech recognition in information processing systems, particularly for adjusting detection sensitivity based on elapsed time during audio playback. The system includes a speech recognition unit that dynamically modifies its detection criteria for user utterances in response to the duration since audio reproduction began. This addresses the problem of inconsistent speech recognition performance during interactive audio playback, where user responses may vary in timing and clarity. The speech recognition unit initially applies a strict detection reference to filter out background noise or unintended sounds at the start of playback. As time progresses, the detection criteria are relaxed to accommodate natural delays in user responses or to better capture speech that may be less distinct due to environmental factors. The system may also include a playback control unit that manages audio output and a response processing unit that interprets recognized speech to trigger subsequent actions. This adaptive approach improves accuracy in voice interaction systems by aligning detection sensitivity with expected user behavior patterns over time.

Claim 13

Original Legal Text

13. An information processing method comprising: presenting a plurality of choices to a user; recognizing utterance contents of the user for selecting one of the plurality of choices; specifying a choice selected by the user based on whether or not a phrase included in the recognized utterance contents of the user corresponds to a phrase included in a dictionary corresponding to each of the plurality of choices prepared in advance; and calculating a feature amount of a sound signal including the utterance of the user to evaluate an emotion of the user and performing outputting according to a result of the evaluation of the emotion and the choice selected by the user.

Plain English Translation

This invention relates to an information processing method that enhances user interaction by combining voice recognition with emotion analysis. The method addresses the challenge of accurately interpreting user selections from multiple choices while also assessing the user's emotional state during the interaction. The system presents a set of choices to a user and captures their spoken response. It then processes the utterance to determine which choice was selected by comparing recognized phrases against a predefined dictionary associated with each option. Additionally, the method analyzes the sound signal of the user's voice to extract features that indicate emotional state, such as tone or pitch. Based on the selected choice and the evaluated emotion, the system generates an appropriate output response. This approach improves user experience by tailoring interactions to both the user's explicit selection and their implicit emotional context, making it useful in applications like virtual assistants, customer service systems, or interactive interfaces where emotional awareness is beneficial. The method ensures accurate choice recognition while dynamically adapting to the user's emotional feedback.

Claim 14

Original Legal Text

14. A non-transitory, computer-readable information storage medium storing a program, the program when executed by a computer, causing the computer to implement actions, comprising: by a choice presentation unit, presenting a plurality of choices to a user; by a speech recognition unit, recognizing utterance contents of the user for selecting one of the plurality of choices; by a selection result specification unit, specifying the choice selected by the user based on whether or not a phrase included in the recognized utterance contents of the user corresponds to a phrase included in a dictionary corresponding to each of the plurality of choices prepared in advance; and by an outputting unit, calculating a feature amount of a sound signal including the utterance of the user to evaluate an emotion of the user and performing outputting according to a result of the evaluation of the emotion and the choice selected by the user.

Plain English Translation

This invention relates to a computer-implemented system for interactive user engagement, particularly in voice-based interfaces. The system addresses the challenge of accurately interpreting user selections from a set of choices while also assessing the user's emotional state through speech analysis. The program, when executed, presents multiple choices to a user and uses speech recognition to capture the user's spoken selection. A selection specification module determines the chosen option by comparing phrases in the recognized speech against a predefined dictionary linked to each choice. Additionally, the system analyzes the audio features of the user's utterance to evaluate their emotional state, such as tone or stress levels. Based on this emotional assessment and the selected choice, the system generates an appropriate output response. This approach enhances user interaction by combining selection accuracy with emotional context, improving responsiveness in applications like virtual assistants, customer service bots, or interactive voice response systems. The system ensures robustness by leveraging pre-prepared dictionaries for choice matching and advanced audio feature extraction for emotion detection.

Patent Metadata

Filing Date

Unknown

Publication Date

November 17, 2020

Inventors

Shinichi Honda
Megumi Kikuchi
Takashi Satake

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, FAQs, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “INFORMATION PROCESSING APPARATUS” (10839800). https://patentable.app/patents/10839800

© 2026 Nomic Interactive Technology LLC. Machine-readable context available at /api/llm-context/10839800. See llms.txt for full attribution policy.

INFORMATION PROCESSING APPARATUS