9704484

Speech Recognition Method and Speech Recognition Device

PublishedJuly 11, 2017
Assigneenot available in USPTO data we have
InventorsShiro Iwai
Technical Abstract

Patent Claims
13 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1

1. A system for recognizing speech, said system comprising: a microcomputer comprising a trigger generation section and a speech recognition section; the trigger generation section configured to generate a trigger signal based on at least a condition of present-absent of opening of a mouth; and the speech recognition section configured to, in response to the trigger signal, receive audio signals and start speech recognition relative to the received audio signals, wherein, when the trigger generation section generates the trigger signal based solely on the condition of present-absent of opening of the mouth, the trigger generation section generates the trigger signal for a predetermined time duration retroactively from a time point at which the condition of present-absent of opening of the mouth is present, and, when the trigger generation section generates the trigger signal based on the condition of the present-absent of opening of the mouth and generates another trigger signal based on present-absent of a change in a view direction of eyes, and/or present-absent of a change in an orientation of a face, the trigger generation section generates the trigger signal and the another trigger signal when one of the present-absent conditions is present, and when only the trigger signal based on the condition of the present-absent of opening of the mouth is generated, the speech recognition section utilizes the trigger signal as is, and when the another trigger signal based on the condition of the present-absent of the change in the view direction of eyes and/or the present-absent of the change in an orientation of the face is also generated, the speech recognition section utilizes the another trigger signal.

2

2. The speech recognition device of claim 1 , wherein the speech recognition section is configured to perform the speech recognition based on the condition of present-absent of opening of the mouth, using the trigger signal in advance and, when an outcome of the speech recognition by the speech recognition section indicates an error, corrects the trigger signal, and the corrected trigger signal comprises a trigger signal that is generated for the predetermined duration of time retroactively from the time point at which the condition of present-absent of opening of the mouth is present.

3

3. The speech recognition device of claim 1 , wherein the predetermined time is a period of 2-3 seconds.

4

4. A speech recognition device comprising: a microcomputer configured to generate a trigger signal based on at least a condition of present-absent of opening of a mouth, and in response to the trigger signal, receive audio signals and start speech recognition relative to the received audio signals, wherein, the microcomputer is further configured to generate another trigger signal based on present-absent of the change in a view direction of eyes and/or present-absent of the change in an orientation of the face, when the trigger signal based on the condition of the present-absent of opening of the mouth and the another trigger signal based on the condition of present-absent of the change in the view direction of eyes and/or present-absent of the change in an orientation of the face are generated, the microcomputer is configured to start speech recognition using the another trigger signal, and when an outcome of the speech recognition indicates an error, the microcomputer is configured to generate the trigger signal for a predetermined time duration retroactively from a time point at which the condition of present-absent of opening of the mouth is present, and restart the speech recognition with the trigger signal.

5

5. The speech recognition device of claim 4 , wherein the predetermined time is a period of 2-3 seconds.

6

6. A computer-implemented speech recognition method comprising the steps of: generating a trigger signal via a processor, said trigger signal based solely on motion of a first facial organ or based on the motion of the first facial organ and motion of a second facial organ different from the first facial organ; and starting speech recognition relative to audio signals via the processor in response to the trigger signal, wherein the first facial organ is a mouth, and when only the trigger signal based on the motion of the first facial organ is generated, the trigger signal, as is, is utilized to start speech recognition, and when the trigger signal based on the motion of the second facial organ is also generated, the trigger signal based on the motion of the second facial organ is utilized to start the speech recognition.

7

7. The computer-implemented speech recognition method of claim 6 , wherein the second facial organ comprises an eye and/or a face.

8

8. The computer-implemented speech recognition method of claim 7 , wherein the motion of the mouth is present-absent of opening of the mouth, the motion of the eye is present-absent of a change in a view direction of the eye, and the motion of the face is present-absent of a change in an orientation of the face.

9

9. A computer-implemented speech recognition method comprising the steps of: generating a trigger signal via a processor based on a condition of present-absent of motion of a mouth; taking in audio signals via the processor in response to the trigger signal and starting speech recognition relative to the audio signals taken in, wherein the trigger signal is generated from a predetermined time duration retroactively from a point in time at which the condition of present-absent of motion of the mouth is present, and when only the trigger signal based on the condition of the present-absent of opening of the mouth is generated, the speech recognition is started using the trigger signals as is, and when another trigger signal based on a condition of present-absent of change in a view direction of eyes and/or present-absent of change in an orientation of a face is also generated, the speech recognition is started using the another trigger signal.

10

10. The computer-implemented speech recognition method of claim 9 , wherein, when the speech recognition encounters an error, the trigger signal is generated for the predetermined time duration retroactively from the time point at which the condition of present-absent of motion of the mouth is present.

11

11. The computer-implemented speech recognition method of claim 9 , wherein the predetermined time duration is a period of 2-3 seconds.

12

12. A computer-implemented speech recognition method comprising the steps of: generating a trigger signal via a processor from a point in time at which a condition of present-absent of motion of a mouth is present; taking in audio signals via the processor in response to the trigger signal and starting speech recognition relative to the audio signals taken in; and judging via the processor whether an outcome of the speech recognition indicates an error, wherein, when the trigger signal based on the condition of the present-absent of opening of the mouth and another trigger signal based on a condition of present-absent of change in a view direction of eyes and/or present-absent of change in an orientation of a face are generated, the speech recognition is started using the another trigger signal, and when the outcome of the speech recognition indicates an error, the trigger signal is generated for a predetermined time duration retroactively from a point in time at which the condition of present-absent of motion of the mouth is present, and the speech recognition is restarted in response to the trigger signal.

13

13. The computer-implemented speech recognition method of claim 12 , wherein the predetermined time duration is a period of 2-3 seconds.

Patent Metadata

Filing Date

Unknown

Publication Date

July 11, 2017

Inventors

Shiro Iwai

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “SPEECH RECOGNITION METHOD AND SPEECH RECOGNITION DEVICE” (9704484). https://patentable.app/patents/9704484

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.