International - English

Cart Console

Topic Center

Contact Sales

Home > Industries > Computer

(original) to realize the function of human-computer interaction by using the message flying voice

Last Update:2015-07-08 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Currently in a car-driven project, one of the requirements is to realize the dialogue between people and mobile phone when driving, the whole process does not need to use the hand, only voice control.

This is similar to the human and robot dialogue, the robot in the background has been on standby, the user talking robot to make corresponding reflection.

However, due to the user's valuable mobile phone power, but also can not allow users to open the recording monitoring, this is very resource-intensive. Therefore, the wake-up function provided by the Voice over speech is used.

How do you do it specifically?

Look at a flowchart: This flowchart uses most of the technology of the flying (voice wake, Voice wake + command word recognition, semantic recognition, speech synthesis), no nonsense, look at the picture

The flowchart has been written very clearly, a brief introduction to the next

Start wake-up when the program starts, and this time the user says the wake-up word will wake the machine and listen to the command. But if there is a message at this time, it will give priority to the information, the time of the broadcast will be suspended, the broadcast after the completion of the wake-up. There is an important reason for this is that the wake-up is always occupy the recording resources, and this time to broadcast voice will be intermittent, I heard this can be set, but the broadcast when used in general will not say wake-up words.

There are two modes of awakening: simple wake-up and wake-up + command word recognition

A simple wake-up will have a callback that wakes up successfully.

and wake up + command word recognition can not only wake up, if you say a wake-up word at the same time to say a command, then he will recognize the command, you can simply receive this command to execute, and do not need to start what semantic recognition after the execution of the command, which is also very cool for users.

However, the command word has a certain limit, that is, the command word must be built before the grammar, and the content of the command word must be known in advance. But if the user said a Shihezi university how to go, this command in your command word build grammar file did not! What to do? At this point you have to prompt the user to let the user semantic input.

So my idea here is: the user says the command word, initiates the wake, and then recognizes the command word. The recognition command word executes the command successfully, recognizing the command Word error starting semantic recognition.

One drawback is that users say that the content of the Awakening Word + semantic recognition, the content of semantic recognition is consumed by the command word, the user can only repeat the semantic recognition of the content to recognize the semantics.

In order to avoid this problem, we after the wake-up word recognition, if the command word is not recognized, with the voice of synthetic speech prompts the following users "what can I help you", this means that the machine does not recognize the user just the semantic content, users need to re-say, I am not very treacherous O (∩_∩) o

The next step is semantic recognition, which is nothing to say, the main point is that if the user does not speak, you have to keep it recording state? Of course not, ah, how much electricity! In order to save the user power, I also designed a user does not speak 20s automatically into the state of waiting to wake up the process. 20s how come? Use time stamp Ah! is to record a timestamp every time the user command recognizes success or if the wake succeeds. Then the next time to start the semantic recognition before the first judge whether the current time and timestamp time difference is greater than 20s, if less than 20s continue to start semantic recognition, if more than 20s start wake up, ready to let users say command word to wake it.

All right, it's almost there. Don't ask me why not always let the user say wake-up words and then execute the corresponding commands. If you're going to have to talk about waking words before you do something, I guess you're going to freak out, even if you're not crazy, people will think you're insane. No derogatory, joking, O (∩_∩) o haha

My github Address: https://github.com/dongweiq/study

Welcome attention, Welcome to star O (∩_∩) O. What's the problem please contact [email protected] qq714094450

(original) to realize the function of human-computer interaction by using the message flying voice

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

Lenovo computer Update BIOS reboot prompts "Secure Flash auth... 01-13

Computer storage unit Byte, KB, MB, GB, TB, PB, EB, ZB, YB, D... 07-23

Clear trojans on your computer 12-08

Computer Information Processing 12-08

Number representation in the computer 12-08

It takes 3 minutes to shut down useless services on your comp... 12-08

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

What's Trending

Top 10 Tags

datastax versions naming convention zookeeper client class definition md5 microsoft sql server 2005 data structures exception handling error handling

Top 10 Keywords

microsoft download center down wordpress address url site address url wordpress address url windows installer 4 0 download 302 not found web address url definition site address url wordpress db2 integer mac os installation step by step pdf abbreviation for return

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

(original) to realize the function of human-computer interaction by using the message flying voice

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support