Reference: http://blog.sina.com.cn/s/blog_923fdd9b0101flx1.htmlRealization of speech recognition via Google Voice interfaceRecently there are features in the project that need to implement speech recognition. Toss a few days to do well. At the beginning of the time do not have a clue, the information found on the internet is a mess, or is very ancient implementation methods, some simple code snippets. So I decided to share my experience with everyone.To implement the speech recognition process i
There is a good audio FAQ. If you have time, go up and check it out.
Link: http://forum.doom9.org/showthread.php? T = 68300
The following part is reproduced:
General Audio procedures FAQCross-format techniques for encoding, decoding, and playback.
1. What is the difference between lossy, lossless, and uncompressed audio formats?
Uncompressed audio is simply audio without any compression applied to it. uncompressed audio is commonly used in AV conversions in PCM or
Speech recognition through Google Voice interface
Recently, there is a need to implement the speech recognition function in the project. It took a few days to get it done. There was no clue at the beginning, and the information found on the Internet was messy, or very old implementation methods and some simple code snippets. So I decided to share my experience with you.
The process for implementing Speech Recognition in IOS is as follows:
Recording-> PCM format-> convert
In the last article, I added a functional interface for browsing local WAV Files with slight changes.
ProgramCode
View code
Using System; Using System. Windows. forms; Using System. Media; Namespace Alarmclock{ Public Partial Class Form1: Form{ Public Form1 (){Initializecomponent ();}Soundplayer player = New Soundplayer (); Private Void Form1_load ( Object Sender, eventargs E){Timer1.start (); // Bind to Co
C # Call the C # sample code below winmm. DLL to play the audio file # demonstrate how to use the platform call service on Windows platform to play the waveform sound file.ExampleThis code example uses dllimport to import the playsound method entry point of winmm. dll as form1 playsound. This example has a simple windows form with a button. Click this button to open the standard Windows openfiledialog dialog box to open the file to be played. After the waveform file is selected, the playsound ()
WAV: Losslessis a sound file format developed by Microsoft that conforms to the Piffresource Interchange File format specification for storing audio information resources for the Windows platform and is supported by the Windows platform and its applications. “*. WAV "format support MSADPCM, Ccittalaw and other compression algorithms, support a variety of audio digits, sampling frequency and channel, the sta
question mainly examines the examinee's proficiency in the standard library functions. When necessary, referencing the library functions can greatly simplify the workload of programming.The most frequently used library functions include:(1) strcpy(2) memcpy(3) memsetQuestion 6: The following table lists the known WAV file formats. Open a WAV file, organize the wav
Basic ape knowledge:
1. Why Ape? (in short, ape is a music file that provides the highest quality)
(1) CD file acquisition:
Several years ago, people had a recorder who listened to the music on the tape. Now there are a lot of CDs and DVDs, and the sound effect is better than the tape. However, the music on CD cannot be recognized directly in windows and copied directly to hard disks (other formats can be recognized in windows, such as data files, VCD, such as DVDs, but CD cannot be recognized d
using a large song database that has been labeled as a particular genre, but I want to keep this solution completely independent for the sake of simplicity. Finally, although there are many formats for digital music (MP3, WMA, M4A, OGG, etc.), in order to make it simple, here I just focus on the MP3 file. In fact, the algorithm developed in this paper can work well for other formats as long as the audio in this format can be converted to WAV format f
In the Silverlight 4 and WP7 programs, you need to use the XNA interface to play the bell sound in the background, instead of relying on a certain xaml page.
Add a reference to Microsoft. Xna. Framework. dll in the project.
After adding a reference, add the following two namespaces at the top of the code file by using:
Using Microsoft. Xna. Framework. Audio;
Using Microsoft. Xna. Framework;
This class only supports WAV files, so only music in
the dictionary dictionary, where each word has a corresponding sound sequence mapped.So, in addition to voice data, you also need a transcripts, and two dictionaries. One is the corresponding table for each word to the pronunciation, and a table in which the unit is not pronounced, recorded as filler Dictionay.Training beginsThe following two directories need to be prepared before training
etc
your_db.dic- Phonetic dictionary
your_db.phone- phoneset file
you
Note: Button pictures and audio files (*.wav) need to go online to download the replacement (here is not easy to upload), what is the problem, you can leave a message ...Import Java.applet.applet;import java.applet.audioclip;import Java.awt.color;import Java.awt.font;import Java.awt.list;import Java.awt.event.itemevent;import Java.awt.event.itemlistener;import java.awt.event.MouseEvent ; Import Java.awt.event.mouselistener;import Java.net.malformedurl
Python writes, the function is, will curl the source of INF.D and voice.d two files into Utau sound source, and restore before packaging WAV files and INF files, can be analyzed with the curl source maker.The disassembled folder can be dragged to the Utau directory for use as a source and has been brought oto. Generate FRQ with the auto-generate Frq function of Utau.Tools and instructions for use Http://files.cnblogs.com/files/bdbmb/nn2utau.zipIn orde
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.