"Cross-lingual adaptation with multi-task adaptive Networks" (1)

Last Update:2015-03-31 Source: Internet

Author: User

Tags dnn to domain

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

First of all, why read this article.
1. This paper if not wrong should be based on DNN do cross-lingual adaptation, now DNN is still very fire, so if can dnn to do cross-lingual adaptation certainly have a future
2. The paper mentions that the training is using the Theano library, this library I have touched a bit before, using the Gtx690gpus for training, that is, the code does not write their own.
3. This paper is cross-lingual adaptation for ASR, see if you can draw something from ASR to the crafting side.
Introduction

First paragraph:

In Cross-lingual automatic speech recognition (ASR), models applied to a target language is enhanced using data from a DI Fferent source language.
1. Strongly found that the previous reading of the paper a little, or to a wide range of papers, at least I know now in cross-lingual adaptation synthesis, no one with DNN to do, but in cross lingual recognition, has used DNN to do a lot of people, If you can learn something from ASR, then you can certainly send a good article
2. Suppose you now have 1000 sentences in Cantonese, training an ASR model to recognize Cantonese,
3. If there are 1000 sentences of English corpus, English is called source language, Cantonese is called target language,
4. Then using these 1000 sentences of the English corpus, in the previous training of the ASR model to retrain, will be strengthened models.
In this scenario, the target language was typically low-resourced:transcribed acoustic training data for the target Langua GE May is difficult or expensive to acquire.
1. The target language is very few,
2. And it's very difficult to record the training data in the target language.
3. That is, the target language is difficult to obtain, only a small amount, but the source language is very easy to obtain.
The cross-lingual approach is motivated by the fact and the source language data, despite being mismatched to the target, May cap-ture Common properties of the acoustics of speech which is shared across languages, improving the Generalisatio N of the fi-nal models to unseen speakers and conditions.
1. How does a cross-lingual approach be inspired?
2. The data of the source language, which can be captured to a common acoustic characteristic attribute, is shared by the Kua language,
3. Say something, feel the statement is not fluent, that is, source language English and target language Cantonese, although it is a different language, but they still have some acoustic characteristics can be shared, there must be some acoustic characteristics of each language is unique
4. This is based on the shared acoustic characteristics between the different languages, which can enhance the universality of the final model.

Second paragraph

2. Cross-lingual ASR may be viewed as a form of adaptation.
  1. The cross-language ASR can be thought of as an adaptive one,
  2. What do you mean?
  3. Adaptive to a broad concept, the following include
    1. ASR for cross-language
    2. Cross-language synthesis
    3. .....
4. In contrast to domain or speaker adaptation, the major problem with cross-lingual adaptation arises from the differences I n phone sets between the source and target languages.
  1. Compared to domain adaptive or speaker self-adaptation.
  2. What causes the main problem of cross-language adaptive?
    1. is caused by differences in the phoneme set of the source and target language languages.
6. Even when a universal phone set was used, it had been found that realisation of what are ostensibly the same phone still dif Fers across languages [1].
  1. Despite the use of a common set of phonemes,
  2. The following sentence will not translate,
8. In this paper, we focus on approaches where source and target languages be assumed not to share a phone set, which is pro Bably a valid assumption when a small number of source lan-guages is used, which is unlikely to provide complete phone Coverage for an arbitrary target language.
  1. The author's approach is to assume that source and target language do not share a phone set
  2. Perhaps the author's hypothesis is an effective hypothesis when a small amount of source language is used, in which case it is impossible to provide a complete phoneme overlay for any target language

Third paragraph:

2. Arguably the simplest approach to the problem of cross-lingual Phoneset mismatch are to define a deterministic mapping bet Ween source and target phone sets [2] which may is estimated in a data-driven fashion [3].
  1. There are some ways to solve the mismatch of Cross-lingual's phoneme set
  2. One simple approach is to define a deterministic mapping between the source and target phoneme sets.
  3. This seems to be a commonly used method in synthesis, as I do now is not the state mapping?
4. This was the motivation behind the work of [6], where a subspace GMM (SGMM) was used, in which the source languages define a SubSpace of full covariance gaussians.
  1. This is inspired by the work of "6", using a subspace of GMM
  2. The source language defines a full covariance Goss space.
  3. There are a lot of mathematical knowledge involved here.

"Cross-lingual adaptation with multi-task adaptive Networks" (1)

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More