Video reading and sound re-engraving: Let the voice of black technology to retain your voice

Source: Internet
Author: User

Summary: In the rapid development of artificial intelligence and mobile internet today, product voice homogeneity more and more serious, how to highlight the product's voice features, so that the product's voice temperature becomes particularly important. To this end, Iflytek's products have launched a sound re-engraving function, this feature can be based on deep learning of artificial intelligence synthesis technology for individual customized audio library.

In the rapid development of artificial intelligence and mobile internet today, product voice homogeneity more and more serious, how to highlight the product's voice features, so that the product's voice temperature becomes particularly important. To this end, Iflytek's products have launched a sound re-engraving function, this feature can be based on deep learning of artificial intelligence synthesis technology for individual customized audio library.

Read the voice of the sound of the function based on the world's leading intelligent speech synthesis and personalized technology, the user simply through a simple voice input sample, you can get a complete sound library, so that any text into their own voice, the final use of Shuwang reading and other fields.

The first beta version of the Sonic reading sound re-engraving feature, released in November 2017, is the world's first personalized speech synthesis application for a wide range of users, and is a custom product for a truly "civilian" personal audio library of arcane speech synthesis technology. Coincidentally, Microsoft launched the custom voice synthesis platform in May 2018, designed to achieve the civilian voice customization technology. In the source of Intelligent voice innovation, and Microsoft, Google and other technology giants seem to be thinking alike, but in fact, the forward-looking and report card more attention: The voice of the sound of the reading function only needs 10 sound collection, you can complete the personal sound of the re-engraved, the acquisition of only the industry average of 1% ( Far below Microsoft's 500 sentences with the industry's thousand sentences).

Xun Fei Reading is also a long-term technical precipitation after the product, because Iflytek 20 years to focus on Intelligent voice technology, in speech synthesis, speech recognition, oral evaluation, natural language processing and other technologies have international leading results. In the same competition with international giants, Hkust has been the winner of the International Speech Synthesis Competition (Blizzard Challenge) for 13 consecutive years, and was selected as one of the "smartest 50 companies in the world" by MIT Technology Review. Iflytek in the process of cooperation with enterprises and the media, to create a number of typical cases, among which the public is more familiar with the navigation category of the many stars broadcast the custom sound library, "Innovation China" in the "Resurrection" Li Yi Teacher voice of the custom sound library, used in CCTV and other radio and television field of virtual presenter solutions. This enables Iflytek to be familiar with the sound library customization and sound library application technology, and to make the industrial solution universal and realize the function of self-customization of the personal Audio library.

Using the audio and video reading function to create a personal voice library in the future can be used in car navigation, game entertainment, smart home, early education toys and other life-related areas. Can imagine in the near future, the voice of the girlfriend in the smart speaker every day to wake you up, for you to broadcast the weather, remind you to add Obi umbrella; in-vehicle navigation system, your wife, children's voice to escort you, prompting you to drive safely, go home early, or you put your voice in the smart toys, Let it tell a story for you, accompany your child to sleep, and even if you are far away, you can still accompany your family!

Iflytek's vision is to build a better world with artificial intelligence, and the ultimate purpose of artificial intelligence is to meet individual differentiated needs and provide personalized services to provide a solid foundation for individual self-realization.

The audio and video reading function is now limited, with the use of high-quality personalized sound application software and hardware to join, the global personalized sound library ecosystem will gradually form. Famous Spectrum art leader Andy Warhol once said, "Everyone can be famous for 15 minutes", and the message fly reading can be re-engraved beautiful voice, let your voice become your brand assets, so that those worthy of the engraved good time often accompany forever!

Video reading and sound re-engraving: Let the voice of black technology to retain your voice

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.