What does it mean to read lip tones? I can see where you are with a bag of potato chips.

Source: Internet
Author: User
Keywords What potato chips watch tone a bag
Tags abstract course environment high it is read speed video
Abstract: Some people with super ability can see the mouth to guess what the other said, of course, such people are very rare. But if someone can look at a bag of potato chips in a soundproof environment and restore what you're saying, do you believe it? What the hell are you talking about?

Some people with super ability can see the mouth and guess what the other person said, of course, such people are very rare. But if someone can look at a bag of potato chips in a soundproof environment and restore what you're saying, do you believe it?

What the hell are you talking about? Isn't it more outrageous than casting pearls on the wall?

Researchers at MIT, Microsoft and Adobe have done something seemingly super outrageous. The mystery is the study of vibrations. The researchers were able to restore the sound signals in the environment by analyzing the tiny vibrations in the video produced by the sound in the body. In one group of experiments, the researchers, in the case of soundproofing, made a speech sound by shooting the vibration of a potato bag from a high-speed camera 15 feet away. In addition to the potato bag, the researchers also conducted experiments on aluminum foil, water cups and even potted plants, all of which received good results.

The principle is that sound travels to an object to cause vibrations, and this vibrational motion creates a very subtle visual signal that is invisible to the naked eye. But the computer can capture the premise of capturing video at frequencies higher than the audio--in the experiment, the researchers used the high speed camera's FPS (the number of frames per second) reached 2000~6000 (average smartphone video fps is typically 60, High-end commercial high-speed cameras can reach 100000FPS.

Of course, this high-speed camera is not what ordinary people can have. But the researchers then experimented with ordinary digital cameras. Using a bizarre design of most camera sensors, the researchers succeeded in inferring high-frequency vibration information at 60FPS frequencies. Although this reduction is not as good as a high-speed camera, it is enough to identify how many people speak, male or female, and even have enough accurate speakers ' acoustic characteristics.

Clearly, this ability has extensive use in legal forensics and criminal investigation. In turn, given the different vibrational modes of different objects/objects to sound, this characteristic can give birth to a new kind of imaging technology. The interesting thing about science is that the first thing you study about it is cool, but others keep coming up with new uses.


Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.