Input method and Big data: Can you relate?

Source: Internet
Author: User

In the current information age, it is facing a potential load tendency. This tendency reflects the industry's so-called "big Data"-that is, the amount of data that is so large that traditional databases are hard to digest or even load. Then, the word "big data" has derived the meaning of a kind of technology that needs to use a large number of data operations.

So whenever we focus on some sort of data that contains billions of (or even trillions of) of records from networks and other sources, we're talking about "big data." In many cases, however, we have overlooked how much the word "big data" is associated with the use of the term "large" in the daily contact with technology, such as automatic error correction on mobile devices, word processing software, and email clients.

Automatic error correction and word recommendation

Automatic error correction is sometimes wrong, so many sites are dedicated to stickers that show "modified results" that are funny (and often violent). Nonetheless, the ability to automatically correct misspellings on mobile devices and even predict the next word is still more useful than expected, after all, more often than not, it avoids many awkward situations that may be caused by finger typing.

But sometimes I get goose bumps when I think about it. The smart machine can list all the word results that may be generated, based on any alphabetical order you type. Considering the feasibility of foreign language programming and the sliding input function of most smartphones, it can be said that the almost limitless number of combinations of words produced is really "big data".

Word recommendation and Auto-error correction are based on a smart phone's own algorithm. Basically, it can compare your typing to a built-in dictionary and find out what's different from the spelling. And its built-in dictionaries tend to cover multiple languages. For example, every time I type a foreign letter, the phone will provide the corresponding dictionary of the Association and error correction function.

When the type of content matches the dictionary, the hand displays the words that might match it. If the suggested words are correct, this feature will greatly shorten the typing time and improve communication efficiency. If the results are not available, the mobile phone program will continue to provide more words to choose, some of the grammar is correct, some are reasonable, some even you have not thought, the rest of the basic is online those "phone input method funny pictures" material.

Find the perfect match

How to type what kind of recommendation, this is one of the many challenges programmers face, including the following two points:

1, the system to make a comprehensive thesaurus-the thesaurus of the function can not be discounted due to the development of the times, but should be with the times, with high practical, such as to include easy to appear in text messages in the network popular language.

2. Finalize a language model without obvious flaws-the model should be able to verify the words entered by the user and give reasonable advice on possible errors.

That is to say, if you type "Taxos", then is "taxis" or "tacos"? Your input keyboard will provide both recommendations. But if you want to enter "taxes", then you need the input method known context meaning, such as "there's nothing sure but death and ..." to correctly give taxes. If there is no contextual hint, but simply wrong to lose the Taxos, unless the use of very mature input method, generally can only get "taxis", "tacos", or "Taxos" recommendation. Of course, the vast majority of people who use automatic error correction and Word association will still be amazed by its accuracy.

How input methods know

The Google search engine's spell checker Records learning user preferences and corrects search results accordingly. However, the vast majority of phone keyboards are not so smart, in part because collecting user typing habits and generating databases is likely to violate personal privacy.

A thesaurus used for automatic error correction generally obtains vocabulary from a collection of patent-restricted essays. Programmers have devised a series of algorithms for identifying commonly used syntax, the main usage and repetition of a word, spelling, and possible letter strings, and, of course, alphabetic errors that often occur due to keyboard layout.

Even so, your phone will learn how to fix your words after automatic correction. This is more common when entering proper nouns such as business terminology and newborn words.

Without big data on the management of large-volume word combinations, the smart input method of business intelligence platform is irrelevant. However, big data can also make the keyboard more intelligent than the existing features. As technology matures, mobile phones can store more information, and mobile-phone thesaurus will become larger and more refined.

Input method and Big data: Can you relate?

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.