Remark entry: Latent Semantic analysis (LSA)

Source: Internet
Author: User

The Latent Semantic Analysis (LSA) is also called latent Semantic indexing (LSI), which is to discover the potential meanings and concepts of these documents by analyzing the documents, latent That is, to establish the relationship between semantic (lexical family) and document potential meaning, It maps words and documents into a ' concept ' space and compares them within this space (note: a dimensionality reduction technique).

Latent Semantic Analysis (latent Semantic analyses), is a new branch of semantics. Traditional semantics usually study the meaning of words and words, as well as the relationship between word and word, like righteousness, synonyms, anti-righteousness and so on. Latent semantic analysis explores a relationship that is hidden behind words, which is not based on the definition of a dictionary, but rather as the most basic reference to the use of the word environment. This idea comes from a psychologist. They believe that hundreds of languages in the world should have a common, simple mechanism that allows anyone to master that language as long as they are grown up in a particular language environment. Under the guidance of this idea, people have found a simple mathematical model, the input of which is a library composed of documents written in any language, and the output is a mathematical expression (vector) of the words and words in the language. The comparison between words, the relationship between words, and even the meaning of any piece of article is generated by the operation between the vectors.

The concept of latent semantics is also applied to information retrieval, so sometimes latent semantics is also called implied semantic index (latent Semantic indexing,lsi).

Based on the SVD decomposition of lexical-document relation matrix, the dimensionality reduction processing of the data can be further realized, and the relation degree of lexical-document subject is revealed.


Reference:

http://blog.csdn.net/bob007/article/details/30496559

http://www.csdn.net/article/2015-02-05/2823865


Remark entry: Latent Semantic analysis (LSA)

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.