Use Python to do natural language processing must know eight tools "reprint"

Source: Internet
Author: User
Tags nltk

Python is loved by developers for its clear, concise syntax, ease-of-use and extensibility, and its vast library of libraries. Its built-in, very powerful machine learning code base and math library make Python a Natural language processing tool.

Then using Python for natural language processing, if you do not know the 8 tools are really out.

NLTK

NLTK is the leading platform for processing language data using Python. It provides a simple and easy-to-use interface for vocabulary resources like WordNet. It also has a text classification (classification), text markers (tokenization), stemming (stemming), part-of-speech tagging (tagging), semantic analysis (parsing), and semantic inference (semantic reasoning ) to prepare the text processing library.

Pattern

Pattern has a range of tools for POS tagging (Part-of-speech taggers), N-gram Search, sentiment analysis, and WordNet. It also supports vector space modeling, cluster analysis, and support vector machines.

Textblob

Textblob is a python library that processes text data. It provides an easy-to-use API for digging into conventional natural language processing, such as POS tagging (part-of-speech tagging), noun phrase extraction (noun phrase extraction), sentiment analysis, text categorization, machine translation, and more.

Gensim

Gensim is a similarity search for topic modeling, document indexing, and using large-scale corpus data. It can handle more input data than RAM. The authors call it "the most robust, effective, and reassuring software for unsupervised modeling based on plain text."

Pynlpl

Pynlpl:python Natural Language Processing library (pronounced: pineapple) is a Python library for natural language processing. It consists of a series of independent or loosely separate modules that are used to handle routine or less conventional NLP tasks. PYNLPL can be used for n-gram calculations, frequency lists and distributions, and language modeling. In addition, there are more complex data models, such as priority queues, and search engines such as beam search.

Spacy

Spacy is a commercially available open source software that uses Python and Cython for industrial-grade natural language processing. It is currently the fastest and highest level natural language processing tool.

Polyglot

Polyglot is a natural language processing tool that supports a huge number of languages. It supports text markers in up to 165 languages, language detection in 196 languages, named entity recognition in 40 languages, POS tagging in 16 languages, sentiment analysis in 136 languages, root embedding in 137 languages, morphological analysis of 135 languages, and transliteration of 69 languages.

Montylingua

Montylingua is a free, common-sense, end-to-end English natural language understanding software. Users only need to input the original English text into Montylingua, it can output the semantic interpretation of the text. The software is ideally suited for information extraction, demand processing, and quiz. From the given English text, it can extract the subject/verb/adjective object tuple, noun phrase and verb phrase, and extract the person's name, place, event, date and time, and other semantic information.

Dong Zhinan

Source: NLP Natural Language Processing

Reprinted from: http://mp.weixin.qq.com/s?__biz=MzAwNDc0MTUxMw==&mid=2649638920&idx=1&sn= 1b62850ea3dec2401a6a3f61e946de38&scene=0#rd

Use Python to do natural language processing must know eight tools "reprint"

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.