Python is loved by developers for its clear, concise syntax, ease-of-use and extensibility, and its vast library of libraries. Its built-in, very powerful machine learning code base and math library make Python a Natural language processing tool.
Then using Python for natural language processing, if you do not know the 8 tools are really out.
NLTK
NLTK is the leading platform for processing language data using Python. It provides a simple and easy-to-use interface for vocabulary resources like WordNet. It also has a text classification (classification), text markers (tokenization), stemming (stemming), part-of-speech tagging (tagging), semantic analysis (parsing), and semantic inference (semantic reasoning ) to prepare the text processing library.
Pattern
Pattern has a range of tools for POS tagging (Part-of-speech taggers), N-gram Search, sentiment analysis, and WordNet. It also supports vector space modeling, cluster analysis, and support vector machines.
Textblob
Textblob is a python library that processes text data. It provides an easy-to-use API for digging into conventional natural language processing, such as POS tagging (part-of-speech tagging), noun phrase extraction (noun phrase extraction), sentiment analysis, text categorization, machine translation, and more.
Gensim
Gensim is a similarity search for topic modeling, document indexing, and using large-scale corpus data. It can handle more input data than RAM. The authors call it "the most robust, effective, and reassuring software for unsupervised modeling based on plain text."
Pynlpl
Pynlpl:python Natural Language Processing library (pronounced: pineapple) is a Python library for natural language processing. It consists of a series of independent or loosely separate modules that are used to handle routine or less conventional NLP tasks. PYNLPL can be used for n-gram calculations, frequency lists and distributions, and language modeling. In addition, there are more complex data models, such as priority queues, and search engines such as beam search.
Spacy
Spacy is a commercially available open source software that uses Python and Cython for industrial-grade natural language processing. It is currently the fastest and highest level natural language processing tool.
Polyglot
Polyglot is a natural language processing tool that supports a huge number of languages. It supports text markers in up to 165 languages, language detection in 196 languages, named entity recognition in 40 languages, POS tagging in 16 languages, sentiment analysis in 136 languages, root embedding in 137 languages, morphological analysis of 135 languages, and transliteration of 69 languages.
Montylingua
Montylingua is a free, common-sense, end-to-end English natural language understanding software. Users only need to input the original English text into Montylingua, it can output the semantic interpretation of the text. The software is ideally suited for information extraction, demand processing, and quiz. From the given English text, it can extract the subject/verb/adjective object tuple, noun phrase and verb phrase, and extract the person's name, place, event, date and time, and other semantic information.
Dong Zhinan
Source: NLP Natural Language Processing
Reprinted from: http://mp.weixin.qq.com/s?__biz=MzAwNDc0MTUxMw==&mid=2649638920&idx=1&sn= 1b62850ea3dec2401a6a3f61e946de38&scene=0#rd
Use Python to do natural language processing must know eight tools "reprint"