This article is a computer Quality Pre-sale recommendation >>>>Mastering text: Discovery, organization and processing of text
Editor's recommendation
This book is suitable for the Internet Text content processing field of developers to read, but also for those who are interested in joining the field of students, practitioners read. Even for people who have been working on research and development for many years of text processing, this book is a useful complementary reading.
Content Introduction
Text processing is the key technology of Internet content application (such as search engine, recommendation engine) at present. This book covers several aspects of text processing concepts and techniques, including text preprocessing, search, string matching, information extraction, named entity recognition, classification, clustering, label generation, summarization, quiz, and so on. This book is characterized by an example of how these concepts and techniques of text processing are understood, and readers can implement these instances themselves using existing open source tools.
Partial catalogs
The 1th chapter begins with the mastery of the text ......... .............. ............................. 1
1.1 Driving the text important reason ............................................................................................... 2
1.2 preview: A fact-based question-and-answer system ......... ........... ............................. 4
1.2.1: Hi, Dr. Frankenstein ......... .................... ....... ..... ....................... 5
1.3 Understanding text is difficult ....................................................................................................... 8
1.4 Navigating the text ..... ..... ..... ..... ..... ..... ..... ..... ..... ...... ....... ....... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ....... ....... 11
1.5 text and Smart apps: Search and other ...... ..... ................ ........ ................... ....... -
1.5.1 Search and match ................................................................................................. -
1.5.2 Extracting information ..................................................................................................... -
1.5.3 Grouping of information ................................................................................................. the
1.5.41 Smart apps ..... ..... ..... ....... .................. ..... ..... ..... ..... ..... .............. .......... the
1.6 Summary ......................................................................................................................... the
1.7 Related Resources ................................................................................................................. -
the 2nd chapter is the foundation of the mastery of the text ......... .......... ....................... ........ -
2.1 Basic Language knowledge ......................................................................................................... -
2.1.1 words and their categories ...... ..... ....................... ..... ..... ..... ..... ..... .............. .......... +
2.1.2 phrases and clauses ................................................................................................. -
2.1.3 Morphology ..... ..... ..... ..... ..... ..... ..... ..... ..... ...... ....... ....... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ....... ....... 21
2.2 Common tools for text processing ................................................................................................. at
2.2.1 String Processing tool ..... ..... ......................... ..... ..... ..... ................. .......... at
2.2.2 Entry and word-cutting ...... ..... ..... ........................ ..... ..... ..... ..... ..... ............... ......... 23
2.2.3 part-of-speech tagging ..... ..... ..... ..... ..................... ..... ..... ..... ..... ..... ..... ............ ........... 25
2.2.4 stemming ..... ..... ..... ..... ..... ....................... ..... ..... ..... ..... ..... ..... ............ ........... 27
2.2.5 sentence detection ..... ..... .... ..... ..... ....................... ..... ..... ..... ..... ..... ..... ............ ........... 29
2.2.6 syntactic analysis and grammar ....... ..... ...................... ..... ..... ..... ..... .............. ........... 31
2.2.7 sequence modeling ..... ..... .... ..... ..... ....................... ..... ..... ..... ..... ..... ..... ..... ....... .............. 33
2.3 Extract content from common format files and do preprocessing ..... ... ..... ... ..... ..... ..... ..... ..... ..... ..... ..... ..... ............. 34
The importance of 2.3.1 pretreatment ....... ......................... ..... ..... ..... ................. .......... 35
2.3.2 uses Apache Tika to extract the content ..... ..... ... .... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ......... 37
2.4 Summary ......................................................................................................................... 39
2.5 Related Resources ................................................................................................................. 40
Copyright NOTICE: This article for Bo Master original article, without Bo Master permission not reproduced.
Mastering text: The discovery, organization and processing of text--Interactive publishing network