Mastering text: The discovery, organization and processing of text--Interactive publishing network

Source: Internet
Author: User

This article is a computer Quality Pre-sale recommendation >>>>Mastering text: Discovery, organization and processing of text


Editor's recommendation

This book is suitable for the Internet Text content processing field of developers to read, but also for those who are interested in joining the field of students, practitioners read. Even for people who have been working on research and development for many years of text processing, this book is a useful complementary reading.


Content Introduction

Text processing is the key technology of Internet content application (such as search engine, recommendation engine) at present. This book covers several aspects of text processing concepts and techniques, including text preprocessing, search, string matching, information extraction, named entity recognition, classification, clustering, label generation, summarization, quiz, and so on. This book is characterized by an example of how these concepts and techniques of text processing are understood, and readers can implement these instances themselves using existing open source tools.


Partial catalogs

The 1th chapter begins with the mastery of the text ......... .............. ............................. 1
1.1 Driving the text important reason ............................................................................................... 2
1.2 preview: A fact-based question-and-answer system ......... ........... ............................. 4
1.2.1: Hi, Dr. Frankenstein ......... .................... ....... ..... ....................... 5
1.3 Understanding text is difficult ....................................................................................................... 8
1.4 Navigating the text ..... ..... ..... ..... ..... ..... ..... ..... ..... ...... ....... ....... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ....... ....... 11
1.5 text and Smart apps: Search and other ...... ..... ................ ........ ................... ....... -
1.5.1 Search and match ................................................................................................. -
1.5.2 Extracting information ..................................................................................................... -
1.5.3 Grouping of information ................................................................................................. the
1.5.41 Smart apps ..... ..... ..... ....... .................. ..... ..... ..... ..... ..... .............. .......... the
1.6 Summary ......................................................................................................................... the
1.7 Related Resources ................................................................................................................. -
the 2nd chapter is the foundation of the mastery of the text ......... .......... ....................... ........ -
2.1 Basic Language knowledge ......................................................................................................... -
2.1.1 words and their categories ...... ..... ....................... ..... ..... ..... ..... ..... .............. .......... +
2.1.2 phrases and clauses ................................................................................................. -
2.1.3 Morphology ..... ..... ..... ..... ..... ..... ..... ..... ..... ...... ....... ....... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ....... ....... 21
2.2 Common tools for text processing ................................................................................................. at
2.2.1 String Processing tool ..... ..... ......................... ..... ..... ..... ................. .......... at

2.2.2 Entry and word-cutting ...... ..... ..... ........................ ..... ..... ..... ..... ..... ............... ......... 23
2.2.3 part-of-speech tagging ..... ..... ..... ..... ..................... ..... ..... ..... ..... ..... ..... ............ ........... 25
2.2.4 stemming ..... ..... ..... ..... ..... ....................... ..... ..... ..... ..... ..... ..... ............ ........... 27
2.2.5 sentence detection ..... ..... .... ..... ..... ....................... ..... ..... ..... ..... ..... ..... ............ ........... 29
2.2.6 syntactic analysis and grammar ....... ..... ...................... ..... ..... ..... ..... .............. ........... 31
2.2.7 sequence modeling ..... ..... .... ..... ..... ....................... ..... ..... ..... ..... ..... ..... ..... ....... .............. 33
2.3 Extract content from common format files and do preprocessing ..... ... ..... ... ..... ..... ..... ..... ..... ..... ..... ..... ..... ............. 34
The importance of 2.3.1 pretreatment ....... ......................... ..... ..... ..... ................. .......... 35
2.3.2 uses Apache Tika to extract the content ..... ..... ... .... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ......... 37
2.4 Summary ......................................................................................................................... 39
2.5 Related Resources ................................................................................................................. 40



Copyright NOTICE: This article for Bo Master original article, without Bo Master permission not reproduced.

Mastering text: The discovery, organization and processing of text--Interactive publishing network

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.