This version contains some new functions, such as parsing HTML, Dom cloning, and The: Not () pseudo CSS selector from the input stream. It also fixes some bugs and improves stability; improved HTTP connection processing and enhanced document standardization.
JsoupIs a Java HTML Parser that can directly parse a URL address and HTML text content. It provides a set of very labor-saving APIs that can be used to retrieve and manipulate data through DOM, CSS, and operations similar to jquery.
The main functions of jsoup are as follows:
- Parse HTML from a URL, file, or string;
- Use the Dom or CSS selector to find and retrieve data;
- HTML elements, attributes, and text can be operated;
Jsoup is released based on the MIT protocol and can be safely used in commercial projects.
From: http://www.oschina.net/news/13310/jsoup-1-4-1-html-parser
This article is the use of b3log solo from the simple design of the art of the original article: http://88250.b3log.org/jsoup-1-4-1-html-parser