Efficient Java sensitive words, Keyword Filtering Toolkit _ filter Illegal words

Source: Internet
Author: User
Tool Javahtmljar

Instructions for use:
1, the toolkit by the Beijing Normal University computer department Zhang Jay Development and production based on multi-fork tree Search, any questions please contact:
[Email protected]
2, the toolkit comes with the word library of sensitive words, the first call to read into the thesaurus, so the first call time may be longer, in the class load after the ordinary PC HTML Filter 5000 words in 80 milliseconds, plain text 35 milliseconds or so.
3, if you need to customize the thesaurus, the jar package into the Web-inf Project Lib directory, in the Web-inf/classes directory to build a utf-8 words.dict text file, in the file in the "keyword = level" way to write, such as:
China *gongchandang=4
Chinese =1
0 is the lowest level, filtered back to the highest level appearing in the original string
Calling method: Wordfilterutil.filterhtml (str, ' * ');

: Http://download.csdn.net/user/ranjio_z

Efficient Java sensitive words, Keyword Filtering Toolkit _ filter Illegal words

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.