Bayesian Forum junk post shielding Demonstration System Beta 1

Source: Internet
Author: User

Demonstration System of Bayesian Forum junk post shielding

Introduction:

As Forum moderators, one of the tasks is to maintain the quality of Forum speeches, delete advertising posts, add fake posts, and so on.
The purpose of the system development is to reduce the workload of the moderator and automatically identify a spam Demonstration System.
The theoretical basis is the naive Bayes principle.

The procedure is as follows:
1. First, log on to the system by registering an account with zule.
2. input the raw data of the training system. There are two types of spam and non-spam.
3. Enter the posts to be checked and view the percentage of spam posts.

 

 

Welcome to discuss and improve this program.
 

Microsoft Asia Research Institute-natural language computing group

Thesis

  1. Dependency Language Model for Information Retrieval
    Jianfeng Gao, Jian-yun nie, Guangyuan Wu and guihong Cao. "dependence language model for information retrieval", in SIGIR-2004. Sheffield, UK, July 25-29,200 4.
  2. A New Method for English-Chinese naming object alignment
    Dong-hui Feng, ya-Juan LV, Ming Zhou, "a new approach for English-Chinese Named Entity alignment", 2004 Conference on empirical methods in natural language processing, Barcelona, Spain, jul. 2004.
  3. Automatic Acquisition of paired Translation Based on Single-language corpus
    Ya-Juan LV, Ming Zhou, "collocation translation acquisition using monolingual injection a", 42nd Annual Meeting of the Association for computational linguistics, Barcelona, Spain, Jul. 2004.
  4. Adaptive Chinese Word Segmentation
    Jianfeng Gao, andI Wu, Mu Li, Chang-ning Huang, Hongqiao Li, xinsong Xia and haowei Qin. "Adaptive Chinese word segmentation", 42nd Annual Meeting of the Association for computational linguistics, Barcelona, Spain, Jul. 2004.
  5. Use SVM to recognize Chinese New Words
    Hongqiao Li, Chang-ning Huang, Jianfeng Gao and xiaozhong fan, "the use of SVM for Chinese New Word identification", in IJCNLP-04. sanya City, Hainan Island, China, March 22-24,200 4.
  6. Experience in obtaining long-distance dependency in language models
    Jianfeng Gao and hisami Suzuki, "capturing long distance dependency for Language Modeling: An Empirical Study", in IJCNLP-04. Sanya City, Hainan Island, China, March 22-24,200 4.
  7. Word Translation Disambiguation Using bilingual bootstrapping
    Hang Li and Cong Li, "Word Translation Disambiguation Using bilingual bootstrapping", computational linguistics 30 (1), 1-22,200 4.
  8. Text Classification Using stochastic keyword generation
    Cong Li, Ji-rong Wen, and hang Li, "text classification using stochastic keyword generation", Proc. Of icml '2017-03,464.
  9. Uncertainty ction in collaborative bootstrapping: Measure and Algorithm
    Yunbo Cao, hang Li, and Li Lian, "Uncertainty functions in collaborative bootstrapping: Measure and algorithm", Proc. of ACL '2017-03,327.
  10. Application of Improved source-channel model in Chinese Word Segmentation
    Ya-jjianfeng Gao, Mu Li and Chang-ning Huang, "improved source-channel models for Chinese word segmentation", 41nd Annual Meeting of the Association for computational linguistics. sapporo. japan, July 7-12,200 3.
  11. Topic analysis using a Finite Mixture Model
    Hang Li and Kenji yamanishi, "topic analysis using a finite mixture model", Information Processing & Management, 39 (4), 521-541, (2003 ).
  12. Using bilingual web data to mine and rank translations
    Hang Li, Yunbo Cao, and Cong Li, "using bilingual web data to mine and rank translations", IEEE intelligent systems, vol. 18 (4), 54-59, (2003)

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.