当前位置:首页  新闻公告  新闻资讯
ITNLP实验室将于本周三(12月20日)上午9:00在新技术楼618#举行学术活动,会上由王强博士生为大家作报告。
发布人:test  发布时间:2006-12-19   浏览次数:17
题目: Research on Spam Filtering Task
摘要:A realistic classification model for spam filtering should not only take account of the fact that spam evolves over time, but also that labeling a large number of examples for initial training can be expensive in terms of both time and money. This paper address the problem of separating legitimate emails from unsolicited ones with active and online learning algorithm, using a Support Vector Machines (SVM) as the base classifier. We evaluate its effectiveness using a set of goodness criteria on TREC2006 spam filtering benchmark datasets, and promising results are reported.