Reducing Classification Times for Email Spam Using Incremental Multiple Instance Classifiers

Moh, Teng-Sheng; Lee, Nicholas

doi:10.1007/978-3-642-19423-8_20

Teng-Sheng Moh⁴ &
Nicholas Lee⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 141))

Included in the following conference series:

International Conference on Information Intelligence, Systems, Technology and Management

1133 Accesses

Abstract

Combating spam emails is both costly and time consuming. This paper presents a spam classification algorithm that utilizes both majority voting and multiple instance approaches to determine the resulting classification type. By utilizing multiple sub-classifiers, the classifier can be updated by replacing an individual sub-classifier. Furthermore, each sub-classifier represents a small fraction of a typical classifier, so it can be trained in less time with less data as well. The TREC 2007 spam corpus was used to conduct the experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Gradient Correlation: Are Ensemble Classifiers More Robust Against Evasion Attacks in Practical Settings?

An Optimized Approach for Detection and Classification of Spam Email’s Using Ensemble Methods

Article Open access 13 November 2024

Spam Mail Classification Using Ensemble and Non-Ensemble Machine Learning Algorithms

References

Hoanca, B.: How good are our weapons in the spam wars? IEEE Technology and Society Magazine 25(1), 22–30 (2006)
Article Google Scholar
Carpinter, J., Hunt, R.: Tightening the net: a review of current and next generation spam filtering tools. Computers & Security 25(8), 566–578 (2006)
Article Google Scholar
Islam, M.R., Zhou, W.: An Innovative Analyser for email classification based on grey list analysis. In: 2007 IFIP International Conference on Network and Parallel Computing Workshops, pp. 176–182. IEEE Computer Society, Washington, DC (2007)
Chapter Google Scholar
Islam, M.R., Zhou, W., Chowdhury, M.U.: MVGL Analyser for Multi-Classifier Based Spam Filtering System. In: The Eighth IEEE/ACIS International Conference on Computer and Information Science (ICIS), pp. 394–399. IEEE Computer Society, Washington, DC (2009)
Google Scholar
Kang, F., Naphade, M.R.: A generalized multiple instance learning algorithm with multiple selection strategies for cross granular learning. In: 2006 IEEE International Conference on Image Processing, pp. 3213–3216. IEEE Press, New York (2006)
Chapter Google Scholar
Zhou, Y., Jorgensen, Z., Inge, M.: Combating good word attacks on statistical spam filters with multiple instance learning. In: Nineteenth IEEE International Conference on Tools with Artificial Intelligence (ICTAI), pp. 298–305. IEEE Computer Society, Washington, DC (2007)
Chapter Google Scholar
Sirisanyalak, B., Sornil, O.: An artificial immunity-based spam detection system. In: 2007 IEEE Congress on Evolutionary Computation (CEC), pp. 3392–3398. IEEE Press, New York (2007)
Chapter Google Scholar
Yeh, C.-C., Chiang, S.-J.: Revisit Bayesian approaches for spam detection. In: Ninth International Conference for Young Computer Scientists (ICYCS), pp. 659–664. IEEE Computer Society, Washington, DC (2008)
Google Scholar
SPAM Track Guidelines - TREC 2005-2007, http://plg.uwaterloo.ca/~gvcormac/spam/
Islam, R., Zhou, W., Xiang, Y., Mahmood, A.N.: Spam filtering for network traffic security on a multi-core environment. Concurrency and Computation: Practice and Experience 21(10), 1307–1320 (2009)
Article Google Scholar
Tran, D., Ma, W., Sharma, D., Nguyen, T.: Possibility theory-based approach to spam email detection. In: 2007 IEEE International Conference on Granular Computing (GRC), p. 571. IEEE Computer Society, Washington, DC (2007)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, San Jose State University, San Jose, CA, U.S.A.
Teng-Sheng Moh & Nicholas Lee

Authors

Teng-Sheng Moh
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science, College of Engineering and Science, Louisiana Tech University, 71272, Ruston, LA, USA
Sumeet Dua
CISE Department, CSE 301, University of Florida, 32611, Gainesville, FL, USA
Sartaj Sahni
Management Development Institute, 122 007, Sukhrali, Gurgaon, India
D. P. Goyal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Moh, TS., Lee, N. (2011). Reducing Classification Times for Email Spam Using Incremental Multiple Instance Classifiers. In: Dua, S., Sahni, S., Goyal, D.P. (eds) Information Intelligence, Systems, Technology and Management. ICISTM 2011. Communications in Computer and Information Science, vol 141. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19423-8_20

Download citation

DOI: https://doi.org/10.1007/978-3-642-19423-8_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19422-1
Online ISBN: 978-3-642-19423-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics