Abstract
Combating spam emails is both costly and time consuming. This paper presents a spam classification algorithm that utilizes both majority voting and multiple instance approaches to determine the resulting classification type. By utilizing multiple sub-classifiers, the classifier can be updated by replacing an individual sub-classifier. Furthermore, each sub-classifier represents a small fraction of a typical classifier, so it can be trained in less time with less data as well. The TREC 2007 spam corpus was used to conduct the experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hoanca, B.: How good are our weapons in the spam wars? IEEE Technology and Society Magazine 25(1), 22–30 (2006)
Carpinter, J., Hunt, R.: Tightening the net: a review of current and next generation spam filtering tools. Computers & Security 25(8), 566–578 (2006)
Islam, M.R., Zhou, W.: An Innovative Analyser for email classification based on grey list analysis. In: 2007 IFIP International Conference on Network and Parallel Computing Workshops, pp. 176–182. IEEE Computer Society, Washington, DC (2007)
Islam, M.R., Zhou, W., Chowdhury, M.U.: MVGL Analyser for Multi-Classifier Based Spam Filtering System. In: The Eighth IEEE/ACIS International Conference on Computer and Information Science (ICIS), pp. 394–399. IEEE Computer Society, Washington, DC (2009)
Kang, F., Naphade, M.R.: A generalized multiple instance learning algorithm with multiple selection strategies for cross granular learning. In: 2006 IEEE International Conference on Image Processing, pp. 3213–3216. IEEE Press, New York (2006)
Zhou, Y., Jorgensen, Z., Inge, M.: Combating good word attacks on statistical spam filters with multiple instance learning. In: Nineteenth IEEE International Conference on Tools with Artificial Intelligence (ICTAI), pp. 298–305. IEEE Computer Society, Washington, DC (2007)
Sirisanyalak, B., Sornil, O.: An artificial immunity-based spam detection system. In: 2007 IEEE Congress on Evolutionary Computation (CEC), pp. 3392–3398. IEEE Press, New York (2007)
Yeh, C.-C., Chiang, S.-J.: Revisit Bayesian approaches for spam detection. In: Ninth International Conference for Young Computer Scientists (ICYCS), pp. 659–664. IEEE Computer Society, Washington, DC (2008)
SPAM Track Guidelines - TREC 2005-2007, http://plg.uwaterloo.ca/~gvcormac/spam/
Islam, R., Zhou, W., Xiang, Y., Mahmood, A.N.: Spam filtering for network traffic security on a multi-core environment. Concurrency and Computation: Practice and Experience 21(10), 1307–1320 (2009)
Tran, D., Ma, W., Sharma, D., Nguyen, T.: Possibility theory-based approach to spam email detection. In: 2007 IEEE International Conference on Granular Computing (GRC), p. 571. IEEE Computer Society, Washington, DC (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Moh, TS., Lee, N. (2011). Reducing Classification Times for Email Spam Using Incremental Multiple Instance Classifiers. In: Dua, S., Sahni, S., Goyal, D.P. (eds) Information Intelligence, Systems, Technology and Management. ICISTM 2011. Communications in Computer and Information Science, vol 141. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19423-8_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-19423-8_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19422-1
Online ISBN: 978-3-642-19423-8
eBook Packages: Computer ScienceComputer Science (R0)