Dimension Reduction Using Clustering Algorithm and Rough Set Theory

Sengupta, Shampa; Das, Asit Kumar

doi:10.1007/978-3-642-35380-2_82

Shampa Sengupta²⁰ &
Asit Kumar Das²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7677))

Included in the following conference series:

International Conference on Swarm, Evolutionary, and Memetic Computing

2951 Accesses

Abstract

In real world, datasets have large number of attributes but few are important to describe them properly. The paper proposes a novel dimension reduction algorithm for real valued dataset using the concept of Rough Set Theory and clustering algorithm to generate the reduct. Here, projection of dataset based on two conditional attributes C _i and C _j is taken and K-means Clustering algorithm is applied on it with K = number of distinct values of decision attribute D of the dataset to obtain K clusters. Also the dataset is clustered into K-groups using Indiscernibility relation applied on the decision attribute D. Then the connecting factor k of combined conditional attributes (C _i C _j) with respect to D is calculated using two cluster sets and attribute connecting set ACS = {(C _i C _j \(\rightarrow^{\hspace*{-2.5mm}^k} D\)) for all C _i,C _j ∈ C, Conditional attribute set, and D (Decision attribute)} is formed. Each element (C _i C _j \(\rightarrow^{\hspace*{-2.5mm}^k} D\)) ∈ ACS implies that C _i and C _j connecting together partition the objects that yields (k*100) % similar partitions as made on D. Now an undirected weighted graph with weights as the connecting factor k is constructed using attribute connecting set ACS. Finally based on the weight associated with edges, the important attributes, called reduct are generated. Experimental result shows the efficiency of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Similarity-based attribute reduction in rough set theory: a clustering perspective

Article 02 May 2019

Minimal attribute reduction with rough set based on compactness discernibility information tree

Article 12 March 2015

Dimension Reduction Based on Geometric Reasoning for Reducts

References

Della Pietra, S., Della Pietra, V., Lafferty, J.: Inducing features of random fields. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(4), 380–393 (1997)
Article Google Scholar
Jensen, R., Shen, Q.: Fuzzy-Rough Attribute Reduction with Application to Web Categorization. Fuzzy Sets and Systems 141(3), 469–485 (2004)
Article MathSciNet MATH Google Scholar
Zhong, N., Skowron, A.: A Rough Set-Based Knowledge Discovery Process. Int. Journal of Applied Mathematics and Computer Science 11(3), 603–619 (2001); BIME Journal 05(1) (2005)
Google Scholar
Alpaydin, E.: Introduction to Machine Learning. PHI (2010)
Google Scholar
Pawlak, Z.: Rough set theory and its applications to data analysis. Cybernetics and Systems 29, 661–688 (1998)
Article MATH Google Scholar
K. Thangavel, A. Pethalakshmi. Dimensionality reduction based on rough set theory : A review, Journal of Applied Soft Computing, Volume 9, Issue 1, pages 1 -12, 2009.
Google Scholar
Das, A.K., et al.: Reduct Generation by Formation of Directed Minimal Spanning Tree using Rough Set Theory. In: INDIA 2012(2012)
Google Scholar
Hartigan, J.: Clustering Algorithms. Wiley, New York (1975)
MATH Google Scholar
Bang-Jensen, J., Gutin, G.: Digraphs: Theory, Algorithms and Applications. Springer, ISBN 1-85233-268-9
Google Scholar
Murphy, P., Aha, W.: UCI repository of machine learning databases (1996), http://www.ics.uci.edu/mlearn/MLRepository.html
Hall, M.A.: Correlation-Based Feature Selection for Machine Learning, PhD thesis, Dept. of Computer Science, Univ. of Waikato, Hamilton, New Zealand (1998)
Google Scholar
Liu, Setiono, R.: A Probabilistic Approach to Feature Selection: A Filter Solution. In: Proc. 13th Int’l Conf. Machine Learning, pp. 319–327 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Technology, MCKV Institute of Engineering, Liluah, Howrah, 711 204, West Bengal, India
Shampa Sengupta
Dept. of Computer Sc. & Tech., Bengal Engineering & Science University, Shibpur, Howrah, 711 103, West Bengal, India
Asit Kumar Das

Authors

Shampa Sengupta
View author publications
You can also search for this author in PubMed Google Scholar
Asit Kumar Das
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical Engineering, Indian Institute of Technology, 110016, Delhi, India
Bijaya Ketan Panigrahi
Electronics and Communication Sciences Unit, Indian Statistical Institute, 700108, Kolkata, India
Swagatam Das
School of Electrical and Electronic Engineering, Nanyang Technological University, Block N4, 2b-39, Nanyang Avenue, 639798, Singapore, Singapore
Ponnuthurai Nagaratnam Suganthan
Department of Electronics and Telecom. Engineering, Institute of Technical Education & Research, Siksha ’O’ Anusandhan University, 751030, Bhubaneswar, Odisha, India
Pradipta Kumar Nanda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sengupta, S., Das, A.K. (2012). Dimension Reduction Using Clustering Algorithm and Rough Set Theory. In: Panigrahi, B.K., Das, S., Suganthan, P.N., Nanda, P.K. (eds) Swarm, Evolutionary, and Memetic Computing. SEMCCO 2012. Lecture Notes in Computer Science, vol 7677. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35380-2_82

Download citation

DOI: https://doi.org/10.1007/978-3-642-35380-2_82
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35379-6
Online ISBN: 978-3-642-35380-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Dimension Reduction Using Clustering Algorithm and Rough Set Theory

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Similarity-based attribute reduction in rough set theory: a clustering perspective

Minimal attribute reduction with rough set based on compactness discernibility information tree

Dimension Reduction Based on Geometric Reasoning for Reducts

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Dimension Reduction Using Clustering Algorithm and Rough Set Theory

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Similarity-based attribute reduction in rough set theory: a clustering perspective

Minimal attribute reduction with rough set based on compactness discernibility information tree

Dimension Reduction Based on Geometric Reasoning for Reducts

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation