Scalable Model for Extensional and Intensional Descriptions of Unclassified Data

Prado, Hércules A.; Hirtle, Stephen C.; Engel, Paulo M.

doi:10.1007/3-540-45591-4_53

Hércules A. Prado^2,3,
Stephen C. Hirtle³ &
Paulo M. Engel²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1800))

Included in the following conference series:

International Parallel and Distributed Processing Symposium

925 Accesses

Abstract

Knowledge discovery from unlabeled data comprises two main tasks: identification of “natural groups” and analysis of these groups in order to interpret their meaning. These tasks are accomplished by unsupervised and supervised learning, respectively, and correspond to the taxonomy and explanation phases of the discovery process described by Langley [9]. The efforts of Knowledge Discovery from Databases (KDD) research field has addressed these two processes into two main dimensions: (1) scaling up the learning algorithms to very large databases, and (2) improving the efficiency of the knowledge discovery process. In this paper we argue that the advances achieved in scaling up supervised and unsupervised learning algorithms allow us to combine these two processes in just one model, providing extensional (who belongs to each group) and intensional (what features best describe each group) descriptions of unlabeled data. To explore this idea we present an artificial neural network (ANN) architecture, using as building blocks two well-know models: the ART1 network, from the Adaptive Resonance Theory family of ANNs [4], and the Combinatorial Neural Model (CNM), proposed by Machado ([11] and [12])). Both models satisfy one important desiderata for data mining, learning in just one pass of the database. Moreover, CNM, the intensional part of the architecture, allows one to obtain rules directly from its structure. These rules represent the insights on the groups. The architecture can be extended to other supervised/unsupervised learning algorithms that comply with the same desiderata.

Researcher at EMBRAPA — Brazilian Enterprise for Agricultural Research and lecturer at Catholic University of Brasilia (Supported by CAPES - Coordenaçao de Aperfeiçoamento de Pessoal de Nivel Superior, grant nr. BEX1041/98-3)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 5719; Price includes VAT (Japan)

Softcover Book: JPY 7149; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Inference Algorithm for Knowledge Bases with Rule Cluster Structure

Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery

References

Agrawal, R., Gehrke, J., Gunopulos, D., Raghanavan, P. Automatic Subspace Clustering of High-Dimensional Data for Data Mining Applications. In: Proceedings of ACM SIGMOD98 International Conference on Management of Data, Seattle, Washington, 1998.
Google Scholar
Beckenkamp, F. G., Feldens, M. A., Pree, W.: Optimizations of the Combinatorial Neural Model. IN: Vth Brazilian Symposium on Neural Networks. (SBRN’98), Belo Horizonte, Brazil.
Google Scholar
Bigus, J. P. Data Mining with Neural Networks. [S.l.]: McGraw-Hill, 1996. p.3–42.
Google Scholar
Carpenter, G. and Grossberg, S. Neural Dynamics of Category Learning and Recognition: Attention, Memory, Consolidation, and Amnesia. In: Joel L. Davis (ed.), Brain structure, learning, and memory. AAAS Symposia Series, Boulder, CO: Westview Press, 1988. p.233–287.
Google Scholar
Easterlin, J.D., Langley, P.: A Framework for Concept Formation. In: Seventh Annual Conference of the Cognitive Science Society, Irvine, CA, 1985.
Google Scholar
Engel, P. M. Lecture Notes. Universidade Federal do Rio Grande do Sul. Porto Alegre-RS, Brazil: CPGCC da UFRGS, 1997.
Google Scholar
Freeman, J. A., Skapura, D. M.: Neural Networks, Algorithms, Applications, and Program Techniques. [S.l.]: Addison-Wesley, 1992. p.292–339.
Google Scholar
Guha, S., Rastogi, R., Shim, K. Cure: An Efficient Clustering Algorithm for Large Databases. In: Proceedings of ACM SIGMOD98 International Conference on Management of Data, Seattle, Washington, 1998.
Google Scholar
Langley, P. The Computer-Aided Discovery of Scientific Knowledge. In: Proc. of the First International Conference on Discovery Science, Fukuoka, Japan, 1998.
Google Scholar
Lippmann, D. An Introduction to Computing with Neural Nets, IEEE ASSP Magazine. April, 1987.
Google Scholar
Machado, R. J., Rocha, A. F.: Handling knowledge in high order neural networks: the combinatorial neural network. Rio de Janeiro: IBM Rio Scientific Center, Brazil, 1989. (Technical Report CCR076).
Google Scholar
Machado, R. J., Carneiro, W., Neves, P. A.: Learning in the combinatorial neural model, IEEE Transactions on Neural Networks, v.9, p.831–847. Sep.1998.
Article Google Scholar
Medin, D., Altom, M.W., Edelson, S.M. and Freko, D. Correlated symptoms and simulated medical classification. Journal of Experimental Psychology: Learning, Memory and Cognition, 8:37–50, 1983.
Google Scholar
Murphy, G. and Medin, D.: The Role of Theories in Conceptual Coherence. Psychological Review, 92(3):289–316, July, 1985.
Article Google Scholar
Pereira, W. C. de A. Resoluçao de Problemas Criativos: Ativaçao da Capacidade de Pensar. Departamento de Informaçao e Documentaçao/EMBRAPA, Brasilia-DF, 1980. 54pp.
Google Scholar
Polya, G. How to Solve It: A New Aspect of Mathematical Method. Princeton: Princeton University Press, 1972. 253pp.
MATH Google Scholar
Prado, H. A., Frigeri, S. R., Engel, P. M.: A Parsimonious Generation of Combinatorial Neural Model. IN: IV Congreso Argentino de Ciencias de la Computación (CACIC’98), Neuquén, Argentina, 1998.
Google Scholar
Prado, H. A. do; Machado, K.F.; Frigeri, S. R.; Engel, P. M. Accuracy Tuning in Combinatorial Neural Model. PAKDD’ 99-Pacific-Asia Conference on Knowledge Discovery and Data Mining. Proceedings... Beijing, China, 1999
Google Scholar
Wrobel, S. Concept Formation and Knowledge Revision. Dordrecht, The Netherlands: Kluwer, 1994. 240pp.
Book Google Scholar

Download references

Author information

Authors and Affiliations

Instituto de Informática, Universidade Federal do Rio Grande do Sul, Av. Bento Gonçalves, 9500 - Bairro Agronomia, Porto Alegre / RS, Brazil, Caixa Postal 15.064, CEP 91.501-970
Hércules A. Prado & Paulo M. Engel (Professor)
Department of Information Sciences and Telecommunications, University of Pittsburgh, 135 North Bellefield Ave., Pittsburgh, PA, 15.260, USA
Hércules A. Prado & Stephen C. Hirtle (Professor)

Authors

Hércules A. Prado
View author publications
You can also search for this author in PubMed Google Scholar
Stephen C. Hirtle
View author publications
You can also search for this author in PubMed Google Scholar
Paulo M. Engel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Centre Universitaire d’Informatique, Université de Genève, 24, rue Général Dufour, CH-1211, Genève 4, Switzerland
José Rolim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Prado, H.A., Hirtle, S.C., Engel, P.M. (2000). Scalable Model for Extensional and Intensional Descriptions of Unclassified Data. In: Rolim, J. (eds) Parallel and Distributed Processing. IPDPS 2000. Lecture Notes in Computer Science, vol 1800. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45591-4_53

Download citation

DOI: https://doi.org/10.1007/3-540-45591-4_53
Published: 25 May 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67442-9
Online ISBN: 978-3-540-45591-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Scalable Model for Extensional and Intensional Descriptions of Unclassified Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Inference Algorithm for Knowledge Bases with Rule Cluster Structure

Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery

Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Scalable Model for Extensional and Intensional Descriptions of Unclassified Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Inference Algorithm for Knowledge Bases with Rule Cluster Structure

Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery

Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation