Abstract
Large and continuous growing knowledge bases (KBs) have been widely studied in recent years. A major challenge in this field is how to develop techniques to help populating such KBs and improve their coverage. In this context, this work proposes an “association rules”-base approach. We applied an association rule mining algorithm to discover new relations between the instances and categories, to populate a KB. Considering that automatically constructed KBs are often incomplete, we modified traditional support criteria, creating the MSC measure, to deal with missing values. Experiments showed that an association rule mining algorithm, with and without the modified support calculation, brings relevant rules and can play an interesting role in the process of increasing a large growing knowledge base.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Galágarra, L., Teflioudi, C., Hose, K., Suchanek, F.M.: AMIE: Association Rule Mining under Incomplete Evidence in Ontological Knowledge Bases. In: Proc. of the 22nd Int. Conf. of World Wide Web, pp. 413–422. ACM, New York (2013)
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic web. In: 16th Proc. of the Int. Conf. of World Wide Web, pp. 697–706. ACM, New York (2007)
Matuszek, C., Cabral, J., Witbrock, M., DeOliveira, J.: An Introduction to the Syntax and Content of Cyc. In: Proc. of 2006 AAAI Spring Symp. on Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering, pp. 44–49 (2006)
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia – a crystallization point for the web of data. J. of Web Seman. 7, 154–165 (2009)
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an Architecture for Never-Ending Language Learning. In: 24th Proc. of the Conf. on Artificial Intelligence, pp. 1306–1313. AAAI Press, Atlanta (2010)
Carlson, A., Betteridge, J., Hruschka Jr., E.R., Mitchell, T.M.: Coupling Semi-Supervised Learning of Categories and Relations. In: Proc. of the NAACL HLT 2009 Work. on Semi-supervised Learning for Natural Language Processing, pp. 1–9. ACL, New Jersey (2009)
Carlson, A., Betteridge, J., Wang, R.C., Hruschka Jr., E.R., Mitchell, T.M.: Coupled Semi-Supervised Learning for Information Extraction. In: 3rd Int. Conf. on Web Search and Data Mining (WSDM), pp. 101–110. ACM, New York (2010)
Appel, A.P., Hruschka Jr., E.R.: Prophet – a link-predictor to learn new rules on NELL. In: 11th Proc. of the Int. Conf. on Data Mining Work., pp. 917–924. IEEE (2011)
Mohamed, T.P., Hruschka Jr., E.R., Mitchell, T.M.: Discovering Relations between Noun Categories. In: Proc. of the 8th Conf. on Emp. Methods in Natural Language Processing, pp. 1447–1455. Association for Computational Linguistics, Stroudsburg (2011)
Pedro, S.D., Hruschka Jr., E.R.: Conversing Learning: Active Learning and Active Social Interaction for Human Supervision in Never-Ending Learning Systems. In: Pavón, J., Duque-Méndez, N.D., Fuentes-Fernández, R. (eds.) IBERAMIA 2012. LNCS, vol. 7637, pp. 231–240. Springer, Heidelberg (2012)
Curran, J.R., Murphy, T., Scholz, B.: Minimising semantic drift with mutual exclusion bootstrapping. In: Proc. of the 10th Conf. of the Pacific Association for Computational Linguistics, pp. 172–180 (2007)
Gardner, M., Talukdar, P.P., Kisiel, B., Mitchell, T.: Improving Learning and Inference in a Large Knowledge-base using Latent Syntactic Cues. In: Proc. of the Conference on Empirical Methods in Natural Language Processing, pp. 833–838 (2013)
Agrawal, R., Imielinski, T., Swami, A.M.: Mining Association Rules between Sets of Items in Large Databases. In: 19th ACM SIGMOD Annual Conference on Management of Data, pp. 207–216 ACM, New York (1993)
Miani, R.G., Yaguinuma, C.A., Santos, M.T., Biajiz, M.: NARFO Algorithm: Mining Non-redundant and Generalized Association Rules Based on Fuzzy Ontologies. In: Filipe, J., Cordeiro, J. (eds.) Enterprise Information Systems. LNBIP, vol. 24, pp. 415–426. Springer, Heidelberg (2009)
Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. J. Art. Intelligence. 196, 28–61 (2012)
Etzioni, O., Fader, A., Christensen, J., Soderland, S., Mausam, M.: Open Information Extraction: the Second Generation. In: 22nd Proc. of the IJCAI, pp. 3–10 (2011)
Navigli, R., Ponzetto, S.P.: BabelNet: building a very large multilingual semantic network. In: 48th Proc. of the Annual Meeting of the Assoc. for Comp. Ling., pp. 216–225 (2010)
Wang, Z., Wang, Z., Li, J., Pan, J.Z.: Building a Large Scale Knowledge Base from Chinese Wiki Encyclopedia. In: Pan, J.Z., Chen, H., Kim, H.-G., Li, J., Wu, Z., Horrocks, I., Mizoguchi, R., Wu, Z. (eds.) JIST 2011. LNCS, vol. 7185, pp. 80–95. Springer, Heidelberg (2012)
Niu, F., Zhang, C., Ré, C., Shavlik, J.: Elementary: Large-scale Knowledge-base Construction via Machine Learning and Statistical Inference. J. on Sem. Web and Inf. Sys. 8, 42–73 (2012)
Kiddon, C., Domingos, P.: Knowledge Extraction and Joint Inference Using Tractable Markov Logic. In: Proc. of the 2nd Joint Work. on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction, pp. 79–83. ACL (2012)
Schoenmackers, S., Etzioni, O., Weld, D.S.: Learning First-Order Horn Clauses from Web Text. In: Proc. of the Conf. on Empirical Methods in Natural Language Processing, pp. 1088–1098 (2010)
Ji, H., Grishman, R.: Knowledge Base Population: Successful Approaches and Challenges. In: Proc. of the 49th Annual Meet. of the Assoc. for Comp. Ling., pp. 1148–1158 (2011)
Lao, N., Mitchell, T., Cohen, W.W.: Random Walk Inference and Learning in a Large Scale Knowledge Base. In: 8th Proc. of the Conf. on Empirical Methods in Natural Language Processing, pp. 529–539. Assoc. for Computational Linguistics, Stroudsburg (2011)
Srikant, R., Agrawal, R.: Mining Generalized Association Rules. In: Proceedings of the International Conference of Very Large Knowledge Bases, pp. 407–419 (1995)
Kauppinen, T., Kuittinen, H., Seppälä, K., Tuominen, J., Hyvönen, E.: Extending an Ontology by Analyzing Annotation Co-occurrences in a Semantic Cultural Heritage Portal. In: Proc. of the 3rd Work. on Collective Int. at the Asian Sem. Web Conf., Bangkok (2008)
Hyvönen, E., Viljanen, K., Tuominen, J., Seppälä, K.: Building a national semantic web ontology and ontology service infrastructure –the FinnONTO approach. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 95–109. Springer, Heidelberg (2008)
Ragel, A., Cremilleux, B.: Treatment of Missing Values for Association Rules. In: 2nd Pacific-Asia Conf. on Knowledge Disc. and Data Mining, Melbourne, pp. 258–270 (1998)
Hahsler, M., Grün, B., Hornik, K.: arules – A Computational Environment for Mining Association Rules and Frequent Item Sets. J. of Statistics Software 14, 1–25 (2005)
Nayak, J.R., Cook, D.J.: Approximate Association Rule Mining. In: Proceedings of 14th FLAIRS Conference, pp. 259–263. AAAI, Key West (2001)
Calders, T., Goethals, B., Mampaey, M.: Mining Itemsets in the Presence of Missing Values. In: Proc. of the 22nd Annual ACM Symposium on Applied Computing, pp. 404–408 (2007)
Chen, R.H., Fan, C.M.: Treatment of Missing Values for Association Rule-Based Tool Commonality Analysis in Semiconductor Manufacturing. In: Proc. of the 8th IEEE Int. Conf. on Automation Science and Engineering, pp. 886–891. IEEE, Seoul (2012)
Hong, T.P., Wu, C.W.: Mining rules from incomplete dataset with high missing rate. J. of Expert Systems with Applications: An International Journal 38, 3931–3936 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Miani, R.G.L., de S. Pedro, S.D., Hruschla, E.R. (2014). Association Rules to Help Populating a Never-Ending Growing Knowledge Base. In: Bazzan, A., Pichara, K. (eds) Advances in Artificial Intelligence -- IBERAMIA 2014. IBERAMIA 2014. Lecture Notes in Computer Science(), vol 8864. Springer, Cham. https://doi.org/10.1007/978-3-319-12027-0_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-12027-0_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12026-3
Online ISBN: 978-3-319-12027-0
eBook Packages: Computer ScienceComputer Science (R0)