Keyword Enhanced Web Structure Mining for Business Intelligence | SpringerLink
Skip to main content

Keyword Enhanced Web Structure Mining for Business Intelligence

  • Conference paper
Advanced Internet Based Systems and Applications (SITIS 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4879))

Abstract

The study proposed the method of keyword enhanced Web structure mining which combines the ideas of Web content mining with Web structure mining. The method was used to mine data on business competition among a group of DSLAM companies. Specifically, the keyword DSLAM was incorporated into queries that searched for co-links between pairs of company Websites. The resulting co-link matrix was analyzed using multidimensional scaling (MDS) to map business competition positions. The study shows that the proposed method improves upon the previous method of Web structure mining alone by producing a more accurate map of business competition in the DSLAM sector.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Madria, S.K., Bhowmick, S.S., Ng, W.-K., Lim, E.-P.: Research issues in web data mining. In: Mohania, M., Tjoa, A.M. (eds.) DaWaK 1999. LNCS, vol. 1676, pp. 303–312. Springer, Heidelberg (1999)

    Google Scholar 

  2. Lu, Z., Yao, Y., Zhong, N.: Web Log Mining. In: Zhong, N., Liu, J., Yao, Y. (eds.) Web Intelligence, pp. 173–194. Springer, Berlin (2003)

    Chapter  Google Scholar 

  3. Thuraisingham, B.: Web Data Mining and Applications in Business Intelligence and Counter-Terrorism. CRC Press, Boca Raton (2003)

    Book  Google Scholar 

  4. Liu, B., Ma, Y., Yu, P.S.: Discovering Unexpected Information from Your Competitors’ Web Sites. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, U.S.A, August 26-29 (2001), www.cs.buffalo.edu/~sbraynov/seminar/unexpected_information.pdf

  5. Vaughan, L., You, J.: Mining Web hyperlink data for business information: The case of telecommunications equipment companies. In: Proceedings of the First IEEE International Conference on Signal-Image Technology and Internet-Based Systems, Yaoundé, Cameroon, November 27–December 1, pp. 190–195 (2005)

    Google Scholar 

  6. Björneborn, L., Ingwersen, P.: Towards a basic framework of webometrics. Journal of the American Society for Information Science and Technology 55(14), 1216–1227 (2004)

    Article  Google Scholar 

  7. Vaughan, L., Gao, Y., Kipp, M.: Why are hyperlinks to business Websites created? A content analysis. Scientometrics 67(2), 291–300 (2006)

    Article  Google Scholar 

  8. Beniston, G.: IP DSLAMs, A Heavy Reading Competitive Analysis. Heavy Reading report series 3(15) (August 2005), http://www.heavyreading.com/details.asp?sku_id=836&skuitem_itemid=793&promo_code=&aff_code=&next_url=%2Flist%2Easp%3Fpage%5Ftype%3Dall%5Freports

  9. Google (2006). Google SOAP Search API Reference (Retrieved August 18, 2006), http://www.google.com/apis/reference.html#2_2

  10. Vaughan, L., Kipp, M.E.I., Gao, Y.: Are colinked business Web Sites really related? A qualitative study. Paper under review in Online Information Review (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Vaughan, L., You, J. (2009). Keyword Enhanced Web Structure Mining for Business Intelligence. In: Damiani, E., Yetongnon, K., Chbeir, R., Dipanda, A. (eds) Advanced Internet Based Systems and Applications. SITIS 2006. Lecture Notes in Computer Science, vol 4879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01350-8_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-01350-8_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-01349-2

  • Online ISBN: 978-3-642-01350-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics