Cooperative CG-Wrappers for Web Content Extraction | SpringerLink
Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4604))

Included in the following conference series:

  • 750 Accesses

Abstract

We use Conceptual Graphs (CGs) to model web content extraction rules (CG-Wrappers). The approach presented incorporates all major existing extraction techniques and allows the definition of synergies of cooperative wrappers for handling complex extraction task, without requiring programming.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. CoGXML: http://cogitant.sourceforge.net/cogitant_html/cogxml.html

  2. Document Object Model (DOM) Level 3 Core Specification (2002), http://www.w3.org/TR/2004/REC-DOM-Level-3-Core-20040407/

  3. Flesca, S., Manco, G., Masciari, E., Rende, E., Tagarelli, A.: Web wrapper induction: a brief survey. In: AI Communications, vol. 17, pp. 57–61. IOS Press, Amsterdam (2004)

    Google Scholar 

  4. Kauchak, D., Smarr, J., Elkan, C.: Sources of Success for Boosted Wrapper Induction. Journal of Machine Learning 5, 499–527 (2004)

    MathSciNet  Google Scholar 

  5. Kokkoras, F., Bassiliades, N., Vlahavas, I.: Aggregator: A Knowledge based Comparison Chart Builder for e-Shopping. Intelligent Knowledge-Based Systems: Business and Technology in the New Millennium. In: Leondes, C.T. (ed.) Knowledge-Based Systems, vol.1, ch. 6, pp. 140–163. Kluwer Academic Publishers (2005)

    Google Scholar 

  6. Laender, A., Ribeiro-Neto, B., da Silva, A.S., Teixeira, J.: A Brief Survey of Web Data Extrac-tion Tools. ACM SIGMOD Record 31(2) (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Uta Priss Simon Polovina Richard Hill

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kokkoras, F., Bassiliades, N., Vlahavas, I. (2007). Cooperative CG-Wrappers for Web Content Extraction. In: Priss, U., Polovina, S., Hill, R. (eds) Conceptual Structures: Knowledge Architectures for Smart Applications. ICCS 2007. Lecture Notes in Computer Science(), vol 4604. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73681-3_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73681-3_38

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73680-6

  • Online ISBN: 978-3-540-73681-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics