Abstract
RSS is the XML-based format for syndication of Web contents and users aggregate RSS feeds with RSS feed aggregators. As the usage of RSS service has been diffused, it is crucial to have a good aggregation policy that enables users to efficiently aggregate postings that are generated. Aggregation policies may determine not only the number of aggregations for each RSS feed, but also schedule when aggregations take place. In this paper, we first propose the algorithms of minimum missing aggregation policy which reduces the number of missing postings during aggregations. Second, we compare and analyze the experimental results of ours with the existing minimum delay aggregation policy. Our analysis shows that the minimum missing aggregation policy can reduce approximately 29% of the posts the existing minimum delay aggregation policy would miss.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Allblog, http://www.allblog.net.
Bloglines, http://www.bloglines.com.
B. Brewington and G. Cybenko, ‘How Dynamic is the Web?’, Proceedings of the 9th International World Wide Web Conference, pages 257–276, 2000.
J. Cho and H. Garcia-Molina, ‘Synchronizing a Database to Improve Freshness’, Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pages 117-128, 2000.
J. Cho and H. Garcia-Molina, ‘The Evolution of the Web and Implications for an Incremental Crawler’, Proceedings of the 26th International Conference on Very Large Data Bases, pages 200–209, 2000.
K.E. Gill, ‘Blogging, RSS and the Information Landscape: A Look At Online News’, WWW 2005 2nd Annual Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics, 2005.
Y.G. Han, S.H. Lee, J.H. Kim, and Y. Kim, ‘A New Aggregation Policy for RSS Services’, Proceedings of International Workshop on Context Enabled Source and Service Selection, Integration and Adaptation, 2008.
D. Johnson, ‘RSS and Atom In Action’, Manning, Greenwich, 2006.
S.J. Kim and S.H. Lee, ‘An Empirical Study on the Change of Web Pages’, Proceedings of the Seventh Asia-Pacific Web Conference, pages 632–642, 2005.
S.J. Kim and S.H. Lee, ‘Estimating the Change of Web Pages’, Proceedings of the International Conference on Computational Science 2007, pages 798–805, 2007.
S. Lawrence and C.L. Giles, ‘Accessibility of Information on the Web’, Nature, 400(6740), pages 107–109, 1999.
A. Ntoulas, J. Cho, and C. Olston, ‘What’s New on the Web? The Evolution of the Web from a Search Engine Perspective’, Proceedings of the 13th International World Wide Web Conference, pages 1–12, 2004.
Pew Internet and American Life, ‘The State of Blogging’, http://www.pewinternet.org/pdfs/PIP_blogging_data.pdf.
M. Rosenblum and J.K. Ousterhout, ‘The Design and Implementation of a Log-Structured File System’, ACM Transactions on Computer Systems, 10(1):26–52, February 1992.
RSS 2.0 Specification. http://blogs.law.harvard.edu/tech/rss.
K.C. Sia, J. Cho and H.K Cho, ‘Efficient Monitoring Algorithm for Fast News Alert’, IEEE Transaction on Knowledge and Data Engineering, 19(7):950–961, July 2007.
K.C. Sia, J. Cho, K. Hino, Y. Chi, S. Zhu and B.L. Tseng, ‘Monitoring RSS Feeds Based on User Browsing Pattern’, Proceedings of the International Conference on Weblogs and Social Media, 2007.
What is RSS?, http://www.xml.com/pub/a/2002/12/18/dive-into-xml.html.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kim, J.H., Lee, S.H., Han, Y.G. (2009). An Effective Aggregation Policy for RSS Services. In: King, I., Baeza-Yates, R. (eds) Weaving Services and People on the World Wide Web. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00570-1_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-00570-1_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00569-5
Online ISBN: 978-3-642-00570-1
eBook Packages: Computer ScienceComputer Science (R0)