As a guest user you are not logged in or recognized by your IP address. You have
access to the Front Matter, Abstracts, Author Index, Subject Index and the full
text of Open Access publications.
News dataset is one of the most abundant data source for recording any event happening around people. For news event detection, people usually need to collect the related news to explore major events manually. To explore major events in large news datasets is difficult due to the amount of data grows quickly with the rapid development of the Web and also an article of news with unstructured data. How to discover events from unstructured-like articles has become an important problem. In this paper, we propose an event detection algorithm based on five-dimensional named entity feature and random-walk with restart to achieve event detection in news articles with unstructured data. The first part of this algorithm is to categorize news term into five predefined named-entity by exploring the Web page of Wikipedia in order to generate more distinctive features of each news article. The second one is to aggregate the news articles by the similarity between news articles using random-walk with restart clustering algorithm. The experimental results show that the proposed algorithm is indeed effective. Especially it is also demonstrated that this algorithm provides better event detection quality than other approaches in terms of the ability of handling multi-event news articles.
This website uses cookies
We use cookies to provide you with the best possible experience. They also allow us to analyze user behavior in order to constantly improve the website for you. Info about the privacy policy of IOS Press.
This website uses cookies
We use cookies to provide you with the best possible experience. They also allow us to analyze user behavior in order to constantly improve the website for you. Info about the privacy policy of IOS Press.