Abstract
Faced with huge amounts of information to realize the accurate retrieval under the network environment, the first step is indexing words cannot appear ambiguity word. Because Chinese’s the basic unit is Chinese characters, Chinese characters form words, Word is divided into monosyllabic word and compound word, and there’s no space between Chinese keywords and there are a lot of ambiguous concept. Therefore a lot of ambiguity in the indexing process will be produced. The result detected information of irrelevant or mistakenly identified. The paper focuses on a method to eliminating the crossed meanings ambiguous words in the automatic indexing. The paper puts forward a method to eliminating ambiguous words combined algorithm of exhaustive method and disambiguation rules. Experiments show that it can avoid a great lot segmenting ambiguities with better segmenting results.
Chapter PDF
Similar content being viewed by others
References
Li, D., Cao, Y., Wan, Y.: New Security Feature Extraction Method Based on Association Rules. Computer Engineering and Applications (S1), 105–107 (2006)
Xiao, H., Xu, S.-H.: A Method of Automatic Keyword Extraction based on Co-occurrence Model. Transactions of Shenyang Ligong University (5), 38–41 (2009)
Su, X., Liu, X., Shao, P.: The Word-indexand Position Retrievalforthe Document TitlesIn Chinese. Journal of Nanjing University(Natural Sciences Edition) (2), 329–333 (1990)
Weng, H.: Comparison Studies on Inconsistencies and Ambiguity Automatic Identification Method in Chinese Information Processing. Language Applied Research (12), 93–94 (2006)
Li, G., Liu, K., Zhang, Y.: Segmentating Chinese Word and Processing Different Meanings Structure. Journal of Chinese Information Processing (3), 27–32 (1988)
Yao, J.-W., Zhao, D.: Disambiguation Method in Chinese Word Segmentation Based on Phrase Match. Journal of Jilin University(Science Edition) 48(3), 427–432 (2010)
Bai, S.: Chinese word segmentation and POS integrated approach to automatic annotation. In: Advances in Computational Linguistics and Applied, Beijing, pp. 56–61. Tsinghua University Press (1995)
Cai, J.: "Chinese Library Classification" professional classification "Agricultural Professional Classification". Beijing. Library Press (October 1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 IFIP International Federation for Information Processing
About this paper
Cite this paper
Dan, W., Xiaorong, Y., Jie, Z. (2014). Elimination Method Study of Ambiguous Words in Chinese Automatic Indexing. In: Li, D., Chen, Y. (eds) Computer and Computing Technologies in Agriculture VII. CCTA 2013. IFIP Advances in Information and Communication Technology, vol 420. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54341-8_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-54341-8_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54340-1
Online ISBN: 978-3-642-54341-8
eBook Packages: Computer ScienceComputer Science (R0)