Abstract
In this paper we present a concept as well as a prototype of a tool pipeline to utilize the abundant information available on the World Wide Web for contextual, user driven creation and display of language learning material. The approach is to capture Wikipedia articles of the user’s choice by crawling, to analyze the linguistic aspects of the text via natural language processing and to compile the gathered information into a visually appealing presentation of enriched language information. The tool is designed to address the Japanese language, with a focus on kanji, the pictographic characters used in Japanese scripture.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Wentland, W., et al.: Building a multilingual lexical resource for named entity disambiguation, translation and transliteration. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008). European Language Resources Association (ELRA), Marrakech (2008)
Judea, A., Nastase, V., Strube, M.: Concept-based selectional preferences and distributional representations from wikipedia articles. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)
Fujii, A., Fujii, Y., Tokunaga, T.: Effects of document clustering in modeling wikipedia-style term descriptions. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)
Lin, C.Y., Hovy, E.: The automated acquisition of topic signatures for text summarization. In: Proceedings of the 18th Conference on Computational Linguistics, COLING 2000, vol. 1, pp. 495–501. Association for Computational Linguistics, Stroudsburg (2000)
Bond, F., et al.: Enhancing the Japanese WordNet. In: Proceedings of the 7th Workshop on Asian Language Resources, ALR7, pp. 1–8. Association for Computational Linguistics, Stroudsburg (2009)
Strube, M., Ponzetto, S.P.: Wikirelate! computing semantic relatedness using wikipedia. In: Proceedings of the 21st National Conference on Artificial Intelligence, pp. 1419–1424. AAAI Press (2006)
McClain, Y.: Handbook of Modern Japanese Grammar. Hokuseido Press (1981)
Kudo, T.: Mecab: Yet another part-of-speech and morphological analyzer, http://mecab.googlecode.com/svn/trunk/mecab/doc/index.html (last accessed: April 28, 2013)
Denis, A., et al.: Representation of linguistic and domain knowledge for second language learning in virtual worlds. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)
Moneglia, M., et al.: The IMAGACT cross-linguistic ontology of action. a new infrastructure for natural language disambiguation. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)
Lefever, E., Hoste, V., Cock, M.D.: Discovering missing wikipedia inter-language links by means of cross-lingual word sense disambiguation. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)
Breen, J.: Multiple indexing in an electronic kanji dictionary. In: Enhancing and Using Electronic Dictionaries, COLING, Geneva, Switzerland, pp. 1–7 (2004)
Saravanan, K., et al.: An empirical study of the occurrence and co-occurrence of named entities in natural language corpora. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC 2012). European Language Resources Association (ELRA), Istanbul (2012)
Sweller, J., van Merrienboer, J.J., Paas, F.G.: Cognitive architecture and instructional design. Educational Psychology Review 10(3), 251–296 (1998)
Suchanek, F.M., et al.: Yago2s: Modular high-quality information extraction with an application to flight planning. In: BTW, pp. 515–518 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wloka, B., Winiwarter, W. (2013). COLLEAP – COntextual Language LEArning Pipeline. In: Wang, JF., Lau, R. (eds) Advances in Web-Based Learning – ICWL 2013. ICWL 2013. Lecture Notes in Computer Science, vol 8167. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41175-5_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-41175-5_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41174-8
Online ISBN: 978-3-642-41175-5
eBook Packages: Computer ScienceComputer Science (R0)