Websom for Textual Data Mining
作者:Krista Lagus, Timo Honkela, Samuel Kaski, Teuvo Kohonen
摘要
New methods that are user-friendly and efficient are needed for guidanceamong the masses of textual information available in the Internet and theWorld Wide Web. We have developed a method and a tool called the WEBSOMwhich utilizes the self-organizing map algorithm (SOM) for organizing largecollections of text documents onto visual document maps. The approach toprocessing text is statistically oriented, computationally feasible, andscalable – over a million text documents have been ordered on a single map.In the article we consider different kinds of information needs and tasksregarding organizing, visualizing, searching, categorizing and filteringtextual data. Furthermore, we discuss and illustrate with examples howdocument maps can aid in these situations. An example is presented wherea document map is utilized as a tool for visualizing and filtering a stream ofincoming electronic mail messages.
论文关键词:data mining, document filtering, exploratory data analysis, information retrieval, self-organizing map, SOM, text document collection, WEBSOM
论文评审过程:
论文官网地址:https://doi.org/10.1023/A:1006586221250