This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
|
||||||||
|
Paper Details
Paper Title
Efficient Text Mining Using Side Information of Documents
Authors
  Rosemary Tripura,  P.Selvaraj
Abstract
Due to the increasing availability of digital data, text document continue to grow as well hence the need of text mining. These digital documents comprise of the normal body text as well as side information. The side information will be in different formats for example hyperlinks and may contain useful information for mining. It is of utmost importance that the value of the side information be ascertained before consideration in the data selected for the text mining process as it may give an adverse impact on the quality of text mined. A principled way to perform the mining process is therefore required so as to maximize on the benefits of side information. In this paper, we use the Naive Bayes model to create an effective text mining approach.
Keywords- data mining, text mining, Stop word, word stemming, NLP.
Publication Details
Unique Identification Number - IJEDR1501075Page Number(s) - 409-414Pubished in - Volume 3 | Issue 1 | Jan 2015DOI (Digital Object Identifier) -    Publisher - IJEDR (ISSN - 2321-9939)
Cite this Article
  Rosemary Tripura,  P.Selvaraj,   "Efficient Text Mining Using Side Information of Documents", International Journal of Engineering Development and Research (IJEDR), ISSN:2321-9939, Volume.3, Issue 1, pp.409-414, Jan 2015, Available at :http://www.ijedr.org/papers/IJEDR1501075.pdf
Article Preview
|
|
||||||
|