Volume : IV, Issue : VIII, August - 2015

Determine Word Relevance in Document Queries Using TF–IDF

Ashok Koujalagi

Abstract :

In this paper, we examine the results ofapplying Term Frequency Inverse DocumentFrequency (TF–IDF) to determine what words in a corpusof documents might be more favorable to use ina query. As the term implies, TFIDFcalculates values for each word in a document throughan inverse proportion of the frequency of theword in a particular document to the percentageof documents the word appears in. Words withhigh TF–IDF numbers imply a strong relationship with the document they appearin, suggesting that if that word were to appear ina query, the document could be of interest tothe user. We provide evidence that thissimplealgorithm efficiently categorizes relevantwords that can enhance queryretrieval.

Keywords :

Article: Download PDF   DOI : 10.36106/ijsr  

Cite This Article:

Ashok Koujalagi Determine Word Relevance in Document Queries Using TF-IDF International Journal of Scientific Research, Vol : 4, Issue : 8 August 2015


Number of Downloads : 1843


References :