Table 2 Techniques Adopted in Existing and Proposed Algorithms

advertisement
Table 2 Techniques Adopted in Existing and Proposed Algorithms
Algorithm
Document
representation
Similarity Measure
Data set
Existing Algorithms
SHC
( Gad, Kamel 2010)
Term weight (word/phrase
relationship)
Semantic Similarity
ESHC-IntraCVS
(Shaw and Xu 2009)
Term frequency
Cosine Similarity
Verb argument Structure
Concept similarity
Measure
Term occurrence
Jaccard coefficient
CBA
( Shehata ; Shehata et
al. 2010)
ICA( Liu et al. 2008)
Reuters-21578 and 20Newsgroups
UW-CAN dataset, 314 web
pages from University of
Waterloo
ACM abstract articles,
Reuters, Brown corpus,
Usenet newsgroups
20NewsGroup corpus
Proposed Algorithms
TMARDC
Term frequency
MARDL, Sentence
Similarity
CCMARC
Correlated Terms
Semantic Similarity
CCFICA
Correlated Terms
Semantic Similarity
ACM abstract articles,
20Newsgroup
ACM abstract articles,
20Newsgroup
ACM abstract articles,
20Newsgroup
Download