Table 2 Techniques Adopted in Existing and Proposed Algorithms Algorithm Document representation Similarity Measure Data set Existing Algorithms SHC ( Gad, Kamel 2010) Term weight (word/phrase relationship) Semantic Similarity ESHC-IntraCVS (Shaw and Xu 2009) Term frequency Cosine Similarity Verb argument Structure Concept similarity Measure Term occurrence Jaccard coefficient CBA ( Shehata ; Shehata et al. 2010) ICA( Liu et al. 2008) Reuters-21578 and 20Newsgroups UW-CAN dataset, 314 web pages from University of Waterloo ACM abstract articles, Reuters, Brown corpus, Usenet newsgroups 20NewsGroup corpus Proposed Algorithms TMARDC Term frequency MARDL, Sentence Similarity CCMARC Correlated Terms Semantic Similarity CCFICA Correlated Terms Semantic Similarity ACM abstract articles, 20Newsgroup ACM abstract articles, 20Newsgroup ACM abstract articles, 20Newsgroup