Opinion Digger: An Unsupervised Opinion Miner from Unstructured Product Reviews Samaneh Moghaddam, Martin Ester CopyRight@luzhonghao Abstract • Propose an unsupervised method for aspect extraction from unstructured reviews using known aspects. • Introduce an unsupervised method for aspect rating(on a scale from 1 to 5)based on the rating guideline. 2 Lab for Internet Software Technologies Problem Definition Known aspects Output aspects 3 Lab for Internet Software Technologies Aspect Extraction • Finding frequent noun phrases • Mining opinion patterns • Filtering out non-aspects 4 Lab for Internet Software Technologies Finding Frequent Noun Phrases • Hypothesis:those nouns that are frequent noun phrases as a set of potential aspects • Apply Apriori algorithm to find all multi-part noun phrases which are frequent. (support value=1%) 5 Lab for Internet Software Technologies Mining Opinion Patterns • Finds matching phrases for each of the known aspects.It searches for each known aspect in the reviews and finds its nearest adj. in that sentence segment as corresponding sentiment. • Saved the matching phrase and picks the POS tags of all words as a pattern. • After mining all POS patterns, use GSP(Generalized Sequential Pattern) to find frequent patterns(sup=1%) 6 Lab for Internet Software Technologies Filtering out non-aspects • Pnum: whitch is the number of opinion patterns that are matched at least once by the potential aspect. • If pnum < 2: eliminate the aspect 7 Lab for Internet Software Technologies Compute Aspect Rating • For each aspect,extracts the nearest adj. as its sentiment. • For each sentiment of the every aspect of a product,search in the WordNet synonymy graph to find two rated synonyms from the rating guideline .(can see them from the website Epinion.com) • Compute the rating of each sentment 8 Lab for Internet Software Technologies Sentiment rating space 9 Lab for Internet Software Technologies Compute each sentiment 10 Lab for Internet Software Technologies Experimental results 11 Lab for Internet Software Technologies Evaluation of Aspect Extraction 12 Lab for Internet Software Technologies Thank you! 13 Lab for Internet Software Technologies