Lecture Notes in Computer Science Commenced Publication in 1973 Founding and Former Series Editors: Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen Editorial Board David Hutchison Lancaster University, UK Takeo Kanade Carnegie Mellon University, Pittsburgh, PA, USA Josef Kittler University of Surrey, Guildford, UK Jon M. Kleinberg Cornell University, Ithaca, NY, USA Alfred Kobsa University of California, Irvine, CA, USA Friedemann Mattern ETH Zurich, Switzerland John C. Mitchell Stanford University, CA, USA Moni Naor Weizmann Institute of Science, Rehovot, Israel Oscar Nierstrasz University of Bern, Switzerland C. Pandu Rangan Indian Institute of Technology, Madras, India Bernhard Steffen TU Dortmund University, Germany Madhu Sudan Microsoft Research, Cambridge, MA, USA Demetri Terzopoulos University of California, Los Angeles, CA, USA Doug Tygar University of California, Berkeley, CA, USA Gerhard Weikum Max-Planck Institute of Computer Science, Saarbruecken, Germany 6184 Lei Chen Changjie Tang Jun Yang Yunjun Gao (Eds.) Web-Age Information Management 11th International Conference, WAIM 2010 Jiuzhaigou, China, July 15-17, 2010 Proceedings 13 Volume Editors Lei Chen Hong Kong University of Science and Technology Department of Computer Science Clear Water Bay, Kowloon, Hong Kong, China E-mail: leichen@cs.ust.hk Changjie Tang Sichuan University, Computer Department Chengdu 610064, China E-mail: cjtang@scu.edu.cn Jun Yang Duke University, Department of Computer Science Box 90129, Durham, NC 27708-0129, USA E-mail: junyang@cs.duke.edu Yunjun Gao Zhejiang University, College of Computer Science 388 Yuhangtang Road, Hangzhou 310058, China E-mail: gaoyj@zju.edu.cn Library of Congress Control Number: 2010929625 CR Subject Classification (1998): H.3, H.4, I.2, C.2, H.2, H.5 LNCS Sublibrary: SL 3 – Information Systems and Application, incl. Internet/Web and HCI ISSN ISBN-10 ISBN-13 0302-9743 3-642-14245-1 Springer Berlin Heidelberg New York 978-3-642-14245-1 Springer Berlin Heidelberg New York This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer. Violations are liable to prosecution under the German Copyright Law. springer.com © Springer-Verlag Berlin Heidelberg 2010 Printed in Germany Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India Printed on acid-free paper 06/3180 Preface WAIM is a leading international conference on research, development, and applications of Web technologies, database systems, and information management. Traditionally, WAIM has drawn the strongest participation from the Asia-Pacific region. The previous WAIM conferences were held in Shanghai (2000), Xi'an (2001), Beijing (2002), Chengdu (2003), Dalian (2004), Hangzhou (2005), Hong Kong (2006), Huangshan (2007), Zhangjiajie (2008), and Suzhou (2009). In 2010, WAIM was held in Jiuzhaigou, Sichuan, China. This high-quality program would not have been possible without the authors who chose WAIM for disseminating their contributions. Out of 205 submissions from 16 countries and regions, including Australia, Canada, France, Germany, Hong Kong, Japan, Korea, Macau, Malaysia, Mainland China, Saudi Arabia, Singapore, Taiwan, Thailand, UK, and USA, we selected 58 full papers and 11 short papers for publication. The acceptance rate for regular full papers was 28%. The contributed papers addressed a wide range of topics such as Web, XML, and multimedia data, data processing in the cloud or on new hardware, data mining and knowledge discovery, information integration and extraction, networked data and social networks, graph and stream processing, similarity search, etc. We are also grateful to our distinguished keynote speakers Prof. Jianzhong Li, Dr. Divesh Srivastava, Prof. Katsumi Tanaka, and Prof. Xiaofang Zhou. A conference like WAIM can only succeed as a team effort. We want to thank the Program Committee members and the reviewers for their invaluable efforts. Special thanks go to the local Organizing Committee headed by Changjie Tang, Aoying Zhou, and Lei Duan. Many thanks also go to our Workshop Co-chairs (Jian Pei and Hengtao Shen), Tutorial Co-chairs (Liu Wenyin and Jian Yang), Publicity Co-chairs (Hua Wang and Shuigeng Zhou), Industrial Chairs (Qiming Chen and Haixun Wang), Registration Chair (Chuan Li), and Finance Co-chairs (Howard Leung and Yu Chen). Last but not least, we wish to express our gratitude for the hard work of our webmaster Jie Zuo, and for our sponsors who generously supported the smooth running of our conference. Lei Chen Changjie Tang Jun Yang Masaru Kitsuregawa Qing Li WAIM 2010 Conference Organization Honorary Chair Yi Zhang Sichuan University, China Conference Co-chairs Masaru Kitsuregawa Qing Li University of Tokyo, Japan City University of Hong Kong, Hong Kong Program Committee Co-chairs Lei Chen Changjie Tang Jun Yang Hong Kong University of Science and Technology, Hong Kong Sichuan University, China Duke University, USA Local Organization Co-chairs Aoying Zhou Lei Duan East China Normal University, China Sichuan University, China Workshops Co-chairs Jian Pei Hengtao Shen Simon Fraser University, Canada University of Queensland, Australia Tutorial/Panel Co-chairs Wenyin Liu Jian Yang City University of Hong Kong, Hong Kong Macquarie University, Australia Industrial Co-chairs Qiming Chen Haixun Wang HP Labs, Palo Alto, USA Microsoft Research Asia, China VIII Organization Publication Chair Yunjun Gao Zhejiang University, China Publicity Co-chairs Hua Wang Shuigeng Zhou University of Southern Queensland, Australia Fudan University, China Finance Co-chairs Howard Leung Yu Chen Hong Kong Web Society, Hong Kong Sichuan University, China Registration Chair Chuan Li Sichuan University, China CCF DB Society Liaison Xiaofeng Meng Renmin University of China, China Steering Committee Liaison Zhiyong Peng Wuhan University, China Web Master Jie Zuo Sichuan University, China Program Committee James Bailey Gang Chen Hong Chen Yu Chen Reynold Cheng David Cheung Dickson Chiu Byron Choi Bin Cui Alfredo Cuzzocrea University of Melbourne, Australia Zhejiang University, China Chinese Univeristy of Hong Kong, Hong Kong Sichuan University, China The University of Hong Kong, Hong Kong The University of Hong Kong, Hong kong Dickson Computer Systems, Hong Kong Hong Kong Baptist University, Hong Kong Peking University, China University of Calabria, Italy Organization Guozhu Dong Xiaoyong Du Lei Duan Ling Feng Johann Gamper Bryon Gao Yong Gao Jihong Guan Giovanna Guerrini Bingsheng He Jimmy Huang Seung-won Hwang Wee Hyong Yoshiharu Ishikawa Yan Jia Ruoming Jin Ning Jing Ben Kao Yong Kim Nick Koudas Wu Kui Carson Leung Chengkai Li Chuan Li Feifei Li Tao Li Tianrui Li Zhanhuai Li Zhoujun Li Xiang Lian Lipeow Lim Xuemin Lin Huan Liu Lianfang Liu Qizhi Liu Weiyi Liu Wenyin Liu Eric Lo Zongmin Ma Weiyi Meng Mohamed Mokbel Yang-Sae Moon Akiyo Nadamoto Miyuki Nakano IX Wright State University, USA Renmin University of China, China Sichuan University, China Tsinghua University, China Free University of Bozen-Bolzano, Italy Texas State University at San Marcos, USA Univeristy of British Columbia, Canada Tongji University, China Università di Genova, Italy Chinese Univeristy of Hong Kong, Hong Kong York Univeristy, Canada Pohang University of Science and Technology, Korea Microsoft Nagoya University, Japan National University of Defence Technology, China Kent State University, USA National University of Defence Technology, China The University of Hong Kong, Hong Kong Korea Education & Research Information Service, Korea Univeristy of Toronto, Canada Victoria University, Canada University of Manitoba, Canada University of Texas at Arlington, USA Sichuan University, China Florida State University, USA Florida International University, USA Southwest Jiaotong University, China Northwestern Polytechnical University, China Beihang University, China Hong Kong University of Science and Technology, Hong Kong University of Hawaii at Manoa, USA University of New South Wales, Australia Arizona State University, USA Computing Center of Guangxi, China Nanjing University, China Yunnan University, China City Univeristy of Hong Kong Hong Kong Polytechnic University, Hong Kong Northeastern University, China State University of New York at Binghamton, USA University of Minnesota, USA Kangwon National University, Korea Konan University, Japan University of Tokyo, Japan X Organization Raymond Ng Anne Ngu Tadashi Ohmori Olga Papaemmanouil Zhiyong Peng Evaggelia Pitoura Tieyun Qian Shaojie Qiao Markus Schneider Hengtao Shen Yong Tang David Taniar Maguelonne Teisseire Anthony Tung Shunsuke Uemura Jianyong Wang Ke Wang Tengjiao Wang Wei Wang Raymond Wong Raymond Chi-Wing Wong Xintao Wu Yuqing Wu Junyi Xie Li Xiong Jianliang Xu Jian Yang Xiaochun Yang Ke Yi Hwanjo Yu Jeffrey Yu Lei Yu Philip Yu Ting Yu Xiaohui Yu Demetris Zeinalipour Donghui Zhang Ji Zhang Baihua Zheng Aoying Zhou Shuigeng Zhou Xiangmin Zhou Qiang Zhu Lei Zou University of British Columbia, Canada Texas State University at San Marcos, USA University of Electro Communications, Japan Brandeis University, USA Wuhan University, China University of Ioannina, Greece Wuhan University, China Southwest Jiaotong University, China University of Florida, USA University of Queensland, Australia Sun Yat-sen University, China Monash University, Australia University Montpellier 2, France National University of Singapore, Singapore Nara Sangyo University, Japan Tsinghua University, China Simon Fraser University, Canada Peking University, China University of New South Wales, Australia University of New South Wales, Australia Hong Kong University of Science and Technology, Hong Kong University of North Carolina at Charlotte, USA Indiana University at Bloomington, USA Oracle Corp., USA Emory University, USA Hong Kong Baptist University, Hong Kong Macquaire University, Australia Northeastern University, China Hong Kong University of Science and Technology, Hong Kong Pohang University of Science and Technology, Korea Chinese Univeristy of Hong Kong, Hong Kong State University of New York at Binghamton, USA University of Illinois at Chicago, USA North Carolina State University, USA York University, Canada University of Cyprus, Cyprus Microsoft Jim Gray Systems Lab, USA University of Southern Queensland, Australia Singapore Management University, Singapore East China Normal University, China Fudan University, China CSIRO, Australia University of Michigan at Dearborn, USA Peking University, China Organization Organized by Sichuan University Sponsored by 华东师范大学 EAST CHINA NORMAL UNIVERSITY XI Table of Contents Analyzing Data Quality Using Data Auditor (Keynote Abstract) . . . . . . . Divesh Srivastava 1 Rebuilding the World from Views (Keynote Abstract) . . . . . . . . . . . . . . . . Xiaofang Zhou and Henning Köhler 2 Approximate Query Processing in Sensor Networks (Keynote Abstract) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Jianzhong Li 3 Web Data I Duplicate Identification in Deep Web Data Integration . . . . . . . . . . . . . . . . Wei Liu, Xiaofeng Meng, Jianwu Yang, and Jianguo Xiao 5 Learning to Detect Web Spam by Genetic Programming . . . . . . . . . . . . . . Xiaofei Niu, Jun Ma, Qiang He, Shuaiqiang Wang, and Dongmei Zhang 18 Semantic Annotation of Web Objects Using Constrained Conditional Random Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Yongquan Dong, Qingzhong Li, Yongqing Zheng, Xiaoyang Xu, and Yongxin Zhang Time Graph Pattern Mining for Web Analysis and Information Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Taihei Oshino, Yasuhito Asano, and Masatoshi Yoshikawa 28 40 Networked Data FISH: A Novel Peer-to-Peer Overlay Network Based on Hyper-deBruijn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ye Yuan, Guoren Wang, and Yongjiao Sun 47 Continuous Summarization of Co-evolving Data in Large Water Distribution Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Hongmei Xiao, Xiuli Ma, Shiwei Tang, and Chunhua Tian 62 Proactive Replication and Search for Rare Objects in Unstructured Peer-to-Peer Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Guoqiang Gao, Ruixuan Li, Kunmei Wen, Xiwu Gu, and Zhengding Lu 74 XIV Table of Contents SWORDS: Improving Sensor Networks Immunity under Worm Attacks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Nike Gui, Ennan Zhai, Jianbin Hu, and Zhong Chen Efficient Multiple Objects-Oriented Event Detection over RFID Data Streams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Shanglian Peng, Zhanhuai Li, Qiang Li, Qun Chen, Hailong Liu, Yanming Nie, and Wei Pan 86 97 Social Networks CW2I: Community Data Indexing for Complex Query Processing . . . . . . Mei Hui, Panagiotis Karras, and Beng Chin Ooi 103 Clustering Coefficient Queries on Massive Dynamic Social Networks . . . . Zhiyu Liu, Chen Wang, Qiong Zou, and Huayong Wang 115 Predicting Best Answerers for New Questions in Community Question Answering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Mingrong Liu, Yicen Liu, and Qing Yang Semantic Grounding of Hybridization for Tag Recommendation . . . . . . . . Yan’an Jin, Ruixuan Li, Yi Cai, Qing Li, Ali Daud, and Yuhua Li Rich Ontology Extraction and Wikipedia Expansion Using Language Resources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Christian Schönberg, Helmuth Pree, and Burkhard Freitag 127 139 151 Cloud Computing Fine-Grained Cloud DB Damage Examination Based on Bloom Filters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Min Zhang, Ke Cai, and Dengguo Feng XML Structural Similarity Search Using MapReduce . . . . . . . . . . . . . . . . . Peisen Yuan, Chaofeng Sha, Xiaoling Wang, Bin Yang, Aoying Zhou, and Su Yang Comparing Hadoop and Fat-Btree Based Access Method for Small File I/O Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Min Luo and Haruo Yokota 157 169 182 Data Mining I Mining Contrast Inequalities in Numeric Dataset . . . . . . . . . . . . . . . . . . . . . Lei Duan, Jie Zuo, Tianqing Zhang, Jing Peng, and Jie Gong 194 Table of Contents Users’ Book-Loan Behaviors Analysis and Knowledge Dependency Mining . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fei Yan, Ming Zhang, Jian Tang, Tao Sun, Zhihong Deng, and Long Xiao An Extended Predictive Model Markup Language for Data Mining . . . . . Xiaodong Zhu and Jianzheng Yang A Cross-Media Method of Stakeholder Extraction for News Contents Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Ling Xu, Qiang Ma, and Masatoshi Yoshikawa XV 206 218 232 Stream Processing An Efficient Approach for Mining Segment-Wise Intervention Rules in Time-Series Streams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Yue Wang, Jie Zuo, Ning Yang, Lei Duan, Hong-Jun Li, and Jun Zhu Automated Recognition of Sequential Patterns in Captured Motion Streams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Liqun Deng, Howard Leung, Naijie Gu, and Yang Yang Online Pattern Aggregation over RFID Data Streams . . . . . . . . . . . . . . . . Hailong Liu, Zhanhuai Li, Qun Chen, and Shanglian Peng Cleaning Uncertain Streams by Parallelized Probabilistic Graphical Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Qian Zhang, Shan Wang, and Biao Qin 238 250 262 274 Graph Processing Taming Computational Complexity: Efficient and Parallel SimRank Optimizations on Undirected Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Weiren Yu, Xuemin Lin, and Jiajin Le 280 DSI: A Method for Indexing Large Graphs Using Distance Set . . . . . . . . . Yubo Kou, Yukun Li, and Xiaofeng Meng 297 K-Radius Subgraph Comparison for RDF Data Cleansing . . . . . . . . . . . . . Hai Jin, Li Huang, and Pingpeng Yuan 309 Query Processing A Novel Framework for Processing Continuous Queries on Moving Objects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Liang Zhao, Ning Jing, Luo Chen, and Zhinong Zhong 321 XVI Table of Contents Group Visible Nearest Neighbor Queries in Spatial Databases . . . . . . . . . Hu Xu, Zhicheng Li, Yansheng Lu, Ke Deng, and Xiaofang Zhou iPoc: A Polar Coordinate Based Indexing Method for Nearest Neighbor Search in High Dimensional Space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Zhang Liu, Chaokun Wang, Peng Zou, Wei Zheng, and Jianmin Wang Join Directly on Heavy-Weight Compressed Data in Column-Oriented Database . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Gan Liang, Li RunHeng, Jia Yan, and Jin Xin 333 345 357 Potpourri Exploiting Service Context for Web Service Search Engine . . . . . . . . . . . . Rong Zhang, Koji Zettsu, Yutaka Kidawara, and Yasushi Kiyoki 363 Building Business Intelligence Applications Having Prescriptive and Predictive Capabilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Chen Jiang, David L. Jensen, Heng Cao, and Tarun Kumar 376 FileSearchCube: A File Grouping Tool Combining Multiple Types of Interfile-Relationships . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Yousuke Watanabe, Kenichi Otagiri, and Haruo Yokota 386 Trustworthy Information: Concepts and Mechanisms . . . . . . . . . . . . . . . . . Shouhuai Xu, Haifeng Qian, Fengying Wang, Zhenxin Zhan, Elisa Bertino, and Ravi Sandhu 398 Web Data II How to Design Kansei Retrieval Systems? . . . . . . . . . . . . . . . . . . . . . . . . . . . Yaokai Feng and Seiichi Uchida 405 Detecting Hot Events from Web Search Logs . . . . . . . . . . . . . . . . . . . . . . . . Yingqin Gu, Jianwei Cui, Hongyan Liu, Xuan Jiang, Jun He, Xiaoyong Du, and Zhixu Li 417 Evaluating Truthfulness of Modifiers Attached to Web Entity Names . . . Ryohei Takahashi, Satoshi Oyama, Hiroaki Ohshima, and Katsumi Tanaka 429 Searching the Web for Alternative Answers to Questions on WebQA Sites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Natsuki Takata, Hiroaki Ohshima, Satoshi Oyama, and Katsumi Tanaka Domain-Independent Classification for Deep Web Interfaces . . . . . . . . . . . Yingjun Li, Siwei Wang, Derong Shen, Tiezheng Nie, and Ge Yu 441 453 Table of Contents XVII Data Mining II Data Selection for Exact Value Acquisition to Improve Uncertain Clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Yu-Chieh Lin, De-Nian Yang, and Ming-Syan Chen 459 Exploring the Sentiment Strength of User Reviews . . . . . . . . . . . . . . . . . . . Yao Lu, Xiangfei Kong, Xiaojun Quan, Wenyin Liu, and Yinlong Xu 471 Semantic Entity Detection by Integrating CRF and SVM . . . . . . . . . . . . . Peng Cai, Hangzai Luo, and Aoying Zhou 483 An Incremental Method for Causal Network Construction . . . . . . . . . . . . . Hiroshi Ishii, Qiang Ma, and Masatoshi Yoshikawa 495 DCUBE: CUBE on Dirty Databases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Guohua Jiang, Hongzhi Wang, Shouxu Jiang, Jianzhong Li, and Hong Gao 507 XML and Images An Algorithm for Incremental Maintenance of Materialized XPath View . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Xueyun Jin and Husheng Liao Query Processing in INM Database System . . . . . . . . . . . . . . . . . . . . . . . . . Jie Hu, Qingchuan Fu, and Mengchi Liu 513 525 Fragile Watermarking for Color Image Recovery Based on Color Filter Array Interpolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Zhenxing Qian, Guorui Feng, and Yanli Ren 537 A Hybrid-Feature-Based Efficient Retrieval over Chinese Calligraphic Manuscript Image Repository . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Yi Zhuang and Chengxiang Yuan 544 Efficient Filtering of XML Documents with XPath Expressions Containing Ancestor Axis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bo Ning, Chengfei Liu, and Guoren Wang 551 New Hardware ACAR: An Adaptive Cost Aware Cache Replacement Approach for Flash Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Yanfei Lv, Xuexuan Chen, and Bin Cui GPU-Accelerated Predicate Evaluation on Column Store . . . . . . . . . . . . . . Ren Wu, Bin Zhang, Meichun Hsu, and Qiming Chen 558 570 XVIII Table of Contents MOSS-DB: A Hardware-Aware OLAP Database . . . . . . . . . . . . . . . . . . . . . Yansong Zhang, Wei Hu, and Shan Wang 582 Similarity Search Efficient Duplicate Record Detection Based on Similarity Estimation . . . Mohan Li, Hongzhi Wang, Jianzhong Li, and Hong Gao A Novel Composite Kernel for Finding Similar Questions in CQA Services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Jun Wang, Zhoujun Li, Xia Hu, and Biyun Hu Efficient Similarity Query in RFID Trajectory Databases . . . . . . . . . . . . . . Yanqiu Wang, Ge Yu, Yu Gu, Dejun Yue, and Tiancheng Zhang 595 608 620 Information Extraction Context-Aware Basic Level Concepts Detection in Folksonomies . . . . . . . Wen-hao Chen, Yi Cai, Ho-fung Leung, and Qing Li 632 Extracting 5W1H Event Semantic Elements from Chinese Online News . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Wei Wang, Dongyan Zhao, Lei Zou, Dong Wang, and Weiguo Zheng 644 Automatic Domain Terminology Extraction Using Graph Mutual Reinforcement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Jingjing Kang, Xiaoyong Du, Tao Liu, and He Hu 656 Knowledge Discovery Semi-supervised Learning from Only Positive and Unlabeled Data Using Entropy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Xiaoling Wang, Zhen Xu, Chaofeng Sha, Martin Ester, and Aoying Zhou 668 Margin Based Sample Weighting for Stable Feature Selection . . . . . . . . . . Yue Han and Lei Yu 680 Associative Classifier for Uncertain Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . Xiangju Qin, Yang Zhang, Xue Li, and Yong Wang 692 Information Integration Automatic Multi-schema Integration Based on User Preference . . . . . . . . Guohui Ding, Guoren Wang, Junchang Xin, and Huichao Geng 704 EIF: A Framework of Effective Entity Identification . . . . . . . . . . . . . . . . . . Lingli Li, Hongzhi Wang, Hong Gao, and Jianzhong Li 717 Table of Contents A Multilevel and Domain-Independent Duplicate Detection Model for Scientific Database . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Jie Song, Yubin Bao, and Ge Yu XIX 729 Extending Databases Generalized UDF for Analytics Inside Database Engine . . . . . . . . . . . . . . . Meichun Hsu, Qiming Chen, Ren Wu, Bin Zhang, and Hans Zeller 742 Efficient Continuous Top-k Keyword Search in Relational Databases . . . . Yanwei Xu, Yoshiharu Ishikawa, and Jihong Guan 755 V Locking Protocol for Materialized Aggregate Join Views on B-Tree Indices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Gang Luo 768 Web Information Credibility (Keynote Abstract) . . . . . . . . . . . . . . . . . . . . . Katsumi Tanaka 781 Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 783