Yiwei Zhou Email: Yiwei.Zhou@warwick.ac.uk Department of Computer Science Phone: +44(0) 7410511235 University of Warwick, Coventry, UK CV4 7AL EDUCATION Apr.2014-present PhD candidate in Data Mining and Machine Learning, University of Warwick, UK Nov.2014-Dec.2014 Visiting researcher in L3S research centre, University of Hannover, Germany Sept.2011-Mar.2014 M.S. in Information Engineering, Beijing University of Aeronautics and Astronautics, China Jan.2013-June.2013 Exchange Student in School of Compute, Technical University of Denmark, Denmark Sept.2007-July.2011 B.Eng. in Electrical Engineering, Beijing University of Aeronautics and Astronautics, China Honours and Awards: Chancellor's International Scholarship for PhD study; rank No.1 out of 232 students in undergraduate degree National Scholarship (highest honour in China, for all-round ability, awards to 1 out of 100 students) 2009 Model Student of Academic Records 2009, 2010, 2011; 1st Prize scholarship of Academic Performance 2009, 2010, 2011 RESEARCH EXPERIENCE Sept.2015-Jan.2016 Online Timeline Generation for Real-world Events in Twitter Applying online incremental clustering to cluster tweets describing the same sub-events of “Ebola outbreak”. Summarise each sub-event tweets cluster to generate real-time timeline. Jun.2014-Sept.2015 Language-specific Bias Analysis of Multilingual Wikipedia Built entity-centric graph employing Wikipedia’s link structure to extract relevant articles. Analyse language-specific bias in multilingual Wikipedia for events and named entities. Extracted language-specific contexts to support multilingual entity-centric information retrieval. Nov.2014-Dec.2014 Twitter Message-level Sentiment Analysis Employed a combination of various features, such as lexical features, LDA-based topical features, and word embedding features. Employed a two-step binary classification method for ternary classification. INTERNSHIP Aug.2013-Oct.2013 Alibaba Database Technology Department Software Engineering Intern Programmed in Perl to build status-monitoring tools of MySQL databases servers. Programmed in C to build a MySQL stress-testing tool. Jun.2013-Aug.2013 Baidu Search Engine Advertising Department Software Engineering Intern Programmed in Python utilising Hadoop to process China’s biggest search engine’s advertisement daily log. Analysed Key Accounts’ consumption trend, provided advice to improve their return on investment. Participated in the design of next generation online advertising trading platform. ACADEMIC WORKS Apr.2013-May.2013 New York Times Data Mining Project Team Leader Developed methods to automatically categorise New York Times articles. Employed a lexicon-based method to generate sentiment score time series of articles from different categories, to analyse the real-word events’ influence on the sentiment of New York Times. Built a web application using Jinja2 framework. July.2012-Aug.2012 Massive Data Processing/Cloud Computing Peking University Summer School Programmed in Java utilising Hadoop to build an inverted index of news collection, perform PageRank analysis on Wikipedia and perform canopy clustering to recommend films for Netflix users. PUBLICATIONS Y. Zhou, E. Demidova, and A. I. Cristea. Who likes me more? Analysing entity-centric language-specific bias in multilingual Wikipedia. In Proceedings of the 31th Annual ACM Symposium on Applied Computing, SAC ’16, 2016. Y. Zhou, A. I. Cristea, and Z. Roberts. Is Wikipedia really neutral? A sentiment perspective study of war-related Wikipedia articles since 1945. In Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 29, 2015. Y. Zhou, E. Demidova, and A. I. Cristea. Analysing entity context in multilingual Wikipedia to support entity-centric retrieval applications. In Proceedings of the 1st International KEYSTONE Conference, IKC 2015, 2015. R. Townsend, A. Tsakalidis, Y. Zhou, B. Wang,M. Liakata, A. Zubiaga, A. Cristea, and R. Procter. Warwickdcs: From phrase-based to target-specific sentiment recognition. In Proceedings of the 9th International Workshop on Semantic Evaluation, SemEval 2015, 2015. EXTRACURRICULAR ACTIVITIES Aug.2012-Sept.2012 China Electronics Technology Group Corporation International (Intern) Mar.2010-Mar.2011 Electronic Science and Technology Association (President) Sept.2008-Sept.2009 Student Union in Electrical Engineering Department (Vice-President) PROGRAMMING SKILLS Programming languages: Python, C, C++, Matlab, Java, Perl Programming environments: Linux, OS X, Windows