2013 CRA-W Graduate Cohort Workshop Finding a Research Topic Carla Brodley Professor and Chair, Department of Computer Science Tufts University (with credits to Lori Pollack and Padma Raghavan) Academic History • • • • • • Started graduate school, UMASS…………….Fall 1988 Ph.D. awarded………………………………….Aug 1994 Started as Assistant Professor, Purdue….….Nov 1994 Promoted to Assoc. Prof. w/ tenure ………Spring 2000 Started as a Full Professor, Tufts ..………..…Fall 2004 Department Chair, Tufts……………………….Fall 2010 The Thesis Equation Topic + Advisor = Dissertation What is (CS) Research? • the systematic investigation into and study of materials, sources, etc., in order to establish facts and reach new conclusions Oxford dictionary – Experimental scientific research: • Observe a problem • Formulate a hypothesis • Develop a strategy to solve problem based on hypothesis • Perform experiments and demonstrate conclusive evidence • Interpret results What is (CS) Research? • the systematic investigation into and study of materials, sources, etc., in order to establish facts and reach new conclusions Oxford dictionary – Theoretical scientific research: • Identify an open question • Formulate a hypothesis • Prove hypothesis Research is not knowing the answer or how to get it What is CS Research? Example from Machine Learning Classification k-Nearest Neighbor oo o oooo o oo xxx x x x xx x ? Classification k-Nearest Neighbor oo o oooo o ? oo xxx x x x xx x Classification k-Nearest Neighbor o o oooo o ?x o xx x x x xx x oo Assign majority class of the k nearest neighbors What is CS Research? Example from Machine Learning • Observe a problem: Performance of k-NN is little better than random guessing for a particular dataset • Hypothesis: Classification accuracy will improve if I can find and eliminate irrelevant and noisy features. • Strategy: Develop a feature selection algorithm: eliminate features with low correlation with the class label • Evaluation/Evidence: Implement and compare accuracy of original k-NN to new feature selection kNN across a large number of data sets. • Interpret results: Feature selection improves performance in M of the N datasets, …next steps????? So, what isn’t PhD research? How do I choose a topic area for my research? • Whose interest do you need to grab? – You – Your advisor – Your research community • Love your topic! – Sets the course for your next 2-3 years – Determines, in part, opportunities offered to you upon graduation – May work in same/related area for years More Things to Consider • What are your strengths? weaknesses? – Programming, design, data analysis, proofs – Key insights versus long/detailed verification/simulation • What drives you? bores you? – Technology, puzzles, applications, interdisciplinary • Do you (i.e., your advisor) have funding for you to work in the area? – Working as a TA – Working as an RA – Having university/college, government, industry, etc… fellowship/scholarship/grant Which comes first? Advisor or Topic Area? • For many people “advisor before topic” – Meet faculty member with compelling research interests • For some people “topic before advisor” – Need a guide in an area already of great interest to you • Want an advisor – Knowledgeable about your topic • Interdisciplinary topics may require >1 advisor – With compatible working style (e.g., solo vs team) – With lots of research ideas – With strong interest in working with PhD students Focusing from Area to Topic • Area = subfield – architecture, theory, AI, high performance computing, or interdiscplinary – Is it important? Timely? Jobs in the area? • Topic = specific open problems in subfield – – – – – Theory: provably better algorithm AI: Improving a machine learning algorithm Architecture: multicore cache design HPC: parallel algorithm, scheduling scheme Interdisciplinary: computer simulation of tumor growth Topic Scale and Scope • Scale – Should have more than one open problem, or solving one should lead to another – Should lead to more than one result/finding, some big, some smaller • Scope – Too narrow, e.g., just analysis no experiment, many not leave enough room – Too broad, e.g., data mining, for what? why? too open ended Selecting a Topic • Moving from coursework to picking a topic is often a low point – Even for the most successful students • Why? – Going from what you knowcoursework, to something newresearch – It is very important – There is no *one* ideal way, but many good ways 7 Ways to Identify a Good Research Problem 1) Flash of Brillance • You wake up one day with a new insight/idea • New approach to solve an important open problem • Warnings: – This rarely happens if at all – Even if it does, you may not be able to find an advisor who agrees 2) The Apprentice • Your advisor has a list of topics • Suggests one (or more!) that you can work on • Can save you a lot of time/anxiety • Warnings: – Don’t work on something you find boring, fruitless, badly-motivated,… – Several students may be working on the same/related problem 3) The Extended Course Project • You take a project course that gives you a new perspective • The project/paper combines your research project with the course project – One (and ½) project does double duty • Warnings: – This can distract from your research if you can’t find a related project/paper 4) Redo … Reinvent • You work on some projects – Re-implement or re-do; Evaluate – Identify an improvement, algorithm, proof • You have now discovered a topic • Warnings: – You may be without “a topic” for a long time – It may not be a topic worthy of a doctoral thesis 5) Analyze Data • You participate in more senior student’s evaluation study: – Help with data collection and analysis – Identify open challenges • You have now discovered a topic • Warnings: – You will have to agree on who works on identified open challenges – It may not be a topic worthy of a doctoral thesis 6) The Stapler • You work on a number of small topics that turn into a series of conference papers • You figure out somehow how to tie it all together, create a chapter from each paper, and put a BIG staple through it • Warning: – May be hard/impossible to find the tie 7) The Synthesis Model • You read papers from other subfields in computer science or a related field • Look for places to apply insight from another (sub)field to your own – E.g., machine learning to compiler optimizations • Warnings: – You can read a lot of papers and not find a connection – Or realize someone has done it already! – Or you have not made a significant impact in either field Tips and Suggestions • Topic + advisor are both important • Keep a research ideas “journal” (wiki) • Keep an annotated bibliography (bibtex) • Follow your interests and passion – Key driver for success and impact • Are you eager to get to work, continue working? • If not really interested, adapt – Tedium or actual lack of interest and motivation? When you’re stuck at the start • Read/present papers regularly to find open research issues – Practice summarizing, synthesizing & comparing sets of papers – Write your own slides for presentations • Work with a senior PhD student on their research • Try something…. • Get feedback and ideas from others: conferences, research internships, advisor’s idea When you’re still stuck… • Read a PhD thesis in your area – Often contain an ‘open problems’ or ‘future work’ section • Read your advisor’s grant proposals • Attend PhD oral exams and thesis defenses – Understand how to formulate problems – Understand what constitutes a problem solution • Assess your progress, with your advisor – Set goals per semester - Have you ruled out an area? converged on an area? Chosen a topic for an exploratory research project? When you’re stuck again • Divide your topic into milestones, and develop a plan to work on them one-byone – Reward yourself when you finish a milestone – Publications and/or posters as milestones – Vary what you do during the day, but set aside blocks of time for each activity • Assess your progress regularly, with your advisor – Have you submitted a workshop paper? A term project with documentation? A poster at a conference? A talk at a regional conf? When you’re really really stuck • Change research topics? – May move you out of your advisor’s comfort zone of expertise – Starting from “scratch” (e.g., need to learn the related work in a new area) • Change research advisor? – May go through ‘shakedown’ period again – May or may not be better off • But change can be invigorating – What’s hard? Need to recognize when things are not working out and take action – Weigh consequences of changing and not changing The Six Questions…. (from Paul Utgoff) • What research issue(s) interest you most? Why? • Who else has worked in this vein? What did they accomplish? What can't they do? • What kind of progress would you like to see? Why? • Do you have an idea for making some such progress? Explain. • What do you expect to discover from your investigation? • How will your expected result(s) affect the research community? So how did I find my topic? • At ICML1990, I was irritated by – “Yet Another Learning Algorithm (YALA)” – Strategic selection of UCI benchmark datasets to show YALA’s superiority • My idea: Given a dataset, select the “best” algorithm automatically for that dataset • My next observation: Why should we assume all parts of the data space have the same bias? • My next idea: Recursive automatic bias selection Identify a research topic and get started! Great relevant article in ACM Crossroads, “How to Succeed in Graduate School: A Guide for Students and Advisors”, (part I, Dec 1994; part II, Feb 1995), available in ACM Digital Library