phd in db

advertisement
PhD in Databases
introduction
• why PhD in databases?
– everyone should have a reason!
• which topic should I choose?
– usually the supervisor does it for you, when you’re young
– when you grow up, you get to choose the things you get involved
with
• what are the requirements? how long does it take?
– typically, in NTUA it takes at least 3 years and a refereed
international journal publication is required
– in reality, they are more strict and are set by your supervisor
• what is a good PhD?
– it should cover in detail an interesting research area and offer
novel technical insight
– as a result, a PhD student should have a good publication record
example publication venues
• conferences
– Very Large Databases (VLDB)
• journals
– ACM Transactions on Database Systems (TODS)
• workshops
– International Workshop on the Web and Databases (WebDB)
• national events
– Hellenic Data Management Symposium (HDMS)
• other? symposia
conferences
• a conference attracts the most recent, state-of-the-art, work
on a broad range of topics
– (usually) annual event, organized around the world
– highly competitive, great visibility/impact
• submission process
– first an abstract is submitted
– a week later the full version is due
• 12 double column pages, strict formatting
– around 2 months later the decisions are announced
• review process
– three referees summarize the contributions and give marks on
relevance, novelty, presentation, impact, technical depth, overall
recommendation
• scores: reject, weak reject, neutral, weak accept, accept, strong accept
– discussion phase among referees, before final decision
conferences
types of papers
• full paper
– requires presentation: 25 mins + 5 mins questions
• short paper / poster paper
– a reduced printed version of the paper, e.g., 6 pages
– shorter (20 mins) or no presentation
• demo paper
– describe a system implementation
– a demonstration and poster is required in special session
• all papers appear in the proceedings of the conference
– published by major publishers (ACM, IEEE press, Springer)
conferences
some terms
• Call for Papers (CFP) contains all information, defines topics
• Important Dates specify the submission deadlines and event
dates
• Organizing Committee: people responsible for the local
arrangements
• Program Committee (PC): people that do the review process
• Organizing / PC Chair: person in charge
• Acceptance Rate: the ratio of accepted vs. submitted papers
– not always representative of the quality of the venue
• Tracks/Areas: target group separation
– research vs. industrial track
– separate committees
• Sessions: accepted papers are thematically organized into
parallel sessions
conference ranking
• a very old (partially subjective) list (from NUS 1999)
http://www.cc.gatech.edu/~guofei/CS_ConfRank.htm#db
AREA: Data Bases
Rank 1:
SIGMOD: ACM SIGMOD Conf on Management of Data
PODS: ACM SIGMOD Conf on Principles of DB Systems
VLDB: Very Large Data Bases
ICDE: Intl Conf on Data Engineering
ICDT: Intl Conf on Database Theory
Rank 2:
SSD: Intl Symp on Large Spatial Databases
DEXA: Database and Expert System Applications
FODO: Intl Conf on Foundation on Data Organization
EDBT: Extending DB Technology
DOOD: Deductive and Object-Oriented Databases
DASFAA: Database Systems for Advanced Applications
CIKM: Intl. Conf on Information and Knowledge Management
SSDBM: Intl Conf on Scientific and Statistical DB Mgmt
CoopIS - Conference on Cooperative Information Systems
ER - Intl Conf on Conceptual Modeling (ER)
Rank 3: …
workshops
• like conferences, but with a more focused subject
– they are organized in parallel with a conference
– some have unofficial proceedings, other full published proceedings
• how to recognize good workshops?
– long-running, well-established
– the good ones are always in collaboration with good conferences
• what should go in a workshop?
– dblab diploma theses are excellent candidates
• somewhat limited impact
– however, some very good papers have appeared in workshops
journals
• what are the differences with conferences?
– no call (except for special issues) --- send anytime
– longer, more detailed reviewing phase --- can take a year
• revisions, answer letters
– longer page limits
– often high acceptance rates
– only a few good but a lot of bad journals
• which paper should you send there?
– extended version of conference papers, with only 30% new material
– not previously accepted papers are also good candidates
• why bother with journals?
– in all scientific disciplines, except in CS, journals are more
important/prestigious than conferences
– people not familiar with CS and DB usually judge a CV by #journals
journal ranking
a rather subjective list
•
•
•
•
•
•
ACM Transaction on Database Systems (TODS)
VLDB Journal (VLDBJ)
IEEE Transactions on Knowledge and Data Engineering (TKDE)
Elsevier Information Systems
Elsevier Data and Knowledge Engineering (DKE)
…
reading papers
• is it important?
– yes! sometimes, more important than reading text books
• what do I read?
– read recent bibliography on your topic
– find a good survey paper
– read good papers, even if they are not directly related to your
topic
• where do I find them?
–
–
–
–
look for them in the most recent and relevant good conferences
ask your supervisor!
search the web
get them from DBLP, google (scholar), etc.
writing papers
• what goes in a paper?
– research paper: it presents an interesting solution to an interesting
problem
– survey paper: reviews all relevant literature for an interesting
topic
• structure of a paper
– abstract, (1) introduction, (2) related work, (3) problem statement,
(4) solution, (5) experiments, (6) conclusion, references
• a lot of advice available on the web
– http://people.csail.mit.edu/mernst/advice/write-technicalpaper.html
– http://www.cs.uoi.gr/~pvassil/linx/localCopies4grads/Survey_guidel
ines_byPV.pdf
submitting papers
• choose carefully the right venue
– based on the topic
– based on the quality of your work
• be objective, compare your work with others
– your supervisor knows better
• how to handle rejection
– don’t despair!
– reviews can be noisy/unfair, resubmit…
– if you get consistently bad reviews  you should lower your
expectations
– consider journals, where you can answer the reviewers’ comments
• how to handle acception
– don’t cheer!
– reviews can be noisy/unfair, you’re lucky
– consider creating an extended version and submit it to a journal
web sources
• DBLP (computer science archive)
– http://www.informatik.uni-trier.de/~ley/db/
• DBWorld (db announcements mailing list)
– http://www.cs.wisc.edu/dbworld/
• citeseer (digital library)
– http://citeseer.ist.psu.edu/
• Google Scholar (search engine for papers)
– http://scholar.google.com
Download