Project Bioinformatics MOL075 2013-2014 First sit down and think about the research project. What is the meaning of the data? What is the research question that you are going to answer? Where in this project are you going to transfer information? Your task We expect you to write, in teams of three students, a report of max 7 A4 (excluding appendices). You will make the project report in this Word file. Do not use 2 columns text on a page, use the full width of the page. Hand in your report digitally at C.vanGelder@umcn.nl as a doc file (not a pdf!) Deadline for report is 15 nov 2013, 9:00 hr. Remember that the report needs to be printed to be corrected and graded, so check that it can be printed! General hints & remarks for the report: General remarks: o Choose carefully what information is relevant to show, and how you want to show it. Use figures, tables, pictures etc. to illustrate your results. You can for instance in the Introduction paragraph show schematically the domains in a protein sequence. o Don’t make your text a “bullet list” of results where the user has to draw his own conclusions. You will take the reader by the hand and lead him through your text in a story. o Avoid constructs like “We ran BLAST to…and then we did this and then we did that, etc. Write neutral texts: “Searches for homologs were performed with BLAST (version etc)” o Try to be as clear and specific as possible. When talking about a certain amino acid residue or mutation mention the protein involved, the residue name and the residue number. Do not use words like “mutation number 1 has no effect”, but rather “the A127P mutation has no effect”. o In the text, when talking about specific amino acids, use the amino acid numbering belonging to to the protein under study and not amino acid numbering related to homologues. . Figures and tables: o should be functional, comprehensive, and nearly self-explanatory o should be numbered and you must refer to them in the main text, otherwise no one will read them. o should always have a title and a legend explaining what is shown in those figures/tables. o Should not contain things that have nothing to do with the goal of the figure o Should contain labels when relevant Copyright: It is allowed to copy maximally 200 words literally from anywhere, provided you put double quotes around it and you provide a reference to the source from where you copied it. This is the only hard rule in this list of suggestions and the only rule that, if broken, results in 1 as your final project grade. 1 Title The report has a good title describing the topic of the study. Authors List the authors here including student numbers. Abstract The abstract will consist of a few sentences (max 7) where you summarise the main aspects of the project: Sentence 1 summarises the research question being solved in this project. Sentence 2 summarises your approach. Sentence 3 lists the main results/conclusion. Be specific here, do not use words like “mutation number 1” but use the amino acid name and number to indicate a mutation And that should normally be enough. If really needed, you can add a fourth sentence summarising some discussion points. Do not put literature references in the Abstract, nor tables or figures Do not use the names of bioinformatics tools in the abstract, describe the approach/method not the specific tools that you used. Do not use swissprot and PDB codes in the abstract. Describe the protein and/or gene with its proper name & species not by its code from a database. Often, you will write the abstract after you have written the complete report, because it can be easily distilled from the report text. Introduction (maximally1,5 A4) The introduction should explain the question and provide the background information needed for somebody of your own skill-level to understand what you have done to answer that question. The things really needed in the introduction are: 1. The molecule & and its role in biology: what is it , where does it do what etc. Also describe the important functional sites of the protein sequence and structure, like active site residues, ligands, protein domain structure, known mutations and anything else you find worth mentioning 2. The mutations investigated in this study, how were they found, what is their effect. You may want to put in a table with the mutations, but this can also be part of the Results & Discussion section. Please use at least one Figure to illustrate an aspect of the biological function that you think is relevant to show .. You may want to use one or two pictures to illustrate the Introduction. Be careful to choose a functional picture, not only a nice colourful image. Methods ( ~ 0,5 A4) The report has a short Methods section where you describe the methods and tools used. In principle, others should find enough information in the Methods section to allow them to repeat your studies Always start running your sequence with BLAST against the SwissProt database to discover if the sequence is already known or to find close homologues of the protein. Study the information of the proteins in the hit-list. You can use this info for your Introduction. Do NOT put results or discussion in the methods section. You also do not discuss here WHY you use certain tools, only HOW you use(d) them. The following sentence is an example of a good sentence that you can use: “Searches for homologs in SwissProt (version…) and PDB (has no version, so you cannot list it ) were performed with MRS BLAST. The PDB file 1ABC was used for all studies, except blabla”. 2 Put references to the tool and databases in the References section (and not in the Methods text!). So do not put URLs in the Methods section. In the case of tools, you can use the website of the tool as reference. Please include the version numbers of the databases and tools. Use the right tool for the job. Use BLAST to search for homologues, use CLUSTAL for alignments. So make sure not to use BLAST (which is a local alignment tool for fast database searching) but CLUSTAL to make the crucial alignment in the Results & Discussion section.. When analyzing structure(s) use the WHAT IF servers (http://swift.cmbi.ru.nl/servers/html/) to: 1. List the sequence of a PDB file (Under Administration) 2. Analyze protein-cofactor contacts (Under Protein Analysis) 3. Analyze hydrogen bonds Results & Discussion ( maximally 4 A4) In these sections the results of your study have to be described, including an explanation of the strategy you used, the steps you took, etc. They also include the final results, i.e. the answers to the biological question(s). Run BLAST against the PDB database to find out if the structure of this protein is already known. If the structure is not known, see if there is a file in the PDB that looks a lot like your protein and that you can use to transfer information. If you have found one: Look at the structure of the protein and try to understand what is going on Important: If you are going to transfer information from one sequence to another: Indicate the % identity and length of the alignment, and make clear if you are allowed to transfer information. Also describe carefully from which sequence your are transferring information and which sequence you are transferring information to. Include a Figure showing the crucial alignment used for transfer of information! This figure belongs here and not in the Appendix. When showing an alignment: 1. you should use a proportional font, e.g. Courier, otherwise the amino acids will not line up correctly. 2. make sure to have the names of the proteins in the alignment and not names like “sequence 1” etc. 3. Think of ways to illustrate important amino acids in your alignment. Use colours, numbers, boxes, labels, arrows etc. Be creative. Colour them and/or put a box around them. Also put the residue number of important amino acids in the alignment. 4. skip useless data. If an alignment has been made to only find out if two sequences are homologs, then you should not put the alignment in the report. But if an alignment was made to transfer information from one sequence to another, an alignment is needed. Be systematic. If you describe the mutations, name and describe them in the same order as you mentioned them in the introduction (if you did). Feel free to add small but clear 3D pictures for the mutations. If possible, combine two mutations in one picture. When showing structures/ parts of structures: o Don’t make pictures because they are nice, but because they make something clear to the reader that you think is important to get clear o Remove from pictures everything that does not add information useful for the point you want to make o In detailed pictures make sure to display the side chain atoms of the amino acids, not only the backbone. o When making pictures of molecules, use YASARA with a white background. o Remove hydrogens if showing them is not relevant. o Only label atoms or residues that you discuss in the text. o Do not zoom in too much, do not zoom out too much, think about what you want to show o Use colours only when needed to show something or to help the reader find a residue, atom, interaction, etc., and not because they make the picture nicer. Conclusion ( maximally1/2 A4) This is a short section containing the overall conclusion of your research. You can also talk about things like potential weaknesses in your study, suggestions for follow-up research and anything else you think should be discussed 3 Acknowledgements If you want you can add Acknowledgements. References Here you give a list of references. These are typically: 1. research papers 2. hyperlinks (to tools, databases & webservers). For a research paper: Cite the literature reference. You may want to include the URL (e.g. of PubMed) for the paper, but it should be preceded by the full literature reference!! Example for reference to a research paper: 1. Joosten RP, Joosten K, Cohen SX, Vriend G, Perrakis A. Automatic rebuilding and optimization of crystallographic structures in the Protein Data Bank. Bioinformatics 27:3392-8 (2011). Make sure to reference to a paper in case you borrowed a picture from that paper. Make sure that you quote the references in the text of the report Please choose one method of referencing and use this method for all references. Example of reference to tool or database: 1. Models were analyzed and results were visualized using YASARA View version 12.1 (http://www.yasara.org) There is more out there than Wikipedia! Use more sources to gather information otherwise it will cost you points. Appendices Feel free to use Appendices but choose carefully what to put in the main text, what to put in an Appendix, and what not to put in at all because you can also describe it in two sentences instead of showing a large computer output. Appendices contain additional information that is not required to follow the flow of the story you tell in the report. If you use Appendices make sure to refer to them in the main text, otherwise they will not be read at all. 4