LOYOLA COLLEGE (AUTONOMOUS), CHENNAI – 600 034 M.Sc. DEGREE EXAMINATION – BIO TECHNOLOGY SUPPLEMENTARY EXAMINATION – JUNE 2007 BT 3814 - BIOSTATISTICS AND BIOINFORMATICS Date & Time: 27/06/2007 / 9:00 - 12:00 Dept. No. Max. : 100 Marks PART A Answer all the following questions (20 marks) I. Choose the correct answer (5 x 1 = 5 ) 1. OMIM is a _________ database. a) disease b) metabolic pathways c) gene d) protein 2. BLOSUM is _________. a) scoring matrix b) prediction tool c) algorithm d) smithwaterman algorithm 3. _________ is used for database searching. a) fasta b) PAM c) clustalw d) PDB 4. The central value of the variable in a set of observations arranged in ascending or descending order is a. Mean b. Median c. Mode d. Variance 5. Uncertainty is numerically expressed as a. Binomial distribution c. Normal distribution b. Poison distribution d. Frequency distribution II. State whether the following statements are True or False 6. (5 x 1 = 5 ) GC% is high near gene regions. 7. Blocks is a protein secondary database. 8. CATH is a secondary structure prediction tool. 9. Amount of rainfall and yield of crop is a negative correlation. 10. Pie diagram is a semi-circular graph whose area is subdivided into sectors by radii. III. Complete the following (5 x 1 = 5) 11. _________ is the algorithm used for local alignment. 12. Restriction sites can be identified using _________. 13. ------------------ are measurable characteristics which can be numerically expressed in terms of units. 14. ------------- is used to compare an observed ratio with an expected ratio. 15. The value in a series which occurs most frequently is -----------------. IV. Answer the following, each within 50 words.. (5 x 1 = 5) 16. scoring matrix. 17. back translation 18. Co-efficient of variation 19. Kurtosis 20. Regression curve PART B Answer any FIVE of the following, each not exceeding 350 words. Choose not more than three in each section. (5 x 8 = 40) Section - I 21. Explain BLAST and its applications. 22. Explain GC% and the software for the prediction of CpG islands. 23. Discuss the steps in the smithwaterman algorithm for local alignment. 24. Discuss the rules in a pattern. Section - II 25. Calculate the arithmetic mean and standard error for the following data. Class interval 10 – 20 Frequency 3 20 -30 5 30-40 10 40-50 15 50-60 5 60-70 12 26. Calculate the correlation coefficient between X and Y for the following data. X Y 1 10 2 11 3 12 4 14 5 13 6 15 7 16 8 17 9 18 27. In a cross between black and white coat coloured mice, the F2 indvidual segregated into 787 black and 277 white coat coloured individuals. If you have to test these results agree with the expected ratio 3 : 1, then apply chi square P = 5%. 28. Bring out the salient features of normal distribution. PART C Answer any two of the questions in about 1500 words. Draw necessary diagrams (2 x 20 = 40) 29. a) Explain the procedure for multiple sequence alignment and its applications and output format. OR b) Discuss the procedures for the discovery of a drug. . 30. (a) In order to find the effect of Azolla growth on the rice field and experimentally grown Azolla in 10 similar field plots before rice planting and other 10 similar plots were taken as control without Azolla growth. Rice was grown in all these plots and yields were noted. Plot 1 No. With 15.3 Azolla Without 14.5 Azolla 2 3 4 5 6 7 8 9 10 15.8 16.1 17.0 15.5 16.5 16.2 15.5 17.1 16.3 13.8 15.9 13.9 14.8 14.9 15.2 15.0 14.1 13.7 (b) The following data give the yields on 12 plots of land in three samples under three varieties of fertilizers. A B C 25 20 24 22 17 26 24 16 30 21 19 20 Is there any significant difference in the average yields of land under the three varieties of fertilizers? ****************