CSCI/BIOL 4931- INTRODUCTION TO BIOINFORMATICS

advertisement
BIOL-4534/CSCI-4931
INTRODUCTION TO BIOINFORMATICS
Room- B3324
Wednesdays 4:00-6:50 pm
Course Goals
This course introduces the students to Bioinformatics, which uses computer databases to
store, retrieve and assist in understanding biological information. Genome-scale sequencing
projects have led to an explosion of genetic sequences available for automated analysis. These
gene sequences are the codes, which direct the production of proteins that in turn regulate all life
processes. The students will be shown how these sequences can lead to a much fuller
understanding of many biological processes allowing pharmaceutical and biotechnology
companies to determine for example new drug targets. Students will be introduced to the basic
concepts behind Bioinformatics and Computational Biology tools. Hands-on sessions will
familiarize students with the details and use of the most commonly used online tools and
resources. The course will cover the use of NCBI's Entrez, BLAST, PSI-BLAST, FASTA,
ClustalW, Pfam, PRINTS, BLOCKS, Prosite and the PDB. An introduction to basic molecular
biology, database design and the principles of programming languages will be provided.
Course Objectives:
At the end of the course the students should be able to:
1) Understand the basic terminologies in Bioinformatics
2) Learn some basic algorithms used in sequence similarity analysis.
3) Use biological databases available through Internet.
4) Write small programs in Bioinformatics using these databases.
Course Prerequisite:
Students should have basic skills for using the computer.
Course Methodology:
The course is designed for students from both biology and computer science background.
Initial lectures will introduce students with basic concepts in computer science and biology.
Learning algorithms and database concepts in Bioinformatics will follow this.
Appraisal:
Exams (Mid term and Final)
Assignments
Quizzes
Group presentations
Attendance
50%
20%
10%
15%
5%
Grading Scale
93+ = A;
73+ = C;
90+ = A-;
70+ = C-;
87+ = B+;
67+ = D+;
83+ = B;
63+ = D;
80+ = B-;
60+ = D-;
77+ = C+;
0+ = F
Required Textbook:
“Introduction to Bioinformatics: A Theoretical and Practical Approach”
By Stephen A. Karwetz and David D. Womble. Humanna Press.
Reference Book:
“Bioinformatics Computer Skill” by Gibas and Jambeck. O’Reilly press.
“Bioinformatics – Sequence and Genome Analysis” by David W. Mount, Cold spring Harbor
Laboratory Press.
“Bioinformatics- A Practical Guide to the Analysis of Genes and Proteins” by
Baxevanis and Ouellette.
Office Hours.
Monday (3:00 PM to 4.00 PM) or based on appointment.
Contact information:
Email to M. Bazlur Rashid: rashid@cl.uh.edu
Or call 281-283-3756
Tentative Syllabus
Fundamentals of Bioinformatics and Information Technology





Introduction to Bioinformatics- What is it? Why needed? Potential?
Experimental sources of biological data
Publicly available databases and servers
Operating systems.
Internet tools
Biology Students
Intro to Programming and Databases
Goal: Introduce concepts of programming and DB
Computer Science Students
Intro to Molecular Biology
Goal: Explain the basic concept of molecular biology
technology.
 Fundamental principles of programming
 Object-oriented programming using Java
Introduction to databases
 SQL
to computer science students.

Cell, Molecule, Gene, Chromosome, DNA,
RNA

Protein, Connection DNA-RNA-Protein,
Protein structures and functions

Bio-chemical properties of amino acids, Motif,
Domain

Protein Families, Evolution, Similarity and
Homology.
Assignment I
Bioinformatics Tools for Genome Analysis (Both groups converge)
Goal: To introduce the most important Bioinformatics software tools, and explains the principles and
algorithms that underpins them
 High throughput sequencing and sequence analysis
 Pair wise alignment and database searching
 Gene Finders and Feature analysis
 Algorithms for finding Transcriptional Regulatory Signals
 Comparative Genomics and phylogenetic analysis
Assignment II
Midterm Exam!!
Computational analysis of protein structure and functions
Goal: To understand the computational approach of protein classification and structure predication





Multiple alignments of protein sequences
Methods for pair wise alignment and motif discovery
Searching for homology in protein sequence databases.
Algorithms for predication of secondary and 3D structure or proteins.
Application of Neural Networks in protein structure prediction
Assignment III
Applications and Commercial Aspects of Bioinformatics
Goal: To outline the current and potential applications of bioinformatics and the legal, ethical and
commercial aspects of using biological data.
 Drug discovery
 Genetic basis of disease
 Personalized medicine and gene-based diagnostics
 Legal, ethical and commercial ramifications of bioinformatics
PERL Programming to Facilitate Biological Analysis
Goal: To provide students with the ability to program in PERL, the most popular programming language
in the bioinformatics community.
PERL programming for:
 Data manipulation
 File maintenance
 Regular Expression
 Application of Perl in Bioinformatics
Assignment IV
Computational Tools for Expression Analysis
Goal: To provide an understanding of Microarray technology and its implications in the health care
industry.
 Microarray Technology
 DNA and Protein microarrays
 Analysis of expression array data using available software
 Gene expression informatics
Bioinformatics Research
Goal: To equip students with the skills and understanding required beginning a career in bioinformatics
research.
 How to carry out research - with a particular emphasis on computer-based research
 Introduction of the general research topics in Bioinformatics
 Recent discoveries using Bioinformatics.
Intro to Bioinformatics
Tentative Schedule
Spring 2007
Date
Topic
January 17
Lecture 1
Introduction
January 24
Lecture 2
Chapter 2
January 31
Lecture 3
Chapter 4
February 07
Lecture 4
Chapter 5
February 14
Lecture 5
NCBI databases and programs
Assignment #1 due
February 21
Lecture 6
Sequence alignment algorithm
Chapter 27 and handouts
February 28
Lecture 7
BLAST algorithm
March 07
Mid-term
Assignment #2 due
March 14
Spring Break
March 21
Lecture 8
FASTA algorithm
March 28
Lecture 9
Phylogenetic analysis
Assignment #3 due
April 04
Lecture 10
Phylogenetic analysis
April 11
Lecture 11
Chapter 34
DNA micro-array analysis
April 18
Lecture 12
Predicting structure of proteins
April 25
May 02
Review
Assignment #4 due
FINAL
Download