PowerPoint - Mathematical Sciences

advertisement
College of Science
Bioinformatics
The University of Texas at El Paso
www.bioinformatics.utep.edu
Bioinformatics Research and
Training at UTEP
Ming-Ying Leung, Ph.D.
Professor, Department of Mathematical Sciences and
Director, Bioinformatics and Computational Science Programs
Director, Bioinformatics Core Facility
Border Biomedical Research Center (BBRC)
The University of Texas at El Paso, El Paso, Texas
College of Science
Bioinformatics
The University of Texas at El Paso
www.bioinformatics.utep.edu
Bioinformatics Program
Interdisciplinary program
supported by departments of
Biological Sciences
Chemistry
Computer Science
Mathematical Sciences
With 24 faculty and staff
members from the Colleges of
Science, Engineering, and
Health Sciences.
Bioinformatics Faculty and Staff
 Biological Sciences: Steve Aley, Igor Almeida, Sid Das,
Krinstine Garza, Kyle Johnson, Carl Lieb, Vanessa
Lougheed, Elizabeth Walsh, Jianying Zhang
 Chemistry: James Becvar, Ricardo Bernal, Mahesh
Narayan, Chuan Xiao
 Clinical Laboratory Science: Delfina Dominguez
 Computer Science: Martine Ceberio, Olac Fuentes,
Vladik Kreinovich,
 Electrical and Computer Engineering: Wei Qian
 Mathematical Sciences: Art Duval, Hamide Dogan,
Ming-Ying Leung, Naijun Sha
 Staff: Oliana Alikaj, Gerardo Cardenas
3
Program Goals
1. Training
NSF-Funded Undergraduate Participation
in Bioinformatics Training (UPBiT)
Professional Science Master (PSM) degree
in Bioinformatics
Computational Bioscience for Ph.D. in
Computational Science
Program Goals
2. Research
Biomolecular Sequence Analysis
Ecoinformatics and Phylogenetics
Medical Imaging Informatics
Molecular Structure and Dynamics
Proteomics Data Analysis
Students and Graduates
Current students and trainees:
16 Undergrad trainees in UPBiT, 16 M.S. , and 4 Ph.D. students
in computational bioscience.
Students and Graduates
Over 50 graduates working at:
•
•
Academic Institutions (e.g., Stanford, Vanderbilt, Texas Tech., UTEP)
Research Labs (e.g., Broad Inst., Whitehead Inst., M.D. Anderson)
Government agencies (e.g., NIH, USDA, USAMRIID)
•
Industry (Pfizer, Monsanto)
•
Biomolecular
Sequence Analysis
Ecoinformatics
and Phylogenetics
Research
Rotations
M
Molecular
Structure and
2 Dynamics
C i
Microarray and
Proteomics Data
Analysis
i1
Q(w)
|| w ||2
Undergraduate Participation in Bioinformatics Training (UPBiT)
NIH-Funded Bioinformatics Core Facility in the
Border Biomedical Research Center (BBRC)
Analytical
Cytology
Cell
Culture
Study Design &
Data Analysis
DNA
Analysis
Integrate Computational &
Experimental Approaches
Statistical
Bioinformatics
Statistical
Consulting Lab
Biomolecule
Analysis
e.g., RNA structure
prediction
e.g., data
mining
Bioinformatics Computing Lab
BBRC Bioinformatics Core Facility
• Bioinformatics Computing Lab
• Translational Bioinformatics Lab
• Structural Bioinformatics Lab
Specific Aims for
Bioinformatics Core Facility
Provide bioinformatics support to biomedical research:
 Supporting BBRC research clusters through computational
modeling, data management and interpretations, to final
production of and dissemination bioinformatics tools.
 Interfacing with other BBRC core facilities by establishing
protocols and operation manuals integrating computational
components in the experimental procedure.
 Providing consulting services for non-BBRC researchers,
strengthening collaborative efforts for interdisciplinary
research in the border region.
11
Computing Resources Development Plan
12
Connection with Other BBRC Units
13
Software Development
 RNAVLab: Virtual laboratory for RNA sequence analysis and
secondary structure prediction (with DNA sequencing core)




RNASSA: RNA Secondary Structure Analysis
InversFinder: Finding inversions
Segmenta: RNA Segmentation
Grid and Cloud Computing
 Post-translational modification studies (with Biomolecular
Characterization Core)
 GPI and GPI-anchored protein structure prediction based on mass
spectrometry data (in collaboration with Dr. Clemente Aguilar,
National Institute of Mathematical and Biological Synthesis)
 ISOGlyP (Isoform Specific O-Glycosylation Prediction)
(with Dr. Thomas Gerken as PI, submitted R01 in January 2014)
14
Glycosylation Site Prediction Software
(isoglyp.utep.edu)
15
Protein Structure, Dynamics
and Interactions
 Establish a standard toolset on Bioinformatics Grid
to facilitate the workflow for computational prediction
of protein structures, interactions, and docking models.
 Current BBRC projects include:
 Searching for cancer-related interactors for tumorassociated antigens from Human Protein Reference Database
(HPRD) and other sources.
 In silico design of nociceptin agonists for Post-Traumatic
Stress Disorder (PTSD) through halogen bonding. Poster to
be presented by Dr. Mahesh Narayan at the Advancing
Computational Biology @ Howard University Symposium.
16
17
Sequence Analysis Projects
 Motif detection to identify potential enhancer
elements critical for learning and memory (led
by Dr. K. Han)
 GPCR prediction and vaccine development
(Collaboration with Dr. Felix Guerrero of the USDAARS and Dr. Kyle Johnson, funded by the USDA-HSI
Program)
 Functional genomic analysis of high-risk
childhood leukemia (BBRC pilot project, led by
Dr. Jeremy Ross)
18
19
20
21
Pilot Project: Functional Genomic Analysis of
High-Risk Childhood Leukemia
Jeremy A. Ross, Ph.D., BBRC Research Assistant Professor
Ming-Ying Leung, Ph.D. Director, BBRC Bioinformatics Core
Joseph Knapka, M.S. Student, Bioinformatics
Derrick Oaxaca, M.S. Student, Biological Sciences
Harry Wilson, M.D., Chief Pathologist, El Paso Children’s Hospital
Benjamin Carcamo, M.D., Pediatric Oncologist, El Paso Children’s Hospital
Relevance to Health Disparities: Hispanic children have the highest rates of incidence and death from acute
lymphoblastic leukemia (ALL) in the U.S.
Major Objective: Provide significant insight into the genetic abnormalities driving ALL tumor cell growth and drug
resistance in the Hispanic population.
Approach: 1) Identify protein-encoding somatic mutations using whole-exome sequencing of DNA isolated from
an Hispanic pediatric leukemia patient cohort of new onset or relapsed ALL. 2) Develop and employ a pathwaybased computational pipeline to prioritize functionally relevant sequence variants and aid biological interpretation.
Project Highlight - BBRC “OncoMiner” Pipeline
Read
Alignment
and Variant
Calling
Map reads to
reference
genome and
indentifies
SNP/indels
QC Check
and Data
Cleanup
Remove low
quality reads and
variants located
in introns
Variant
Selection
Select nonsynonymous
mutations for
analysis
Variant
Filtering
Remove dbSNP /
1000 Genomes
redundancies
PROVEAN
Scoring
Prioritize
variants based
upon predicted
functional impact
Gene
Ontology
Group by
biological
process and
molecular
function
Literature
Search
Provide link to
available
PubMed / OMIM
data
Future Plan
 Continue to strengthen bioinformatics support to genomics
and proteomics data analysis.
 Expand software development capabilities for biomedical
informatics and translational bioinformatics research.
 Create portals from Bioinformatics Grid to high
performance computing platforms at UTEP and TACC, and
the RTRN Data Coordinating Center.
 Explore further training and professional opportunities
for students and graduates of the program
23
College of Science
Bioinformatics
The University of Texas at El Paso
www.bioinformatics.utep.edu
Contact Us
Email: bioinformatics@utep.edu
Web: www.bioinformatics.utep.edu
Facebook: www.facebook.com/Bioinformatics.UTEP
Phone: 915.747.8484
Download