Pingzhao Hu

Department of Biochemistry and Medical Genetics, University of Manitoba

“Machine Learning Approaches for Predicting Protein Functions and Disease Outcomes Using Omics Data ”

Date: Thursday, November 20, 2014

It has been known that earlier detection of diseases (e.g., cancer) is the key for treatments. Traditionally, clinical information-based classifier usually has low prediction accuracy. It has been expected that molecular classifier is one of the most promising tools to improve the accuracy. To do this, it is vital to identify clinically-significant while biological function relevant protein/gene biomarkers. However, although many genomes have been sequenced, almost half of genes in these sequenced genomes have no function information.

To solve the issues, we developed network-based machine learning frameworks. For predicting sample disease status, we proposed to identify subnetwork-based biomarkers from co-expression network and developed a modular-based linear discriminant analysis approach by integrating ‘essential’ correlation structure among genes into the predictor rather than considering all types of correlations (e.g., strong, weak and noise correlations) or ignoring all these correlations. Hence, the correlated gene clusters, which are related to the diagnostic classes we look for, can have potential functional interpretation. For predicting protein functions, we devised an iterative relaxation labeling procedure to find its maximally likely labeling on protein network. Contrary to the traditional methods, which treated gene ontology (GO) terms as a flat structure, we addressed the problem of multi-label multi-class classification of protein functions by taking into account the inter-correlation of GO terms in a hierarchy structure.

Important Date

February 19 – February 23: Reading Week (No classes)

Upcoming Exams

STAT 1150 A01 Midterm
Monday, February 26 at 2:30 p.m.

STAT 2220 A01 Midterm
Monday, February 26 at 5:30 p.m.

STAT 1000 Midterm
Wednesday, February 28 at 5:30 p.m.

Upcoming Seminars

Statistics seminar: Kevin Fraser — Thursday, March 1 at 2:45 p.m., 204 Robson Hall.

Statistics seminar: Jonathan Foord — Thursday, March 15 at 2:45 p.m., 301 Biological Sciences.

Statistics seminar: Forough Khadem — Thursday, March 22 at 2:45 p.m., 301 Biological Sciences.

PIMS lecture: Troy Day — Thursday, March 22 at 4 p.m., Robert Schultz Theatre.

Where are they now?

Mostofa Sarkar, M.Sc. (2012)

Jeffrey Alan Sloan, Ph.D. (1991)