Pingzhao Hu

Department of Biochemistry and Medical Genetics, University of Manitoba

“Machine Learning Approaches for Predicting Protein Functions and Disease Outcomes Using Omics Data ”

Date: Thursday, November 20, 2014

It has been known that earlier detection of diseases (e.g., cancer) is the key for treatments. Traditionally, clinical information-based classifier usually has low prediction accuracy. It has been expected that molecular classifier is one of the most promising tools to improve the accuracy. To do this, it is vital to identify clinically-significant while biological function relevant protein/gene biomarkers. However, although many genomes have been sequenced, almost half of genes in these sequenced genomes have no function information.

To solve the issues, we developed network-based machine learning frameworks. For predicting sample disease status, we proposed to identify subnetwork-based biomarkers from co-expression network and developed a modular-based linear discriminant analysis approach by integrating ‘essential’ correlation structure among genes into the predictor rather than considering all types of correlations (e.g., strong, weak and noise correlations) or ignoring all these correlations. Hence, the correlated gene clusters, which are related to the diagnostic classes we look for, can have potential functional interpretation. For predicting protein functions, we devised an iterative relaxation labeling procedure to find its maximally likely labeling on protein network. Contrary to the traditional methods, which treated gene ontology (GO) terms as a flat structure, we addressed the problem of multi-label multi-class classification of protein functions by taking into account the inter-correlation of GO terms in a hierarchy structure.


Congratulations to our October, 2018 graduates: Jenna Tichon (Ph.D.), Kanika Grover (M.Sc.), Mohammed Kamso (M.Sc.), Inesh Munaweera Arachchilage (M.Sc.), Joynob Siddiqua (M.Sc.), Shamsia Sobhan (M.Sc.).

Important Date

October 3: Fee Payment Deadline Fall Term

Upcoming Seminars

Statistics seminar: Bo Li: “Spatially Varying Autoregressive Models for Prediction of New HIV Diagnoses” — Thursday, September 20 at 2:45 p.m., P230 Duff Roblin.

Statistics seminar: Mohammad Jafari Jozani: “Quantile Regression with Nominated Samples: An example in Osteoporosis Analysis” — Thursday, September 27 at 2:45 a.m., P230 Duff Roblin.

Statistics seminar: Yuliya Martsynyuk — Thursday, October 4 at 2:45 a.m., P230 Duff Roblin.

Statistics seminar: Darja Barr: “Evidence Driven Best Practices for Teaching the Mathematical Sciences” — Thursday, October 11 at 2:45 a.m., P230 Duff Roblin.

Where are they now?

Taraneh Abarin, Ph.D. (2009)

Yanqing Yi, Ph.D. (2008)