Back
to top

Patents | Books | Journals | Refereed Conferences | Symposia | Thesis | Technical Reports | Workshops | Tutorials | Supplementary Data

Publications

Patents and Disclosures

  • Zhu, C.Q., Jurisica, I., Aviel-Ronen, S., Coe, B., Lam, W., Tsao, M.-S., D. Der, S. Compositions and methods for classifying lung cancer and prognosing lung cancer survival. 5 June 2009, TDC #2007-020-02.


  • Tsao, M.-S., Craddock, K., Lam, W., Buys, T., Jurisica, I., Shepherd, F.A. Methods and compositions for lung cancer prognosis. US61-171687, TDC# 2009-043-02, Filed 2009-04-21.


  • Jurisica, I., Shepherd, F. A., Tsao, M.-S., Zhu, C.Q., Der, S.D. Prognostic gene expression signature for squamous cell carcinoma of the lung. US61170743, TDC # 2009-007-01. Filed 2009-04-20.


  • D. Otasek, M. Ali, W. Xie, M. McGuffin, B. Devani, K. R. Brown, Jurisica, I. NAViGaTOR: A scalable tool for protein-protein interaction network analysis and visualization, Invention Disclosure, March 25, 2009. 2009-032.


  • M.-S. Tsao, C.-Q. Zhu, Jurisica, I., S. D. Der, F. A. Shepherd. A 12-gene prognostic gene signature for squamous cell carcinoma of lung. January 2009.


  • M.-S. Tsao, P. C. Boutros, S. Lau, F. A. Shepherd, L. Z. Penn, Jurisica, I., S. D. Der. Methods for biomarker identification and biomarker for non-small cell lung cancer. US61119936, TDC #2008-024-01, December, 4, 2008.


  • P. C. Boutros, S. Lau, F. A. Shepherd, S. D. Der, M.-S. Tsao, L. Z. Penn, Jurisica, I. A method to find all prognostic signatures from a microarray dataset. Invention Disclosure. July 2008.


  • P. C. Boutros, S. K. Lau, F. A. Shepherd, S. D. Der, M.-S. Tsao, L. Z. Penn, Jurisica, I. 2048 Novel Six-Gene Prognostic Markers for Non-Small Cell Lung Cancer. Invention Disclosure. July 2008.


  • M.-S. Tsao, S. A.-Ronen, Jurisica, I., C.-Q. Zhu, W. Lam, B. P. Coe. Gene markers of invasive cancer in lung bronchiole-alveolar carcinoma. IPD 2007-020-02. INV 08-011, US Provisional patent 61/059,085, B&P 10723-271, filed June 5, 2008.


  • M.-S. Tsao, F. A. Shepherd, Jurisica, I., C.-Q. Zhu, D. Strumpf, K. Ding, L. Seymour. A 15-gene prognostic signature for non-small cell lung cancer. June 2008.


  • S. A.-Ronen, B. P. Coe, S. K. Lau, G. C. Santos, C. Q. Zhu, D. Strumpf, Jurisica, I., W. L. Lam, M.-S. Tsao. Compositions and Methods for Classifying Lung Cancer and Prognosing Lung Cancer Survival. USP 61-059085, July 2008.


  • K. R. Brown and Jurisica, I. Interologous interaction database, Invention Disclosure, June 18, 2007. 2007-034


  • T. Kislinger and Jurisica, I. Detection of ovarian cancer biomarkers by proteomics and bioinformatics, US Provisional Patent 60/929, 861, May 22, 2007.


  • M. Sound-Tsao, S. Der, P. Boutros, S. Lau, M. Pintilie, F. Shepherd, Jurisica, I. A minimal set of prognostic marker genes for early stage Non-small cell lung cancer - Materials and Methods for Prognosing Lung Cancer - 6-gene classifier, 2007.


  • M. Sound-Tsao, S. Der, P. Boutros, S. Lau, M. Pintilie, F. Shepherd, Jurisica, I., L. Penn. A minimal set of prognostic marker genes for early stage Non-small cell lung cancer - Materials and Methods for Prognosing Lung Cancer - 3-gene classifier, US 11/940,707, 2007.


  • E. Xia., Jurisica, I., J. Waterhouse, E. Cialini. Sca-min-min: A Scalable Min-min Scheduling Heuristic for Heterogeneous Systems. IBM Invention Disclosure, IP&L Disclosure Evaluation: CA8-2007-0054, 2007. Patent filed February 21, 2008.


  • E. Xia, Jurisica, I., J. Waterhouse, V. Sloan. Dynamic selection of scheduling heuristics in heterogeneous systems. IBM Invention Disclosure, http://www.priorartdatabase.com/IPCOM/000148770/, 2006.


  • E. Xia, Jurisica, I., J. Waterhouse, V. Sloan. Run time estimation using TA3 case-based reasoning system for scheduling in a grid environment. IBM Invention Disclosure, http://www.priorartdatabase.com/IPCOM//000148769D, 2006.


  • M. Sound-Tsao, Jurisica, I., I. Seiden-Long, K. Brown. Potential new markers for colorectal cancer diagnosis and targeting. Invention Disclosure, 2005.


  • H. Dayani-Fard, Jurisica, I. Dynamic semi-structured repository for mining software and software-related information.
    US Patent US 6,339,776 B2, January 15, 2002;
    Canadian Patent 2,284,949, October 14, 2003.


Books

Cancer Informatics in Post Genomic Era

Jurisica, I., D. A. Wigle, and B. Wong. Cancer Informatics in the Post Genomic Era; Toward Information-Based Medicine
Series: Cancer Treatment and Research, Volume 137, Springer Verlag, July 2007.


Less than 50% of diagnosed cancers are cured using current treatment modalities. Many common cancers can already be fractionated into such therapeutic subsets with unique prognostic outcomes based on characteristic molecular phenotypes. It is widely expected that treatment approaches of complex cancer will soon be revolutionized by combining molecular profiling and computational analysis, which will result in the introduction of novel therapeutics and treatment decision algorithms that target the underlying molecular mechanisms of cancer.

The sequencing of the human genome was the first step in understanding the ways in which we are wired.  However, this genetic blueprint provides only a “parts list”, and neither information about how the human organism is actually working, nor insight into function or interactions among the ~30 thousand constitutive parts that comprise our genome. Considering that the 30 years of worldwide molecular biology efforts have only annotated about 10% of this gene set, and we know even less about proteins, it is comforting to know that high-throughput data generation and analysis is now widely available.

By arraying tens of thousands of genes and analyzing abundance of and interaction among proteins, it is now possible to measure the relative activity of genes and proteins in normal and diseased tissue. The technology and datasets of such profiling-based analyses will be described along with the mathematical challenges that face the mining of the resulting datasets.  We describe the issues related to using this information in the clinical setting, and the future steps that will lead to drug design and development to cure complex diseases such as cancer.

Knowledge Discovery in Proteomics

Jurisica, I. and D. Wigle. Knowledge Discovery in Proteomics. Mathematical & Computational Biology Series, Volume 8, Chapman & Hall/CRC Press, 2006.

Who knows useful things, not many things, is wise. Aeschylus (ca. 525-456 BC)

The nascent fields of bioinformatics and computational biology are currently an odd amalgam of everything from biologists with a computational bent, through physicists and mathematicians, to computer scientists and engineers sifting through the myriad of data and grappling with biological questions. Much of the excitement comes from a collective sense that there is something truly new evolving. Hardware and software limitations are declaring themselves as major challenges to managing and interpreting the avalanche of data from high-throughput biological platforms. This drinking from the fire hydrant'' sensation continues to spark interest and draw technical skill from other domains. As we move forward to true systems biology experimentation, it is increasingly obvious that experts in robotics, engineering, mathematics, physics, and computer science have become key players alongside traditional molecular biology.

Life sciences applications are typically characterized by multimodal representations, lack of complete and consistent domain theories, rapid evolution of domain knowledge, high dimensionality, and large amounts of missing information. Data in these domains require robust approaches to deal with missing and noisy information. Modern proteomics is no exception. As our understanding of protein structure and function becomes ever more complicated, we have reached a point in time where the actual management of data is a major hurdle to knowledge discovery. Many of the browse-through applications of yesterday are clearly not useful for computational manipulation. If the data was not created having data mining and decision support in mind, how well can it serve that purpose?

We felt this book was a timely discussion of some of the key issues in the field. In subsequent chapters we discuss a number of examples from our own experience that represent some of the challenges of knowledge discovery in high-throughput proteomics. This discussion is by no means comprehensive, and does not attempt to highlight all relevant domains. However, we hope to provide the reader with an overview of what we envision as an important and emerging field in its own right by discussing the challenges and potential solutions to the problems presented. We have selected five specific domains to discuss: (1) Mass spectrometry based protein analysis; (2) Protein--protein interaction network analysis; (3) Systematic high-throughput protein crystallization; (4) A systematic and integrated analysis of multiple data repositories using a diverse set of algorithms and tools; and (5) Systems biology. In each of these areas, we describe the challenges created by the type of data produced, and potential solutions to the problem of data mining within the domain. We hope this stimulates even more discussion, and newer and better ways to deal with the problems at hand.

 

  • Chaudhri, V.K., Jurisica, I., Koubarakis, M., Plexousakis, D., Topaloglou, T. The KBMS project and beyond. In Borgida, A. T. et al., (Eds.), Conceptual Modeling: Foundations and Applications, LNCS 5600, Springer, 466-483, 2009.

Journals

  • Niu, Y., Otasek, D., Jurisica, I. Evaluation of linguistic features useful in extraction of interactions from PubMed; Application to annotating known, high-throughput and predicted interactions in I2D. Bioinformatics, 2009. doi: 10.1093/bioinformatics/btp602.

  • Brown, K.R., Otasek, D., Ali, M., McGuffin, M., Xie, W., Devani, B., van Toch, I. L., Jurisica, I. NAViGaTOR: Network analysis, visualization & graphing Toronto. Bioinformatics, 2009. doi: 10.1093/bioinformatics/btp595.

  • McGuffin, M, and Jurisica, I. Interaction techniques for selecting and manipulating subgraphs in network visualizations. IEEE Transactions on Visualization and Computer Graphics (TVCG), 15 (6): 937-944, 2009.[Honorable Mention at InfoVis'09]

  • Cervigne, N. K., Reis, P. P., Machado, J., Sadikovic, B., Bradley, G., Galloni, N. N., Pintilie, M., Jurisica, I., Gilbert, R., Gullane, P., Irish, J., and Kamel-Reid, S. Identification of a microRNA signature associated with progression of leukoplakia to oral carcinoma, Hum Mol Genet, 2009. In press.

  • Mills, G. B., Jurisica, I., Yarden, Y., Norman, J. C. Genomic amplicons target vesicle recycling in breast cancer. J Clin Invest, 119, 2123-7, 2009.


  • Agarwal R., Jurisica, I., Cheng K.W., Mills G.B. The emerging role of the RAB25 small GTPase in cancer. Traffic, 10(11): 1561-8, 2009.


  • Cox, B., Kotlyar, M., Evangelou, A., Ignatchenko, V., Ignatchenko, A., Whiteley, K., Jurisica, I., Adamson, L., Rossant, J., Kislinger, T., Comparative systems biology of human and mouse as a tool for modeling human placental pathology, Mol Syst Biol 5, 279, 2009.


  • Hui, A. B., Shi, W., Boutros, P. C., Miller, N., Pintilie, M., Fyles, T., McCready, D., Wong, D., Gerster, K., Waldron, L., Jurisica, I., Penn, L. Z., Liu, F. F. Robust global micro-RNA profiling with formalin-fixed paraffin-embedded breast cancer tissues. Lab Invest 89, 597-606, 2009.


  • Savas, S., Geraci, J., Jurisica, I., Liu, G. A comprehensive catalogue of functional genetic variations in the EGFR pathway: Protein-protein interaction analysis reveals novel genes and polymorphisms important for cancer research. Int J Cancer 125, 1257-65., 2009.


  • Boutros, P.C., Lau, S.K., Liu, N., Shepherd, F.A., Der, S.D., Tsao, M.-S., Penn, L.Z., Jurisica, I. Prognostic gene signatures for non-small cell lung cancer. PNAS, 106(8): 2824-8, 2009.


  • Zhu, C.Q., Pintilie, M., John, T., Strumpf, D., Shepherd, F.A., Der, S.D., Jurisica, I., Tsao, M.-S., Understanding Prognostic Gene Expression Signatures in Lung Cancer, Clin Lung Cancer, 10(4), 2009.

  • Dong, J., Kislinger, T., Jurisica, I., Wigle, D. A. Lung cancer: Developmental networks gone awry? Cancer Biol Ther, 8(4), 2009.


  • Ponzielli, R., Boutros, P., Katz, S., Stojanova, A., Hanley, A., Khosravi, F., Bros, C., Jurisica, I., Penn, L. Optimization of experimental design parameters for high-throughput chromatin immunoprecipitation studies, Nucl Acid Res, 36(21): e144, 2008.


  • Tomasini, R., Tsuchihara, K., Wilhelm, M., Fujitani, M., Rufini, A., Cheung, C.C., Khan, F., Itie-Youten, A., Wakeham, A., Tsao, M.-S., Iovanna J. L., Squire, J., Jurisica, I., Kaplan, D., Melino, G., Jurisicova, A. and Mak, T. W., TAp73 knockout shows genomic instability with tumor suppressor, Genes Dev, 22(19): 2677-91, 2008. ePub 2008/09/23.


  • Snell, E.H., Lauricella, A.M., Potter, S.A., Luft, J.R., Gulde, S.M, Collins, R.J., Franks, G., Malkowski, M.G., Cumbaa, C., Jurisica, I. and DeTitta, G. T., Establishing a training set through the visual analysis of crystallization trials part II: Crystal examples, Acta Crystallogr D, 64(pt11): 1123-30, 2008.


  • Snell, E.H., Luft, J.R., Potter, S.A., Lauricella, A.M., Gulde, S.M, Malkowski, M.G., Koszelak-Rosenblum, M., Said, M.I., Smith, J.L., Veatch, C.K., Collins, R.J., Franks, G., Thayer, M., Cumbaa, C., Jurisica, I. and DeTitta, G. T., Establishing a training set through the visual analysis of crystallization trials part I: ~150,000 images. Acta Crystallogr D, 64(pt11): 1131-7, 2008.


  • Zavareh, R. B., K. S. Lau, Lau, K. S., Hurren, R., Datti, A., Ashline, D. J., Gronda, M., Cheung, P., Simpson, C. D., Liu, W., Wasylishen, A. R., Boutros, P. C., Shi, H., Vengopal, A., Jurisica, I., Penn, L. Z., Reinhold, V. N., Ezzat, S., Wrana, J., Rose, D. R., Schachter, H. , Dennis, J. W., Schimmer, A. D. Inhibition of the sodium/potassium ATPase impairs N-glycan expression and function, Cancer Res, 68(16): 6688-97, 2008.


  • Director’s Challenge Consortium for the Molecular Classification of Lung Adenocarcinoma, Gene expression-based survival prediction in lung adenocarcinoma: A multi-site, blinded validation study, Nature Medicine, 14(8): 822-827, 2008. ePub 2008/07/22.


  • Aviel-Ronen, S., Coe, B. P., Lau, S., Santos, G. C., Zhu, C, Q., Strumpf, D., Jurisica, I., Lam, W. L., Tsao, M.S. Genomic markers for malignant progression in pulmonary adenocarcinoma, PNAS, 105(29): 10155-10160, 2008. ePub 2008/07/18.


  • Sodek K.L., Evangelou A.I., Ignatchenko A., Brown T.J., Ringuette M., Jurisica I., & Kislinger T. Identification of pathways associated with invasive behavior by ovarian cancer cells using multidimensional protein identification technology (MudPIT). Mol Biosyst, 4(7):762-773, 2008.


  • Jurisicova, A., Jurisica, I., T. Kislinger. Advances in ovarian cancer proteomics: The quest for biomarkers and improved therapeutic interventions, Expert Review of Proteomics, 5(4): 551-560, 2008.


  • Gortzak-Uzan, L., Ignatchenko, A., Evangelou, A., Agochiya, M., Brown, K. St.Onge, P., Kireeva, I., Schmitt-Ulms, G., Brown, T., Murphy, J., Rosen, B., Shaw, P., Jurisica, I., Kislinger, T. A proteome resource of ovarian cancer ascites: Integrated proteomic and bioinformatic analyses to identify putative biomarkers. Journal of Proteome Research. 7(1): 339-351, 2008.


  • Ghavidel, A., T. Kislinger, O. Pogoutse, R. Sopko, Jurisica, I., and A. Emili. Regulated tRNA export mediates the execution of G1 checkpoint in response to DNA damage. Cell, 131(5):915-26, 2007..


  • Kim S.S., Shago M., Kaustov L., Boutros P.C., Clendening J.W., Sheng Y., Trentin G.A., Barsyte-Lovejoy D., Mao D.Y., Kay R., Jurisica I., Arrowsmith C., Penn L.Z. CUL7 is a novel anti-apoptotic oncogene, Cancer Research, 67(20): 9616-9622, 2007.


  • Lau, S.K., P. C. Boutros, M. Pintilie, F. H. Blackhall, C.-Q. Zhu, D. Strumpf, M. R. Johnston, G. Darling, S. Keshavjee, T. K. Waddell, N. Liu, D. Lau, L. Z. Penn, F. A. Shepherd, Jurisica, I., S. D. Der, M.-S. Tsao. A three-gene prognostic classifier for early-stage non-small cell lung cancer. J Clinical Oncology, 25(35): 5562-5569, 2007.


  • Zhu, C.Q., S. Popova, E. R S Brown, D. Barsyte-Lovejoy, R. Navab, W. Shih, M. Li, M. Lu, Jurisica, I., L. Penn, D. Gullberg and M.-S. Tsao. Integrin a11 regulates IGF-2 expression in fibroblasts to enhance tumorigenicity of human non-small cell lung cancer cells, PNAS, 104(28): 11754-9, 2007.


  • Barrios-Rodilesm M., A. Viloria-Petit, K. R. Brown, Jurisica, I., and J. L. Wrana. High-throughput screening of protein interaction networks in the TGFb interactome: understanding the signaling mechanisms driving tumor progression. Cancer Drug Discovery and Development: Transforming Growth Factor-b in Cancer Therapy, Vol 2: Cancer Treatment and Therapy Edited by: Sonia B. Jakowlew, Humana Press Inc., Totowa, N.J., 2007.


  • Brown, K. and Jurisica, I.. Unequal evolutionary conservation of human protein interactions in interologous networks. Genome Biology,8(5), 2007. Interologous Interaction Database:  I2D server


  • Wu, C., Ma, M. H., Brown, K. R., Geisler, M., Li, L., Tzeng, E., Jia, C. Y., Jurisica, I., Li, S. S. Systematic identification of SH3 domain-mediated human protein-protein interactions by peptide array target screening. Proteomics. 7(11):1775-85, 2007.


  • Cox, B., T. Kislinger, D. A. Wigle, A. Kannan, K. Brown, T. Okubo, B. Hogan, Jurisica, I., B. Frey, J. Rossant and A. Emili. Integrated proteomic and transcriptomic profiling of mouse lung development and Nmyc target genes, Molecular Systems Biology, 3:109, 2007.


  • Wei-Lynn Wong, W., J. W. Clendening, A. Martirosyan, P. C. Boutros, C. Bros, F. Khosravi, Jurisica, I., K. Stewart, P. L. Bergsagel, and L. Z. Penn. Determinants of sensitivity to lovastatin-induced apoptosis in multiple myeloma, Molecular Cancer Therapeutics, 6(6):1886-97, 2007.


  • Wigle, D.A. and Jurisica, I.. Cancer as a system failure. Cancer Informatics. Systems Biology Special Issue editorial. 3(2):10-18, 2007.


  • Bachtiary, B., P. Boutros, M. Pintilie, W. Shi, C. Bastianutto, J.-H. Li, J. Schwock, L. Z. Penn, Jurisica, I., A. Fyles, F.-F. Liu. Gene expression profiling in cervical cancer . an exploration of intra-tumor heterogeneity. Clinical Cancer Research, 12(19):5632-5640, 2006.


  • A. Evangelou, L. Gortzak-Uzan, Jurisica, I. and T. Kislinger. Mass spectrometry, proteomics, data mining and their applications in infectious disease research, Anti-Infective Agents in Medicinal Chemistry, 6(2):89-105, 2007.


  • Motamed-Khorasani, A., Jurisica, I., M. Letarte, P.A. Shaw, R.K. Parkes, X. Zhang, A. Evangelou, B. Rosen, K.J. Murphy, and T.J. Brown. Differentially Androgen-Modulated Genes in Ovarian Epithelial Cells from BRCA Mutation Carriers and Control Patients Predict Ovarian Cancer Survival and Disease Progression. Oncogene, 26:(2):198-214, 2007.
  • Epidemiological studies have implicated androgens in the etiology and progression of epithelial ovarian cancer. We previously reported that some androgen responses were dysregulated in malignant ovarian epithelial cells relative to control, non-malignant ovarian surface epithelial (OSE) cells. Moreover, dysregulated androgen responses were observed in OSE cells derived from patients with germline BRCA-1 or -2 mutations (OSEb), which account for the majority of familial ovarian cancer predisposition, and such altered responses may be involved in ovarian carcinogenesis or progression. In the present study, gene expression profiling using cDNA microarrays identified 17 genes differentially expressed in response to continuous androgen exposure in OSEb cells and ovarian cancer cells as compared to OSE cells derived from control patients. A subset of these differentially affected genes was selected and verified by quantitative real-time RT-PCR. Six of the gene products mapped to the OPHID protein-protein interaction database, and five were networked within two interacting partners. Basic leucine zipper transcription factor 2 (BACH2) and acetylcholinesterase (ACHE), which were up-regulated by androgen in OSEb cells relative to OSE cells, were further investigated using an ovarian cancer tissue microarray from a separate set of 149 clinical samples. Cytoplasmic ACHE and BACH2 immunostaining were both significantly increased in ovarian cancer relative to benign cases. High levels of cytoplasmic ACHE staining correlated with decreased survival, whereas nuclear BACH2 staining correlated with decreased time to disease recurrence. The finding that products of genes differentially responsive to androgen in OSEb cells may predict survival and disease progression supports a role for altered androgen effects in ovarian cancer. In addition to BACH2 and ACHE, this study highlights a set of potentially functionally related genes for further investigation in ovarian cancer.

  • Shi, W., C. Bastianutto, A. Li, B. Perez-Ordonez, R. Ng, K.-Y. Chow, W. Zhang, Jurisica, I., A. Bayley, J. Kim, B. O'Sullivan, L. Siu, E. Chen, F.-F. Liu. Multiple dysregulated pathways in nasopharyngeal carcinoma revealed by gene expression profiling, Int J Cancer, 119(10):2467-2475, 2006.
  • Gene expression profiling was conducted using primary human nasopharyngeal carcinoma (NPC) biopsy samples to improve the understanding of the molecular pathways defining NPC and to identify novel potential therapeutic targets. RNA samples were extracted from 36 patients suspected to have NPC and hybridized onto the Affymetrix U133A chip. NPC was diagnosed in 19 patients, 11 had lymphoid hyperplasia (LH), and 6 were .normal. biopsies. Clinical stages for these NPC patients ranged from I.IV, including one M1. All NPC patients (except the M1) were treated with curative intent, which included radiotherapy alone (4 patients), or combined with chemotherapy (14 patients). Unsupervised clustering demonstrated a distinct NPC expression pattern, compared to normal biopsies. Subsequent Significance Analysis of Microarrays (SAM) derived from 14 NPC and 6 normal samples discovered 1089 differentially regulated genes. Pathway analyses revealed novel insights into the mechanisms leading to NPC, whereby up-regulation of NFkB2 and survivin play central roles in increasing resistance to apoptosis, and changes in integrin and WNT/b-catenin signaling leading to uncontrolled proliferation. The role of survivin in resisting apoptosis in NPC was confirmed by RNA interference. Our data provide novel insights into the development and progression of NPC, and suggest survivin as a novel therapeutic target for NPC.

  • Barsyte-Lovejoy, D., Lau, S.K., Boutros, P.C., Khosravi, F., Jurisica, I., Andrulis, I.L., Tsao, M.S., Penn, L.Z. The c-Myc oncogene directly induces H19 non-coding RNA by allele specific binding to potentiate tumorigenesis. Cancer Research, 66(10):1-8, 2006.
  • The product of the MYC oncogene is widely deregulated in cancer and functions as a regulator of gene transcription. Despite an extensive profile of regulated genes, the transcriptional targets of c-Myc essential for transformation remain unclear. In this study we show that c-Myc significantly induces the expression of the H19 non-coding RNA in several diverse cell types including breast epithelial, glioblastoma and fibroblast cells. C-Myc binds to evolutionary conserved E boxes in the imprinting control region to facilitate histone acetylation and transcriptional initiation of the H19 promoter. In addition, c-Myc downregulates the expression of the IGF2, the reciprocally imprinted gene at the H19/IGF2 locus. Evidence shows that c-Myc regulates these two genes independently and does not affect the imprinting of H19. Indeed, allele-specific chromatin immunoprecipitation and expression analyses indicate that c-Myc binds and drives the expression of only the maternal H19 allele. The role of H19 in transformation is addressed using a knockdown approach and shows that downregulation of H19 significantly decreases breast and lung cancer cell clonogenicity and anchorage independent growth. In addition, c-Myc and H19 expression shows strong association in primary breast and lung carcinomas. This work indicates that c-Myc induction of the H19 gene product holds an important role in transformation.

  • Brierley, M., K. L. Marchington, Jurisica, I., E. N. Fish. The role of STAT2 in interferon inducible GAS-mediated gene transcription, FEBS J, 273(7):1569-1581, 2006.
  • The role of STAT2 in interferon inducible GAS-mediated gene transcription

    STAT2 is a critical component of interferon-. (IFN) signaling. To identify genes regulated by IFN-inducible STAT2-DNA binding, cDNA from IFN-treated cells expressing intact STAT2 or a DNA-binding mutant STAT2 were analyzed by Affymetrix microarrays. IFN-inducible expression of genes regulated by IFN-stimulated gene factor 3 (ISGF3), wherein STAT2 functions as a transactivator, 2 5. OAS, Mx, ISG15, 9-27, MHC-I, is similar in both cell types. Nineteen genes were identified whose expression was higher in IFN-treated cells expressing intact STAT2 compared with cells expressing the mutant STAT2. Using quantitative PCR, we confirmed that ISGF3-dependent gene transcription is unaffected in cells expressing mutant STAT2 but that a subset of IFN-inducible genes is differentially regulated in these cells: CLDN4, BF, DGFK, MSR1 and TLR3, containing .-activated sequence (GAS)-like elements in their 5. flanking sequences. Our data indicate that the DNA binding domain of STAT2 is required for full IFN-inducible activation of (GAS)-regulated target genes.

  • Kislinger, T. and Jurisica, I.. Proteomics and bioinformatics in biomedical research. Cancer Genomics and Proteomics, 3(1):11-28, 2006.
  • Cancer Genomics and Proteomics

    Proteomics, the science of globally detecting proteins in cells, tissues or organisms under defined conditions has highly benefited from recent developments in mass spectrometry (MS). It is now possible to detect hundreds to thousands of proteins with high confidence in a single experiment. In this review, we summarize the basic MS technologies currently used by laboratories around the world to identify proteins in complex biological samples. We further provide the reader with a short overview of useful separation strategies to minimize the initial complexity of biological samples, and the multitude of bioinformatics tools essential to manage large-scale proteomics data to obtain meaningful biological insight. Finally, we summarize recent advances in three main areas of medical proteomics; proteomics in cancer research, proteomics of the heart, and proteomics in diabetes research.

  • Przulj, N, D. G. Corneil, Jurisica, I.. Efficient estimation of graphlet frequency distributions in protein-protein interaction networks. Bioinformatics, 22(8):974-980, 2006. Advance Access published on February 1, 2006; doi: doi:10.1093/bioinformatics/btl030
  • Algorithmic and modeling advances in the area of protein-protein interaction (PPI) network analysis could contribute to the understanding of biological processes. Local structure of networks can be measured by the frequency distribution of graphlets, small connected non-isomorphic induced subgraphs. This measure of local structure has been used to show that high-confidence PPI networks have local structure of geometric random graphs. Finding graphlets exhaustively in a large network is computationally intensive. More complete PPI networks, as well as PPI networks of higher organisms, will thus require efficient heuristic approaches.
    We propose two efficient and scalable heuristics for finding graphlets in high-confidence PPI networks. We show that both PPI and their model geometric random networks, have defined boundaries that are sparser than the "inner parts" of the networks. In addition, these networks exhibit "uniformity" of local structure inside the networks. Our first heuristic exploits these two structural properties of PPI and geometric random networks to find good estimates of graphlet frequency distributions in these networks up to 690 times faster than the exhaustive searches. Our second heuristic is a variant of a more standard sampling technique and it produces accurate approximate results up to 377 times faster than the exhaustive searches. We indicate how the combination of these approaches may result in an even better heuristic.

  • Kotlyar, M. and Jurisica, I.. Predicting protein-protein interactions by association mining. Information Systems Frontiers, 8: 37-47, 2006.
  • Identifying protein-protein interactions is a key problem in molecular biology. Currently, interactions cannot be reliably predicted on a proteome-wide scale but direct and indirect evidence for interactions is increasingly available from high-throughput interaction detection methods, gene expression microarrays, and protein annotation projects. In this paper we propose an association mining approach to integrating these diverse types of evidence. We apply this approach to a number of datasets consisting of interacting and non-interacting protein pairs annotated with different types of evidence. We identify patterns that distinguish interacting and non-interacting protein pairs, and use these patterns to assign a confidence level to proposed interactions.

  • Seiden-Long, I. M., K. Brown, W. Shih, D. A. Wigle, N. Radulovich, Jurisica, I., M.-S. Tsao. Transcriptional targets of Hepatocyte Growth Factor signaling and Ki-ras oncogene activation in colorectal cancer, Oncogene, 25(1): 91-102, 2006.
  • Both Ki-ras mutation and Hepatocyte Growth Factor (HGF) receptor Met overexpression occur at high frequency in colon cancer. This study investigated the transcriptional changes induced by Ki-ras oncogene and HGF-Met signaling activation in colon cancer cell lines in vitro and in vivo. The microarray global transcriptional profiling data demonstrate that changes induced by Met receptor activation overlap with those induced by Ki-ras oncogene. However, in the presence of Ki-ras mutation, the magnitude of transcriptional alterations in response to HGF-Met signaling in vitro and in vivo was attenuated. Overlapping genes between in vitro and in vivo microarray datasets were selected as a subset of HGF/Met and Ki-ras oncogene regulated targets, and were investigated further for validation. Using the Online Predicted Human Interaction Database (OPHID), we identified novel Met and Ki-ras regulated proteins and other functionally linked targets. . The novel proteins comprised histone acetyltransferase 1 (HAT1), phosphoribosyl pyrophosphate synthetase 2 (PRPS2), chaperonin containing TCP1, subunit 8 (CCT8), CSE1 chromosome segregation 1-like (yeast)/cellular apoptosis susceptibility (mammals) (CSE1L/CAS) and Cyclin H. The results demonstrate a strategy that may reveal novel pathways or mechanisms by which HGF/Met and Ki-ras oncogene signaling affects the biology of colon cancer cells.

  • Heisler L.E., Torti D., Boutros P.C., Watson J., Chan C., Winegarden N., Takahashi M., Yau P., Huang T.H., Farnham P.J., Jurisica I., Woodgett J.R., Bremner R., Penn L.Z., Der S.D. CpG Island microarray probe sequences derived from a physical library are representative of CpG Islands annotated on the human genome. Nucl. Acids Res. 33(9):2952-2961, 2005.
  • An effective tool for the global analysis of both DNA methylation status and protein-chromatin interactions is a microarray constructed with sequences containing regulatory elements. One type of array suited for this purpose takes advantage of the strong association between CpG Islands (CGIs) and gene regulatory regions. We have obtained 20,736 clones from a CGI Library and used these to construct CGI arrays. The utility of this library requires proper annotation and assessment of the clones, including CpG content, genomic origin and proximity to neighboring genes. Alignment of clone sequences to the human genome (UCSC hg17) identified 9595 distinct genomic loci; 64% were defined by a single clone while the remaining 36% were represented by multiple, redundant clones. Approximately 68% of the loci were located near a transcription start site. The distribution of these loci covered all 23 chromosomes, with 63% overlapping a bioinformatically identified CGI. The high representation of genomic CGI in this rich collection of clones supports the utilization of microarrays produced with this library for the study of global epigenetic mechanisms and protein-chromatin interactions. A browsable database is available on-line to facilitate exploration of the CGIs in this library and their association with annotated genes or promoter elements.

  • M.Trus, R. L. Yang,. F. Suarez-Saiz, L. Bordeleau, Jurisica I. and M.D. Minden. The histone deacetylase inhibitor valproic acid alters sensitivity towards all trans retinoic acid in acute myeloblastic leukemia cells, Leukemia, 19(7):1161-1168, 2005.
  • Acute myeloblastic leukemia (AML) may be classified in a number of ways. Using the French American British classification, the M3 form of the disease or acute promyelocytic leukemia (APL) has been found to be sensitive in vitro and in vivo to the retinoid all trans retinoic acid (ATRA). The mechanism for this is by restoration of normal gene expression through the release of histone deacetylase complexes (HDACs). In contrast to APL, other forms of AML are either nonresponsive or show blunted responses to ATRA. We evaluated if the inhibitor of HDAC activity, valproic acid (VPA), could mimic or enhance retinoid sensitivity in the AML cell line, OCI/AML-2, and clinical samples derived from patients with AML. An Affymetrix GeneChip experiment demonstrated that VPA modulated the expression of numerous genes in OCI/AML-2 cells that were not affected by ATRA including p21, a retinoid responsive gene in APL. VPA induced p21 expression in OCI/AML-2 cells and the majority of the AML samples tested; this was associated with cell cycle arrest and apoptosis not seen with ATRA alone. The addition of ATRA to VPA accentuated many of these responses, supporting the potential beneficial combination of these drugs in the treatment of AML.Leukemia advance online publication, 5 May 2005; doi:10.1038/sj.leu.2403773.

  • Soleymanlou, N., Jurisica, I., Nevo, O., Ietta, F., Zhang, X., Zamudio, S., Post, M. and Caniggia, I. Molecular evidence of placental hypoxia in preeclampsia.J Clin Endocr Metab, 907:(4299-308), 2005.
  • Oxygen plays a central role in human placental pathologies including preeclampsia, a leading cause of fetal and maternal death and morbidity. Insufficient utero-placental oxygenation in preeclampsia is believed to be responsible for the molecular events leading to the clinical manifestations of this disease. Using high-throughput functional genomics, we determined the global gene expression profiles of placentae from high altitude pregnancies, a natural in vivo model of chronic hypoxia, as well as that of first trimester explants under 3% and 20% oxygen, an in vitro organ culture model. We next compared the genomic profile from these two models to that obtained from pregnancies complicated by preeclampsia. Microarray data was analyzed using the Binary Tree-Structured Vector Quantization (BTSVQ) algorithm, which is capable of generating global gene expression maps. Our data highlight a striking global gene expression similarity between 3% O2-treated explants, high altitude placentae and importantly placentae from preeclamptic pregnancies. We demonstrate herein the utility of explant culture and high altitude placenta as biologically-relevant and powerful models for studying the oxygen-mediated events in preeclampsia. Our results provide the first molecular evidence that aberrant global placental gene expression changes in preeclampsia are due to reduced oxygenation and that these events can successfully be mimicked in vivo and in vitro models of placental hypoxia.

  • Barrios-Rodiles, M., K. R. Brown, B. Ozdamar, R. Bose, Z. Liu, R. S. Donovan, F. Shinjo, Y. Liu, J. Dembowy, I. W. Taylor, V. Luga, N. Przulj, M. Robinson, H. Suzuki, Y. Hayashizaki, Jurisica, I., and J. L. Wrana. High-Throughput Mapping of a Dynamic Signaling Network in Mammalian Cells , Science 307:(5715): 1621-1625, 2005.
  • Signaling pathways transmit information through protein interaction networks that are dynamically regulated by complex extracellular cues. We developed LUMIER (for luminescence-based mammalian interactome mapping), an automated high-throughput technology, to map protein-protein interaction networks systematically in mammalian cells and applied it to the transforming growth factor -B (TGFB) pathway. Analysis using self-organizing maps and k-means clustering identified links of the TGFB pathway to the p21-activated kinase (PAK) network, to the polarity complex, and to Occludin, a structural component of tight junctions. We show that Occludin regulates TGFB type I receptor localization for efficient TGFB-dependent dissolution of tight junctions during epithelial-to-mesenchymal transitions.

  • Arshadi, N. and Jurisica, I.. Integrating case-based reasoning systems with data mining techniques for discovering and using disease biomarkers. IEEE Transactions on Knowledge and Data Engineering. Special Issue-Mining Biological Data. 17(8): 1127-1137, 2005. e-pub June 17, 2005.
  • Case-based reasoning (CBR) is a suitable paradigm for class discovery in molecular biology, where the rules that define the domain knowledge are difficult to obtain, and the number and the complexity of the rules affecting the problem are too large for formal knowledge representation. To extend the capabilities of CBR, we propose mixture of experts for case-based reasoning (MOE4CBR), a method that combines an ensemble of CBR classifiers with spectral clustering and Logistic Regression. Our approach not only achieves higher prediction accuracy, but also leads to the selection of a subset of features that have meaningful relationships with their class labels.
    We evaluate MOE4CBR by applying the method to a CBR system called TA3 -- a computational framework for CBR systems. For two mass spectrometry data sets, the prediction accuracy improves from 80% to 93% and from 90% to 98.4%, respectively. We also apply the method to leukemia and lung microarray data sets with prediction accuracy improving from 65% to 74% and from 60% to 70%, respectively. Finally, we compare our list of discovered biomarkers with the lists of selected biomarkers from other studies for the mass spectrometry data sets.

  • Cumbaa, C. A. and Jurisica, I.. Automatic classification and pattern discovery in high-throughput protein crystallization trials, Journal of Structural and Functional Genomics,6(2-3):195-202, 2005.
  • Conceptually, protein crystallization can be divided into two phases: search and optimization. Robotic protein crystallization screening can speed up the search phase, and has a potential to increase process quality.

    Automated image classification helps to increase throughput and consistently generate objective results. Although the classification accuracy can always be improved, our image analysis system can classify images from 1536-well plates with high classification accuracy (85%) and ROC score (0.87), as evaluated on 127 human-classified protein screens` containing 5600 crystal images and 189472 non-crystal images.

    Data mining can integrate results from high-throughput screens with information about crystallizing conditions, intrinsic protein properties, and results from crystallization optimization. We apply association mining, a data mining approach that identifies frequently occurring patterns among variables and their values. This approach segregates proteins into groups based on how they react in a broad range of conditions, and clusters cocktails to reflect their potential to achieve crystallization. These results may lead to crystallization screen optimization, and reveal associations between protein properties and crystallization conditions. We also postulate that past experience may lead us to the identification of initial conditions favorable to crystallization for novel proteins.

  • Brown, K. and Jurisica, I.. Online Predicted Human Interaction Database OPHID, Bioinformatics, 21(9):2076-2082, 2005. Advance Access published on January 18, 2005. doi:10.1093/bioinformatics/bti273.
  • Motivation: High-throughput experiments are being performed at an ever-increasing rate to systematically elucidate protein-protein interaction (PPI) networks for model organisms, while complexities of higher eukaryotes have prevented these experiments for humans.
    Results: The Online Predicted Human Interaction Database (OPHID) is a web-based database of predicted interactions between human proteins. It combines the literature-derived human PPI from BIND, HPRD and MINT, with predictions made from S. cerevisiae, C. elegans, D. melanogaster, and M. musculus. The 23,889 predicted interactions currently listed in OPHID are evaluated using protein domains, gene co-expression and Gene Ontology terms. OPHID can be queried using single or multiple IDs, and results can be visualized using our custom graph visualization program.
    Availability: Freely available to academic users at http://ophid.utoronto.ca, both in tab-delimited and PSI-MI formats. Commercial users, please contact I.J.

  • Blackhall FH, Pintilie M, Wigle DA, Jurisica I, Liu N, Radulovich N, Johnston MR, Keshavjee S, Tsao MS. Stability and heterogeneity of expression profiles in lung cancer specimens harvested following surgical resection. Neoplasia, 6(6):761-767, 2004.
  • One of the major concerns in microarray profiling studies of clinical samples is the effect of tissue sampling and RNA extraction on data. We analyzed gene expression in lung cancer specimens that were serially harvested from tumor mass and snap-frozen at several intervals up to 120 minutes after surgical resection. Global gene expression was profiled on cDNA microarrays, and selected stress and hypoxia-activated genes were evaluated using real-time reverse transcription polymerase chain reaction (RT-PCR). Remarkably, similar gene expression profiles were obtained for the majority of samples regardless of the time that had elapsed between resection and freezing. Real-time RT-PCR studies showed significant heterogeneity in the expression levels of stress and hypoxia-activated genes in samples obtained from different areas of a tumor specimen at one time point after resection. The variations between multiple samplings were significantly greater than those of elapsed time between sampling/freezing. Overall samples snap-frozen within 30 to 60 minutes of surgical resection are acceptable for gene expression studies, thus making sampling and snap-freezing of tumor samples in a routine surgical pathology laboratory setting feasible. However, sampling and pooling from multiple sites of each tumor may be necessary for expression profiling studies to overcome the molecular heterogeneity present in tumor specimens.

  • Przulj, N., Corneil, D., Jurisica, I. Modeling interactome: Scale-free or geometric?, Bioinformatics, 20(18):3508-3515, 2004. Bioinformatics Advance Access published on July 29, 2004 Bioinformatics 2004; doi:10.1093/bioinformatics/bth436.
  • Motivation: Networks have been used to model many real-world phenomena to better understand the phenomena and to guide experiments in order to predict their behavior. Since incorrect models lead to incorrect predictions, it is vital to have an improved model. As a result, new techniques and models for analyzing and modeling real-world networks have recently been introduced.

    Results: One example of large and complex networks involves protein-protein interaction (PPI) networks. We analyze PPI networks of yeast \emph{S. cerevisiae} and fruitfly \emph{D. melanogaster} using a newly introduced measure of local network structure as well as the standardly used measures of global network structure. We examine the fit of four different network models, including Erd\"{o}s-R\'{e}nyi, scale-free, and geometric random network models, to these PPI networks with respect to the measures of local and global network structure. We demonstrate that the currently popular scale-free model of PPI networks fails to fit the data in several respects and show that a random geometric model provides a much more accurate model of the PPI data. We hypothesize that only the noise in these networks is scale-free. Conclusions: We systematically evaluate how well different network models fit the PPI networks. We show that the structure of PPI networks is better modeled by a geometric random graph than by a scale-free model.
    Supplementary data

  • King, A. D., N. Przulj, Jurisica, I. Protein complex prediction via cost-based clustering. Bioinformatics,, 20(17):3013-3020, 2004. Bioinformatics Advance Access published on June 4, 2004 Bioinformatics 2004; doi:10.1093/bioinformatics/bth351.
  • Motivation: When studying the workings of a biological cell, it is useful to be able to detect known and predict still undiscovered protein complexes within the cell's protein-protein interaction (PPI) network. Such predictions may be used as an inexpensive tool to direct biological experiments. The increasing amount of available PPI data necessitates a fast, accurate approach to protein complex identification.
    Results: We have developed the Restricted Neighbourhood Search Clustering Algorithm (RNSC) to efficiently partition networks into clusters using a cost function. We applied this cost-based clustering algorithm to PPI networks of S. cerevisiae, D. melanogaster, and C. elegans to identify and predict protein complexes. We also investigated functional and graph-theoretical properties of known complexes in the MIPS database, and by filtering clusters based on these properties, we attained a high matching rate between filtered clusters and true protein complexes.
    Conclusions: Our application of the cost-based clustering algorithm provides a scalable, accurate, and efficient method of detecting and predicting protein complexes within a PPI network.
    Supplementary data

  • Jiang Liu, Fiona Blackhall, Isolde Seiden-Long, Igor Jurisica, Roya Navab, Ni Liu, Nikolina Radulovich, Dennis Wigle, Muhajid Sultan, Jim Hu, Ming-Sound Tsao, and Michael R. Johnston. Modeling of lung cancer by an orthotopically growing H460SM variant cell line reveals novel candidate genes for systemic metastasis, Oncogene,23(37): 6316-6324, 2004.
  • Endobronchial implantation of NCI-H460 cells into the nude rat generates a primary lung tumor with mediastinal lymph node spread, but rarely systemic metastases. We isolated tumor cells from mediastinal nodes, orthotopically reimplanted the cells into nude rats and repeated this four times to derive a cell line, designated H460SM, that spontaneously metastasizes to bone, kidney, brain, soft tissue and contralateral lung. H460SM cells demonstrated higher invasive activity in vitro than parental NCI-H460 cells. Spectral karyotyping revealed a new inversion within 17q and loss of an extra normal copy of chromosome 14 present in parental NCI-H460 cells. Expression profiling of orthotopic primary tumors revealed differential expression of 360 genes. Of these, 173 were represented in the probe set of a 19.2K OCI cDNA microarray previously used to profile the gene expression of surgically resected lung cancer specimens. We have computationally validated clinical importance of these genes by using in silico analysis of 18 cases of pulmonary adenocarcinoma, which were split into two patient groups with markedly different clinical outcome. The model identifies additional novel candidate genes for the progression of lung cancer to systemic metastases and poor prognosis.

  • Fiona H. Blackhall, Dennis Wigle, Igor Jurisica, Melania Pintilie, Ni Liu, Gail Darling, Michael R. Johnston, Shaf Keshavjee, Thomas Waddel, Frances A. Shepherd and Ming-Sound Tsao. Validating the prognostic value of marker genes derived from a non-small cell lung cancer microarray study. Lung Cancer 46(2): 197-204. 2004.
  • We previously reported that our cDNA microarray analysis of primary non-small cell lung carcinoma (NSCLC) could predict for patients at increased risk of cancer recurrence. From the result of this analysis, we selected 11 genes that were considered candidate prognostic marker genes and used the realtime reverse transcription polymerase chain reaction (RT-PCR) to investigate their expression in the same set of NSCLC cases used in the microarray study. Cluster analysis of the realtime RT-PCR data separated these patients into two groups with significantly different disease-free survivals (log-rank test, [Formula: see text] ). In contrast, cluster analysis failed to confirm the prognostic significance of the realtime RT-PCR results for these 11 genes in a validation series of 92 NSCLC cases. In univariate analysis, hypoxia inducible factor 1alpha, Rho-GDP dissociation inhibitor (GDI) alpha (RhoGDI) and Citron/rho-interacting serine-threonine kinase 21 (Citron K21) were significant prognostic factors for disease-free survival in the entire cohort of 130 NSCLC patients, but none were significant in multivariate analysis. The results demonstrate that the prognostic significance of microarray (SAM) results can be partially validated using realtime RT-PCR, but secondary validation using larger and independent series of tumors is necessary to identify true prognostic marker genes.

  • Dennis A. Wigle, Ming Tsao, Igor Jurisica. Making sense of lung cancer gene expression profiles, Genome Biology, 5, 309.1-309.3, 2004.


  • Giles C. Warner, Patricia P. Reis, Igor Jurisica, Mujahid Sultan, Christina Macmillan, Nigel Beasley, Antti A. Makitie, Shilpi Arora, Mahadeo Sukhai, Reidar Grénman, Richard A. Wells, Dale Brown, Ralph Gilbert, Patrick Gullane, Jonathan Irish, Suzanne Kamel-Reid. Molecular classification of oral cancer by cDNA microarrays identifies overexpressed genes correlated with nodal metastasis. Inernational Journal of Cancer, 110:857-868, 2004.
  • Our purpose was to classify OSCCs based on their gene expression profiles, to identify differentially expressed genes in these cancers and to correlate genetic deregulation with clinical and histopathologic data and patient outcome. After conducting proof-of-principle experiments utilizing 6 HNSCC cell lines, the gene expression profiles of 20 OSCCs were determined using cDNA microarrays containing 19,200 sequences and the BTSVQ method of data analysis. We identified 2 sample clusters that correlated with the T3-T4 category of disease (p = 0.035) and nodal metastasis (p = 0.035). BTSVQ analysis identified a subset of 23 differentially expressed genes with the lowest QE scores in the cluster containing more advanced-stage tumors. Expression of 6 of these differentially expressed genes was validated by quantitative real-time RT-PCR. Statistical analysis of quantitative real-time RT-PCR data was performed and, after Bonferroni correction, CLDN1 overexpression was significantly correlated with the cluster containing more advanced-stage tumors (p = 0.007). Despite the clinical heterogeneity of OSCC, molecular subtyping by cDNA microarray analysis identified distinct patterns of gene expression associated with relevant clinical parameters. Application of this methodology represents an advance in the classification of oral cavity tumors and may ultimately aid in the development of more tailored therapies for oral carcinoma.

  • Acton, B.M., Jurisicova, A., Jurisica, I. and Casper, R.F. Alterations in mitochondrial membrane potential during preimplantation stages of murine and human embryo development. Molecular Human Reproduction, 10(1):23-32, 2004.
  • Molecular Human Reproduction

    Mitochondria are cellular organelles regulating metabolism and cell death pathways. This study examined changes in mitochondrial membrane potential (DYm) throughout the stages of preimplantation development in murine embryos conceived either in vivo or in vitro and human embryos donated to research from IVF. Embryos stained with the DYm sensitive dye (JC-1) were quantified for the ratio of highly to lowly polarized mitochondria using a deconvolution microscope. Overall, murine zygotes and early embryos contain a subset of highly polarized mitochondria with a progressive increase in the ratio of highly to lowly polarized mitochondria observed with increasing cleavage. A transient increase in the ratio of high to low DYm was observed in in vivo fertilized two-cell stage embryos, coincident with embryonic genome activation in the mouse, but not in two-cell embryos obtained through IVF. We further observed that arrested murine two-cell embryos possessed an increased ratio of highly to lowly polarized mitochondria compared to non-arrested embryos. In human eight cell embryos we observed an increased ratio of highly to lowly polarized mitochondria with increasing degrees of embryo fragmentation. We concluded that the pattern of DYm progressively changes throughout preimplantation development, and that an aberrant shift in DYm could contribute to or is associated with embryo abnormalities.

  • Przulj, N., Wigle, D., Jurisica, I. Functional topology in a network of protein interactions. Bioinformatics, 20(3):340-348, 2004.
  • The building blocks of biological networks are individual protein-protein interactions (PPI). The cumulative PPI dataset in S. cerevisiae now exceeds 78,000. Studying the network of these interactions will provide valuable insight into the inner workings of cells.
    Results: We performed a systematic graph theory based analysis of this PPI network to construct computational models for describing and predicting the properties of lethal mutations and proteins participating in genetic interactions, functional groups, protein complexes, and signaling pathways. Our analysis suggests that lethal mutations are not only highly connected within the network, but they also satisfy an additional property: the ir removal causes a disruption in network structure. We also provide evidence for the existence of alternate paths that bypass viable proteins in PPI networks, while such paths do not exist for lethal mutations. In addition, we show that distinct functional classes of proteins have differing network properties. We also demonstrate a way to extract and iteratively predict protein complexes and signaling pathways. We evaluate the power of predictions by comparing them to a random model, and assess accuracy of predictions by analyzing their overlap with MIPS database.
    Conclusions: Our models provide a means for understanding the complex wiring underlying cellular function, and enable us to predict essentiality, genetic interaction, function, protein complexes and cellular pathways. This analysis uncovers structure-function relationships observable in a large PPI network.
    Supplementary information
    Supplementary data

  • Cumbaa, C., Lauricella, A., Fehrman, N., Veatch, C., Collins, R., Luft, J., DeTitta, G., Jurisica, I. Automatic classification of sub-microlitre protein crystallization trials in 1536-well plates, Acta Crystallographica Section D-Biological Crystallography D59(9):1619-1627, 2003.
  • A technique for automatically evaluating microbatch (400 nL) protein crystallization trials is described. This method addresses analysis problems introduced at the sub-microlitre scale, including non-uniform lighting and irregular droplet boundaries. The droplet is segmented from the well using a loopy probabilistic graphical model with a two-layered grid topology. A vector of 23 features is extracted from the droplet image using the Radon transform for straight-edge features and a bank of correlation filters for microcrystalline features. Image classification is achieved by linear discriminant analysis of its feature vector. The results of the automatic method are compared to those of a human expert on 32 1536-well plates. Using the human-labeled images as ground truth, this method classifies images with 85% accuracy and a ROC score of 0.84. This result compares well with the experimental repeatability rate assessed at 87%. Images falsely classified as crystal-positive variously contain speckled precipitate resembling microcrystals, skin effects, or genuine crystals falsely labeled by the human expert. Many images falsely classified as crystal-negative variously contain very fine crystal features or dendrites lacking straight edges. A characterization of these misclassifications suggests directions for improving the method.

  • Breitkreutz, A., Boucher, L., Breitkreutz, B.J., Sultan, M., Jurisica, I., Tyers, M. Phenotypic and transcriptional plasticity directed by a yeast MAPK network. Genetics,165(3):997-1015, 2003.
  • The yeast pheromone/filamentous growth MAPK pathway mediates both mating and invasive-growth responses. The interface between this MAPK module and the transcriptional machinery consists of a network of two MAPKs, Fus3 and Kss1, two regulators, Rst1 and Rst2 (a.k.a. Dig1 and Dig2) and two transcription factors, Ste12 and Tec1. Of sixteen possible combinations of gene deletions in FUS3, KSS1, RST1, and RST2 in the S1278 background, ten exhibited constitutive invasive-growth. Rst1 was the primary negative regulator of invasive growth, while other components either attenuated or enhanced invasive growth, depending on the genetic context. Despite activation of the invasive response by lesions at the same level in the MAPK pathway, transcriptional profiles of different invasive mutant combinations did not exhibit a unified program of gene expression. The distal MAPK regulatory network is thus capable of generating phenotypically similar invasive-growth states (an attractor) from different molecular architectures (trajectories) that can functionally compensate for one another. This systems level robustness may also account for the observed diversity of signals that trigger invasive-growth.

  • Janice Glasgow, Igor Jurisica and Burkhard Rost. Introduction to Special Issue on AI and Bioinformatics, Artificial Intelligence Magazine, 25(1):7-8, 2004.


  • Jurisica, I. and J. Glasgow, Application of case-based reasoning in molecular biology. Artificial Intelligence Magazine, Special issue on Bioinformatics. 25(1):85-95, 2004.
  • Case-Based Reasoning (CBR) is a computational reasoning paradigm that involves the storage and retrieval of past experiences to solve novel problems. It is an approach that is particularly relevant in scientific domains, where there is a wealth of data, but often a lack of theories or general principles. This paper describes several CBR systems that have been developed to carry out planning, analysis and prediction in the domain of molecular biology.

  • Jurisica, I., J. Mylopoulos, E. Yu. Ontologies for knowledge management: An information systems perspective. An International Journal of Knowledge and Information Systems, Special issue on Ontologies, 6(4):380-401, 2004.
  • Knowledge management research focuses on concepts, methods, and tools supporting the management of human knowledge. The main objective of this paper is to survey basic concepts that have been used in Com-puter Science for the representation of knowledge and summarize some of their advantages and drawbacks. A secondary objective is to relate these techniques to Information Science theory and practice.
    The survey classifies the concepts used for knowledge representation into four broad ontological categories. Static ontologies describe static aspects of the world, i.e., what things exist, their attributes and relationships. A dynamic ontology, on the other hand, describes the changing aspects of the world in terms of states, state transitions and processes. Intentional ontologies encompass the world of things agents believe in, want, prove or disprove, and argue about. Finally, social ontologies cover social settings - agents, positions, roles, authority, permanent organizational structures or shifting networks of alliances and interdependencies.

  • Evangelou, A, Letarte, M., Jurisica, I., Sultan, M., Murphy, K., Rosen, B., Brown, T. Loss of coordinated androgen regulation in non-malignant ovarian epithelial cells with BRCA1/2 mutations and ovarian cancer cells. Cancer Research, 63:2416-2424, 2003.
  • Epidemiological studies have implicated androgens in the etiology/ progression of epithelial ovarian cancer. Because normal and malignant ovarian epithelial cells are growth inhibited by transforming growth factor (TGF) beta, we tested the ability of 5alfa-dihydrotestosterone (DHT) to modulate this response and the expression of TGFbeta receptor types I and II. Cells derived from the ovarian surface epithelium of women undergoing oophorectomy (n = 7) for nonovarian indications or with a germ-line BRCA1 or 2 mutation (n = 9), and from the ascitic fluid of patients with primary ovarian cancer (n = 8) were cultured with and without DHT. Cell proliferation after TGF-beta1 or vehicle treatment was determined, and transcripts for TGF-beta receptors were measured by quantitative reverse transcription-PCR. As low levels of androgen receptor were observed in the cultures, we also measured transcript levels for steroid receptor coactivators SRC-1, ARA70, and AIB1. TGF-beta1 inhibited growth in 12 of 13 cultures tested, and DHT generally reversed this effect, demonstrating that androgens can block TGF-beta-induced growth inhibition in both malignant and nonmalignant ovarian epithelial cells. Transcripts for TGF-beta receptors, SRC-1, and ARA70 were found to be coordinately regulated by androgen in control cells, but not in either malignant or BRCA1/2-positive cell cultures. These findings raise the possibility that by modulating steroid receptor coactivator expression, androgen might affect other hormonal responses and contribute to the initiation of ovarian cancer.

  • Jurisica, I. and Wigle, D. Understanding biology through intelligent systems. Genome Biology, 3(11):Reports 4035.1-4035.4, 2002.
  • A report on the Tenth International Conference on Intelligent Systems for Molecular Biology (ISMB), Edmonton, Canada, 3-7 August 2002.

  • Wigle, D., Jurisica, I., N. Radulovich, M. Pintilie, J. Rossant, N. Liu, C. Lu, J. Woodgett, I. Seiden, M. Johnston, S. Keshavjee, G. Darling, T. Winton, B. Breitkreutz, P. Jorgenson, M. Tyers, F. A. Shepherd, M.S. Tsao. Molecular profiling of non-small cell lung cancer and correlation with disease-free survival Cancer Research, 62(11):3005-3008, 2002.
  • Recent studies have suggested that information from gene expression profiles could be used to develop molecular classifications of cancer. We hypothesized that expression levels of specific genes in operative specimens could be correlated to recurrence risk in non-small cell lung cancer (NSCLC). We performed expression profiling using 19.2K cDNA microarrays on tumor specimens from a total of 39 NSCLC patients with known clinical follow-up information. Statistical analysis and clustering approaches were used to determine patterns of gene expression segregating with clinical outcome. The results provide evidence that molecular subtyping of NSCLC can identify distinct profiles of gene expression correlating with disease-free survival.
    Supplementary Data

  • Sultan, M., Wigle, D., Cumbaa, C., Maziarz, M., Glasgow, J., M.-S. Tsao, Jurisica, I. Binary tree-structured vector quantization approach to clustering and visualizing microarray data. Bioinformatics. Special Issue of ISMB'02, 18(Suppl. 1):S111-S119. 2002.
  • Motivation: With the increasing number of gene expression databases, the need for more powerful analysis and visualization tools is growing. Many techniques have successfully been applied to unravel latent similarities among genes and/or experiments. Most of the current systems for microarray data analysis use statistical methods, hierarchical clustering, self-organizing maps, support vector machines, or k-means clustering to organize genes or experiments into meaningful groups. Without prior explicit bias almost all of these clustering methods applied to gene expression data not only produce different results, but may also produce clusters with little or no biological relevance. Of these methods, agglomerative hierarchical clustering has been the most widely applied, although many limitations have been identified.
    Results: Starting with a systematic comparison of the underlying theories behind clustering approaches, we have devised a technique that combines tree-structured vector quantization and partitive k-means clustering (BTSVQ). This hybrid technique has revealed clinically relevant clusters in three large publicly available data sets. In contrast to existing systems, our approach is less sensitive to data preprocessing and data normalization. In addition, the clustering results produced by the technique have strong similarities to those of self-organizing maps (SOMs). We discuss the advantages and the mathematical reasoning behind our approach.
    Availability: The BTSVQ system is implemented in Matlab R12 using the SOM toolbox for the visualization and preprocessing of the data. BTSVQ is available for non-commercial use (http://www.uhnres.utoronto.ca/ta3/BTSVQ).
    Supplementary Data

  • Luft, J., Wolfley, J., Jurisica, I., Glasgow, J., Fortier, S., DeTitta, G.T. Macromolecular crystallization in a high throughput laboratory - the search phase. Journal of Crystal Growth, 232: 591-595, 2001.
  • Macromolecular crystallization efforts are frequently divided into a search phase, during which approximate conditions are sought, and an optimization phase, when the approximate conditions are optimized to yield crystals of sufficient quality for diffraction work. Faced with the possibility that, on a yearly basis, many hundreds of proteins might be generated, both in our laboratories and at the laboratories of our collaborators, we have recently designed and commissioned a high throughput robotics lab designed for the search phase. The lab is capable of setting up and photographically evaluating over 60,000 microbatch crystallization experiments per week. In the first four months of operation we have set up crystallization experiments for more than one hundred proteins.

  • Jurisica, I., Rogers, P., Glasgow, J., Collins, R., Wolfley, J., Luft, J., DeTitta, G.T. Improving Objectivity and Scalability in Protein Crystallization: Integrating Image Analysis With Knowledge Discovery. IEEE Intelligent Systems Journal, Special issue on Intelligent Systems in Biology, 16(6): 26-34, 2001.
  • This paper describes issues related to integrating image analysis techniques with knowledge discovery and case-based reasoning. Although the work is applicable to a number of problem domains, here we focus on the problem of analyzing and classifying outcomes of protein crystallization experiments in high-throughput structural genomics. We apply fast Fourier transform to analyze image content in order to extract important features of the spectrum. A combination of these features is used to classify crystallization experiments' outcomes. Although humans can analyze images more flexibly, a computational approach makes the process scalable and more objective. We evaluate the classification process and present results on how the automatically-extracted features can be combined to discover important crystallographic knowledge.

  • Wigle, D., Rossant, J., Jurisica, I. Mining mouse microarray data, Genome Biology, 2(7): 1019.1-1019.4, 2001.
  • Microarrays of mouse genes are now available from several sources, and they have so far given new insights into gene expression in embryonic development, regions of the brain and during apoptosis. Microarray data posted on the internet can be reanalyzed to study a range of questions.

  • Jurisica, I., Rogers, P., Glasgow, J., Fortier, S., Luft, J., Wolfley, J., Bianca, M., Weeks, D., DeTitta, G.T. Intelligent Decision Support for Protein Crystal Growth. IBM Systems Journal, Special issue on Deep Computing for Life Sciences, 40(2): 394-409, 2001.
  • Genomic projects are producing hundreds of proteins a year for structural analysis. The challenge of the research described in this paper is to remove crystal growth experiments as a rate-limiting step in the enterprise of structure determination of proteins. We meet this challenge by combining a high-throughput crystallization setup and evaluation in the wet lab with a sophisticated algorithmic analysis of the outcomes in the computer lab. Furthermore, we apply techniques from knowledge management and artificial intelligence to develop an automated system that assists expert crystallographers in planning and evaluating novel crystal growth experiments. Fundamental to our computational approach to crystallization is a comprehensive information repository for crystal growth experiments. This stored information will be used to discover general rules or principles underlying the growth process for crystals, as well as to guide the reasoning algorithm for planning experiments.
    The paper reports on the preliminary results in the wet lab and computation lab respectively. We define the problem, propose an architecture for intelligent decision support in the crystallization domain, and report on the status of the individual components of the architecture.

  • Jurisica, I., J. Glasgow, and J. Mylopoulos. Incremental Iterative Retrieval and Browsing for Efficient Conversational CBR Systems. International Journal of Applied Intelligence. 12(3): 251-268, 2000.
  • A case base is a repository of past experiences that can be used for problem solving. Given a new problem, expressed in the form of a query, the case base is browsed in search of "similar" or "relevant" cases. Conversational case-based reasoning (CBR) systems generally support user interaction during case retrieval and adaptation. Here we focus on case retrieval where users initiate problem solving by entering a partial problem description. During an interactive CBR session, a user may submit additional queries to provide a  "focus of attention". These queries may be obtained by relaxing or restricting the constraints specified for a prior query. Thus, case retrieval involves the iterative evaluation of a series of queries against the case base, where each query in the series is obtained by restricting or relaxing the preceding query.
    This paper considers alternative approaches for implementing iterative browsing in conversational CBR systems. First, we discuss a naive algorithm, which evaluates each query independent of earlier evaluations. Second, we introduce an incremental algorithm, which reuses the results of past query evaluations to minimize the computation required for subsequent queries. In partiular, the paper proposes an efficient algorithm for case base browsing and retrieval using database techniques for incremental view maintenance. In addition, the paper evaluates the performance of the proposed algorithm with respect to alternative approaches considering two perspectives: (i) experimental efficiency evaluation using diverse application domains, and (ii) scalability evaluation using the performance model of the proposed system.

  • Jurisica, I., Mylopoulos, J., Glasgow, J., Shapiro, H., and Casper, R. F. Case-based reasoning in IVF: Prediction and knowledge mining. Artificial Intelligence in Medicine, 12(1), 1-24, 1998.
  • In vitro fertilization (IVF) is a medically-assisted reproduction technique, enabling infertile couples to achieve successful pregnancy. Given the unpredictability of the task, we propose to use a case-based reasoning system that exploits past experiences to suggest possible modifications to an IVF treatment plan in order to improve overall success rates. Once the system's knowledge base is populated with a sufficient number of past cases, it can be used to explore and discover interesting relationships among data, thereby achieving a form of knowledge mining. The article describes the TA3IVF system -- a case-based reasoning system which relies on context-based relevance assessment to assist in knowledge visualization, interactive data exploration and discovery in this domain. The system can be used as an advisor to the physician during clinical work and during research to help determine what knowledge sources are relevant for a treatment plan.

  • Jurisica, I. and J. Glasgow. Improving performance of case-based classification using context-based relevance. International Journal of Artificial Intelligence Tools. Special Issue of IEEE International Conf. on Tools with AI (ICTAI-96) Best Papers. 6(4):511-536, 1997.
  • Classification involves associating instances with particular classes by maximizing intra-class similarities and minimizing inter-class similarities. Thus, the way similarity among instances is measured is crucial for the success of the system. In case-based reasoning, it is assumed that similar problems have similar solutions. The case-based approach to classification is founded on retrieving cases from the case base that are similar to a given problem, and associating the problem with the class containing the most similar cases.
    Similarity-based retrieval tools can advantageously be used in building flexible retrieval and classification systems. Case-based classification uses previously classified instances to label unknown instances with proper classes. Classification accuracy is affected by the retrieval process -- the more relevant the instances used for classification, the greater the accuracy.
    The paper presents a novel approach to case-based classification. The algorithm is based on a notion of similarity assessment and was developed for supporting flexible retrieval of relevant information. Case similarity is assessed with respect to a given context that defines constraints for matching. Context relaxation and restriction is used for controlling the classification accuracy. The validity of the proposed approach is tested on real-world domains, and the system's performance, in terms of accuracy and scalability, is compared to that of other machine learning algorithms.


Refereed Conferences

  • Niu, Y. and Jurisica, I., Detecting protein-protein interaction sentences using a mixture model, in Natural Language and Information Systems (NLDB'08), Lecture Notes in Computer Science, E. Kapetanios, V. Sugumaran, and M. Spiliopoulou, Editors, Springer Verlag, Berlin, 352-354, 2008.


  • Xia, E., Jurisica, I., J. Waterhouse, V. Sloan. The impact of runtime estimation in accuracy on scheduler performance, IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2007), November 19-21, Cambridge, MA, 2007


  • Yan, R., P. C. Boutros, L.Z. Penn, Jurisica, I.. Comparison of machine learning and pattern discovery algorithms for the prediction of human single nucleotide polymorphisms. IEEE International Conference on Granular Computing, IEEE, Silicon Valley, USA, Nov 2-4, 2007.


  • Xia, E., Jurisica, I., J. Waterhouse. CasSim: a Top-level-simulator for Grid Scheduling and Applications, IBM Cascon Conference. 2006.


  • Otasek, D., K. Brown, Jurisica, I.. Confirming protein-protein interactions by te xt mining. SIAM Conference on Text Mining, Bethesda, Maryland, April 2006.


  • Xia, E., Jurisica, I., J. Waterhouse, V. Sloan. Scheduling functional regression tests in IBM DB2. IBM Cascon, 2005.


  • Arshadi, N. and Jurisica, I.. An ensemble of case-based classifiers for high-dime nsional biological domains. In ICCBR'05, Springer-Verlag Press, pp. 21-34, 2005.


  • Arshadi, N. and Jurisica, I.. Feature selection for improving case-based classifiers on high- dimensional data sets. In AAAI FLAIRS, AAAI Press, Menlo Park, pp. 99-104, 2005.


  • E. Xia and I . Jurisica. Effectiveness of grid configurations on application performance. Parallel and Distributed Computing and Systems (PDCS 2004), 2004.


  • Arshadi, N. and Jurisica, I.. Maintaining CBR systems: A machine learning approach. 7th Eu ropean Conference on Case-Based Reasoning (ECCBR'04), 2004.


  • Xia, E. and Jurisica, I.. Optimizing job scheduling in the grid environment. In Proceedings of The Seventh International Conference on Computer Science and Informatics, Predictive Modeling Techniques, pp. 447-451, Research Triangle Park, NC, 2003.


  • Jurisica, I., C. Cumbaa, A. Lauricella, N. Fehrman, C.Veatch, R. Collins, J. Luft, G. DeTitta. Automatic Classification of Protein Crystallization Screens on 1536-well Plates. The Annual Meeting of the American Crystallographic Association (ACA'03), Session on high-throughput crystallography, Cincinnati, OH, July 26-28, 2003.


  • Jurisica, I., Rogers, P., Glasgow, J., Fortier, S., Luft, J., Bianca, D., DeTitta, G.T. Image-Feature Extraction for Protein Crystallization: Integrating Image Analysis and Case-Based Reasoning Thirteenth Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-2001), Seattle, WA, 2001, p. 73-80.

    This paper describes issues related to integrating image analysis techniques into case-based reasoning. Although the approach is generic, a high-throughput protein crystallization problem is used as an example. Our solution to the crystallization problem is to store outcomes of experiments as images, extract important image features, and use them to automatically recognize different crystallization outcomes. Subsequently, we use the outcomes of image classification to perform case-based planning of crystallization experiments for new proteins. Knowledge-discovery techniques are used to extract general principles for crystallization. Such principles are applicable to the adaptation phase of case-based reasoning. The motivation for automated image-feature extraction is twofold: \snum{1} the human interpretation/analysis of image content is subjective, and \snum{2} many problem domains require reasoning with large databases of uninterpreted images. In this paper we present the design and implementation of our integrated system, as well as some preliminary experimental results.

  • Jurisica, I. (2000). Building better decision-support systems by using knowledge discovery. Annual Conference of the American Society for Information Science, Chicago, IL, p. 281-291.


  • Luft, J. R., J. Wolfley, M. Bianca, D. Weeks, Jurisica, I., P. Rogers, J. Glasgow, S. Fortier, G. T. DeTitta. (2000). Gearing up for structural genomics: The challenge of hundreds of proteins and hundred of thousands of crystallizaiton experiments per year. The Annual Conference of the American Crystallographic Association (ACA'00), Saint Paul, MN.

    Structural genomics projects promise to produce hundreds of proteins a year for structural analysis.  The challenge to crystal growers is to make some other step in the structural biology enterprise rate-limiting.  Our approach is to combine high throughput (HTP) crystallization setup and evaluation in the wet lab with sophisticated algorithmic analyses of the HTP outcomes in the computer lab for the purposes of recipe prediction.

    In the wet lab we now have the capacity to prepare and evaluate the results of over sixty thousand (61.4K) crystallization experiments a workweek.  Each is a microbatch experiment conducted under paraffin oil.  Pipetting is performed with robots outfitted with 96 or 384 syringes and XYZ translation stages.  High density (1536 well) micro-assay plates hold the experiments.  1536 crystallization cocktails, covering a wide range of crystallizing agents, have been prepared.  Current pipetting protocols allow us to deploy 200 nanoL droplets of protein solution and crystallization cocktails (total drop size 400 nanoL).  Once a micro-assay plate is prepared with paraffin oil and crystallization cocktails it is possible to set protein solution into the wells in less than five minutes, allowing us to work quickly with unstable proteins.  Current total protein requirements are being assessed, but are likely to be in the 10 mg range.  After setup plates are placed on a computer controlled XY table with micron positioning accuracy.  The plates are translated under a megapixel digital camera where images are captured by a framegrabber.  The XY table can accommodate 28 plates (43K experiments) at a time and the camera can record 43K images in approximately twelve hours.

    In the computer lab the images are analyzed automatically to determine the outcomes of the crystallization experiments.  We are developing a standard vocabulary of outcomes that will describe the results:  clear drop, amorphous precipitate, phase separation, microcrystals, crystals, and uncertain outcome.  These outcomes, recorded as a function of time, are the cornerstone of a crystallization database that will contain physical information about individual proteins as well as results of crystallization experiments with those proteins.  Using case-based reasoning algorithms we will identify patterns of similar properties and crystallization outcomes relating two or more proteins in the database.  Our hypothesis is that, given a quantitative measure of similarity between proteins, recipes successfully employed for one protein will be useful starting points for crystallization experiments with similar proteins.  Future work will center upon the most predictive measures of similarity.

  • Luft, J. R., J. Wolfley, M. Bianca, D. Weeks, Jurisica, I., P. Rogers, J. Glasgow, S. Fortier, G. T. DeTitta. (2000). Gearing Up for ~40K Crystallization Experiments a Day: Meeting The Needs of HTP Structural Proteomics Projects. Eighth International Conference on the Crystallization of Biological Macromolecules (ICCBM-8), Sandestin, Florida.

    The medical potential of the various genome projects now underway will be realized when we know not only the sequences of the amino acids coded in open reading frames but also what these ORFs represent, both structurally and functionally.  Structural proteomics will challenge us to grow more and better crystals for diffraction studies.  Our labs are involved in two major aspects of that work:  getting the techniques and equipment in place to do large scale, high thruput crystallization experiments, and assembling the expertise to make sense of all the data that will come from those experiments.

  • Jurisica, I. (2000). Knowledge Organization by Systematic Knowledge Management and Discovery. International Conference of the International Society of Knowledge Organization (ISKO 6), Toronto, Ontario, p. 366-371.

    We need to use dynamic knowledge organization approaches in order to facilitate effective access and use of domain knowledge. Although there are many approaches to knowledge organization available, it is a challenge to systematically organize evolving domains, because it is not feasible to rely only on humans to create relationships among individual knowledge sources. Additional problems arise because knowledge may not be consistently and completely described, and quality control may not always be in place in distributed knowledge environments. In this article we describe a generic approach to knowledge organization by using systematic knowledge management and applying knowledge-discovery techniques. We use a case-based reasoning system, called TA3, as a core component for knowledge management. Application of symbolic knowledge-discovery component of TA3 supports three main tasks: system optimization, knowledge evolution and evidence creation. To explain advantages of this approach, we use our experience from biomedical domains.

  • Jurisica, I. and Glasgow, J. (2000). Extending case-based reasoning by discovering and using image features in IVF. ACM Symposium on Applied Computing (SAC'2000) Villa Olmo, Como, Italy, p. 52-59.

    This paper describes the application of automated image analysis to evaluate morphology and developmental features of oocytes and embryos in the domain of in vitro fertilization (IVF). Although humans can analyze images more flexibly, computer vision techniques make the proc-ess more objective and precise. We propose to use com-puter-based morphometry to precisely and objectively identify developmental features of oocytes and embryos. Extracted morphological information can be linked with symbolic information to better predict pregnancy outcome and suggest further medical procedures. Recognized fea-tures can then be used to support case-based reasoning and knowledge discovery. The combination of image analysis techniques and case-based reasoning can thus serve as: (1) a feature extraction technique; (2) an indexing approach; and (3) an analysis tool. A combination of symbolic and image information can then be used to identify morpho-logical features of oocytes and embryos that are vital for successful IVF. Extracting image features and analyzing them helps to perform knowledge discovery from images. 



  • Jurisica, I, J. Mylopoulos, E. Yu. (1999) Using Ontologies for Knowledge Management: A Computational Perspective. Annual Conference of the American Society for Information Science, Washington, DC, p. 482-496.

    Knowledge management research focuses on the development of concepts, methods, and tools supporting the management of human knowledge. To further this objective, researchers are studying the way organizations, groups and individuals use knowledge in the performance of daily tasks. They are also developing computer-based tools and techniques to support the acquisition, representation, organization, retrieval, analysis and evolution of knowledge in its many forms. The main objective of this paper is to survey some of the primitive concepts that have been used in computer science for the representation of knowledge and summarize some of their advantages and drawbacks. A secondary objective is to relate these techniques to information sciences theory and practice.

    Several research areas within computer science have developed techniques for representing knowledge so that it can be accessed and used by humans and software systems alike. In particular, Artificial Intelligence (AI) has developed techniques for representing knowledge so that it can be exploited by intelligent systems. Databases have focused on techniques, which allow the representation and management of large amounts of simple knowledge, using as vehicles relational databases and related technologies. Software Engineering and Information Systems have developed elaborate techniques for capturing knowledge that relates to the requirements, design decisions and rationale for a software system. We characterize all these techniques in terms of the primitive concepts they offer for representing knowledge within a given class of applications. 

  • Dayani-Fard, H. and Jurisica, I. (1998) Reverse Engineering by Mining Dynamic Repositories. InWorking Conference on Reverse Engineering (WCRE'98), Honolulu, Hawaii.

    This paper presents some preliminary results on applying information retrieval and knowledge-mining techniques to reverse engineering of legacy systems. In order to support a dynamic environment, we take an approach of integrating lightweight tools. Instead of forcing a user to use a fixed environment, our approach provides a basic information repository, which manages information extracted from the documentation and source code. The system stores this information in a graph structure, it supports navigation through the repository, and modification of its structure and annotation. Preliminary evaluation of the proposed approach on the small-size software system is encouraging. 

  • Jurisica, I. (1998) Asynchronous Telemedicine: A Case-Based Reasoning Approach to Knowledge Sharing. InInformation Technology in Community Health (ITCH*98) Conference, Victoria, BC.

    The health care industry faces constant demands to improve quality, extend services, and reduce cost. Telemedicine satisfies these demands by supporting distant consultations. In addition, knowledge-based systems may augment current synchronous telemedicine applications by storing and managing medical experience over time. By providing timely and efficient access to the knowledge repository, knowledge-based systems help to distribute experience, standardize procedures, lower cost, and increase quality of health care services. This facilitates asynchronous telemedicine.

    Our previous experience from using a case-based reasoning system to support specialists in in vitro fertilization domain shows that this paradigm is suitable for building medical knowledge repositories for knowledge sharing. We propose to extend the system to support tele-consultations: (1) between specialists (rare medical cases); (2) between general practitioners and specialists (standard practices); and (3) between health care professionals and patients (generic medical information). This will help to standardize patient examination and treatment practices. In addition, physicians will be able to share experience via remote knowledge repository.

    This paper focuses on extensions for specialists. We show how case-based reasoning can support evidence-based medicine, remote consultations, and improve knowledge sharing and domain understanding. 

  • Mylopoulos, J, Jurisica, I. and Yu, E. (1998) Computational mechanisms for knowledge organization. In 5th International Conference of the International Society of Knowledge Organization (ISKO 5), pages 125-132, Lille, France. ISKO'98.ps.Z

    This paper reviews several knowledge organization techniques used in Computer Science, in areas such as Artificial Intelligence, Databases and Software Engineering. Some of these computational mechanisms may assist in the organization and management of immense digital information resources. At the same time, the paper notes an increasing need for computer-based information systems to operate in open networked environments. This need requires knowledge organization principles, which are flexible and can be used with informally expressed knowledge. We expect to find such knowledge organization techniques in Library and Information Sciences, and hope to integrated them with the computational techniques described in this paper.  

  • Jurisica, I. and Glasgow, J. (1998). An efficient approach to iterative browsing and retrieval for case-based reasoning. Editor Angel Pasqual del Pobil, Jose Mira and Moonis Ali,Lecture Notes in Computer Science, IEA/AIE*98, pages 535-546, Springer-Verlag. IEA/AIE'98.ps.Z

    A case base is a repository of past experiences that can be used for problem solving. Given a new problem, expressed in the form of a query, the case base is browsed in search of "similar" or "relevant" cases. One way to perform this search involves the iterative evaluation of a series of queries against the case base, where each query in the series is obtained by restricting or relaxing the preceding query.

    The paper considers alternative approaches for implementing iterative browsing in case-based reasoning systems, including a naive algorithm, which evaluates each query independent of earlier evaluations, and an incremental algorithm, which reuses the results of past query evaluations to minimize the computation required for subsequent queries. In particular, the paper proposes an efficient algorithm for case base browsing and retrieval using database techniques for view maintenance. In addition, the paper evaluates the performance of the proposed algorithm with respect to alternative approaches considering two perspectives: (1) experimental efficiency evaluation using diverse application domains, and (2) scalability evaluation using the performance model of the proposed system.

  • Jurisica, I. and Nixon, B. (1998) Building quality into case-based reasoning systems. Lecture Notes in Computer Science, CAiSE*98,  pages 363-380, Springer-Verlag. CAiSE'98.ps.Z

    Complex decision-support information systems for diverse domains need advanced facilities, such as knowledge repositories, reasoning systems, and modeling for processing interrelated information. System development must satisfy functional requirements, but must also systematically meet global quality factors, such as performance, confidentiality and accuracy, called non-functional requirements (NFRs).

    Case-based reasoning (CBR) systems, an important class of decision support systems, require a design process that systematically produces high-quality applications. Beyond satisfying basic functional requirements for CBR, it is important to meet global quality factors, such as performance and confidentiality, called non-functional requirements (NFRs). This paper presents a goal-oriented, knowledge-based approach for aiding decision support system development and usage, namely, it proposes an approach for dealing with non-functional requirements (NFRs) for CBR systems. We show how quality can be built into a CBR system, using the "QualityCBR" approach, which integrates existing work on CBR and NFRs. We illustrate the use of the approach in a complex medical domain in vitro fertilization. In this domain, a CBR system is used for: (1) suggesting hormonal therapy for in-vitro fertilization patients, (2) predicting the probability of successful pregnancy, and (3) interactively determining important patient's characteristics that can improve pregnancy rate. The QualityCBR approach is used to address important NFRs, such as performance, accuracy and confidentiality. 

  • Jurisica, I. Similarity-Based retrieval for diverse Bookshelf software repository users In IBM CASCON Conference, pages 224-235, Toronto, Canada, 1997. CASCON'97.ps.Z

    The paper presents a similarity-based retrieval framework for a software repository that aids the process of maintaining, understanding, and migrating legacy software systems. Designing a software repository involves three issues: (1) information content; (2) information representation; and (3) strategies for accessing repository artifacts. Given the architecture of a Bookshelf software repository, we extend the retrieval system to support imprecise queries, iterative browsing, and diverse users. Because of repository size, complexity of queries and relations among artifacts, we take a performance approach to support a scalable implementation. We propose a retrieval system that uses numeric and semantically rich context-based similarity. Efficient iterative browsing is based on an incremental query evaluation algorithm from database management systems. Explicitly defined context supports various retrieval strategies and diverse user models. 

  • Jurisica, I. and Gupta, K. Knowledge-based systems for decision support in healthcare. In Digital Knowledge Conference II, Toronto, 1997. 

    This paper introduces a generic approach to knowledge-based decision-support in medicine. We review problems present in medical domains and introduce available solutions. We describe a case-based reasoning system called SpotLight and discuss its advantages when applied to complex medical domains, in vitro fertilization and nephrology.

  • Jurisica, I. and Glasgow, J. A case-based reasoning approach to learning control. In 5th International Conference on Data and Knowledge Systems for Manufacturing and Engineering, DKSME-96, Phoenix, AZ, 1996. DKSME'96.ps.Z


  • Jurisica, I. and Glasgow, J. Case-Based Classification Using Similarity-Based Retrieval. In 8th IEEE International Conference on Tools with Artificial Intelligence, Toulouse, France, p. 410-419. TAI'96.ps.Z


  • Jurisica, I. TA3: Case-Based Intelligent Retrieval and Advisory Tool. ACM Conference on Society and the Future of Computing. Durango, CO, 1995.


  • Jurisica, I. and Shapiro, H. Case-based reasoning system applied as an advisor for IVF practitioners 51st Annual Conference of the American Society for Reproductive Medicine, Seattle, WA, 1995. ASRM'95.ps.Z


  • Greiner, R. and Jurisica, I. A statistical approach to solving the EBL utility problem. In Proc. ofNational Conference on AI, AAAI-92, pages 241-247, San Jose, CA, 1992. AAAI'92.ps.Z



  • Symposia

    • Waldron, L, Jurisica, I., Pintilie, M. Assessment of strategies for the application of Lasso penalized likelihood regression to survival analysis of simulated high-dimensional gene expression data, Statistics in the Life Sciences, High-dimensional Inference Workshop, Groningen, the Netherlands, 23-25 November, 2009.

    • Luft, J.R., Cohen, A.E., Dumont, M.E., Grayhack, E.J., Gruner, S.M., Hodgson, K., Jurisica, I., McPherson, A., Phizicky, E.M., Snell, E.H., Soltis, S.M., Weeks, C.M., Malkowski, M.G., DeTitta, G.T. Cloning through diffraction technologies developed at the Center for High-throughput Structural Biology. Keystone Symposia. Structural Biology, 2010.


    • Craddock, K.J., Buys, T.P.H., Zhu, C.Q., Strumpf, D., Pintilie, M., Ding, K., Seymour, L., Jurisica, I., Shepherd, F.A., Lam, W.L., Tsao, M.S. High resolution genomic analysis of NSCLC reveals regions of DNA copy number gain that may be predictive of benefit from adjuvant chemotherapy, IASLC, 2009.


    • Elschenbroich, S., Ignatchenko, V., Shaw, P. A., Jurisica, I., Kislinger, T. Identification of putative biomarkers for ovarian cancer from human ascitic fluid. HUPO, Toronto, October, 2009.


    • Rosu, D., Ng, J., Nigul, L., Jurisica, I. Requirements for practical ontologies - Empirical and theoretical considerations, CASCON, 2009.


    • Rosu, D., Ng, J., Nigul, L., Jurisica, I., Frameworks for practical ontologies, CAS University Days, 2009.


    • Yan, R., Reidemeister, T., Ward, P., Iszlai, G., Jurisica, I. Identifying failures in a large-scale software systems using pattern discovery methods and other machine learning techniques, IBM CASCON, 2009.


    • Cumbaa, C. A., Jurisica, I. Crystallization image analysis on the World Community Grid. Annual Meeting of the American Crystallographic Association, Toronto, ON, July 25-30, 2009.


    • Jurisica, I. Rational prediction and analysis of protein interactions; Disease-specific interaction prediction, HUPO PSI Spring Meeting, Turku, Finland, April 25-30, 2009.


    • Jurisica, I. Predicting and analyzing cancer interaction networks. Bellairs Workshop on Biological and Computational Analysis of Protein-Protein Interaction networks, April 19-25, 2009.


    • Jurisica, I. PSI target prioritization, cancer target selection and interpretation using protein-protein interaction prediction and analysis. NIGMS 2009 Workshop on Enabling Technologies for Structural Biology, Natcher Conference Center, NIH, Washington DC, March 4-6, 2009.


    • Cumbaa, C. A. and Jurisica, I.. Crystallization image analysis – progress update. NIGMS 2009 Workshop on Enabling Technologies for Structural Biology, Natcher Conference Center, NIH, Washington DC, March 4-6, 2009.


    • Sharma, P., Ignatchenko, A., Bousette, N., Evangelou, A., Jurisica, I., Gramolini, A., Kislinger, T. Identification and characterization of cell-surface associated proteins in the heart: Applications of a proteomic strategy. Omics Meets Cell Biology, Keystone, CO, January 2009.


    • Yan, R., Reidemeister, T., Jurisica, I., Ward, P., Litani, E., Dancy, L., Labadie, E., Litoiu, M., Identifying Failures in a Large-Scale Software Systems using Pattern Discovery Methods and other Machine Learning Techniques, Technical Showcase, CASCON, 2008.


    • Yan, R., Jurisica, I., and Litoiu, M., Pattern Discovery on ITCAMfRTT, IBM Tivoli University Day, September 2008.


    • Sharma, P., Ignatchenko, A., Bousette, N., Evangelou, A., Jurisica, I., Gramolini, A., Kislinger, T., Proteomic profiling of the cardiomyocyte plasma membrane. HUPO, 7th World Congress, Amsterdam, August, 2008.


    • Kittanakom, S., Suter, B., Heisler, L., Smith, A., Nislow, C., Jurisica, I., Stagljar, I., Systematic and high-throughput screening for protein-protein interactions combining membrane yeast two-hybrid with DNA microarray technology. Yeast Genetics and Molecular Biology Meeting, Toronto, ON, July 22-27, 2008.


    • Jurisica, I. and M. Agochiya. Integrative computational biology approach to ovarian cancer. 20th World Cancer Congress. Geneva, Switzerland, 2008.


    • De Titta, G., C. A. Cumbaa, J., Luft, A. Lauricella, M. Malkowski, R. Nagel, E. Snell, Jurisica, I.. Reading the crystallization tea leaves with a little help from my web friends. ISMB, Toronto, July, 2008.


    • Boutros, P.C., L. Z. Penn, M.S. Tsao, Jurisica, I.. The network properties of prognostic markers for cancer. ISMB, Toronto, ON, 2008.


    • Cumbaa, C. A., and Jurisica, I.. Crystallization image analysis on the World Community Grid. ISMB, Toronto, ON, 2008.


    • Zhu, C.Q., D. Strumpf, A. Shepherd, Jurisica, I., M. S. Tsao. A 12-gene expression signature prognostic for squamous cell lung carcinoma. ISMB, Toronto, ON, 2008.


    • Strumpf, D., W. Xie, C. Q. Zhu, F. A. Shepherd, M. S. Tsao, Jurisica, I.. Systematically determining the key biological mechanisms underlying non-small cell lung cancer by integrative computational biology approach. ISMB, Toronto, ON, 2008.
    • Brown, K. R., Jurisica, I.. Unequal evolutionary conservation of human protein interactions in interologous networks. ISMB, Toronto, ON, 2008.


    • Fortney, K. and Jurisica, I.. Narrowing the search for novel aging genes with systems biology. ISMB, Toronto, ON, 2008.


    • Cox, B., Evangelou, A., Ignatchenko, V., Kotlyar, M., Ignatchenko, A., Whiteley, K., Jurisica, I., Rossant, J., Adamson, SL., and Kislinger, T. A proteome and transcriptome resource of the human and mouse feto-maternal exchange tissue, IVBM, Australia, 2008. Presentation.


    • Jurisica, I. and M. Agochiya. Systems biology approach to ovarian cancer: ovarian cancer data integration portal. 4th Canadian Conference on Ovarian Cancer Research. Montreal, QC, 2008. Presentation.


    • Cumbaa, C. A., and Jurisica, I.. Crystallization image analysis on the World Community Grid. NIH PSI Bottlenecks Meeting, Bethesda, MD, March 2008, Presentation.


    • Eppert, K., E. Lechman, K. Takenaka, J. Lu, Jurisica, I., A. J. Canty, M. Minden, T. R. Golub, B. L. Ebert, J. E. Dick. Identification and characterization of regulatory networks specific for leukemic stem cells. 73rd Symposium: Control & Regulation of Stem Cells, 2008.


    • Tsao, M.-S., Zhu, C.-Q., Ding, K., Strumpf, D., Pintilie, M., Meyerson, M., Seymour, L., Jurisica, I., Shepherd, F. A 15-gene expression signature prognostic for survival and predictive for adjuvant chemotherapy benefit in JBR.10 patients. ASCO, 2008. Presentation


    • May T., Sharma M., Jurisica I.Begley H., Brown T.J., Shaw P.A. Significant Genes and Pathways involved in Low Grade Serous Carcinogenesis. Modern Pathol (Supp.), 2008.


    • May, T., M. Sharma, Jurisica, I., B. Rosen, J. Murphy, T. Brown, P. Shaw, Significant genes involved in low grade ovarian carcinogenesis, SGI 55th Annual Meeting, San Diego March 2008.


    • May T., Sharma M., Jurisica I.,Begley H., B. Rosen, J. Murphy, Brown T.J., Shaw P.A. The genetic profile of ovarian low grade serous carcinoma is similar to serous low malignant potential tumors with micropapillary features and distinct from high grade serous carcinoma. The 1st Ovarian Cancer Action International Conference, London UK, March 6-8, 2008. Presentation.


    • Jurisica, I., H. Li, A. Jurisicova, T. Kislinger. Integrative computational biology approach leads to discovering novel putative biomarkers for early ovarian cancer detection. The 1st Ovarian Cancer Action International Conference, London UK, March 6-8, 2008. Presentation.


    • Craddock K.J., Strumpf D., Xie W., Buys T.P.H., Chi B., Lam W.L., Jurisica I.,Tsao M.S. Integrative genomic microarray analyses reveal novel molecular targets in non-small cell lung carcinoma. United States and Canadian Academy of Pathology Annual Meeting, Denver, Colorado, 2008.


    • Thayer, M., A. Lauricella, E.H. Snell, S. Potter, J. Wolfley, R. Nagel, M. Said, M.E. Snell, M. Rosenblum, M. Malkowski, T. Veatch, E. Cook, C. Cumbaa, Jurisica, I., J.R. Luft, and G.T. DeTitta. Creating and milking a macromolecular crystallization database: A view from the trench. Pittsburgh Diffraction Conference. Buffalo, NY, 2007. Presentation


    • Kotlyar, M., and Jurisica, I.. Predicting mouse essential genes from gene expression, protein-protein interactions and orthology data. CSH Interactome Networks, Hinxton, UK, 2007. (poster)


    • Yan, R., Jurisica, I., J. Waterhouse, E. Poulin. Analysis of patterns in web traffic using an improved Gibbs sampling algorithm. North-East Student Conference on Artificial Intelligence (NESCAI07), Cornell University, Ithaca, NY, 2007. presentation


    • L. Gortzak-Uzan, A. Ignatchenko, A. Evangelou, P. St.Onge, B. Rosen, P. Shaw, T. J. Brown, T. Kislinger, and Jurisica, I.. Integrated Proteomic and Bioinformatic Analyses to Identify Putative Biomarkers, ISMB-07, Vienna, Austria, July 2007. (poster)


    • C. A. Cumbaa and Jurisica, I.. Automated classification of crystallization images. NIH Bottlenecks Meeting, Bethesda, MD, March 2007. (poster)


    • Cumbaa, C. A. and Jurisica, I.. Automated classification of crystallization images. NIH Bottlenecks Meeting, Bethesda, MD, March 2007.


    • Guha, D., K. Warren, M. Agochiya, S. Alwaheeb, M. Curtis, J. Sweet, T. Brown, N. Fleshner, J. Trachtenberg, Jurisica, I., S. J. Done. Identification and characterization of genetic alterations at the stromal-epithelial interface in prostate cancer , AACR, 2007.


    • Gortzak-Uzan L., A. Ignatchenko, A. Evangelou, P. St.Onge, B. Rosen, P. Shaw, T. J. Brown, Jurisica, I., and T. Kislinger. A proteome resource of ovarian cancer ascites. Keystone Symposia, March 22, 2007.


    • Shaw PA, Agochiya M, Sharma M, Murphy J, Rosen B, Brown TJ, Begley H, Jurisica I. Gene expression profiles of ovarian serous carcinoma after neoadjuvant chemotherapy. Modern Pathology, 19:195A-196A, 2006.


    • Bachtiary B, Boutros P, Pintilie M., Shi W, Schwock J, Penn LZ , Jurisica I, Fyles A, Liu FF. Gene expression profiling in cervical cancer - an exploration of intra-tumor heterogeneity. Third International Conference on Translational Research and Pre-Clinical Strategies in Radiation Oncology, March 12-15, 2006.


    • (abstract) Motamed-Khorasani A, Letarte M, Jurisica I, Shaw PA, Murphy KJ, Brown TJ. Androgen-induced altered gene expression profiles in ovarian epithelial cells from BRCA1 and 2 mutation carriers as compared to cells from control patients. Fertility and Sterility, 84:S440, 2005.


    • Liu FF, Bastianutto C, Shi W, Li A, Perez-Ordonez B, Ng R, Chow KY, Zhang W, Jurisica I, Bayley A, Kim J, O'Sullivan B, Siu L, Chen E. Multiple dysregulated pathways in nasopharyngeal carcinoma revealed by gene expression profiling. Radiotherapy and Oncology, 76: S20, 2005.


    • Lau, S. K., C. Q. Zhu, P. C. Boutros, M. Pintilie, W. Zhang, F. Blackhall, N. Liu, L. Penn, Jurisica, I., F. A. Shepherd, S. D. Der, M.-S. Tsao. Quantitative PCR validation of putative non-small cell lung cancer (NSCLC) prognostic marker genes derived from multiple published microarray databases and reports. Lung Cancer, 49: S283, 2005.


    • Brown, K.R., I. W. Taylor, J. L. Wrana, Jurisica, I.. Exploring "date hubs" in higher eukaryotes through large-scale data integration and interolog-mapped protein interactions. Keystone symposium. 2005. (poster).


    • Shi, W., C. Bastianutto, A. Li, B. Perez-Ordonez, R. Ng, KY Chow, D. Huang, P. Busson, W. Zhang, Jurisica, I., A. Bayley, J. Kim, B. O'Sullivan, L. Siu, E. Chen, F.F. Liu. Gene expression profiling of human nasopharyngeal carcinoma. AACR, 2005. (poster)

    • Cumbaa, C. and Jurisica, I., Hidden Factors in Protein Crystallization, ISMB, 2005. (poster)

    • Shi, W. C. Bastianutto, A. Li, B. Perez-Ordonez, R. Ng, K.Y. Chow, D. Huang, P. Busson, W. Zhang, Jurisica, I., A. Bayley, J. Kim, B. O'Sullivan, L. Siu, E. Chen, Fei-Fei Liu, Gene expression profiling of human nasopharyngeal carcinoma, AACR, April 2005.

    • Jurisica, I., C. A. Cumbaa. Pattern discovery in high-throughput protein crystallization trials. NIGMS 2005 PSI Protein Production & Crystallization Workshop, Natcher Conference Center, NIH, Washington DC, January, 2005.


    • E. Xia and Jurisica, I.. Query Processing Using Grid Computing. IBM Cascon Conference, Toronto, ON, October 2004. Poster presentation.


    • C. A. Cumbaa and Jurisica, I.. Automatic Classification and Pattern Discovery in High-throughput Protein Crystallization Trials, International Conference on Structural Genomics, November 17-21, Washington DC, 2004. Poster presentation.


    • Kevin R. Brown and Jurisica, I.. Online Predicted Human Interaction Database [OPHID]: Exploring the Human Interactome. ISMB/ECCB'04, Glasgow, UK, July 31st - August 4th, 2004. Poster presentation.
      OPHID server


    • Kevin R. Brown, Afsaneh Motamed-Khorasani, Miriam Barrios-Rodiles, Ted Brown, Michelle Letarte, Igor Jurisica, I.ntegrated Analysis of Androgen-Altered Genes in Epithelial Ovarian Cancer Using a Predicted Human Protein-Protein Interaction Network. Poster #5751, AACR Annual Meeting, Orlando, Florida, 2004
      OPHID server


    • K. Takenaka, Jurisica, I., J. Dick. Global gene expression analysis of highly purified human hematopoietic stem cells, 45th Annual Meeting and Exposition of American Society of Hematology (ASH'03), San Diego, CA, December 6-9, 2003.


    • Xia, E. and Jurisica, I.. Effectiveness of grid configurations on application performance.  In Proceedings of IBM Cascon Conference, October 6-9, 2003. Technology showcase.


    • Przulj, N. and Jurisica, I.. A call graph analysis. In Proceedings of IBM Cascon Conference, October 6-9, 2003. Poster


    • Przulj, N; Wigle, D.; Jurisica, I. Functional topology in a network of protein interactions, BioPathways, Intelligent Systems for Molecular Biology (ISMB 2003), Brisbane, Australia, June 27, 2003.


    • Blackhall, F., Wigle, D., Pintilie, M., Jurisica, I., Liu, N., Johnston, N., Darling, G., Keshavjee, S., Winton, T., Shepherd, F., Tsao, M. Evaluation of novel prognostic markers detected by cDNA microarray in stage I-III non-small cell lung cancer by real-time RT-PCR. 10th World Congress on Lung Cancer, August 2003.


    • Przulj, N., Lee, G., Jurisica, I. Functional analysis of large software networks. IBM Academy Conference on Proactive Problem Prediction, Avoidance, and Diagnosis, IBM T. J. Watson Research Center, April 28-29, 2003.


    • Jurisicova, A., Oh, J., Acton, B., Jurisica, I., Tilly, J. 2-cell embryonic arrest is accompanied by dysregulated expression of cell death genes and alterations in mitochondrial potential. Keystone Symposia, Poster, February, 2003.


    • Sultan, M., Blackhall, F., Wigle, D., Tsao, M., Jurisica, I. Coupled two-way clustering of microarray data using a vector quantization algorithm. AACR Annual Meeting, Poster, Toronto, April 2003.


    • Giles C. Warner, Jurisica, I., N. Biesley, C. MacMillan, J. Irish, D. Brown, P. Gullane, R. Wells, S. Kamel-Reid. Molecular classification of head and neck cancer, 6th Research Workshop in the biology, prevention and treatment of head and neck cancer, Mclean, Virginia, Oct 2002. Abstract for oral presentation


    • Giles C. Warner, Jurisica, I., N. Biesley, C. MacMillan, J. Irish, D. Brown, P. Gullane, R. Wells, S. Kamel-Reid. Differential gene exression in head and neck cancer, American Academy of Otolaryngology-Head and Neck Surgery Annual Meeting San Diego, Sept. 2002. Abstract for oral presentation.


    • Giles C. Warner, Jurisica, I., N. Biesley, C. MacMillan, J. Irish, D. Brown, P. Gullane, R. Wells, S. Kamel-Reid. Differential gene exression in head and neck cancer, Canadian Society of Surgical Oncology, Ottawa, March 2002, Abstract for oral presentation. (Awarded the 1st prize)


    • Albert, M., Jurisica, I., Park, P., Squire, J., Macgregor, P. (2001). Comparative microarray study of T7-based and PCR-based RNA amplification approaches: A pilot study for the expression profiling of laser capture microdissected prostate cancer samples. Conference on Laser Capture Microdissection and Macromolecular Analysis of Normal Development and Pathology, NIH, Bethesda, MD, July 17-18. Abstract for oral presentation.


    • Mouka, J., Jurisica, I., Huner, O. (2001). MicroArray eXperiment (MAX): A collaborative on-line research environment for large-scale cDNA microarray projects. The Third International Meeting on Microarray Data Standards, Annotations, Ontologies and Databases (MGED-3), Stanford University, Palo Alto, CA. Poster.


    • Jurisica, I., Rogers, P., Glasgow, J., Fortier, S., Collins, R., Wolfley, J., Luft, J. and DeTitta, G. (2001). High Throughput Macromolecular Crystallization: An Application of Case-Based Reasoning and Data Mining, in Methods in Macromolecular Crystallography, L. Johnson and D. Turk (Eds.), Volume 325, NATO Science Series: Life Sciences, Kluwer Academic Press.


    • Luft, J. R., J. Wolfley, M. Bianca, D. Weeks, Jurisica, I., P. Rogers, J. Glasgow, S. Fortier, G. T. DeTitta. (2000). High throughput protein crystallization: Keeping up with the genomics. Gordon Conference on Diffraction Methods in Molecular Biology , Andover, NH.


    • Luft, J.R., Bianca, M., Owczarczak, L. M., Weeks, D. R., Jurisica, I., Rogers, P., Glasgow, J., Fortier, S. and DeTitta, G.T. The development of high througput methods for macromolecular microbatch crystallization. Recent Advances in Macromolecular Crystallization, San Diego, CA, 1999.


    • Jurisica, I., DeTitta, G.T., Luft, J., Glasgow, J., Fortier, S. Knowledge Management in Scientific Domains, AAAI-99 Workshop on Exploring  Synergies of Knowledge Management and Case-Based Reasoning, Orlando, FL, 1999.


    • Luft, J.R., Bianca, M., Jurisica, I., Rogers, P., Glasgow, J., Fortier, S. and DeTitta, G.T. An Opening Strategy for Macromolecular Crystallization: Case-Based Reasoning and the Exploitation of a Precipitation Reaction Outcome Database. Conference of the American Crystallography Association, Buffalo, NY, 1999.


    • Errico, B. and Jurisica, I.. Adaptive Agent-based Systems for the Web: An Application to the NECTAR Project. AAAI Spring Symposium on Intelligent Agents in Cyberspace, Stanford University, March 22 - 24, 1999.


    • Jurisica, I. Supporting evidence-based medicine by cooperative information systems. In Digital Knowledge Conference III, Toronto, 1999.


    • Jurisica, I. Library as a Knowledge Broker: Knowledge Management and Sharing. Ontario Library Association Super Conference, Toronto, January 21-23, 1999.


    • Glasgow, J. and Jurisica, I. Integration of case-based and image-based reasoning, American Association for Artificial Intelligence, Workshop on Case-Based Reasoning, Madison, WI, July 28, 1998. AAAI-CBRW'98.ps.Z


    • Jurisica, I. Supporting flexibility. A case-based reasoning approach. In The AAAI Fall Symposium. Flexible Computation in Intelligent Systems: Results, Issues, and Opportunities, Cambridge, Massachusetts, 1997.


    • Jurisica, I. Inductive learning and case-based reasoning, Canadian AI Conference, Workshop on What is Inductive Learning? Toronto, Ontario, 1996.


    • Jurisica, I. A Similarity-Based Retrieval Tool for Software Repositories The 3rd Workshop on AI and Software Engineering: Breaking the Mold. IJCAI-95, Montreal, Quebec, 1995


    • Jurisica, I. and Glasgow, J. Applying Case-Based Reasoning to Control in Robotics 3rd Robotics and Knowledge-Based Systems Workshop, St. Hubert, Quebec, 1995.


    • Jurisica, I. How to Retrieve Relevant Information? Proceedings of the AAAI Fall Symposium Series on Relevance. New Orleans, Louisiana, 1994.


    Thesis work

    • Jurisica, I. (1998). TA3: Theory, Implementation, and Applications of Similarity-Based Retrieval for Case-Based Reasoning. PhD thesis, Department of Computer Science, University of Toronto, Toronto, Ontario.

    • Jurisica, I. (1993). Query optimization for knowledge base management systems; A machine learning approach. MSc thesis. Department of Computer Science, University of Toronto, Toronto, Ontario.

    • Jurisica, I. (1991). Machine learning in expert systems. Dipl. Ing. thesis, Slovak Technical University in Bratislava, Slovakia.

    Technical reports

    1. Jurisica, I., J. Glasgow, R. Ng, H. Hoos. Integrative Computational Biology, The 3rd Canadian Working Conference on Computational Biology (CCCB'04), IBM Cascon Conference, IBM TR-74-203-8, 2005.

    2. Arshadi, N. and Jurisica, I., Maintaining case-based reasoning in high-dimensional domains using mixture of experts, Technical Report CSRG-490, Department of Computer Science, University of Toronto, Toronto, Ontario, June 2004.

    3. Przulj, N., Corneil, D., Jurisica, I. Modeling Interactome: Scale-Free or Geometric?, Technical Report 321/04, Department of Computer Science, University of Toronto, Toronto, Ontario, 2004.

    4. Jurisica, I.. Data Mining and Knowledge Discovery, IBM Technical Report 74.165-a, IBM Centre for Advanced Studies, Toronto, December 1, 1998.

    5. J. Glasgow and Jurisica, I.. Data Storage, Retrieval and Mining in Biomedical Applications. IBM Technical Report 74.165-b, IBM Centre for Advanced Studies, Toronto, December 1, 1998.

    6. Jurisica, I.. Context-based similarity applied to retrieval of relevant cases. Technical Report DKBS-TR-94-5, University of Toronto, Department of Computer Science, Toronto, 1994.

    7. R. Greiner and Jurisica, I.. An EBL system that (almost) always improve performance. Technical Report, Siemens Corporate Research, Princeton, NJ, 1992.


    Workshops

    1. Jurisica, I. and M. McGuffin, User interfaces for visualizing complex data. IBM Cascon, Toronto, Ontario, October 30, 2008.

    2. Jurisica, I. and D. Aldridge. The Fourth Canadian Working Conference on Computational Biology (CCCB’05) – Systems biology. Toronto, Ontario, October, 2005.

    3. Jurisica, I., J. Glasgow, R. Ng, H. Hoos. The Third Canadian Working Conference on Computational Biology (CCCB’04). Toronto, Ontario, October 4, 2004.

    4. Jurisica, I. and M. Hallett. The Second Canadian Working Conference on Computational Biology (CCCB’02). Toronto, Ontario, October 1, 2002.

    5. Jurisica, I. The First Canadian Working Conference on Computational Biology (CCCB’00). Toronto, Ontario, November 12, 2000.

    6. Jurisica, I. And Rigoutsos, I. Knowledge management: Moving from business to technical and scientific domains, IBM CASCON, 1999.

    7. Jurisica, I. Health care for the future, CITO, February 18, 1999.

    8. Glasgow, J. and Jurisica, I.. “Data storage, retrieval and mining in biomedical applications”. In IBM CASCON Conference, IBM CAS, IBM Technical Report 74.165-b, Toronto, 1998.

    9. Jurisica, I. Telemedicine: Where we are and where can we go? In IBM CASCON Conference, IBM CAS Technical Report 74.161, Toronto, Canada, November 10-13, 1997.

    10. Jurisica, I. and K. Gupta. A Framework for medical knowledge management systems. In Digital Knowledge II Conference, Toronto, Canada, October 20-21, 1997.


    Tutorial Presentations

    1. Jurisica, I. Interaction networks. The Canadian Bioinformatics Workshop Series, Ed. M. Hallett and M. Suderman, Systems and Network Biology. Toronto, ON June 27-28, 2008.

    2. Jurisica, I., Knowledge Discovery in High-Throughput Biological Domains. Introduction to computational biology, RSFDGrC, Regina, SA, September 1, 2005.

    3. Jurisica, I., I. Rigoutsos, A. Floratos. Knowledge Discovery in Biological Domains, ACM, Knowledge Discovery in Databases Conference, Boston, MA, 2000.

    4. Glasgow, J. and Jurisica, I.. Knowledge Mining and Discovery in Molecular Biology, Pacific Symposium on Biocomputing (PSB’99), Hawaii, January 4, 1999.

    5. Jurisica, I. Data Mining and Knowledge Discovery, IBM CASCON Conference, IBM Technical Report 74.165-a, Toronto, December 1, 1998.

    6. Mylopoulos, J., V. Chaundhri, Jurisica, I., D. Plexousakis, A. Shrufi, T. Topaloglou, and H. Wang. Development and Application of Knowledge Base Management Systems. Australian Joint Conference on AI (AJCAI’95), Canberra, Australia, November 1995.

    7. Mylopoulos, J., V. Chaundhri, Jurisica, I., D. Plexousakis, A. Shrufi, T. Topaloglou, and H. Wang. Knowledge Base Management Systems. International Joint Conference on AI (IJCAI’95), Montreal, Quebec, August 1995.

    8. Mylopoulos J., V. Chaundhri, Jurisica, I., D. Plexousakis, A. Shrufi, T. Topaloglou, and H. Wang. Knowledge Base Management and its Application. IEEE Conference on AI Applications (CAIA’94), IEEE Computer Society, San Antonio, TX, March 1994.

    9. Jurisica, I. Representation and management issues for case-based reasoning systems. TRIO/ITRC Research Retreat, Queen's University, Kingston, May 10-12 1994.

    10. Mylopoulos, J., V. Chaundhri, Jurisica, I., D. Plexousakis, A. Shrufi, T. Topaloglou, and H. Wang. Knowledge Base Management Systems. Database and Expert Systems Applications (DEXA’94), Athens, Greece, September 1994

    11. Mylopoulos, J., V. Chaundhri, Jurisica, I., D. Plexousakis, A. Shrufi, T. Topaloglou, and H. Wang. Information and Knowledge Base Management. Information Technology Research Center, University of Toronto, Department of Computer Science, February 1993.

    Supplementary Data

    • Wigle, D., Jurisica, I., N. Radulovich, M. Pintilie, J. Rossant, N. Liu, C. Lu, J. Woodgett, I. Seiden, M. Johnston, S. Keshavjee, G. Darling, T. Winton, B. Breitkreutz, P. Jorgenson, M. Tyers, F. A. Shepherd, M.S. Tsao. Molecular profiling of non-small cell lung cancer and correlation with disease-free survival Cancer Research, 62(11):3005-3008, June 1, 2002.
    • Brown, K. and Jurisica, I.. Unequal evolutionary conservation of human protein interactions in interologous networks. Genome Biology, 2007. In press.
    • Miriam Barrios-Rodiles, Kevin R. Brown, Barish Ozdamar, Rohit Bose, Zhong Liu, Robert S. Donovan, Fukiko Shinjo, Yongmei Liu, Joanna Dembowy, Ian W. Taylor, Valbona Luga, Natasa Przulj, Mark Robinson, Harukazu Suzuki, Yoshihide Hayashizaki, Igor Jurisica, and Jeffrey L. Wrana. High-Throughput Mapping of a Dynamic Signaling Network in Mammalian Cells , Science 307:(5715): 1621-1625, 2005.
    • Brown, K. and Jurisica, I.. Online Predicted Human Interaction Database OPHID, Bioinformatics, 2005. Advance Access published on January 18, 2005. doi:10.1093/bioinformatics/bti273. In press.

    • K. Brown and Jurisica, I.. Online Predicted Human Interaction Database (OPHID): Exploring the human interactome. ISMB/ECCB'04, Glasgow, UK, 2004. Poster.
    • Przulj, N., Corneil, D., Jurisica, I. Modeling interactome: Scale-free or geometric?, Bioinformatics, 20(18):3508-3515, 2004
    • King, A. D., N. Przulj, Jurisica, I. Protein complex prediction via cost-based clustering. Bioinformatics, 20(17):3013-3020, 2004.
    • Przulj, N., Wigle, D., Jurisica, I. Functional topology in a network of protein interactions. Bioinformatics 20(3):340-348, 2004.