The Role of Ontology in Big Cancer Data: Difference between revisions

From NCOR Wiki
Jump to navigationJump to search
 
(84 intermediate revisions by 2 users not shown)
Line 1: Line 1:
Date: May 12-13, 2015
Date: May 12-13, 2015
== Participation ==
'''Please note that for Day 1 of this meeting we are now at maximum capacity. Spaces are available for participants who wish to attend on Day 2 only.'''
Interested persons should write to [mailto:phismith@buffalo.edu Barry Smith]. There is no participation fee.
== Hotels ==
We have a block of rooms reserved near the NCI building which is our venue on Day 1 in:
Hilton Garden Inn Rockville-Gaithersburg
14975 Shady Grove Road
Rockville, MD
(240) 507-1900
The link for reservations is [http://hiltongardeninn.hilton.com/en/gi/groups/personalized/D/DCARVGI-ONT-20150511/index.jhtml?WT.mc_id=POG here]. Cut-off date is April 13th.
For other hotels in the area, see [[Hotels near NCI | this list.]]


== Venue ==
== Venue ==


*Day 1 (10am-5pm) will be in NCI's Shady Grove building (9609 Medical Center Dr, Rockville, MD, 20850), in Room 2W-32/34. 
*Day 1 [http://www.hhs.gov/ash/bloodsafety/advisorycommittee/nih-fisherslane-conferencecenter-directions.pdf FISHERS LANE CONFERENCE CENTER], 5635 Fishers Lane, Rockville, MD 20852 [[Directions to Fishers Lane Conference Center]]


*Day 2 (9am-3pm) will be a public session in Balcony A, Natcher Conference Center, NIH Building 45, Bethesda, MD 20892
*Day 2 [http://www.nih.gov/icd/od/ocpl/VIC/index.htm Balcony A, Natcher Conference Center, NIH Building 45, Bethesda, MD 20892]
 
'''Goal of the meeting (to be expanded)''': To better understand the challenges involved in using big data for cancer research, and to explore the utility of ontologies in addressing these challenges.


==Session 1: Addressing cancer big data challenges through imaging ontologies==
==Session 1: Addressing cancer big data challenges through imaging ontologies==


Tuesday, May 12 in NCI Shady Grove building (9609 Medical Center Dr, Rockville, MD, 20850), Room 2W-32/34
Tuesday, May 12 in FISHERS LANE CONFERENCE CENTER, 5635 Fisher Lane, Rockville, MD 20852


10:00-13:00
10:00 '''Warren Kibbe (NCI): Introductory Remarks'''


'''Barry Smith (Buffalo): The cancer research ontology space: An introduction'''
10:10 '''Barry Smith (Buffalo): The cancer research ontology space: An introduction''' [http://ncor.buffalo.edu/2015/NCI/Smith.pptx Slides]
:The goal of the meeting: to better understand the challenges involved in using big data for cancer research, and to explore the utility of ontologies in addressing these challenges.
:An introduction to existing ontology resources in the cancer domain, including NCI Thesaurus and the OBO Foundry; addressing opportunities and reasons for scepticism as concerns the use of ontologies in addressing cancer big data; topics to address will include the ontology of tissues, of tissue banks, and of image banks
:An introduction to existing ontology resources in the cancer domain, including NCI Thesaurus and the OBO Foundry; addressing opportunities and reasons for scepticism as concerns the use of ontologies in addressing cancer big data; topics to address will include the ontology of tissues, of tissue banks, and of image banks
:What are the desired outcomes from this meeting?


'''Ilya Goldberg (NIA): The role of imaging ontologies in cancer big data'''
10:50 '''Ilya Goldberg (NIA): The role of imaging ontologies in cancer big data''' [http://ncor.buffalo.edu/2015/NCI/Goldberg.pdf Slides]


:As image data accumulates descriptive metadata using controlled vocabularies (ontologies), two key applications form a key "Big Data" technology that will impact cancer research and medicine as a whole:  
:As image data accumulates descriptive metadata using controlled vocabularies (ontologies), two key applications form a key "Big Data" technology that will impact cancer research and medicine as a whole:  
Line 46: Line 24:
::2. automated annotation, which addresses the scaling problem in associating metadata with ever larger image collections by using ontological terms as categories and the images carrying them as sample data to train machine classifiers.  
::2. automated annotation, which addresses the scaling problem in associating metadata with ever larger image collections by using ontological terms as categories and the images carrying them as sample data to train machine classifiers.  


'''Metin Gurcan (Ohio State) and John Tomaszewski (University at Buffalo): How ontologies can help in addressing the big data challenges of pathology imaging'''
11:30 Break
11:45 '''John Tomaszewski (University at Buffalo) and Metin Gurcan (Ohio State): ​How ontologies can help in addressing the big data challenges of pathology imaging: perspectives from a collaborating pathologist and image scientist''' [http://ncor.buffalo.edu/2015/NCI/Tomaszewski.pptx Tomaszewski Slides] [http://ncor.buffalo.edu/2015/NCI/Gurcan.ppt Gurcan Slides]


:Widespread availability of whole slide scanners has a transformative effect on pathology, resulting in data that is "big” not only in terms of size but also in richness of information. Computational algorithms have been developed to tap into this data both to assist pathologists and to improve diagnosis, prognosis and treatment. Now, however, there is an urgent need to provide a set of terms and formal definitions necessary to characterize both the histopathological images and the algorithms that operate on them. We will present in this light our on-going work to create an ontology for histopathological imaging.
:Widespread availability of whole slide scanners has a transformative effect on pathology, resulting in data that is "big” not only in terms of size but also in richness of information. Computational algorithms have been developed to tap into this data both to assist pathologists and to improve diagnosis, prognosis and treatment. Now, however, there is an urgent need to provide a set of terms and formal definitions necessary to characterize both the histopathological images and the algorithms that operate on them. We will present in this light our on-going work to create an ontology for histopathological imaging.
Line 56: Line 35:
13:00 Lunch
13:00 Lunch


==Session 2: Addressing cancer big data challenges with the Ontology for Biomedical Investigations (OBI)==
==Session 2: Addressing cancer big data challenges by capturing metadata for data integration and analysis through ontologies ==
 
13:45 '''Christopher R. Kinsinger (NCI): Ontology Considerations for Cancer Proteomics''' [http://ncor.buffalo.edu/2015/NCI/Kinsinger.pptx Slides]
:NCI’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) conducts proteomic and genomic analysis on clinical tissue samples.  While CPTAC has utilized controlled vocabularies for biospecimen resources, development and adoption of ontology tools for rapidly advancing technologies such as proteomics and genomics remains a challenge.
 
14:15 '''Anna Maria Masci (Duke): Understanding the tumor microenvironment: integration of multiscale cancer data''' [http://ncor.buffalo.edu/2015/NCI/Masci.pdf Slides]
:Tumor biology can be fully understood by considering the tumor as part of an ecosystem that includes tumor cells, stromal cells, matrix, fluids and gas. Real Time PCR, Tissue Micro Arrays, Plasma data and ontology are approaches used to describe the complex tumor ecology.
 
14:45 '''Jingshan Huang (South Alabama): The OMIT ontology'''  [http://ncor.buffalo.edu/2015/NCI/Huang.ppt Slides]


14:00-17:00
:The purpose of constructing OMIT is to establish data exchange standards and common data elements in miR and microgenomics domain. Cell biologists and bioinformaticians can make use of OMIT to leverage emerging semantic technologies in knowledge acquisition and discovery for more effective identification of important roles performed by miRs in human diseases including cancer.


'''Chris Stoeckert (Penn): Integration and alignment of ontologies for cancer metadata collection based on OBI'''
15:15 Break
:tasks: address the challenge that cancer research is multidisciplinary and requires standard terminology from multiple domains.
:briefly describe OBI and show how it has been used for collecting clinical and -omic metadata highlighting relevance to cancer data and integration of other ontologies for that purpose.


'''Gully Burns (UCSD): Applying OBI to cancer pathways via Knowledge Engineering from Experimental Design (KEfED)'''
15:30 '''Chris Stoeckert (Penn): Integration and alignment of ontologies for cancer metadata collection based on the Ontology for Biomedical Investigations (OBI)''' [http://ncor.buffalo.edu/2015/NCI/Stoeckert.pptx Slides]
:addresses the challenge that much of what is known about cancer is only available in publications and requires text mining including the experimental basis for that knowledge.
:Cancer research is multidisciplinary and requires standard terminology from multiple domains. OBI and interoperable OBO Foundry ontologies can be used for collecting and integrating clinical and -omic metadata relevant to cancer data. The Ontology for Biobanking is an example based on OBI.
:application of OBI as semantic base for text mining and knowledge engineering.


'''Philippe Rocca-Serra: How can OBI contribute to unraveling cancer etiology? Scope, Gaps and Future Development of an interoperable semantic resource
16:00 '''Gully Burns (ISI/ USC): Leveraging OBI to cancer pathways via Knowledge Engineering from Experimental Design (KEfED)''' [http://ncor.buffalo.edu/2015/NCI/Gully.pptx Slides]
:address the challenge that cancer big data is multi-scale and requires agents to analyze.  
:Our biomedical knowledge about cancer pathways is derived piecemeal from small-scale experiments in studies of molecular biology. A single study may consist of as many as thirty individual experiments to test small aspects of the underlying hypothesis of a paper. Typically 'pathway databases' interpret and summarize this information as model constructs that describe signaling events with little or no reference to the originating data that acts as evidence for the pathway constructs. As a part of the RUBICON project within the 'Big Mechanisms' DARPA program, I describe our work to model of the underlying design of molecular biology experiments as part of a text mining approach under development across the Big Mechanisms program called 'reading against a model'. This application serves as a valuable method within which OBI terminology and structure is providing essential structure and support.
:demonstrate use of OBI in annotation and data discovery. Address alternatives to OBI and pros and cons.


'''Jingshan Huang: The [http://bioportal.bioontology.org/ontologies/OMIT OMIT ontology]'''
16:30 Close of Day 1


== Session 3: Cancer big data and the Ontology of Disease: Addressing Cancer Big Data Challenges ==
== Session 3: Cancer big data and the Ontology of Disease: Addressing Cancer Big Data Challenges ==


Wednesday, May 13 in Balcony A, Natcher, 9:00-12:00
Wednesday, May 13 in Balcony A, Natcher


<u>'''9:00 Roles of Ontologies in Cancer Big Data'''</u>  
<u>'''Roles of Ontologies in Cancer Big Data'''</u>  


'''Larry Wright (NCI Enterprise Vocabulary Services):''' NCI Thesaurus and Enterprise Vocabulary Services: Some Paths and Lessons in Coding for Cancer Research
9:00 '''Larry Wright (NCI Enterprise Vocabulary Services):''' NCI Thesaurus and Enterprise Vocabulary Services: Resources for Cancer Research [http://ncor.buffalo.edu/2015/NCI/Wright.pptx Slides]


'''Lynn Schriml (University of Maryland Baltimore):''' The Human Disease Ontology: DO_cancer_slim - a unified representation of cancer disease terms
9:15 '''Lynn Schriml (University of Maryland Baltimore):''' The Human Disease Ontology: DO_cancer_slim - a unified representation of cancer disease terms [http://ncor.buffalo.edu/2015/NCI/Schriml.pptx Slides]


'''Ada Hamosh (Hopkins):''' OMIM
9:30 '''Ada Hamosh (Hopkins):''' OMIM [http://ncor.buffalo.edu/2015/NCI/Hamosh.pptx Slides]


'''Olivier Bodenreider (NLM):''' Oncology in SNOMED CT
9:45 '''Olivier Bodenreider (NLM):''' Oncology in SNOMED CT [http://ncor.buffalo.edu/2015/NCI/Bodenreider.pptx Slides]


<u>10:00 Break</u>
10:00 Break


'''<u>10:15 The Role of Ontology in Cancer Big Data Use Cases</u>'''
'''<u>The Role of Ontology in Cancer Big Data Use Cases</u>'''
   
   
'''Lindsay Cowell (UT Southwestern):''' HPV and cervical cancer data in Electronic Health Records -- A Big Data challenge
10:15 '''Lindsay Cowell (UT Southwestern):''' HPV and cervical cancer data in Electronic Health Records -- A Big Data challenge [http://ncor.buffalo.edu/2015/NCI/Cowell.pptx Slides]
:what types of data do we need to represent? HPV and cervical cancer and the Infectious Disease Ontology; challenges involved in keeping and using large collections of samples and of sample data
:what types of data do we need to represent? HPV and cervical cancer and the Infectious Disease Ontology; challenges involved in keeping and using large collections of samples and of sample data


'''Raja Mazumder (George Washington University):'''  The need for cancer disease ontology for pan-cancer data integration and analysis.
10:30 '''Raja Mazumder (George Washington University):'''  The need for cancer disease ontology for pan-cancer data integration and analysis. [http://ncor.buffalo.edu/2015/NCI/Maxumder.pptx Slides]
 
'''Susan Mockus (The Jackson Laboratory, Genomic Medicine):'''  Using the Disease Ontology to Translate Pathology Reports to NGS Clinical Reports.


'''Peter Elkin (University of Buffalo):'''  Ontology-based cancer biomarker discovery
10:45 '''Susan Mockus (The Jackson Laboratory, Genomic Medicine):'''  Using the Disease Ontology to Translate Pathology Reports to NGS Clinical Reports [http://ncor.buffalo.edu/2015/NCI/Mockus.pptx Slides]


11:00 '''Peter Elkin (University at Buffalo):'''  Ontology-based cancer biomarker discovery [http://ncor.buffalo.edu/2015/NCI/Elkin.pptx Slides]


<u>'''11:15 Discussion and Session Wrap Up'''</u>
<u>'''11:15 Discussion and Session Wrap Up'''</u>
Line 107: Line 89:
:Challenges, Needs, Action Item Solutions, short term steps, long term goals  
:Challenges, Needs, Action Item Solutions, short term steps, long term goals  


<u>12:00  Lunch</u>
12:00  Lunch


==Public Session: Cancer Big Data to Knowledge==
==Public Session: Cancer Big Data to Knowledge==


13:00-15:30
13:00 '''Mark Musen (Stanford / NCBO)''': [http://metadatacenter.org CEDAR]: Making it Easier to Use Ontologies to Author Clinical Metadata [http://ncor.buffalo.edu/2015/NCI/Musen.pdf Slides]


'''Barry Smith''' (Chair)
13:40 '''Judith Blake (GO / Jackson Lab)''': The Impact of Ontologies on Comparative Genomics for Cancer: The  Human-Mouse Connection [http://ncor.buffalo.edu/2015/NCI/Blake.pptx Slides]


'''Philip E. Bourne (NIH / ADDS)''': The NIH Big Data Strategy
14:20 '''Cathy Wu (University of Delaware / PRO)''': Ontology and the [http://www.nejm.org/doi/full/10.1056/NEJMp1500523#t=article Precision Medicine Initiative]: The Role of the Protein Ontology in Cancer Knowledge Network Discovery [http://ncor.buffalo.edu/2015/NCI/Wu.pptx Slides]


'''Cathy Wu (University of Delaware / PRO)''': Ontology and the [http://www.nejm.org/doi/full/10.1056/NEJMp1500523#t=article Precision Medicine Initiative]: The Role of OBO Foundry Ontologies in Protein-Centric Cancer Knowledge Network Discovery
14:50 '''Sherri de Coronado (NIH / NCI)''': The importance of Semantics in NCI Efforts to Achieve Precision Oncology [http://ncor.buffalo.edu/2015/NCI/de-Coronado.ppt Slides]
 
'''Mark Musen (Stanford / NCBO)''': [http://metadatacenter.org CEDAR]:  Making it Easier to Use Ontologies to Author Clinical Metadata
 
'''Judith Blake (GO / Jackson Lab)''': The Impact of Ontologies on Comparative Genomics for Cancer: The  Human:Mouse Connection
 
'''Warren Kibbe (NIH / NCI)''': TBD


== Sponsors ==
== Sponsors ==
Line 130: Line 106:
*National Center for Biomedical Ontology ([http://www.bioontology.org/ NCBO])
*National Center for Biomedical Ontology ([http://www.bioontology.org/ NCBO])
*National Center for Ontological Research ([http://ncor.buffalo.edu NCOR])
*National Center for Ontological Research ([http://ncor.buffalo.edu NCOR])
*Center for Expanded Data Annotation and Retrieval ([[http://bd2k.nih.gov/FY14/COE/Musen.pdf CEDAR])
*Center for Expanded Data Annotation and Retrieval ([http://bd2k.nih.gov/FY14/COE/Musen.pdf CEDAR])


----
----
Line 136: Line 112:
== Participants ==
== Participants ==


will include:
*Durga Addepalli (NIH / NCI)
 
*Praveen Arany (NIH / NIDCR)
*Nancy Beck (Reagan-Udall Foundation)
*Judith Blake (GO / PRO / The Jackson Laboratory)
*Judith Blake (GO / PRO / The Jackson Laboratory)
*Olivier Bodenreider (NIH / NLM)
*Evan Bolton (NIH / NLM / NCBI)
*Evan Bolton (NIH / NLM / NCBI)
*Jonathan Bona (PRO / Buffalo)
*Jonathan Bona (PRO / Buffalo)
*Philip E. Bourne (NIH / ADDS)
*William Bone (NIH/NHGRI)  
*Gully Burns (Information Sciences Institute, University of Southern California)
*Gully Burns (Information Sciences Institute, University of Southern California)
*Kisha Coa (NIH/NCI)  
*Kisha Coa (NIH/NCI)  
*Sherri de Coronado (National Cancer Institute)
*Sherri de Coronado (National Cancer Institute)
*Lindsay Cowell (UT Southwestern Medical Center)
*Lindsay Cowell (UT Southwestern Medical Center)
*Rina Das ((NIH/NIMHD)
*Valentina di Francesco (NIH/NHGRI)
*Valentina di Francesco (NIH/NHGRI)
*Rao L. Divi (NCI Division of Cancer Control and Population Sciences)
*Rao L. Divi (NCI Division of Cancer Control and Population Sciences)
*Mary E. Dolan (The Jackson Laboratory)
*Mary E. Dolan (The Jackson Laboratory)
*Peter Elkin (Department of Biomedical Informatics, University at Buffalo)
*Peter Elkin (Department of Biomedical Informatics, University at Buffalo)
*Luis Espinoza (NIH/CSR)
*Jianwen Fang (NIH/NCI)
*Jianwen Fang (NIH/NCI)
*Gilberto Fragoso (National Cancer Institute)
*Gilberto Fragoso (National Cancer Institute)
Line 156: Line 136:
*Sharmistha Ghosh-Janjigian (NIH/NCI)
*Sharmistha Ghosh-Janjigian (NIH/NCI)
*Metin Gurcan (College of Medicine, Ohio State University)
*Metin Gurcan (College of Medicine, Ohio State University)
*Ada Hamosh (OMIM / Johns Hopkins University)
*Lori Henderson (NIH/NCI)
*Jingshan Huang (University of South Alabama / ITCR)
*Jingshan Huang (University of South Alabama / ITCR)
*Rebecca Jacobson (University of Pittsburgh / ITCR)
*Rebecca Jacobson (University of Pittsburgh / ITCR)
*Sonia B Jakowlew (BPRB / NCI)
*Guoqian Jiang (Mayo Clinic)
*Guoqian Jiang (Mayo Clinic)
*Warren Kibbe (NIH / NCI / Disease Ontology)
*Warren Kibbe (NIH / NCI / Disease Ontology)
*Christopher ​Kinsinger (NIH/NCI)
*Christopher ​Kinsinger (NIH / NCI)
*Prasad Konka (NIH/NCI)  
*Prasad Konka (NIH / NCI)
*Raj Krishnaraju (NIH / CSR)
*Leandro Hermida (NIH/NCI)
*Jerry Li (NIH / NCI)
*Jerry Li (NIH / NCI)
*Lu, Le (NIH / CC /DRD)
*Zhengwu Lu (NIH / NCI)
*Diana Ma (Hippocampus Analytics)
*Diana Ma (Hippocampus Analytics)
*Hala R. Makhlouf (Cancer Diagnosis Program / NCI)
*Hala R. Makhlouf (Cancer Diagnosis Program / NCI)
*Cheryl Marks (NIH/NCI) 
*Anna Maria Masci (Duke)
*Raja Mazumder (Georgetown University / Protein Information Resource)
*Raja Mazumder (Georgetown University / Protein Information Resource)
*Elvira Mitraka (University of Maryland, Baltimore)
*Elvira Mitraka (University of Maryland, Baltimore)
Line 170: Line 159:
*Mark Musen (Stanford / National Center for Biomedical Ontology and Center for Expanded Data Annotation and Retrieval)
*Mark Musen (Stanford / National Center for Biomedical Ontology and Center for Expanded Data Annotation and Retrieval)
*Darren Natale (Georgetown University / Protein Ontology Consortium)
*Darren Natale (Georgetown University / Protein Ontology Consortium)
*Miguel Ossandon (NCI/NIH)
*Natsuko ​Miura (NIH / NCI)
*Lauren Neal (Booz Allen Hamilton)
*April Oh (NIH/NCI)
*Miguel Ossandon (NCI / NIH)
*Lisa Paradis (NIH / NCI)
*​Praveen Arany (NIH / NIDCR)
*Thomas Prince (NIH / NCI)
*Dani Przychodzin (CSC / NIH)
*Thomas C. Radman (National Institute on Drug Abuse/NIH)
*Thomas C. Radman (National Institute on Drug Abuse/NIH)
*Lyubov Remennik (NIH/CC/BTRIS) 
*Betsy Rolland (NIH/NCI)
*Lynn Schriml (University of Maryland, Baltimore / Disease Ontology)
*Lynn Schriml (University of Maryland, Baltimore / Disease Ontology)
*Payal Shah (C-Change)
*Mukul Sherekar (NIH/NCI)  
*Mukul Sherekar (NIH/NCI)  
*Hoo Chang Shin (NIH/CC/LDRR)
*Barry Smith (Buffalo / Open Biomedical Ontologies Foundry)
*Barry Smith (Buffalo / Open Biomedical Ontologies Foundry)
*Sriram Sridhar (Booz Allen Hamilton)
*Chris Stoeckert (University of Pennsylvania)c
*Shumei Sun (Virginia Commonwealth University)
*Svetlana Radaeva (NIH/NIAAA)
*Feng Tao (NIH/CSR)
*Elizabeth Thompson (President and CEO, C-Change)
*John Tomaszewski (Pathology and Anatomical Sciences, Buffalo)
*John Tomaszewski (Pathology and Anatomical Sciences, Buffalo)
*Eric Weitz (NIH/NCI)
*Eric Weitz (NIH/NCBI)
*Larry Wright (NIH/NCI)
*Larry Wright (NIH/NCI)
*Tsung-Jung Wu (NIH/NCI)
*Tsung-Jung Wu (NIH/NCI)
*Cathy Wu (Delaware / Protein Ontology)
*Cathy Wu (Delaware / Protein Ontology)
*Tingfen Yan (NIH/NIMHD)
*Hong Yu (University of Massachusetts, Worcester)
*Hong Yu (University of Massachusetts, Worcester)
*Wenjin J. Zheng (Center for Computational Biomedicine, University of Texas Health Science Center at Houston)
*Shadia Zaman (FDA)

Latest revision as of 15:37, 24 May 2015

Date: May 12-13, 2015

Venue

Session 1: Addressing cancer big data challenges through imaging ontologies

Tuesday, May 12 in FISHERS LANE CONFERENCE CENTER, 5635 Fisher Lane, Rockville, MD 20852

10:00 Warren Kibbe (NCI): Introductory Remarks

10:10 Barry Smith (Buffalo): The cancer research ontology space: An introduction Slides

The goal of the meeting: to better understand the challenges involved in using big data for cancer research, and to explore the utility of ontologies in addressing these challenges.
An introduction to existing ontology resources in the cancer domain, including NCI Thesaurus and the OBO Foundry; addressing opportunities and reasons for scepticism as concerns the use of ontologies in addressing cancer big data; topics to address will include the ontology of tissues, of tissue banks, and of image banks
What are the desired outcomes from this meeting?

10:50 Ilya Goldberg (NIA): The role of imaging ontologies in cancer big data Slides

As image data accumulates descriptive metadata using controlled vocabularies (ontologies), two key applications form a key "Big Data" technology that will impact cancer research and medicine as a whole:
1. semantic search, which allows image retrieval independently of image content, for example, retrieval of images spanning different imaging modalities and scales.
2. automated annotation, which addresses the scaling problem in associating metadata with ever larger image collections by using ontological terms as categories and the images carrying them as sample data to train machine classifiers.

11:30 Break 11:45 John Tomaszewski (University at Buffalo) and Metin Gurcan (Ohio State): ​How ontologies can help in addressing the big data challenges of pathology imaging: perspectives from a collaborating pathologist and image scientist Tomaszewski Slides Gurcan Slides

Widespread availability of whole slide scanners has a transformative effect on pathology, resulting in data that is "big” not only in terms of size but also in richness of information. Computational algorithms have been developed to tap into this data both to assist pathologists and to improve diagnosis, prognosis and treatment. Now, however, there is an urgent need to provide a set of terms and formal definitions necessary to characterize both the histopathological images and the algorithms that operate on them. We will present in this light our on-going work to create an ontology for histopathological imaging.
Goals:

1. Describe the transformative role of digital pathology

2. Explain the big data challenges in clinical research and computational algorithms

3. Outline how ontologies can provide solutions to address the big data challenges of pathology imaging

13:00 Lunch

Session 2: Addressing cancer big data challenges by capturing metadata for data integration and analysis through ontologies

13:45 Christopher R. Kinsinger (NCI): Ontology Considerations for Cancer Proteomics Slides

NCI’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) conducts proteomic and genomic analysis on clinical tissue samples. While CPTAC has utilized controlled vocabularies for biospecimen resources, development and adoption of ontology tools for rapidly advancing technologies such as proteomics and genomics remains a challenge.

14:15 Anna Maria Masci (Duke): Understanding the tumor microenvironment: integration of multiscale cancer data Slides

Tumor biology can be fully understood by considering the tumor as part of an ecosystem that includes tumor cells, stromal cells, matrix, fluids and gas. Real Time PCR, Tissue Micro Arrays, Plasma data and ontology are approaches used to describe the complex tumor ecology.

14:45 Jingshan Huang (South Alabama): The OMIT ontology Slides

The purpose of constructing OMIT is to establish data exchange standards and common data elements in miR and microgenomics domain. Cell biologists and bioinformaticians can make use of OMIT to leverage emerging semantic technologies in knowledge acquisition and discovery for more effective identification of important roles performed by miRs in human diseases including cancer.

15:15 Break

15:30 Chris Stoeckert (Penn): Integration and alignment of ontologies for cancer metadata collection based on the Ontology for Biomedical Investigations (OBI) Slides

Cancer research is multidisciplinary and requires standard terminology from multiple domains. OBI and interoperable OBO Foundry ontologies can be used for collecting and integrating clinical and -omic metadata relevant to cancer data. The Ontology for Biobanking is an example based on OBI.

16:00 Gully Burns (ISI/ USC): Leveraging OBI to cancer pathways via Knowledge Engineering from Experimental Design (KEfED) Slides

Our biomedical knowledge about cancer pathways is derived piecemeal from small-scale experiments in studies of molecular biology. A single study may consist of as many as thirty individual experiments to test small aspects of the underlying hypothesis of a paper. Typically 'pathway databases' interpret and summarize this information as model constructs that describe signaling events with little or no reference to the originating data that acts as evidence for the pathway constructs. As a part of the RUBICON project within the 'Big Mechanisms' DARPA program, I describe our work to model of the underlying design of molecular biology experiments as part of a text mining approach under development across the Big Mechanisms program called 'reading against a model'. This application serves as a valuable method within which OBI terminology and structure is providing essential structure and support.

16:30 Close of Day 1

Session 3: Cancer big data and the Ontology of Disease: Addressing Cancer Big Data Challenges

Wednesday, May 13 in Balcony A, Natcher

Roles of Ontologies in Cancer Big Data

9:00 Larry Wright (NCI Enterprise Vocabulary Services): NCI Thesaurus and Enterprise Vocabulary Services: Resources for Cancer Research Slides

9:15 Lynn Schriml (University of Maryland Baltimore): The Human Disease Ontology: DO_cancer_slim - a unified representation of cancer disease terms Slides

9:30 Ada Hamosh (Hopkins): OMIM Slides

9:45 Olivier Bodenreider (NLM): Oncology in SNOMED CT Slides

10:00 Break

The Role of Ontology in Cancer Big Data Use Cases

10:15 Lindsay Cowell (UT Southwestern): HPV and cervical cancer data in Electronic Health Records -- A Big Data challenge Slides

what types of data do we need to represent? HPV and cervical cancer and the Infectious Disease Ontology; challenges involved in keeping and using large collections of samples and of sample data

10:30 Raja Mazumder (George Washington University): The need for cancer disease ontology for pan-cancer data integration and analysis. Slides

10:45 Susan Mockus (The Jackson Laboratory, Genomic Medicine): Using the Disease Ontology to Translate Pathology Reports to NGS Clinical Reports Slides

11:00 Peter Elkin (University at Buffalo): Ontology-based cancer biomarker discovery Slides

11:15 Discussion and Session Wrap Up

Ontology/vocabulary challenges, Use case challenges
Challenges, Needs, Action Item Solutions, short term steps, long term goals

12:00 Lunch

Public Session: Cancer Big Data to Knowledge

13:00 Mark Musen (Stanford / NCBO): CEDAR: Making it Easier to Use Ontologies to Author Clinical Metadata Slides

13:40 Judith Blake (GO / Jackson Lab): The Impact of Ontologies on Comparative Genomics for Cancer: The Human-Mouse Connection Slides

14:20 Cathy Wu (University of Delaware / PRO): Ontology and the Precision Medicine Initiative: The Role of the Protein Ontology in Cancer Knowledge Network Discovery Slides

14:50 Sherri de Coronado (NIH / NCI): The importance of Semantics in NCI Efforts to Achieve Precision Oncology Slides

Sponsors

  • National Cancer Institute Center for Biomedical Informatics and Information Technology (CBIIT)
  • National Center for Biomedical Ontology (NCBO)
  • National Center for Ontological Research (NCOR)
  • Center for Expanded Data Annotation and Retrieval (CEDAR)

Participants

  • Durga Addepalli (NIH / NCI)
  • Praveen Arany (NIH / NIDCR)
  • Nancy Beck (Reagan-Udall Foundation)
  • Judith Blake (GO / PRO / The Jackson Laboratory)
  • Olivier Bodenreider (NIH / NLM)
  • Evan Bolton (NIH / NLM / NCBI)
  • Jonathan Bona (PRO / Buffalo)
  • William Bone (NIH/NHGRI)
  • Gully Burns (Information Sciences Institute, University of Southern California)
  • Kisha Coa (NIH/NCI)
  • Sherri de Coronado (National Cancer Institute)
  • Lindsay Cowell (UT Southwestern Medical Center)
  • Rina Das ((NIH/NIMHD)
  • Valentina di Francesco (NIH/NHGRI)
  • Rao L. Divi (NCI Division of Cancer Control and Population Sciences)
  • Mary E. Dolan (The Jackson Laboratory)
  • Peter Elkin (Department of Biomedical Informatics, University at Buffalo)
  • Luis Espinoza (NIH/CSR)
  • Jianwen Fang (NIH/NCI)
  • Gilberto Fragoso (National Cancer Institute)
  • Gang Fu (NIH / NLM / NCBI)
  • Ilya Goldberg (Image Informatics and Computational Biology Unit, National Institute on Aging)
  • Sharmistha Ghosh-Janjigian (NIH/NCI)
  • Metin Gurcan (College of Medicine, Ohio State University)
  • Ada Hamosh (OMIM / Johns Hopkins University)
  • Lori Henderson (NIH/NCI)
  • Jingshan Huang (University of South Alabama / ITCR)
  • Rebecca Jacobson (University of Pittsburgh / ITCR)
  • Sonia B Jakowlew (BPRB / NCI)
  • Guoqian Jiang (Mayo Clinic)
  • Warren Kibbe (NIH / NCI / Disease Ontology)
  • Christopher ​Kinsinger (NIH / NCI)
  • Prasad Konka (NIH / NCI)
  • Raj Krishnaraju (NIH / CSR)
  • Leandro Hermida (NIH/NCI)
  • Jerry Li (NIH / NCI)
  • Lu, Le (NIH / CC /DRD)
  • Zhengwu Lu (NIH / NCI)
  • Diana Ma (Hippocampus Analytics)
  • Hala R. Makhlouf (Cancer Diagnosis Program / NCI)
  • Cheryl Marks (NIH/NCI)
  • Anna Maria Masci (Duke)
  • Raja Mazumder (Georgetown University / Protein Information Resource)
  • Elvira Mitraka (University of Maryland, Baltimore)
  • Susan Mockus (Jackson Laboratory for Genomic Medicine, Farmington, CT)
  • Mark Musen (Stanford / National Center for Biomedical Ontology and Center for Expanded Data Annotation and Retrieval)
  • Darren Natale (Georgetown University / Protein Ontology Consortium)
  • Natsuko ​Miura (NIH / NCI)
  • Lauren Neal (Booz Allen Hamilton)
  • April Oh (NIH/NCI)
  • Miguel Ossandon (NCI / NIH)
  • Lisa Paradis (NIH / NCI)
  • ​Praveen Arany (NIH / NIDCR)
  • Thomas Prince (NIH / NCI)
  • Dani Przychodzin (CSC / NIH)
  • Thomas C. Radman (National Institute on Drug Abuse/NIH)
  • Lyubov Remennik (NIH/CC/BTRIS)
  • Betsy Rolland (NIH/NCI)
  • Lynn Schriml (University of Maryland, Baltimore / Disease Ontology)
  • Payal Shah (C-Change)
  • Mukul Sherekar (NIH/NCI)
  • Hoo Chang Shin (NIH/CC/LDRR)
  • Barry Smith (Buffalo / Open Biomedical Ontologies Foundry)
  • Sriram Sridhar (Booz Allen Hamilton)
  • Chris Stoeckert (University of Pennsylvania)c
  • Shumei Sun (Virginia Commonwealth University)
  • Svetlana Radaeva (NIH/NIAAA)
  • Feng Tao (NIH/CSR)
  • Elizabeth Thompson (President and CEO, C-Change)
  • John Tomaszewski (Pathology and Anatomical Sciences, Buffalo)
  • Eric Weitz (NIH/NCBI)
  • Larry Wright (NIH/NCI)
  • Tsung-Jung Wu (NIH/NCI)
  • Cathy Wu (Delaware / Protein Ontology)
  • Tingfen Yan (NIH/NIMHD)
  • Hong Yu (University of Massachusetts, Worcester)
  • Shadia Zaman (FDA)