Data Science for HCPs

What Is Data Science: Addressing the Uncertainty Behind Addressing Uncertainty

Victor Gehman, PhD; Michael Becich, MS; Amal Katrib, PhD; Fatima Rubio da Costa, PhD; David Whewell; David Hughes, BSN

03/14/2020

What is Data Science? To answer this question, we will compare two games with which nearly all of us are familiar: chess and poker. Both games are extremely complex, have numerous strategies for players to employ and defend against, and require careful planning and anticipation. The fundamental difference between these games is the fact that in poker, most of the information about the state of the game and how the other players are engaging is hidden. Chess, on the other hand, is played entirely in the open. In chess, the information is all there where anyone can see it, but it may not be clear to all involved.

At its core, data science is the process of opening information up and showing the previously unknown state. In the terms of our analogy, it is turning poker into chess. Done well, data science not only reveals hidden information, but also takes the extra step to equip players with knowledge they can apply to current and future games. The transformation from a concealed, unpredictable game to one that is open and informed is carried out via the scientific method. In practice, this process cannot discover all of the hidden states, becoming an exercise of risk mitigation.

So why is defining "data science" important to us? We believe the term is often used broadly to encompass concepts, including machine learning, database mining, and business intelligence. While data science can touch all of these, it is important to first describe the unmet need that data science addresses. Having a shared understanding of what data science is and is not will help every contributor start from a common foundation to solving complex health care challenges.

The “Science” in Data Science

Data science, with its scientific milieu, inherits the same set of principles that revolutionized the sprawling intellectual and philosophical movement of the 18th century Enlightenment: the scientific method. Scientists of the modern world, now bequeathed with a treasure trove of data and advanced computational tools, are able to harness this newfound capability to support teams and stakeholders in addressing a wide range of complex problems. Staying true to the method, they first establish clear research project goals and accordingly identify hypotheses and construct strategic plans of action. Then, they leverage their domain expertise to meticulously gather and prepare the necessary data. They undergo an iterative process of formulating, testing, and refining hypotheses, using experimental observations to excavate insights along the way. This rich knowledge base can then be harnessed to develop theoretical models that are able to reliably and efficiently resolve real-world problems when deployed to production. This is what we believe data science is, or should look like, behind the scenes.

Modeling Information

Much of data science involves the construction of models that explain and predict phenomena we see in the world. A model is a mathematical way of formalizing our current understanding of a problem in a way that approximates reality. It typically describes patterns and trends in your data. The output of these models should guide intuition and facilitate decision-making or provide clinical or business intelligence.

Data Science in Practice

The connection between these three main topics shown in Figure 1 allows a data scientist to build stories based on what is revealed in the data, guiding them to decisions. It is, of course, quite difficult for any individual to be an expert in all three of these disciplines; it often takes interdisciplinary teamwork to be successful with complex data science challenges.

Figure 1. Data science is often represented as a Venn Diagram of three interlocking circles: computer science, math and statistics, and domain expertise. As shown in the intersections, software, data-driven research, machine learning/artificial intelligence (ML/AI), and cross-discipline communication skills are often necessary for the job at hand.

Data science teams have accelerated a variety of efforts around the world in research and industry. Some examples in health care include:

Personalizing treatments and diagnosis with precision medicine, like predicting outcomes for treatment in breast cancer
Automated diagnosis of serious medical conditions, like irregular heart rhythms
Novel drug discovery, including new antibiotics
Value-Based health care: Improving patient outcomes in community oncology practices

Next Blog Post Topic

The foundation of every data science effort is data. How data is acquired and identifying some of the challenges in the acquisition of data is the focus of our next blog post.

About David Hughes

David David Hughes is the Principal Machine Learning Data Engineer for Octave Bioscience. He develops cloud-based architectures and solutions for surfacing clinical intelligence from complex medical data. He leverages his interest in graph based data and population analytics to support data science efforts. David is using his experience leading clinical pathways initiatives in oncology to facilitate stakeholder engagement in the development of pathways in neurodegenerative diseases. With Octave, he is building a data driven platform for improving patient experience, mitigating cost, and advancing health care delivery for patients and families.

About Octave Bioscience

Octave The challenges for MS are significant, the issues are overwhelming, and the needs are mostly unmet. That is why Octave is creating a comprehensive, measurement driven Care Management Platform for MS. Our team is developing novel measurement tools that feed into structured analytical data models to improve patient management decisions, create better outcomes and lower costs. We are focused on neurodegenerative diseases starting with MS.

Current Issue

April 2025

Volume 11

Issue 2

Current Issue

Issue Archive

Special Reports

Special Report

Predictable Cost of Care Model for Treatment Decisions: Working Group Consensus Statements for Metastatic Non-Small Cell Lung Cancer

02/19/2025

Edward Arrowsmith, MD; Vishnukamal Golla, MD, MPH; Rhonda Henschel, MBA; Andrew Hertler, MD, FACP; David Jackman, MD; Gordon Kuntz; Olaf Lodbrok, MS, MBA; Carole Tremonti, RN, MBA; Lalan Wilfong, MD

The Predictable Cost of Care Working Group developed a model that can be used by various entities evaluating the impact of treatment on the total cost of care.

The Predictable Cost of Care...

02/19/2025

Journal of Clinical Pathways

Sponsored

Special Report

2024 NCCN Clinical Practice Guidelines in Oncology (NCCN Guidelines®) Update: Impact on NSCLC Landscape

11/15/2024

The updated 2024 NCCN Guidelines® recommend broad genomic testing and the need for multidisciplinary care to accurately diagnose and treat non-small cell lung cancer. View this special report to learn more.

The updated 2024 NCCN...

11/15/2024

Journal of Clinical Pathways

Sponsored

JCP Special Report

The Value of Tissue-Based Genomic Profiling in Oncology

09/17/2024

Innovations in precision oncology have helped healthcare providers to create more personalized treatment plans and improve patient outcomes. View this special report to learn more.

Innovations in precision...

09/17/2024

Journal of Clinical Pathways

Sponsored

Special Report

Shedding Light on Non-Small Cell Lung Cancer & Its Impact on Patients

04/03/2023

This supplement aims to raise awareness about non-small cell lung cancer (NSCLC) and its impact on patients by providing comprehensive information to help improve early detection and appropriate care.

This supplement aims to raise...

04/03/2023

Journal of Clinical Pathways

Sponsored

JCP Special Report

Brukinsa® (Zanubrutinib) for Chronic Lymphocytic Leukemia

03/31/2023

In this product monograph, read an interview with Jeff P. Sharman, MD, as he discusses important BRUKINSA® trial data including efficacy, safety, dosing, administration, and other relevant data. These key findings supported the Food and Drug...

In this product monograph, read...

03/31/2023

Journal of Clinical Pathways

Sponsored

JCP Special Report

Tumor Lysis Syndrome: Early Diagnosis and Management

01/04/2023

This review summarizes the diagnosis, pathophysiology, and evidence-based guidelines for the prevention and management of tumor lysis syndrome, a common, acute, life-threatening disease primarily in patients with hematologic cancers and solid...

This review summarizes the...

01/04/2023

Journal of Clinical Pathways

Sponsored

JCP Special Report

A Tumor Lysis Syndrome Risk Assessment and Its Impact on Patients

08/09/2022

In an interview with Journal of Clinical Pathways, Nicholas Short, MD, shares objectives on the design and benefits of MD Anderson’s Tumor Lysis Syndrome clinical assessment for patient risk and impact on patient care.

In an interview with Journal of...

08/09/2022

Journal of Clinical Pathways

Updated NCCN Guidelines on B-Cell Lymphomas

Sponsored

JCP Special Report

Overview of the Updated NCCN Guidelines on B-Cell Lymphomas

06/15/2022

Robert Fee

Updated multiple times in 2022, the National Comprehensive Cancer Network (NCCN) Guidelines for B-Cell Lymphomas provide recommendations for the prevention, diagnosis, and management of malignancies.

Updated multiple times in 2022,...

06/15/2022

Journal of Clinical Pathways

Sponsored

Special Report

Expanded Indication for Zanubrutinib: Marginal Zone Lymphoma and Waldenström’s Macroglobulinemia

06/10/2022

In an interview with Journal of Clinical Pathways, Mitul Gandhi, MD, reviews the clinical impact and treatment approaches and challenges for patients with marginal zone lymphoma and Waldenström’s macroglobulinemia.

In an interview with Journal of...

06/10/2022

Journal of Clinical Pathways

JCP Special Report

Recommendations for Creating an Oncology Clinical Pathways Framework Tool Based on Payer, Provider, and Patient Priorities: Findings From the 2021 Care Pathways Working Group

05/20/2022

Robin T. Zon, MD, FACP, FASCO; Gordon Kuntz; Winston Wong, PharmD

The Journal of Clinical Pathways convened the 2021 Care Pathways Working Group to identify and reconcile the different pathway drivers for each stakeholder now and five years into the future, creating a framework tool based on oncology care...

The Journal of Clinical Pathways...

05/20/2022

Journal of Clinical Pathways

Journal of Clinical Pathways Newsletter

Recent Stories

Videos

Why Pathways Matter: Editorial Advisory Board Testimonials

04/24/2025

Lalan Wilfong, MD

For the past 10 years, Journal of Clinical Pathways has been at the forefront of advancing evidence-based, value-driven care through clinical pathways. As we celebrate our 10th anniversary, we’re highlighting the voices of key thought leaders...

For the past 10 years, Journal...

04/24/2025

Journal of Clinical Pathways

Videos

Confronting Intersectional Stigmas Across the Cancer Care Continuum

04/24/2025

Gretchen McNally, PhD, MPH, discusses how intersectional stigmas—rooted in social biases around race, gender, socioeconomic status, and other identities—negatively affect cancer care across the continuum, influencing treatment access,...

Gretchen McNally, PhD, MPH,...

04/24/2025

Journal of Clinical Pathways

News

EHR-Based Model Accurately Identifies High-Risk Patients for Gastric Cancer

04/24/2025

Lisa Kuhns, PhD, MD

An electronic health record (EHR)-based logistic regression model demonstrated strong performance in identifying individuals at high risk for noncardia gastric cancer (NCGC), offering a potential tool to guide targeted screening in the US,...

An electronic health record...

04/24/2025

Journal of Clinical Pathways

Videos

Strengthening Pathway-Driven Care Through Payer Collaboration

04/23/2025

Gordon Kuntz; Lalan Wilfong, MD

In this episode of Oncology Innovations, Gordon Kuntz and Dr Lalan Wilfong discusses the evolving role of payers and value-based care intermediaries in oncology, emphasizing the importance of whole-person support, coordinated care, and...

In this episode of Oncology...

04/23/2025

Journal of Clinical Pathways

Videos

Evaluating the Performance of Biomarkers from Metastatic vs Primary Sites in Clear Cell Renal Cell Carcinoma

04/23/2025

Steven Monda, MD

Steven Monda, MD, shares his research on how the origin of tumor biopsies can impact biomarker performance and treatment strategies in metastatic clear cell renal cell carcinoma.

Steven Monda, MD, shares his...

04/23/2025

Journal of Clinical Pathways

What Is Data Science: Addressing the Uncertainty Behind Addressing Uncertainty

Current Issue

Special Reports

Subscribe

Recent Stories

Specialties

Events

Year Round Education

HMP Global Products