Skip to content

BIKRAMADITTYA/COVID-Clinical-Trials-EDA-Dashboard

Repository files navigation

COVID-19 Clinical Trials: EDA & Power BI Dashboard


Project Overview

This project provides an in-depth Exploratory Data Analysis (EDA) of global COVID-19 clinical trials data, complemented by an interactive Power BI dashboard. The primary goal is to uncover key trends, patterns, and insights related to trial statuses, phases, start dates, age demographics, and intervention types.


Key Features & Analysis

  • Comprehensive Data Cleaning & Preprocessing: Handled missing values, converted data types, and addressed duplicates to ensure data quality.
  • Temporal Analysis of Trial Starts: Visualized the evolution of trial initiations over time, highlighting periods of high research activity during the pandemic.
  • Trial Status Distribution: Explored the breakdown of trials by their current status (e.g., completed, recruiting, terminated), providing an overview of the research landscape.
  • Clinical Phase Breakdown: Analyzed the distribution of trials across different clinical phases (e.g., Phase 1, Phase 2, Phase 3), showing the progression of research.
  • Age Group Demographics: Investigated the most common age groups targeted in these trials.
  • Relationship Analysis: Examined the interplay between trial statuses and phases, and explored common outcome measures associated with various conditions.
  • Interactive Power BI Dashboard: A dynamic dashboard built to allow users to interactively explore the cleaned data, filter by various parameters, and drill down into specific insights.

Tools & Technologies

  • Python:
    • pandas: For robust data manipulation and analysis.
    • matplotlib.pyplot: For static data visualizations.
    • seaborn: For enhanced statistical data visualizations.
  • Power BI: For creating a comprehensive, interactive, and shareable data dashboard.

Data Source

The dataset used for this analysis is COVID clinical trials.csv, containing information on clinical trials related to COVID-19.

Project Structure


COVID-19 Clinical Trials: EDA & Power BI Dashboard/
├── data/
│   └── COVID clinical trials.csv
├── notebooks/
│   └── covid 19 eda.py
├── power_bi_dashboard/
│   └── Clinical Trials Overview Dashboard.pbix
├── output/
│   ├── final_cleaned_trials.csv
│   ├── conditions_outcomes_summary.csv
│   ├── status_vs_phases.csv
│   └── trial_counts_over_time.csv
├── images/
│   └── [screenshots key plots]


Power BI Dashboard

Explore the data interactively through the Power BI dashboard:


Visualizations/Screenshots

Distribution of Phases: Distribution of Phases

Bar chart showing the distribution of clinical trials across different phases.


Trials Started Over Time: Trial Started over Time

Bar chart showing the number of trials started each month, revealing trends in research activity.


Age Group Distribution: Age Group Distribution

A visualization illustrating the distribution of participants across different age groups in the clinical trials.


Status of Clinical Trial: Status of clinical Trial

A chart showing the overall status of clinical trials (e.g., Recruiting, Completed, Terminated).


Missing Value Count per Column: missing Values count per Column

A visual representation of the count of missing values for each column in the dataset, indicating data completeness.


Status vs. Phases of Clinical Trials: Status vs Phases of Clinical Trial

A plot demonstrating the relationship and distribution of trials across different statuses and clinical phases.*


How to Run the Project (Python EDA)

How replicate the Python EDA and generate the output files:

  1. Clone the repository:

    git clone [https://github.com/](https://github.com/)[YourGitHubUsername]/COVID-19-Clinical-Trials-EDA-Dashboard.git
    cd COVID-19-Clinical-Trials-EDA-Dashboard
  2. Create a virtual environment (recommended):

    python -m venv venv
    # On Windows:
    .\venv\Scripts\activate
    # On macOS/Linux:
    source venv/bin/activate
  3. Install the required Python packages:

    pip install -r [requirements.txt](https://github.com/BIKRAMADITTYA/COVID-Clinical-Trials-EDA-Dashboard/blob/main/requirements.txt)
  4. Run the analysis script:

    python [covid 19 eda.py](https://github.com/BIKRAMADITTYA/COVID-Clinical-Trials-EDA-Dashboard/blob/main/notebooks/covid%2019%20eda.py)
    

How to View the Power BI Dashboard

  1. Ensure you have Power BI Desktop installed on your system.
  2. Download the Clinical Trial Overview Dashboard.pbix file directly from the root of this repository.
  3. Open the downloaded .pbix file using Power BI Desktop.

Contact

Feel free to connect with me if you have any questions, feedback, or would like to discuss this project further!


About

This project delves into COVID-19 clinical trial data through an extensive Exploratory Data Analysis (EDA) in Python. It uncovers trends in trial statuses, phases, start dates, and more, complemented by an interactive Power BI dashboard for deeper insights.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages