This project provides an in-depth Exploratory Data Analysis (EDA) of global COVID-19 clinical trials data, complemented by an interactive Power BI dashboard. The primary goal is to uncover key trends, patterns, and insights related to trial statuses, phases, start dates, age demographics, and intervention types.
- Comprehensive Data Cleaning & Preprocessing: Handled missing values, converted data types, and addressed duplicates to ensure data quality.
- Temporal Analysis of Trial Starts: Visualized the evolution of trial initiations over time, highlighting periods of high research activity during the pandemic.
- Trial Status Distribution: Explored the breakdown of trials by their current status (e.g., completed, recruiting, terminated), providing an overview of the research landscape.
- Clinical Phase Breakdown: Analyzed the distribution of trials across different clinical phases (e.g., Phase 1, Phase 2, Phase 3), showing the progression of research.
- Age Group Demographics: Investigated the most common age groups targeted in these trials.
- Relationship Analysis: Examined the interplay between trial statuses and phases, and explored common outcome measures associated with various conditions.
- Interactive Power BI Dashboard: A dynamic dashboard built to allow users to interactively explore the cleaned data, filter by various parameters, and drill down into specific insights.
- Python:
pandas: For robust data manipulation and analysis.matplotlib.pyplot: For static data visualizations.seaborn: For enhanced statistical data visualizations.
- Power BI: For creating a comprehensive, interactive, and shareable data dashboard.
The dataset used for this analysis is COVID clinical trials.csv, containing information on clinical trials related to COVID-19.
- Local Copy: View Dataset File in this Repository
COVID-19 Clinical Trials: EDA & Power BI Dashboard/ ├── data/ │ └── COVID clinical trials.csv ├── notebooks/ │ └── covid 19 eda.py ├── power_bi_dashboard/ │ └── Clinical Trials Overview Dashboard.pbix ├── output/ │ ├── final_cleaned_trials.csv │ ├── conditions_outcomes_summary.csv │ ├── status_vs_phases.csv │ └── trial_counts_over_time.csv ├── images/ │ └── [screenshots key plots]
Explore the data interactively through the Power BI dashboard:
-
Dashboard File: Clinical Trials Overview Dashboard.pbix
-
- 📊An overview screenshot of the interactive Power BI dashboard, showing key metrics and visualizations.*
Bar chart showing the distribution of clinical trials across different phases.
Bar chart showing the number of trials started each month, revealing trends in research activity.
A visualization illustrating the distribution of participants across different age groups in the clinical trials.
A chart showing the overall status of clinical trials (e.g., Recruiting, Completed, Terminated).
Missing Value Count per Column:

A visual representation of the count of missing values for each column in the dataset, indicating data completeness.
Status vs. Phases of Clinical Trials:

A plot demonstrating the relationship and distribution of trials across different statuses and clinical phases.*
How replicate the Python EDA and generate the output files:
-
Clone the repository:
git clone [https://github.com/](https://github.com/)[YourGitHubUsername]/COVID-19-Clinical-Trials-EDA-Dashboard.git cd COVID-19-Clinical-Trials-EDA-Dashboard -
Create a virtual environment (recommended):
python -m venv venv # On Windows: .\venv\Scripts\activate # On macOS/Linux: source venv/bin/activate
-
Install the required Python packages:
pip install -r [requirements.txt](https://github.com/BIKRAMADITTYA/COVID-Clinical-Trials-EDA-Dashboard/blob/main/requirements.txt)
-
Run the analysis script:
python [covid 19 eda.py](https://github.com/BIKRAMADITTYA/COVID-Clinical-Trials-EDA-Dashboard/blob/main/notebooks/covid%2019%20eda.py)
- Ensure you have Power BI Desktop installed on your system.
- Download the Clinical Trial Overview Dashboard.pbix file directly from the root of this repository.
- Open the downloaded
.pbixfile using Power BI Desktop.
Feel free to connect with me if you have any questions, feedback, or would like to discuss this project further!
- GitHub: BIKRAMADITTYA's GitHub Profile
- LinkedIn: Bikramadittya Nandan on LinkedIn




