COVID19 analysis by using datamining

In this project, you will learn how to preprocess and merge datasets to calculate needed measures and prepare them for an Analysis. In this project, we are going to work with the COVID19 dataset, published by John Hopkins University, which consists of the data related to the cumulative number of con

2025-06-28 16:30:58 - Adil Khan

Project Title

COVID19 analysis by using datamining

Project Area of Specialization Cloud InfrastructureProject Summary

In this project, you will learn how to preprocess and merge datasets to calculate needed measures and prepare them for an Analysis. In this project, we are going to work with the COVID19 dataset, published by John Hopkins University, which consists of the data related to the cumulative number of confirmed cases, per day, in each Country. Also, we have another dataset consist of various life factors, scored by the people living in each country around the globe. We are going to merge these two datasets to see if there is any relationship between the spread of the virus in a country and how happy people are, living in that country.

Project Objectives
  1. Importing COVID19 dataset and preparing it for the analysis by dropping columns and aggregating rows.

  2. Deciding on and calculating a good measure for our analysis.

  3. Merging two datasets and finding correlations among our data.

  4. Visualizing our analysis results using Seaborn.

Project Implementation Method

Importing COVID19 dataset and preparing it for the analysis by dropping columns and aggregating rows.

Deciding on and calculating a good measure for our analysis.

Merging two datasets and finding correlations among our data.

Visualizing our analysis results using Seaborn.

Benefits of the Project
  1. Importing COVID19 dataset and preparing it for the analysis by dropping columns and aggregating rows.

  2. Deciding on and calculating a good measure for our analysis.

  3. Merging two datasets and finding correlations among our data.

  4. Visualizing our analysis results using Seaborn.

Technical Details of Final Deliverable

Coronaviruses are a large family of viruses that can cause severe illness to the human being. The first known severe epidemic is Severe Acute Respiratory Syndrome (SARS) occurred in 2003, whereas the second outbreak of severe illness began in 2012 in Saudi Arabia with the Middle East Respiratory Syndrome (MERS). The current outbreak of illness due to coronavirus is reported in late December 2019. This new virus is very contagious and has quickly spread globally. On January 30, 2020, the World Health Organization (WHO) declared this outbreak a Public Health Emergency of International Concern (PHEIC) as it had spread to 18 countries. On Feb 11, 2020, WHO named this “COVID-19”. On March 11, as the number of COVID-19 cases has increased thirteen times apart from China with more than 118,000 cases in 114 countries and over 4,000 deaths, WHO declared this a pandemic.

Final Deliverable of the Project HW/SW integrated systemCore Industry HealthOther Industries IT Core Technology Artificial Intelligence(AI)Other Technologies BlockchainSustainable Development Goals Life on LandRequired Resources
Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Total in (Rs) 36000
Machine learning softwares Equipment31200036000

More Posts