💡 For the best experience, set your browser zoom level to 75%

    ‎75%‎ للحصول على أفضل تجربة، اضبط مستوى تكبير المتصفح على 💡

Skip to main content
Completion requirements

Lesson 20 - Assignment

Title: Data Cleaning and Visualization Project

Objective:
 Analyze a CSV dataset by handling missing data, removing duplicates, converting data types, calculating correlations, and visualizing results with scatter and line plots.

Instructions:

1.    Load a CSV file using pandas.

2.    Handle missing data by:

o   Dropping rows with too many missing values.

o   Filling missing values in numerical columns with the mean.

o    

3.    Remove duplicates using drop_duplicates().

4.    Convert data types:

o   Convert a date column to datetime.

o   Convert a numeric column from string to float/int.

5.    Perform correlation analysis:

o   Use df.corr() to find correlation between numerical columns.

6.    Create a scatter plot:

o   Show the relationship between two numeric columns using plt.scatter().