Data we get from different sources may not be in the form or format for direct use in our applications. We need to correct these data by using various methods available in Pandas.
In Pandas missing data is represented by two value: None and NaN : NaN (an acronym for Not a Number) It is generally recommended to use NaN (np.nan) instead of None for missing or undefined values in numerical columns of a Pandas DataFrame. This allows Pandas to retain the original data type (such as int), while treating the NaN values as missing data. This ensures consistent data types and enables proper handling of missing values using Pandas' functions and methods designed for numerical data.