problem with regriding a netCDF file using nctoolkit

Question

I am using daily data to calculate monthly averages using ensemble_mean. Once I have the file with the monthly average, I regrid the file from 0.1 to 0.25 degrees using another file as the target grid. The ensemble mean goes well, but when trying to regrid the file I get the following error:

ValueError: CDO error: Error (cdf_put_vara_double): NetCDF: Numeric conversion not representable. Tip: check if missing values are incorrectly set to large actual values!

This happens only in certain months. For some others, the regridding process works perfectly.

The code I am using is:

import nctoolkit as nc

ds = nc.open_data("/home/omi_data/HCHO/data/2006/12/*.nc4")
ds1=nc.open_data("/home/omi_data/NO2/data/2006/07/OMI-Aura_L3-OMNO2d_2006m0702_v003-2019m1121t032327.he5.ncml.nc4")
ds.ensemble_mean('key_science_data_column_amount')
ds.regrid(ds1)
ds.to_nc('/home/omi_data/HCHO/data/2006/monthly_average/HCHO_0612.nc4')

Data link

Robert Wilson · Accepted Answer

This problem appears to be caused by issues in the raw data. The netCDF files say the data format is F32. However, one of the files actually has data values that are outside the maximum range accepted by F32. That's a mistake during file creation. This is causing problems in CDO when nctoolkit calls it. As the error said, you have data can cannot be represented with 32-bit. Essentially, what you will have to do is correct the raw data before processing it. Just set anything outside the valid range to NA. The following should work:

ds = nc.open_data("/home/omi_data/HCHO/data/2006/12/*.nc4")
ds.as_missing([3.40282347E+38, 1e50])
ds.as_missing([-1e50, -3.40282347E+38])
ds1=nc.open_data("/home/omi_data/NO2/data/2006/07/OMI-Aura_L3- 
OMNO2d_2006m0702_v003-2019m1121t032327.he5.ncml.nc4")
ds.ensemble_mean('key_science_data_column_amount')
ds.regrid(ds1)
ds.to_nc('/home/omi_data/HCHO/data/2006/monthly_average/HCHO_0612.nc4')

Alternatively, you could also change the units to something more sensible. You really don't want to work with files with such large values, as you will easily run into limits with whatever numerical precision you are working with. However, the units are slightly confusing. The file says "molecules/cm^2". However, you can get positive and negative values. I don't understand that, so I can't provide guidance on changing the units.

problem with regriding a netCDF file using nctoolkit

Answers (2)

Related Questions