Weather and climate data basics

Weather and climate data basics#

Before we get into how to choose, download, and work with specific weather and climate data, in this section, we will introduce a commonly-used data format for weather and climate data, and cover basic data loading and plotting skills.

Basic visualization of climate and weather data#

To diagnose your data or to illustrate the weather and climate data used in your model, you will likely want to create plots and maps. The following is a very high-level overview; more detailed guides include:

📚 An Introduction to Earth and Environmental Data Science: A great guide to working with xarray in general, but also to plotting geographic data with xarray and cartopy, especially the section on “Maps in Scientific Python”
📚 Visualizing and Processing Climate Data Within MATLAB: A guide to plotting climate data using MATLAB, created by the institute that publishes ERA5

2-dimensional plotting#

Assuming that your data is loaded and named as it is in the section above, the following example shows how to plot the time series of a single-pixel of your variable “variable”, or an average across all pixels.

Python (xarray)

# This will plot a time series of the first lat/lon pixel
ds.variable.isel(lon=0,lat=0).plot()

# This will plot a time series of the pixel closest to 23N, 125W
ds.variable.sel(lon=-125,lon=23,method='nearest').plot()

# This will plot the average time series over all lat/lon points
# Note that, if your data is on a normal rectangular grid (even
# lat/lon spacings), you will need to weight your data to account
# for the changing size of the pixels with latitude
weights = np.cos(np.deg2rad(ds.lat))
ds.variable.weighted(weights).mean(('lat','lon')).plot()

Matlab

As before, we’re assuming the variable variable is in the form lon,lat,time.

% This will plot a time series of the first lat/lon pixel
plot(squeeze(variable(1,1,:)))

% This will plot the average time series over all lat/lon points
% Note that, if your data is on a normal rectangular grid (even
% lat/lon spacings), you will need to weight your data to account
% for the changing size of the pixels with latitude
weights = cos(deg2rad(lat))   
% Now, first take the mean over longitude, then the weighted mean
% over latitude values, and plot the result, squeezing to get rid
% of size-1 dimensions
plot(squeeze((weights'*squeeze(mean(variable,1)))/sum(weights)))

Maps#

Weather and climate data is generally geographic in nature; you’re therefore likely to want or need to create maps of your variables. Maps can also offer an easy first-order check to see if your data subset correctly. Assuming that your data is loaded and named as it is in the section above, the following example shows how to plot a map of a single timestep of your variable “variable” or an average across all timesteps.

Note that which map projection you use will influence how you read the map. In the code examples below, we will use an equal-area projection, in which every grid cell in the gridded data is shown with its accurate relative area, to avoid visually overemphasizing data in regions with smaller geographic extent. To see which other projections are available, see the relevant parts of the documentations (here for cartopy/python, and here for Matlab)

Python (xarray)

## Example without geographic information: 
# To plot a heatmap of your 3-dimensional variable 
# at the first timestep of the data
ds.variable.isel(time=0).plot()
# To plot a heatmap of your variable, averaged across
# all timesteps
ds.variable.mean('time').plot()


## Example with geographic information:
import cartopy.crs as ccrs
from matplotlib import pyplot as plt    
# Create axis, setting the projection of the final map as the
# Eckert IV equal-area projection
ax = plt.axes(projection=ccrs.EckertIV()
# Plot data; specifying that the dimensions of the data should be
# interpreted as lat/lon values
ds.variable.isel(time=0).plot(transform=ccrs.PlateCarree()
# Add coastlines
ax.coastlines()
# (to plot the time mean, for example, use instead 
# ds.variable.mean('time').plot(transform=ccrs.PlateCarree())

Matlab

As before, we’re assuming the variable variable is in the form lon,lat,time.

% To plot a heatmap of your 3-dimensional variable 
% at the first timestep of the data
pcolor(squeeze(variable(:,:,1)).'); shading flat
%  To plot a heatmap of your variable, averaged across
% all timesteps
pcolor(squeeze(mean(variable,3)).'); shading flat

% Alternatively, with geographic information:
axesm('eckert4') % Set desired projection in the function call; i.e. 'eckert4'
pcolorm(lat,lon,squeeze(variable(:,:,1)).'); shading flat 
% coast.mat is included with Matlab installations; this will add coastlines. 
coasts=matfile('coast.mat')
geoshow(coasts.lat,coasts.long)

Moving forward#

Now that you know how to read NetCDF files and conduct basic operations and plotting with them, you can start downloading and using the weather and climate data you need for your projects. To take the first steps on this road, we’ll cover the basics of gridded data in the next section.

Weather and climate data basics

Contents

Weather and climate data basics#

The NetCDF data format#

Your code environment#

NetCDF contents#

NetCDF file organization#

The NetCDF header#

Attributes#

Reading NetCDF data#

Loading a subset of a NetCDF file#

Basic visualization of climate and weather data#

2-dimensional plotting#

Maps#

Moving forward#