Skip to content

Latest commit

 

History

History
273 lines (205 loc) · 10.6 KB

README.md

File metadata and controls

273 lines (205 loc) · 10.6 KB

What is it?

Ml2pl interpolates atmospheric data at model levels to pressure levels. Input and output are in NetCDF format.

Ml2pl was first written to process history files of LMDZ (files named hist....nc). It also works with atmospheric data from other sources. It is used in Climaf.

Installation

Note for the users of French supercomputing centers

If you want to use this program on the computers jean-zay at Idris, or irene at TGCC, or spirit at IPSL, the program is already installed at the following paths.

On jean-zay:

/gpfswork/rech/lmd/rdzt899/bin/ml2pl.py

On irene:

/ccc/work/cont003/gencmip6/guezl/bin/ml2pl.py

On spirit:

/data/guez/bin/ml2pl.py

Dependencies

  • CMake (version ≥ 3.16)1.

  • The NetCDF-C library (version ≥ 4.6).

  • The NetCDF-Fortran library (version ≥ 4.5).

  • The Fortran compiler that was used to compile your installed NetCDF-Fortran library.

  • Git (Git is used by CMake to fetch a dependency).

  • Git-annex (optional, to download the NetCDF test files).

Under Ubuntu ≥ 20.04 or Linux Mint ≥ 20, you can install all these dependencies with the following command:

sudo apt install libnetcdff-dev gfortran cmake git git-annex

Instructions

  1. Get Ml2pl from Github.

  2. Create a build subdirectory in the Ml2pl directory you have just downloaded:

    cd Ml2pl
    mkdir build
    cd build
    
  3. Decide in which directory you want to install Ml2pl after compilation and type the command below with your choice after -DCMAKE_INSTALL_PREFIX= (enter an absolute path). The installation process will install a Python script, ml2pl.py, in $CMAKE_INSTALL_PREFIX/bin. It is convenient for $CMAKE_INSTALL_PREFIX/bin to be in your PATH environment variable. For example:

    cmake .. -DFETCH=ON -DCMAKE_INSTALL_PREFIX=~/.local
    

    Note that this requires a network connection.

  4. Type:

    make install
    

You do not need to keep the downloaded directory Ml2pl (nor the build directory) after installation. Note that the installation process also installs a Fortran executable file, ml2pl, in $CMAKE_INSTALL_PREFIX/libexec. Do not remove this file.

Advanced instructions

Most users should not need these advanded instructions.

  • You can choose any name and any location for the build directory. You have to refer to the source directory when you run cmake from the build directory:

      mkdir /wherever/any/name
      cd /wherever/any/name
      cmake /where/I/downloaded/Ml2pl -DFETCH=ON -DCMAKE_INSTALL_PREFIX=~/.local
    
  • The option -DFETCH=ON instructs CMake to download, compile and install the libraries Jumble, NetCDF95 and Numer_Rec_95. If you have already installed these libraries, you can omit the FETCH option.

  • On some machines, you may have to choose a run-time environment for ml2pl.py. For example, if you have several compilers and you have selected one at build time with environment modules, you probably need to select it also at run time. ml2pl.py tries to source a file named ml2pl_runtime_env.py in the libexec subdirectory of CMAKE_INSTALL_PREFIX. So create this file there if you need it. There is a template for ml2pl_runtime_env.py.

  • After cloning the repository, the NetCDF entries in the Tests subdirectory are broken links. This saves network bandwidth and disk space. If you want to use the NetCDF files for tests, install git-annex (see Dependencies) and type:

      git annex get .
    

    This will download the NetCDF files to the right location inside the the .git subdirectory, such that the symlinks in Tests are fixed.

Troubleshooting

  • If your installation of NetCDF or NetCDF-Fortran is in a non-standard location, and CMake does not find it, then re-run cmake setting the variable CMAKE_PREFIX_PATH to the directory of installation of NetCDF or NetCDF-Fortran. CMake will then search ${CMAKE_PREFIX_PATH}/lib, ${CMAKE_PREFIX_PATH}/include, etc. For example:

      cmake . -DCMAKE_PREFIX_PATH:PATH=/path/to/my/favorite/installation
    
  • If you have several Fortran or C compilers on your machine, it is possible that CMake does not choose the ones you want. Note that when you run cmake, it prints lines telling which compilers it is going to use. For example :

      -- The Fortran compiler identification is GNU 11.3.0
      -- The C compiler identification is GNU 11.3.0
    

    So if you want other compilers, remove everything in the build directory and run cmake again setting the variables FC and CC to the compilers you want. For example:

      rm -r * # in the build directory!
      FC=ifort CC=icc cmake .. -DCMAKE_INSTALL_PREFIX=~/.local
    

Usage

Running the command with argument -h will produce a help message:

$ ml2pl.py -h
usage: ml2pl.py [-h] [-p VARIABLE] [-v VARIABLE] [-w VARIABLE] [-m VARIABLE]
                [--version]
                input_file output_file [pressure_file]

Interpolates NetCDF variables from model levels to pressure levels.

The target pressure levels should be in a file called
"press_levels.txt", in the same unit as input pressure field. For
further information, see https://github.com/lguez/Ml2pl.

Do not run several instances of this script in parallel in the same
directory. It creates temporary files, with names that are not made
different for different instances.

positional arguments:
  input_file
  output_file
  pressure_file

options:
  -h, --help            show this help message and exit
  -p VARIABLE, --pressure_var VARIABLE
                        name of 4-dimensional variable in the input file or in
                        the pressure file containing the pressure field at
                        model levels
  -v VARIABLE           name of variable you want to interpolate, or
                        extrapolate if target pressure level is below surface
  -w VARIABLE           name of variable you want to interpolate, or set to 0
                        if target pressure level is below surface
  -m VARIABLE           name of variable you want to interpolate, or set to
                        missing if target pressure level is below surface
  --version             show program's version number and exit

The interpolation is linear in logarithm of pressure. The input variables depend on longitude, latitude, vertical level and time. There is no constraint on dimension names nor dimensions lengths. The input fields may be defined on only part of the rectangular longitude-latitude domain at each time (as specified with the missing_value or _FillValue attribute).

At given longitude, latitude and time, if a target pressure level is lower than the lower bound of the input pressure field, then variables are extrapolated to this target pressure level. If a target pressure level is higher than the higher bound of the input pressure field, then each variable may be extrapolated or set to 0 or set to missing at this target pressure level. This is controlled by options -v, -w and -m.

All computations are done with single-precision real numbers and output is in single-precision. So if the input fields are in double precision, they are first converted to single-precision.

input-file, output-file and pressure-file are NetCDF files.

You must list the variables you want to interpolate, each variable listed after either -v, -w or -m. There must be at least one variable listed, following either -v, -w or -m. In the same command, you can have several options -v, -w or -m with associated variables: one variable for each -v, -w or -m.

The pressure field at model levels can be specified in input-file or pressure-file either through hybrid coefficients and surface pressure or directly from 4-dimensional pressure. In both cases, pressure must decrease when the index of model level increases. This is checked quickly in the program. If option -p is not used then the program will look for NetCDF variables ap, b (hybrid coefficients) and ps (surface pressure) in the input file or the pressure file. If the program does not find NetCDF variable ap then it looks for NetCDF variables a and p0 and computes ap = a * p0.

Let us call $n_\mathrm{mod}$ the number of model levels. $n_\mathrm{mod}$ is obtained by the program by looking at the size of the vertical dimension of variables to interpolate. The size of ap and b must be $n_\mathrm{mod}$ or $n_\mathrm{mod} + 1$. If the size of ap and b is $n_\mathrm{mod} + 1$ then the program uses mid-values of ap and b: (ap(l) + ap(l + 1)) / 2 and (b(l) + b(l + 1)) / 2.

The target pressure levels should be in a text file called press_levels.txt in the current directory at run-time. The first line of the file is skipped, assuming it is a title line. Target pressure levels should be in the same unit as input pressure at model levels, as given by the 4-dimensional pressure or hybrid coefficients and surface pressure. Target pressure levels can be in any order, one value per line. There should be at least one target pressure level. There is no other constraint on these values nor on the number of values.

There is an example for file press_levels.txt.

Main memory

The program loops on time index and does not use four-dimensional, space plus time, variables. So the amount of main memory used depends on spatial resolution of the fields but not on the number of dates.

If $n_\mathrm{lon}$ is the number of longitudes, $n_\mathrm{lat}$ the number of latitudes, $n_\mathrm{mod}$ the number of model vertical levels, $n_\mathrm{var}$ the number of variables to interpolate and $n_\mathrm{plev}$ the number of target pressure levels, then the amount of main memory used should be approximately :

$$n_\mathrm{lon} n_\mathrm{lat} [n_\mathrm{mod} (n_\mathrm{var} + 1) + n_\mathrm{plev} n_\mathrm{var}] \times 4 B$$

(B is for bytes).

Footnotes

  1. On Mac OS, after downloading the application from the CMake web site, run it, then click on "How to Install For Command Line Use" in the Tools menu.