Converting Chemkin Format Files#
Many existing reaction mechanism files are in CK format, by which we mean
the input file format developed for use with the Chemkin-II software package (and
subsequent releases) as specified in the report describing the Chemkin software
[Kee et al., 1989]. Cantera comes with a converter utility program ck2yaml
(or
ck2yaml.py
) that converts CK format into Cantera’s YAML format. If you want to convert
a Chemkin-format file to YAML format, or you’re having errors when you try to do so,
this section will help.
Converting gas-phase mechanisms#
See also
For documentation of all the command line options that can be used with ck2yaml
, see
its documentation in the reference section.
To convert a gas phase mechanism where the thermodynamic and transport data are provided as separate files, run the following command from a terminal (command prompt):
ck2yaml --input=chem.inp --thermo=therm.dat --transport=tran.dat
If the ck2yaml
script is not on your path but the Cantera Python module is,
ck2yaml
can also be used by running:
python -m cantera.ck2yaml --input=chem.inp --thermo=therm.dat --transport=tran.dat
Tip
If you’re using Cantera from IPython or a Jupyter Notebook, you can run the above shell command from Python by prefixing it with an exclamation point. For example:
!python -m cantera.ck2yaml --input=chem.inp --thermo=therm.dat --transport=tran.dat
Converting surface mechanism files#
For mechanisms involving both surface and gas phase species, an additional input file
is needed to define the surface species and reactions occurring on the surface. This
file is specified with the --surface
option:
python -m cantera.ck2yaml --input=gas.inp --thermo=therm.dat --surface=surf.inp
Converting standalone thermo data#
An input file containing only species definitions (which can be referenced from phase definitions in other input files) can be created by specifying only a thermo file. For example:
ck2yaml --thermo=therm.dat
Debugging common errors in CK files#
When ck2yaml
encounters an error, it attempts to print the surrounding information to
help you to locate the error. Many of the most common errors are due to an inconsistency
of the input files from their standard, as defined in the report for Chemkin referenced
above. Here, we describe some requirements of the CK format that are common sources of
errors.
Tip
Many existing CK format files cause errors in ck2yaml
when they are processed. Some of
these errors may be avoided by specifying the --permissive
option. This option allows
certain recoverable parsing errors (for example, duplicate transport or thermodynamic
data) to be ignored.
Input file sections#
Each section of a CK input file must start with a keyword representing that section and
end with the keyword END
. Keywords that may begin a section include:
ELEMENTS
orELEM
SPECIES
orSPEC
THERMO
orTHERMO ALL
REACTIONS
orREAC
TRANSPORT
Thermo data#
The thermodynamic data is read in a fixed format. This means that each column of the input has a particular meaning. The format for each thermodynamic entry should be as follows:
CH3OH L 8/88C 1H 4O 1 G 200.000 3500.000 1000.000 1
1.78970791E+00 1.40938292E-02-6.36500835E-06 1.38171085E-09-1.17060220E-13 2
-2.53748747E+04 1.45023623E+01 5.71539582E+00-1.52309129E-02 6.52441155E-05 3
-7.10806889E-08 2.61352698E-11-2.56427656E+04-1.50409823E+00 4
Caution
Many common errors are generated because information is missing or in the wrong column. Check thoroughly for extraneous or missing spaces. Also make sure that the input file does not contain tab characters.
The following table is adapted from the Chemkin manual [Kee et al., 1989] to describe the column positioning of each required part of the entry. Empty columns should be filled with spaces.
Line No. |
Columns |
Contents |
---|---|---|
1 |
1–18 |
Species Name |
1 |
19–24 |
Date (Optional) |
1 |
25–44 |
Atomic Symbols and formula |
1 |
45 |
Phase of species ( |
1 |
46–55 |
Low temperature |
1 |
56–65 |
High temperature |
1 |
66–73 |
Common temperature |
1 |
74–78 |
Additional Atomic Symbols |
1 |
80 |
The integer |
2 |
1–75 |
Coefficients \(a_1\) to \(a_5\) for the upper temperature interval |
2 |
80 |
The integer |
3 |
1–75 |
Coefficients \(a_6,\ a_7\) for the upper temperature interval, and \(a_1,\ a_2,\ a_3\) for the lower temperature interval |
3 |
80 |
The integer |
4 |
1–60 |
Coefficients \(a_4\) through \(a_7\) for the lower temperature interval |
4 |
80 |
The integer |
Line 1#
The first 18 columns are reserved for the species name. The name assigned to the
species in the thermodynamic data must be the same as the species name defined in the
SPECIES
section. If the species name is shorter than 18 characters, the rest of the
characters should be filled by spaces.
The next six columns (columns 19–24) are typically used to write a date; they are not used further.
The next 20 columns (25–44) are used to specify the elemental composition of the species. Five characters are used for each element: the first two specify the element symbol, while the remaining three specify the number of atoms of that element in the species. Up to four elements may be specified this way. For species with more than four elements, the extended composition format described below may be used.
In column 45, the phase of the species (S
, L
, or G
for solid, liquid, or gas
respectively) should be specified.
The next 28 columns are reserved for the temperatures that delimit the ranges of the polynomials specified on the next several lines. The first two temperatures have a width of 10 columns each (46–55 and 56–65), and represent the lowest temperature and highest temperature for which the polynomials are valid. The last temperature has a width of 8 columns (66–73) and is the common temperature, where the switch from low to high occurs.
The next 5 columns (74–78) are reserved for atomic symbols and are usually left blank for the default behavior.
Column 79 is blank; and finally, the row is ended in column 80 with the integer 1
.
Lines 2–4#
The next three lines of the thermodynamic entry have a similar format. They contain the coefficients of the 7-coefficient polynomial formulation in two temperature regions.
The second row of the thermo entry (the first after the information row) contains the
first five coefficients that apply to the temperature range between the midpoint and the
upper limit. 15 columns are alloted for each coefficient (for a total of 75 columns),
with no spaces between them. Although the entry above shows spaces between positive
coefficients, it is to be noted that this is done only for formatting consistency with
other lines that contain negative numbers. After the coefficients, four spaces in
columns 76–79 are followed by the integer 2
in column 80.
On the next line, the last two coefficients for the upper temperature range and the
first three coefficients for the lower temperature range are specified. Once again, this
takes up the first 75 columns, columns 76–79 are blank, and the integer 3
is in
column 80.
Finally, on the last line of a particular entry, the last four coefficients of the lower
temperature range are specified in columns 1–60, 19 blank spaces are present, and the
integer 4
is in column 80.
Extended composition format#
If the number of atoms of an element in a thermodynamic entry has more than 3 digits,
the standard format for the composition cannot be used. To allow for such species, the
following extended format is allowed. The element symbol should have a 0
in the first
line of the entry. An ampersand (&
) is added after the index of the first line, and
the element symbols and their amounts should be written on the next line as follows:
BIN6J PYRENEJ1 C 0H 0 0 0G 300.000 5000.000 1401.000 1&
C 778 H 263
3.63345177E+01 3.13968020E-02-1.09044660E-05 1.71125597E-09-1.00056355E-13 2
4.05143093E+04-1.77494305E+02-1.20603441E+01 1.59247554E-01-1.41562602E-04 3
6.26071650E-08-1.09305161E-11 5.56473533E+04 7.68451211E+01 4
or on separate lines with ampersand (&
) as the last character on the line:
BIN6 PYRENE C 0H 0 0 0G 300.000 5000.000 1401.000 1&
C 778&
H 264
3.65839677E+01 3.36764102E-02-1.16783938E-05 1.83077466E-09-1.06963777E-13 2
9.29809483E+03-1.81272070E+02-1.29758980E+01 1.63790064E-01-1.43851166E-04 3
6.31057915E-08-1.09568047E-11 2.48866399E+04 7.94950474E+01 4
Transport parameters#
The transport data file also has a specified format, as described in [Kee et al., 1986], although the format is not as strict as for the thermodynamic entries. In particular, the first 15 columns of a line are reserved for the species name. One common source of errors is a species that is present in the transport data file, but not in the thermodynamic data or in the species list; or a species that is present in the species list but not the transport data file. The rest of the columns on a given line have no particular format, but must be present in the following order:
Parameter Number |
Parameter Name |
---|---|
1 |
An integer with value 0, 1, or 2 indicating monatomic, linear, or non-linear molecular geometry. |
2 |
The Lennard-Jones potential well depth \(\varepsilon/k_B\) in Kelvin |
3 |
The Lennard-Jones collision diameter \(\sigma\) in Angstrom |
4 |
The dipole moment \(\mu\) in Debye |
5 |
The polarizability \(\alpha\) in Angstroms cubed |
6 |
The rotational relaxation collision number \(Z_{rot}\) at 298 K |
Another common error is if all six of these numbers are not present for every species.
Other formatting issues#
It may be the case that scientific formatted numbers are missing the E
. In this case,
numbers often show up as 1.1+01
, when they should be 1.1E+01
. You can fix this with
a Regular Expression “find and replace”:
Find: (\d+\.\d+)([+-]\d+)
Replace: $1E$2