Research data in the field of catalysis are generated in several stages:
Typically, the process begins with the synthesis of a catalyst material using various starting materials and synthesis methods (synthesis data). The synthesized catalyst is then tested in a reactor under a series of varying reaction conditions—such as temperature, pressure, or flow rate—while being exposed to reactants like gases or liquids. The resulting products are analyzed for their composition and quantity using analytical methods such as mass spectrometry or gas chromatography (reaction data).
Catalysis research aims to uncover relationships between the structure of a working catalyst and its activity. Ideally, this is achieved through operando measurements that directly observe catalysts under relevant reaction conditions, often supported by quantum chemical calculations. Structural properties are determined before, during, or after the catalytic reaction using materials analysis techniques such as spectroscopy, diffraction, or absorption methods (characterization data). The reaction data are used to calculate the performance of the catalyst, which can be expressed in various forms—such as concentrations, conversion rates, or selectivity (performance data).
These data feed into the modeling of reaction kinetics (molecular modeling), reactor simulations (multiscale modeling), and process simulations. Based on these simulations, catalytic reactors and processes can be designed. The success of this approach is driven by both experimental and computational methods, which are often carried out separately by different experts. While the overall workflow and fundamental concepts are similar across the different subdisciplines of catalysis, each tends to use slightly different approaches, nomenclature, experimental methods, and descriptors for properties and performance.
Currently, data management in catalysis is mostly organized at the level of individual institutes or research groups and relies on local conventions. Cross-cutting repositories and standards for how data and metadata should be stored exist only in rudimentary form—e.g., in biocatalysis or theoretical catalysis. As a result, data publication still plays a minimal role in the field.