Our current way to do biocatalytic research and development is still limited by low reproducibility of experimental results, limited scalability of experimentation, and limited access to data. Even if accessible, data and metadata cannot be easily exchanged between research groups because of missing interoperability, which needs additional efforts for reformatting and results in loss of information. Limited reproducibility and interoperability severely hampers scientific development and causes additional costs and delays in academic research and industrial development.
In a cooperation with project partners in Europe, USA, and South Africa, we currently establish EnzymeML as a novel standardized data exchange format for biocatalysis, which facilitates the application of the STRENDA Guidelines and thus makes data on enzyme-catalyzed reactions findable, accessible, interoperable, and reusable (FAIR). An Application Programming Interface enables the integration of applications such as electronic lab notebooks, modelling platforms, and databases. Documentation and software of the EnzymeML project are freely available for non-commercial and commercial users.
This research project is funded in the framework of the Stuttgart Cluster of Excellence SimTech.
- Range, J., Halupczok, C., Lohmann, J., Swainston, N., Kettner, C., Bergmann, F. T., Weidemann, A., Wittig, U., Schnell, S., & Pleiss, J. (2022). EnzymeML—a data exchange format for biocatalysis and enzymology. The FEBS Journal, 289(19), 5864--5874. https://doi.org/10.1111/febs.16318
- Pleiss, J. (2021). Standardized data, scalable documentation, sustainable storage – EnzymeML as a basis for FAIR data management in biocatalysis. ChemCatChem, 13, 3909–3913. https://doi.org/10.1002/cctc.202100822