With the continuing decrease in computational cost, machine learning has made its way into more and more applications. The driving factor for successful machine learning is the broad availability of well-described high-quality data. This is one of many factors why data has been described as the new oil of the digital economy.
Within CRC 1333 "Molecular Heterogeneous Catalysis in Confined Geometries", we apply the FAIR data principles (findable, accessible, interoperable, reusable) by establishing data management workflows in various fields of chemistry such as catalysis, organic synthesis, material sciences, analytics, and computational chemistry. Drawing from our experience with the development of our standardised data exchange format EnzymeML we aim to develop and implement useful, novel, bottom-up solutions for research data management. We collaborate closely with the Cluster of Excellence SimTech and the NFDI consortia NFDI4Chem and NFDI4Cat.
Publications
- Windels, A., Franceus, J., Pleiss, J., Desmet, T.: CANDy: Automated analysis of domain architectures in carbohydrate-active enzymes. PLOS ONE. 19, 1–16 (2024). https://doi.org/10.1371/journal.pone.0306410.
Members
Max Häußler
Bioinformatics
Torsten Giess
Bioinformatics