# Mathematical Research Data Initiative (MaRDI consortium)

## Prof. Dr. Harald Sack

The volume of data and its rate of creation are increasing dynamically with the rapid unfolding of mathematics in the data sciences and the ever-increasing computational power. In various disciplines such as physics, chemistry, engineering, the humanities, and life sciences, this leads to increasingly complex mathematical models and mathematical research data. MaRDI's four research-motivated pillars include computer algebra, scientific computing, statistics and machine learning, and interdisciplinary mathematics.

Standardized formats and application programming interfaces will be developed for the resulting mathematical research data, and prototype services will be expanded into full services with added research value. MaRDI will develop enhanced information services that include mathematical models as research data, an interdisciplinary mathematical digital semantic atlas, ontologies, and metadata, such as an algorithm metadata library. In addition, a digital service portal will be established as a central point of contact for the scientific community. Sustained implementation of MaRDI's results requires a community based on a FAIR data culture and FAIR research processes. "FAIR" stands for Findable, Accessible, Interoperable, and Reusable. The goal is to optimally prepare research data for people and machines and make it accessible - unhindered and loss-free. To this end, appropriate cooperation platforms are to be developed for knowledge dissemination, scientific discourse and quality assurance.

## The consortium

The consortium consists of 15 research institutions of the German mathematical community. German and international associations, such as the European Mathematical Society (EMS), scientific societies, and consulting experts are also involved. It is coordinated by Weierstraß-Institut für Angewandte Analysis und Stochastik, a Leibniz Institute in the Berlin research association.

## FIZ Karlsruhe@ MaRDI

In the field of mathematics, FIZ Karlsruhe develops and maintains various information services that are recognized and used by the community worldwide. Their core functions focus on facilitating the indexing and retrieval of bibliographic information, reviews, free full texts, and software. In Task Area 5, FIZ Karlsruhe contributes to the development of the MaRDI portal, networking with its renowned international information services zbMATH Open and swMATH, and through the integration of standardized research data.

Under the umbrella of the MaRDI portal, a customized, cost-efficient and user-friendly service for archiving and publishing research data will be offered in the future. FIZ Karlsruhe‘s research unit Information Service Engineering (ISE) participates in the development and implementation of the MaRDI Knowledge Graph.

**Term:** October 01, 2021 – September 30, 2026

## Publications

H. S. Cohl and M. Schubotz, ‘The digital shadow of mathematics and its ramifications’, in 90 Years of zbMATH, 1st ed., K. Hulek, O. Paniagua Taboada, and O. Teschke, Eds. EMS Press, 2024, pp. 19–22. DOI: 10.4171/zbl90/11.

J. Stegmüller and M. Schubotz, ‘WikiTexVC: MediaWiki’s native LaTeX to MathML converter for Wikipedia’. arXiv, 2024, http://arxiv.org/abs/2401.16786

T. O. F. Conrad, E. Ferrer, D. Mietchen, L. Pusch, J. Stegmüller, and M. Schubotz, ‘Making Mathematical Research Data FAIR: Pathways to Improved Data Sharing’, Sci Data, vol. 11, no. 1, p. 676, 2024, DOI: 10.1038/s41597-024-03480-0.

M. Petrera, F. Müller, and M. Schubotz, ‘API solutions at zbMATH Open’, in 90 Years of zbMATH, 1st ed., K. Hulek, O. Paniagua Taboada, and O. Teschke, Eds. EMS Press, 2024, pp. 55–73. DOI: 10.4171/zbl90/16.

A. Satpute, N. Gießing, A. Greiner-Petter, M. Schubotz, O. Teschke, A. Aizawa, and B. Gipp, ‘Can LLMs Master Math? Investigating Large Language Models on Math Stack Exchange’, in Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Washington DC USA, 2024, pp. 2316–2320. DOI: 10.1145/3626772.3657945.

J. Stegmüller, A. Greiner-Petter, P. Sojka, O. Teschke, and M. Schubotz, ‘Examination of the state of the art of mathematical formula search for zbMATHOpen’, in 90 Years of zbMATH, 1st ed., K. Hulek, O. Paniagua Taboada, and O. Teschke, Eds. EMS Press, 2024, pp. 27–41. DOI: 10.4171/zbl90/13.

N. Gießing, M. Deb, A. Satpute, M. Schubotz, and O. Teschke, ‘Reducing the climate impact of data portals: a case study’. arXiv, 2024. http://arxiv.org/abs/2406.03858

A. Satpute, A. Greiner-Petter, N. Gießing, I. Beckenbach, M. Schubotz, O. Teschke, A. Aizawa, and B. Gipp, ‘Taxonomy of Mathematical Plagiarism’, in Advances in Information Retrieval, vol. 14611, N. Goharian, N. Tonellotto, Y. He, A. Lipani, G. McDonald, C. Macdonald, and I. Ounis, Eds. Cham: Springer Nature Switzerland, 2024, pp. 12–20. DOI: 10.1007/978-3-031-56066-8_2.

The MaRDI Consortium, ‘Research Data Management Planning in Mathematics’, 2023, DOI: 10.5281/zenodo.10018246.

T. Conrad, E. Ferrer, D. Mietchen, L. Pusch, J. Stegmüller, and M. Schubotz, ‘Making Mathematical Research Data FAIR: A Technology Overview’. arXiv, 2023. http://arxiv.org/abs/2309.11829

M. Schubotz, E. Ferrer, J. Stegmüller, D. Mietchen, O. Teschke, L. Pusch, and T. O. Conrad, ‘Bravo MaRDI: A Wikibase Powered Knowledge Graph on Mathematics’, Proceedings of the 4th Wikidata Workshop 2023 co-located with 22nd International Semantic Web Conference (ISWC 2023), vol. 3640, p. 13, 2023, DOI: 10.48550/ARXIV.2309.11484.

L. Rossenova, M. Schubotz, and R. Shigapov, ‘The Case for a Common, Reusable Knowledge Graph Infrastructure for NFDI’, Proc Conf Res Data Infrastr, vol. 1, 2023, DOI: 10.52825/cordi.v1i.266.

The MaRDI Consortium, ‘MaRDI: Mathematical Research Data Initiative Proposal’, 2022, DOI: 10.5281/zenodo.6552436.