Implementing More Effective FAIR Scientific Data Management With a Lakehouse

September 7, 2021

Data powers scientific discovery and innovation. But data is only as good as its data management strategy, the key factor in ensuring data quality, accessibility, and reproducibility of results – all requirements of reliable scientific evidence. As large datasets have become more and more important and accessible to scientists across disciplines, the problems of big data in the past decade — unruly, untamed, uncontrolled, and unreproducible data workflows — have become increasingly relevant to scientific organizations. This led to industry experts to develop a framework for “good data management and stewardship,” initially introduced in a 2016 article in Nature , with “long-term care of …

