Small Data, or: Bridging the Gap Between Smart and Dumb Research Repositories

Scientific research mostly consists of many tiny niches, a ‘long tail’ effect, with many thousands of small data sets. And so we have a ‘Small Data’ problem: how do we connect vastly different experimental results, so that they can be used by other scientists? Currently, there are many ‘dumb’ repositories requiring little metadata or informatics support, but these provide little opportunity for overarching analytics. And highly usable, ‘smart’ repositories, require detailed manual curation, – which does not scale. This talk will present a proposal on bridging the chasm between these two approaches, to enable systems for interoperable results reporting.


Anita de Waard


