Matthew Sparkes – New research shows that 80 per cent of scientific data is no longer available 20 years after findings are published, as information is lost in old email addresses and obsolete storage formats.
“Researchers at the University of British Columbia chose a random set of of 516 studies published between 1991 and 2001 and found that all data from the two-year-old papers was still available but that the chance of it still existing fell off by 17 per cent for each year of age. The paper, published this week in Current Biology, warns that scientists are “poor stewards of their data” and calls for journals to begin uploading information onto public archives so it can be preserved for the future. Having access to the raw data of a study is vital in order for other scientists to asses, replicate or build on that work. Data was requested from the authors of each of the randomly-chosen studies, but the researchers found that the odds for even finding a working email address declined by seven per cent each year since publication.
Tim Vines, a visiting scholar at the University of British Columbia and one of the authors of the paper, said: “Publicly funded science generates an extraordinary amount of data each year. Much of these data are unique to a time and place, and is thus irreplaceable, and many other datasets are expensive to regenerate.”
- See also via Smithsonian magazine – The Vast Majority of Raw Data From Old Scientific Studies May Now Be Missing