What data should be kept over the long term? How does one track digital provenance? Should we track digital provenance? What constitutes a master/archival copy? What options are being evaluated for storage of large data sets “in perpetuity”? Cost/benefit analysis for storage solutions? What challenges are repositories facing with this data type?
http://www.dpconline.org/handbook/digital-preservation/why-digital-preservation-matters