James A Warren edited Data Repositories.tex  about 10 years ago

Commit id: 9336a9d70e47fd233856aedcc37f820d2b4050a6

deletions | additions      

       

\item Allow bi-directional linking between paper and dataset  \item Provide persistent digital identifier  \end{enumerate}  One tempting option might be to take advantage of the on-line storage capability several journals already offer for supplementary materials accompanying journal articles. However, as presently constructed these are not amenable to best practices for dataset storage as they generally are not searchable, separately citable, nor aggregated in one location. In fact, some publishers are reducing or eliminating supplementary file storage due to the haphazard structure and rules associated with their use. Further, new global government policies promoting open access to research works have the publishing industry in a state of flux with regard to their long-standing, subscription-based business model. Publishers have been extremely reticent in taking on a data archiving responsibility given the economic uncertainties in the publishing marketplace. marketplace.\cite{discussion}    As alluded to in the previous section, a fundamental consideration in repository design and/or selection is the level to which the repository will present structured versus unstructured data. Structured technical databases tend to be more useful to a technical community due their uniformity, as evidenced by their data reuse rate. rate.\cite{acharya}  A perfect construct would see the vast majority of materials data resident within structured repositories. A disciplined data structure provides enormous advantages to the researcher both in terms of data discoverability and confidence in its use. However, this structure must be enabled by the application of broader and deeper standards for data and metadata, standards that do not currently exist.   In all likelihood, like biology, MSE publications will be dependent on a collection of repositories that are tailored to specific materials data. For example, NIST is building and demonstrating a data file repository for CALPHAD and interatomic potentials. These may be expandable and largely sufficient for thematic journals such as those devoted to thermodynamics and diffusion. However, repositories such as this will only fill a relatively small niche need in MSE.