Charles H. Ward added missing citations  about 10 years ago

Commit id: 5911f996c1951ebf45fc84063023303bf52f213f

deletions | additions      

       

  As alluded to in the previous section, a fundamental consideration in repository design and/or selection is the level to which the repository will present structured versus unstructured data. Structured technical databases tend to be more useful to a technical community due their uniformity, as evidenced by their data reuse rate.\cite{acharya} A perfect construct would see the vast majority of materials data resident within structured repositories. A disciplined data structure provides enormous advantages to the researcher both in terms of data discoverability and confidence in its use. However, this structure must be enabled by the application of broader and deeper standards for data and metadata, standards that do not currently exist.    In all likelihood, like biology, MSE publications will be dependent on a collection of repositories that are tailored to specific materials data. For example, NIST is building and demonstrating a data file repository for CALPHAD and interatomic potentials.\cite{NISTMDR} These may be expandable and largely sufficient for thematic publications such as those devoted to thermodynamics and diffusion. However, repositories such as this will only fill a relatively small niche need in MSE. \textit{Integrating Materials and Manufacturing Innovation} is piloting an effort to link articles with their supporting data using the NIST repository according to the criteria outlined above.\cite{doi: 10.1186/2193-9772-2-5}\cite{Shade_data} above.\cite{Shade_2013}\cite{Shade_data}    Finally, a business model for sustainably archiving materials data is required. Other technical fields, such as earth sciences, can at least partially rely on government-provided repositories for large and complex datasets. Without these types of repositories to build on, MSE will need to establish viable repository solutions. In response to funding agency requirements for data management plans some universities, Johns Hopkins for example, are beginning to provide centrally-hosted data repositories, but these are not yet common.\cite{jhudata} Private fee-for-service repository services, such as \textit{labarchives} and \textit{figshare}, are also evolving to meet growing demand for accessible data storage.\cite{labarchives,figshare} Additionally, ASM International is working to create a prototype materials data repository through its close association with Granta Design. Termed the Computational Materials Data Network (CMDN), this is an interesting option as the data repository will provide a structured database specifically for materials data, but the business model for CMDN has not yet been solidified.\cite{cmdn} A key open question remains how funding agencies will respond to the OSTP open research policy memo, and how they will fund activities making data open to the public.