Alberto Pepe edited discussion.tex  almost 10 years ago

Commit id: fc26185b720ce27f10e70c0ba218d838229152f2

deletions | additions      

       

\item provide a data citation for every dataset uploaded. The citation includes a persistent identifier which links to the data, and can be added to the the references sections of any publication.  \end{enumerate}  For the everyday astronomer TheAstroData flips the equation of data sharing in a virtual obsevatory context on its head. It trades interoperability that comes with homogenized data sets for ease of data sharing by astronomers. Search functions focus on descriptive metadata instead of quantified slicing of datasets by physical quantities such as location on the sky. This trade off is not permanent, and we assert that the kinds of data access envisioned by \citet{2001Sci...293.2037S} for small published datasets can be achieved ex post facto. Our plans are to re-index (or expose the file level metadata related to) shared data files, extracting addtional additional  numerical metadata fields to enable finer grain search. Further, the audience for TheAstroData is completely transparent and focused on individual scientists or projects that have derived (and often heterogeneous) datasets to share or to publish along side a refereed paper. It is already the case that TheAstroData datasets are linked to literature publication records in two ways. Foremost, we provide primary publication-to-dataset links to the SAO-NASA Astrophysical Data System \href{http://adsabs.harvard.edu/}{ADS}, which is the universal literature resource for all of astronomy; an astronomer's TheAstroData datasets appear as "Data Archive" links in the primary publication's ADS record. Second, our records are listed in the \href{http://wokinfo.com/products_tools/multidisciplinary/dci}{Thomson-Reuters Data Citation Index}, which makes use of the Dataverse Network's OAI-PMH harvesting interface. Our future plans include transmutating the rich DDI metadata standards adopted by the Dataverse Network and enhanced with our astronomy specific extensions means into VO standards and exporting this version to indexing tools such as the VO Registry (or similar data publishing registry). In addition to providing a curation and long-term preservation plan for derived data in astronomy, TheAstroData has two additional benefits for everyday astronomers. First, it natively supports data analysis capabilities, and we plan to integrate it with existing tools for the analysis and visualization of astronomy datasets. Second, the stamping of TheAstroData datasets with a standardized data citation will facilitate the adoption of data citation by publishers - it is critical that ``citations to data" become part of the references sections in publications, and are easily traceable to derive their impact.