Jace Harker edited materials_and_methods.tex  about 9 years ago

Commit id: 9e1b0811afa2efe82096f5234d6eb5e9c53fcc4b

deletions | additions      

       

performed the following filtering workflow. First, we removed links to  domains that are scholarly repositories and which obviously do not  host data (or which did not host data prior to 2008). These include  domains such as \url{http://dx.doi.org}, \url{http://arxiv.org}, \url{http://xxx.lanl.gov}, \href{http://dx.doi.org}{dx.doi.org}, \href{http://arxiv.org}{arxiv.org}, \href{http://xxx.lanl.gov}{xxx.lanl.gov},  and \url{http://adsabs.harvard.edu}. \href{http://adsabs.harvard.edu}{adsabs.harvard.edu}.  Removing links to these domains, which are obviously pointers to articles, narrowed down the corpus to  $26663$.