Alberto Pepe edited subsectionData_collection_Our_analysis.tex  over 11 years ago

Commit id: cc2c2812a660e13bc830c17649544ad0d2aa8d75

deletions | additions      

       

\section{Data and study overview}  \subsection{Data collection}  Our analysis is based on a corpus of 4,606 scientific articles submitted to the preprint database arXiv between October 4, 2010 and May 2, 2011. For each article in this cohort, we gathered information about their downloads from the arXiv server weekly download logs, their daily number of mentions on Twitter using a large-scale collection of Twitter data collected over that period, and their early citations in the scholarly record from Google Scholar. Table \ref{data_specs} summarizes the discussed data collection and Figure \ref{fig:TAC_timeline} provides an overview of the data collection timelines.