Alberto Pepe edited subsectionData_collection_Our_analysis.tex  over 11 years ago

Commit id: 09f7dfc4611038476807aca9682d292d99150763

deletions | additions      

       

\section{Data and study overview}  \subsection{Data collection}  Our analysis is based on a corpus of 4,606 scientific articles submitted to the preprint database arXiv between October 4, 2010 and May 2, 2011. For each article in this cohort, we gathered information about their downloads from the arXiv server weekly download logs, their daily number of mentions on Twitter using a large-scale collection of Twitter data collected over that period, and their early citations in the scholarly record from Google Scholar. Table \ref{data_specs} 1  summarizes the discussed data collection and Figure \ref{fig:TAC_timeline} provides an overview of the data collection timelines.