Authorea

Juan de Monasterio Deleted File almost 8 years ago

Commit id: 21ed51e9f902df782a59ae1af10545dc3da03225

deletions | additions

% Procedimiento: % Obtener la lista de antenas de GC % Determinar la casa de los usuarios % Determinar, para cada usuario, si se comunicó con un habitante de GC % Determinar las agregaciones por antena % Visualizar (agrego alfa, min_volume y zoom en regiones) \section{Methodology} \label{methods} \subsection{Home detection} The first step of the process involves determining the area in which each user lives. Having the granularity of the geolocated data at the antenna level, we can at most match each user $u \in \calN_C$ with its \textit{home antenna} $H_u$. To do so, we assume $H_u$ as the antenna in which user $u$ spends most of the time during weekday nights. This, according to our categorization of types of days of the week, corresponds to Monday to Thursday nights, from 8pm to 6am of the following day. This was based on the assumption that on any given day, users will be located at home during night time. Note that users for which the inferred home antenna is located in \textit{Gran Chaco} (the risk area) will be considered the set of \textit{residents of Gran Chaco}. \subsection{Detection of vulnerable users} Given the set of inhabitants of the risk area, we want to find those with a high communication with residents of the endemic zone. To do this, we get the list of calls for each user and then determine the set of neighbors in the social graph $\calG_C$. For each resident of the endemic zone, we tag all his neighbors as potentially vulnerable. We also tag the calls to (from) a certain antenna from (to) residents of the endemic area as \textit{vulnerable calls}. The next step is to aggregate this data for every antenna. Given an antenna $a$, we will have: \begin{itemize} \item The total number of residents $N_a$ (this is, the number of people for which that is their home antenna). \item The total number of residents which are vulnerable $V_a$. \end{itemize} We also aggregated the total volume of outgoing calls $C_a$ from every antenna and from these we extracted every call that had a user whose home is in the endemic area as a receiver $VC_a$ (\textit{vulnerable calls}). These four numbers are the indicators for each antenna in the studied country. % Reviewed up to here \subsection{Heatmap generation} With the collected data, we generated heatmaps to visualize the mentioned antenna indicators, overlapping these heatmaps with political maps of the region taken for study. A circle is generated for each cell, where: %Generamos un c\'irculo por cada celda donde: \begin{itemize} \item the \textbf{area} depends on the population living in the antenna $N_a$. %el \textbf{\'area} depende de la cantidad de usuarios (habitantes), \item the \textbf{color}, is related to the fraction ${V_a}/{N_a}$ of vulnerable users living there. %el \textbf{color} corresponde al porcentaje de usuarios vulnerables que viven en esa antena. \end{itemize} We used two filtering parameters to control which antennas are plotted. \begin{itemize} \item $\beta$: The antenna is plotted if its percentage of vulnerable users is higher than $\beta$. \item $min\_volume$: The antenna is plotted if its population is bigger than $min\_volume$. \end{itemize} These parameters were tuned differently for different regions. For example: an antenna whose vulnerable percentage would be considered low at the national level can be locally high when zooming in at a more regional level.