Authorea

Nicolas Saunier edited section_Introduction_The_use_of__.tex almost 9 years ago

Commit id: f272af1a1655f5992695142f64dbfd6c33c8bf29

deletions | additions

\section{Introduction} The use of video data for automatic traffic data collection and analysis has been on an upward trend as more powerful computational tools, detection and tracking technology become available. Not only have video sensors been able to emulate inductive loops to collect basic traffic variables such as counts and speed as in the commercial system Autoscope \cite{michalopoulos91autoscope}, but they can also provide more and more accurately higher-level information regarding road user behavior and interactions. Examples include pedestrian gait parameters \cite{saunier11stride-length-trr}, crowd dynamics~\cite{johansson08crowd} and surrogate safety analysis applied to motorized and non-motorized road users in various road facilities~\cite{St_Aubin_2013,Sakshaug_2010,Autey_2012}. Video sensors are relatively inexpensive and easy to install or already installed for example by transportation agencies for traffic monitoring: large datasets can therefore be collected for large scale or long term traffic analysis. This so-called ``big data'' phenomenon brings offers to better understand transportation systems, with its own challenges for data analysis~\cite{st-aubin15big-data}. Despite the undeniable progress of the video sensors and computer vision algorithms in their varied transportation applications, there persists a distinct lack of large comparisons of the performance of video sensors in varied conditions such as the complexity of the traffic scene, the characteristics of cameras~\cite{Wan_201} and its installation (height, angle), the environmental conditions (e.g.\ the weather)~\cite{Fu_2015}, etc. This is particularly hampered by the poor characterization of the datasets used for performance evaluation, the limited availability of benchmarks and public video datasets for transportation applications~\cite{saunier14dataset}. Tracking performance is often reported using ad hoc and incomplete metrics such as ``detection rates'' instead of standard and more suitable metrics such as CLEAR MOT~\cite{Bernardin_2008}. Finally, the computer vision algorithms are typically manually adjusted by trial and error using a small dataset covering few conditions affecting performance while performance evaluated on the same dataset is thus over-estimated: compared to other fields such as machine learning, it should be clear that the algorithms should be systematically optimized on a calibration dataset, while performance should be reported for a separate validation dataset~\cite{ettehadieh15systematic}. The improvement of computer vision is particularly important in surrogate safety analysis such as in \cite{St_Aubin_2013} where automated tracking of trajectories is used to evaluate While thesafety performance ofmerge zones on highway ramps, in \cite{Sakshaug_2010} where the integration of cyclists and motor vehicles determined that accident statistics were not sufficient and video detection was instead used, and in \cite{Autey_2012} where a video-based automated traffic conflict analysis was performed to evaluate a new design sensors for right-turn lanes. \cite{Moreno_2013} performed a study evaluating speed as a measure of safety performance where specialized equipment needed to be installed. The Ministère des Transports du Québec already close to full video coverage of more simple traffic variables has been more extensively studied, not all factors have been systematically analyzed and the highways in issues with parameter optimization and the city lack of Montreal. With proper computer vision technology, speed profiles could be developed separate calibration and evaluated at larger scale than was previously feasible. validation datasets abound. Besides, the relationship of tracking performance with performance for traffic parameters has never been investigated. While flow rates and speed counts can be assessed through various means, computer vision is one The objective of this paper is first to improve the most efficient performance of existing detection and tracking methods for video data in terms of measuring other roadway interactions. Many studies are seeking to analyze microscopic movements accuracy of tracking, but also different kinds of traffic data such as \cite{St_Aubin_2013} in roundabouts counts, speeds, gaps and \cite{Hill_2015} in relation to driver behavior. At this scale, accurately representing road user interactions. This is done through the exact characteristics optimization of tracking parameters using a genetic algorithm comparing the moving vehicle tracker output with manually annotated trajectories. The method is crucial applied to provide accurate analyses a set of traffic videos extracted from a large surrogate safety indicators such as gap time, time-to-collision(TTC) and user pairs. Additionally, the performance study of computer vision tools are not consistent depending on several roundabout merging zones~\cite{st-aubin15big-data}, covering factors such as the distance of road users to the camera, two types of cameras, the camera resolution and type two weather conditions. The second objective is to explore the transferability of camera \cite{Wan_2014} parameters for separate datasets with the same properties (consecutive video samples) and across different properties, by reporting how optimizing tracking for one condition impacts performance in terms of tracking and traffic parameters for the weather conditions \cite{Fu_2015}. other conditions. This paper is a follow up on \cite{ettehadieh15systematic} that investigates more factors and how tracking performance is related to the accuracy of traffic parameters. The objective of this This paper is to improve the performance of feature-based tracking in merging zones of roundabouts through parameterization using a genetic algorithm comparing tracked trajectories to sets of manually annotated ground truths. There is an emphasis on cross-validating sets of parameters on separate datasets with similar properties, as well organized as exploring follows: it provides in the performance on datasets with different properties. This paper will provide next section a brief overview of the current state of computer vision and calibration in traffic applications, then detail presents the detailed methodology including the ground truth inventory, measures of performance and calibration procedure, followed by a presentation and discussion of the results and summarized in the conclusion to conclude with a summary and recommendations for future research.big data follow up on \cite{ettehadieh15systematic} Despite the progress in the applications, there is a surprising lack of public data, benchmarks, etc as noted in \cite{saunier14dataset}