Clayton Miller edited We_investigate_these.tex  almost 10 years ago

Commit id: 186605619d5b60850c0ca1fd3fa8b9048cabab76

deletions | additions      

       

We investigate these scenarios further by taking the most frequent pattern found for each set of parameter inputs and using a box plot to visualize the statistical distribution of the days contained in each pattern according to different windows within the daily profile. Figure \ref{fig:Wmod_boxplot} shows this analysis for the $W$ window count input. A strong difference exists between the variability and, thus, the detail captured by the selection of this input. A relatively high variance is present for $W$=2 during occupied hours between 06:00 and 12:00. Values of $W$ of 8 and 4 created profiles of more similar distribution with slightly higher variance for the latter. These results reinforce the observation that smaller window sizes create more tightly grouped patterns, albeit with many more variations.   Figure \ref{fig:Amod_boxplot} illustrates the same process with the most frequent patterns from the SAX alphabet size, $A$, experiments. The alphabet size decision modulates the magnitude resolution available to the algorithm. Thus it is intuitive that the pattern has less variance as the value of $A$ increases. This observation is also apparent in this analysis where an alphabet size of 4 produces tighter distributions than a value of 2, in this case primarily during occupied hours as well.  Based on these observations of the $A$ and $W$ parameters, we found that setting $A$=3 and $W$=4 resulted in the best balance between number of patterns generated and resolution of detail needed to adequately filter discords in a 24 hour period. While these findings are specific to our case studies, we hypothesize that similar settings will be useful when analyzing other building performance data due to the generally reoccurring daily patterns. These initial parameter setting may be used as a default when implementing the \emph{DayFilter} process and adjusted accordingly based on visualizations similar those developed in this section.