Clustering performance per simulation. On the top panel, the number of Additional Classes was calculated for each simulation and algorithm. Computing a paired sign test for this metric, Spikes_link_WC obtained p<4e-3 against the other algorithms. On the bottom panel, the Adjusted Rand Index measures the similarity between the ground truth and the spike sorting output (i.e. the class label assigned to each spike). A paired sign test was used to evaluate the differences between Spikes_link_WC and the other methods, leading to p<7e-5.