Results of audio diversity test for user entered search queries. The y-axis represents how many generated audio sets (samples in a set are generated with the same conditioning text) were judged to have the diversity shown on the x-axis. Diversity ranges from one (all samples in the set are the same) to five (very high diversity within the set).