The design actually is art, IMO. It could act as a robot character in a movie!
If it is choosing its behavior simply from data of the music, its frequencies, loudness, speaking voice, chorus, iterations etc. and from saving its own behavior to produce patterns of behavior - than it is very good.
I fear that it has extra data additional to the music, because in one video they showed synchronized rollies.