Damn, I'm uncovered!

The tests do not have a random outcome. The randomness is indeed only in real trading, and is generated by shifting the bar start time and some of the signals by a small random amount. This causes trades sometimes to be triggered at a slightly different time, or sometimes not at all.