The rotation matrices would indeed be the easiest, and almost only way to do it.
and most microcontrollers have at least 1 MHz, so speed doesn't have to be a serious problem.


Check out the throwing game here: The throwing game