Use Reinforcement Learning To Generate Testing Commands For Onboard Software Of Small Satellites