Automatically Extracting GPU Instruction-Level Parallelism