Why is my assembler subroutine for Shellsort not sorting the array when integrated with a C program - EdgeBench