Optimizing a bubble sort implementation in C for an x86-64 architecture - EdgeBench