    t.c – the example’s source code
    run – the run script that sets up environment, builds example’s binary, and launches it with user-provided parameters
    t.simple.c – a simpler version of the example which does not support manual synchronization but is a bit easier to understand (see below)

The example can either run offload and AO DGEMM one after another (if the first command line argument is ‘0’) 
or concurrently (if the first command line argument is ‘1’)

The manual synchronization mode is enabled by passing ‘1’ as the second command line argument and mimics ORSL: it places mutexes around offload/AO DGEMM calls. Moreover, it allows AO DGEMM to fall back to host if the lock is already set. The simpler version of the example does not support manual synchronization

# time -p ./run 0 0 4096

# time -p ./run 0 1 4096

# time -p ./run 1 1 4096
