Tutorial / Example of targetting GPU with llvm_instrinsic or __mlir_op etc? - Modular