Basically instead of a controller or a joystick you just use the ESP32's WiFi and Bluetooth capabilities to control it through web pages, e.g: "10.xx.xx.xx/moveForward"
I also taped an ESP32S3 Camera to act as "eyes" ,so basically you could be in another room and still control it!