
Re-imagined the Power Automate Desktop recorder using GPT-V to combine UI automation with voice instructions, creating a more intuitive and accessible desktop flow creation experience
Back in September 2023, OpenAI announce a new AI model (GPT-V) that is multi-modal. In other words, this new model can take various data such as an image, text, sound as an input and generate either an image or text or code as an output. I collaborated closely with a cross-disciplinary team across Paris, Athens and Seattle to explore how we could re-imagine UI automation and disrupt the RPA market.
The RPA AI Recorder represents a significant advancement in automation technology:
Introducing the AI-powered recorder for desktop automation