ICML 2025 Workshop, Vancouver, Canada - Friday, July 18th, 2025
what? A full-day workshop to advance research on improving the performance, safety, and deployment of computer use models in real-world settings.
when, where? At ICML 2025 in the Vancouver Convention Center on July 18, 2025
Follow us on social media or check our official website for updates on schedules, keynote speakers, and more.
We invite submissions that address challenges in building and deploying agents that perform tasks on computer systems.
📝 Paper Track: Up to 8 pages (excluding references & appendix)
🖥️ Demo Track: demo video and up to 2 pages
Submission Deadline: May 18, 2025
Paper Decision Notification: June 9, 2025
Camera-Ready Deadline: June 30, 2025
Submissions must be made via OpenReview and adhere to double-blind guidelines.
Please visit the "Call for Papers" page for more details.
If you would like to review for the workshop, please register here.
Computer use models are attracting significant interest in academia and industry due to their ability to perform complex tasks in non-deterministic environments. However, they are far from being ready for unattended deployment, as evidenced by their performance on the OSWorld benchmark where they achieve only a small fraction of human performance. The rapid evolution of these agents raises important questions regarding their accuracy, safe deployment, and potential impact on the future of work.
The topics we would like to cover are:
Learning Algorithms - which new architectures and learning techniques (e.g. memory mechanisms for extended tasks, exploration strategies) can enhance the intrinsic ability of computer use agents to acquire, represent, and refine knowledge?
Orchestration - what novel frameworks or control methods (e.g. dynamic task planning, modular coordination, multi-agent systems) can efficiently manage and integrate multiple learning components to optimize overall agent performance?
Interfaces - how should agents perceive and act within their environments (e.g., via APIs or UI interactions), and should we design unified systems or specialized agents for different modalities?
Guardrails, safety & societal implications - what guardrails do we need in order to make computer use models safe for deployment ``in the wild'' while ensuring that they have a positive impact on society?
Benchmarking & tools - how can we develop robust environments and evaluation metrics that capture the diversity of real-world settings? Do we need new tools or frameworks to make research on computer use more efficient and accessible?
Human-agent interaction - how will future interactions evolve? Should we optimize agents for full autonomy or design them as personalized, human-centric collaborators?
Broader applications - what are some practical applications for computer use agents across domains such as healthcare, scientific research, software engineering and testing etc.?
Capability horizon - what breakthroughs or engineering challenges are required to enable agents orders of magnitude more capable than today, and what implications would such advances have?
Thanks to generous industry sponsors, we’ve secured support for travel, registration, and accommodation for selected participants. More details coming soon!