
Think about asking AI to plan your journey itinerary, ebook and pay for all of your flights, and prepare your airport transport—all inside a single click on. Luckily, a world analysis group is making this imaginative and prescient a actuality.
The group, composed of researchers from the College of Waterloo, College of Hong Kong, Salesforce Analysis and Carnegie Mellon College developed Laptop Agent Enviornment—an analysis platform that may improve and create pc brokers.
A pc agent is a sort of software program that may carry out duties on behalf of an individual or group, without having fixed human intervention. It could possibly interpret the state of the pc and act autonomously to assist customers clear up issues. Examples of pc brokers embody voice assistants like Siri and Alexa, who may help customers ship messages and schedule conferences.
AI-based pc brokers battle with performing complicated pc duties as a result of it requires controlling a number of pc purposes and numerous steps. For instance, submitting an expense report could also be tough as a result of it requires updating a spreadsheet by looking a number of emails and folders stuffed with financial institution statements and receipts.
Laptop Agent Enviornment is the primary interactive pc use analysis platform that focuses on performing various duties throughout a number of purposes. This work is an extension of the researchers’ work on OSWorld, the world’s first scalable and actual pc surroundings for multimodal brokers.
“Laptop Agent Enviornment supplies a platform for the analysis group to develop efficient and environment friendly brokers that generalize to real-world pc utilization,” says co-developer Dr. Victor Zhong, assistant professor on the Cheriton Faculty of Laptop Science. Like different Waterloo researchers, he’s investigating human-technology interactions, exploring mitigate on a regular basis issues by creating novel applied sciences.
“Laptop Agent Enviornment is distinct from related analysis like Mind2Web and WebArena as a result of it supplies unified software programming interfaces for complete observations and actions in an executable surroundings with a number of purposes.”
By means of Laptop Agent Enviornment, customers can assess and examine numerous pc brokers based mostly on massive language fashions (LLM) and imaginative and prescient language fashions. First, customers choose an working system equivalent to Home windows, and purposes like Google Chrome and Excel. Customers can then immediate the pc agent with a activity, which will likely be carried out concurrently by two AI fashions in real-time. After completion, customers can fee every mannequin’s efficiency and supply suggestions.
Finally, the group seeks to supply a various and dynamic platform for constructing and evaluating brokers that may carry out real-world pc duties as safely, successfully and effectively as people do.
“Our present findings present that basis fashions equivalent to GPT4 and Claude are removed from with the ability to act safely and successfully as assistant pc brokers,” Zhong says. “Laptop Agent Enviornment supplies a well timed testbed to develop the subsequent era of AI brokers.”
College of Waterloo
Quotation:
New platform helps consider AI for complicated pc use (2025, February 20)
retrieved 22 February 2025
from https://techxplore.com/information/2025-02-platform-ai-complex.html
This doc is topic to copyright. Other than any truthful dealing for the aim of personal examine or analysis, no
half could also be reproduced with out the written permission. The content material is offered for data functions solely.