site:www.cs.utexas.edu

www.cs.utexas.edu5d

Data-Efficient Policy Evaluation Through Behavior Policy Search

We consider the task of evaluating a policy for a Markov decision process (MDP).The standard unbiased technique for evaluating a policy is to deploy the policyand observe its performance. We show that ...

www.cs.utexas.edu10d

Artificial Intelligence and Life in 2030

Artificial Intelligence and Life in 2030. Peter Stone, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram ...

www.cs.utexas.edu11d

E. Allen Emerson

E. Allen Emerson has a longstanding interest in formal methods for establishing program correctness. This was inspired in part by reading in the mid-1970's a CACM paper by Tony Hoare "Proof of Program ...

www.cs.utexas.edu10d

TAMER: Training an Agent Manually via Evaluative Reinforcement

Though computers have surpassed humans at many tasks, especially computationally intensive ones, there are many tasks for which human expertise remains necessary and/or useful. For such tasks, it is ...

www.cs.utexas.edu10d

Overlapping Layered Learning

Patrick MacAlpine and Peter Stone.

www.cs.utexas.edu10d

Boosting for Regression Transfer

David Pardoe and Peter Stone.

www.cs.utexas.edu11d

David Harwath

My research interests are in the area of machine learning for speech, language, and sound processing. I am particularly interested in multimodality and unsupervised ...

www.cs.utexas.edu2d

Pandemic Resilience: Case studies of an AI-calibrated ensemble of models to inform decision making (2024)

This report from Global Partnership on Artificial Intelligence (GPAI)'s Pandemic Resilience project follows its 2023 report and is focused on practically implementing the concepts previously developed ...

www.cs.utexas.edu10d

Transfer Learning for Reinforcement Learning Domains: A Survey

Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.

www.cs.utexas.edu10d

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism. Kurt Dresner and Peter Stone. In The Third International Joint Conference on Autonomous Agents and Multiagent Systems ...

www.cs.utexas.edu10d

TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots

TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots. Todd Hester and Peter Stone. Machine Learning, 90(3):385–429, 2013.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results