Legged robots have consistently captured our collective imagination through various forms of media, from Hollywood films, anime, and viral Youtube videos of robots accomplishing incredible feats of acrobatics. These robots have the potential to navigate our environments, capable of completing tasks that would otherwise require human intervention. However, developing controls...
In recent years, model-free Deep Reinforcement Learning (RL) has become an increasingly popular alternative to more traditional model-based or optimization-based control methods in solving robotic legged locomotion. However, deploying RL in the real world can be a significant undertaking. Constructing reward functions which compel controllers to learn the desired behavior...
Learning latent space representations of high-dimensional world states has been at the core of recent rapid growth in reinforcement learning(RL). At the same time, RL algo- rithms have suffered from ignored uncertainties in the predicted estimates of model-free or model-based methods. In our work, we investigate both of these aspects...
Reinforcement learning has emerged as a popular tool for solving control tasks, with multiple works focusing on the complex and dynamic task of locomotion. However, the naive application of reinforcement learning to this problem often produces maladaptive policies that exploit the model or reward function. This results in behavior that...
Papers proposing novel machine learning algorithms tend to present the algorithm or technique in question in the best possible light. The standard practice is generally for authors to emphasize their proposed algorithms' performance in the precise setting where it is maximally impressive, often by only fully evaluating their best known...
We take for granted how quickly we, as humans, form mental models of the world around us. By the time we are toddlers, we have an observable intuition around the physical rules of the world. Stacking blocks such that they don’t fall over becomes such a trivial task, that it...
Dynamic bipedal locomotion is among the most difficult and yet relevant problems in modern robotics. While a multitude of classical control methods for bipedal locomotion exist, they are often brittle or limited in capability. In recent years, work in applying reinforcement learning to robotics has lead to superior performance across...
Anomaly detection has been used in variety of applications in practice, including cyber-security, fraud detection and detecting faults in safety critical systems, etc. Anomaly detectors produce a ranked list of statistical anomalies, which are typically examined by human analysts in order to extract the actual anomalies of interest. Unfortunately, most...
This thesis focuses on the problem of object tracking. Given a video, the general objective of tracking is to track the location over time of one or more targets in the image sequence. This is a very challenging task as algorithms need to deal with problems such as appearance variations,...
We model the popular board game of Clue as an MDP and evaluate Monte-Carlo policy rollout in a simulated environment pitting different agents and policies against each other. We describe the choices we made in the representation, along with some of the problems we encountered along the way. We find...