Upcoming Conference Publication

by sross on March 22, 2010

A new paper from the project set to appear at AISTATS 2010:

S. Ross and J. A. Bagnell. Efficient Reductions for Imitation Learning. To appear in Proc. of the International Conference on Artificial Intelligence and Statistics (AISTATS), 2010. [pdf] [supplementary material]

Abstract: Imitation Learning, while applied successfully on many large real-world problems, is typically addressed as a standard supervised learning problem, where it is assumed the training and testing data are i.i.d..  This is not true in imitation learning as the learned policy influences the future test inputs (states) upon which it will be tested. We show that this leads to compounding errors and a regret bound that grows quadratically in the time horizon of the task. We propose two alternative algorithms for imitation learning where training occurs over several episodes of interaction. These two approaches share in common that the learner’s policy is slowly modified from executing the expert’s policy to the learned policy. We show that this leads to stronger performance guarantees and demonstrate the improved performance on two challenging problems: training a learner to play 1) a 3D racing game (Super Tux Kart) and 2) Mario Bros.; given input images from the games and corresponding actions taken by a human expert and near-optimal planner respectively.

{ Comments on this entry are closed }

Upcoming Conference Publications (cont.)

by Alex Grubb on February 17, 2010

There are a number of papers from the project set to appear at ICRA 2010 in May:

P. Henry, C. Vollmer, B. Ferris, and D. Fox. Learning to navigate through crowded environments. To appear in Proc. of the IEEE International Conference on Robotics & Automation (ICRA), 2010. [pdf]

Abstract: The goal of this research is to enable mobile robots to navigate through crowded environments such as indoor shopping malls, airports, or downtown side walks. The key research question addressed in this paper is how to learn planners that generate human-like motion behavior. Our approach uses inverse reinforcement learning (IRL) to learn human-like navigation behavior based on example paths. Since robots have only limited sensing, we extend existing IRL methods to the case of partially observable environments. We demonstrate the capabilities of our approach using a realistic crowd flow simulator in which we modeled multiple scenarios in crowded environments. We show that our planner learned to guide the robot along the flow of people when the environment is crowded, and along the shortest path if no people are around.

B. Tovar and S. M. LaValle. Searching and mapping among indistinguishable convex obstacles. To appear in Proc. of the IEEE International Conference on Robotics & Automation (ICRA), 2010. [pdf]

Abstract: This paper considers a robot that moves in the plane and is only able to sense the cyclic order of landmarks with respect to its current position. No metric information is available regarding the robot or landmark positions; moreover, the robot does not have a compass or odometers (e.g., coordinates). We carefully study the information space of the robot, and establish its capabilities in terms of mapping the environment and accomplishing tasks, such as navigation and patrolling. When the robot moves exclusively inside the convex hull of the set of landmarks, the information space can be nicely characterized as an order type, which provides information powerful enough to determine which points lie inside the convex hulls of subsets of landmarks. Additionally, if the robot is allowed to move outside the convex hull of the set of landmarks, the information space is described with a swap cell decomposition, which is an aspect graph in which each aspect is a cyclic permutation of landmarks. We show how to construct such decomposition through its dual, using two kinds of feedback motion commands based on the landmarks sensed.

J. Yu and S. M. LaValle. Probabilistic shadow information spaces. To appear in Proc. of the IEEE International Conference on Robotics & Automation (ICRA), 2010. [pdf]

Abstract: This paper introduces a Bayesian ?lter that is specifically designed for counting targets that move outside of the field of view while performing a sensor sweep. Information space concepts are used to dramatically reduce the filter complexity so that information is processed only when the shadow region (all points invisible to the sensors) changes combinatorially or targets pass in and out of view. Previous work assumed perfect observations; however, this paper extends the approach to enable probabilistic disturbances. Practical algorithms are introduced, implemented, and demonstrated for computing the filter outputs based on realistic data.

You can see more publications from the RIRIS project here.

{ Comments on this entry are closed }

Upcoming Conference Publications

February 17, 2010

Geoff Gordon and his group have papers appearing in the upcoming ICASSP 2010 and AAMAS 2010:
B. Boots, G. Gordon, and S. Siddiqi. Closing the Learning-Planning Loop with Predictive State Representations (extended abstract). To appear in 9th Intl. Conf. on Autonomous Agents and Multiagent Systems (AAMAS), 2010. [pdf]
J. Ramos, S. Siddiqi, A. Dubrawski, [...]

Read the full article →

Information Space Tutorial @ IROS 2009

October 7, 2009

Steven LaValle, one of the group’s members, is giving a tutorial on Information Spaces at the upcoming IROS 2009 workshop. More information on the tutorial can be found at http://msl.cs.uiuc.edu/~lavalle/iros09/.

There is also an accompanying paper at http://msl.cs.uiuc.edu/~lavalle/iros09/paper.pdf with a number of minimalist abstract sensor models including detection sensors, gap sensors, and relational sensors [...]

Read the full article →

Software Frameworks for Collaborative Robotics

September 23, 2009

“An Overview of MOOS-IvP and a Brief Users Guide to the IvP Helm Autonomy Software”
http://dspace.mit.edu/handle/1721.1/45569

“Extending a MOOS-IvP Autonomy System and Users Guide to the IvPBuild Toolbox”
http://dspace.mit.edu/handle/1721.1/46361

“MOOS-IvP Autonomy Tools Users Manual”
http://dspace.mit.edu/handle/1721.1/43708

“A Tour of MOOS-IvP Autonomy Software Modules”
http://dspace.mit.edu/handle/1721.1/44590

Read the full article →

Warning: getimagesize() [function.getimagesize]: URL file-access is disabled in the server configuration in /home/lairlab/public_html/ispace/wp-content/themes/thesis_151/lib/functions/post_images.php on line 63

Warning: getimagesize(http://lairlab.org/ispace/InfoMatrixSmall.jpg) [function.getimagesize]: failed to open stream: no suitable wrapper could be found in /home/lairlab/public_html/ispace/wp-content/themes/thesis_151/lib/functions/post_images.php on line 63

What is Information Space?

August 11, 2009

Any system with sensors finds itself living in an information space (called I-space for short), whether it wants to or not. What does this mean and why does this happen? Consider a complicated scenario such as a team of robots operating in the real world. What is the appropriate notion of state?

In computer science, state is usually [...]

Read the full article →