At the beginning of February I implemented neuralnetworks and reinforcement learning C++ libraries which I use in my iCub application. The most of the time I spent by implementation of the application which uses the CACLA algorithm to train iCub to reach objects. I’m also planning to create some UML package and class diagrams to [...]
On 22th of December I watched videos of Machine learning lectures focused on Reinforcement Learning. Lectuer: Associate Professor Andrew Ng, Stanford University Applications of Reinforcement Learning, Markov Decision Process (MDP), Defining Value & Policy Functions, Value Function, Optimal Value Function, Value Iteration, Policy Iteration (watch on YouTube) Generalization to Continuous States, Discretization & Curse of [...]
The goal of this post is to capture my progress in learning of object reaching on my iCub. Suppose iCub stands in the front of the object, which is placed on a table, so the robot can reach it without moving other joints than these on one arm. iCub has to set its arm joint [...]
BPTT, RL, iCub, YARP, inverse kinematics problem… what does it all mean? The aim of this article is to introduce you what these acronyms and abbreviations stands for. Note: This post is not complete yet and will be updated later. Back-propagation through time … Reinforcement learning Literature Peters J, Vijayakumar S, Schaal S (2003), RL [...]
About
Master thesis of Lukáš Zdechovan,
a master degree student of Applied Informatics at Comenius University. Thesis is lead by docent Igor Farkaš.Categories
- AI Theory (1)
- iCub (12)
- Information (1)
- Machine Learning (2)
- Study log (7)
- Uncategorized (1)
Calendar
April 2023 M T W T F S S « Mar 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 iCub on Twitter
Archives
- March 2012 (1)
- November 2011 (1)
- October 2011 (3)
- September 2011 (1)
- July 2011 (1)
- April 2011 (1)
- March 2011 (2)
- February 2011 (1)
- January 2011 (1)
- December 2010 (3)
- November 2010 (4)
- October 2010 (1)
Machine learning