Monthly Archives: March 2012

Solving a Boundary Value Problem in MATLAB – An Example

Posted on March 27, 2012 by machinelearning1

In this post, I demonstrate a simple function to solve a BVP problem. Here the problem is an ODE. After solving the problem, we plot the result. Copy- paste , all the below code into an m-file in MATLAB and … Continue reading →

Posted in Linux, Machine Learning, MATLAB, programming, Software, Ubuntu | Tagged boundary, boundary condition, boundary value problem, bvp, code, example, linux, Matlab, ode, ordinary differential equation, programming, sample, simple, software, solve, windows | Leave a comment

Rewards VS. Values (in the concept of Reinforcement Learning)

Posted on March 20, 2012 by machinelearning1

In the concept of Reinforcement Learning, we usually deal with two pretty similar keywords, reward and value, that can be confusing for beginners. In this brief post, I will mention the differences and actual definitions of these terms. Rewards are … Continue reading →

Posted in Machine Learning, Reinforcement Learning, Robotics, Thoughts | Tagged artificial intelligence, control, definition, learning, Machine Learning, optimization, Reinforcement Learning, reward, reward function, rl, robot, theory, value, value function | Leave a comment

Elements

Posted on March 20, 2012 by machinelearning1

elements of Unsupervised Learning (Reinforcement Learning) 1-Agent (behaviour at a given time) 2-Environment 3-Policy 4-Value Function (specifies what is good in the long run) 5-Reward Function (Define Goal – what is good in an immediate sense) 6-Model of the environment … Continue reading →

Posted in Machine Learning, Reinforcement Learning | Leave a comment

General Categorization of Machine Learning methods

Posted on March 20, 2012 by machinelearning1

There are three main branches of learning methods: 1-Supervised learning – These methods require labeled data to learn from. For instance given a 10000 images of cats and dogs which are labeled correctly by a supervisor, a machine learning approach … Continue reading →

Posted in Machine Learning, Neural Networks, Reinforcement Learning, Robotics, Thoughts | Tagged action, anomaly detection, bayes, bayes classifier, classification, classifier, clustering, data, direct policy search, fuzzy, gaussian, gaussian process, k-means, label, labeled, Machine Learning, neural networks, observation, q-learning, radial basis network, Reinforcement Learning, reward, sarsa, state, supervised, supervised learning, td-learning, unsupervised, unsupervised learning | Leave a comment

Policy Function – definition

Posted on March 14, 2012 by machinelearning1

(An Introduction to Reinforcement Learning) – Part.1 Policy Function: defines action related to a state / maps any state to a related action. There are three kinds of Policies: 1- Stochastic Policy: (probability – uses a transition function that … Continue reading →

Posted in Machine Learning, Reinforcement Learning, Robotics | Tagged Machine Learning, Policy, Reinforcement Learning, Robotics | Leave a comment

	machinelearning1 on install tinycore linux version…
	machinelearning1 on Install TinyCore Linux on Virt…
	Tiago NET on Install TinyCore Linux on Virt…
	kerl on install tinycore linux version…
	machinelearning1 on Convert a grayscale image into…

M	T	W	T	F	S	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Monthly Archives: March 2012

Solving a Boundary Value Problem in MATLAB – An Example

Rewards VS. Values (in the concept of Reinforcement Learning)

Elements

General Categorization of Machine Learning methods

Policy Function – definition

Recent Posts

Recent Comments

Archives

Categories

Stay Tuned....

MyCalendar

Meta