Sunday, May 13, 2007

Saturday, May 12, 2007

[AIMA JAVA Pre Final] Implemented PassiveADPAgent And PassiveTDAgent

The Passive ADP Agent converges much faster than the PassiveTDAgent, which is as it should be.
The Alpha symbol was a bit confusing till I recognized it was the probability normalizing constant!(I mistook it for a learning constant like the ones in neural networks).

Now onto Q Learning!

Wednesday, May 9, 2007

Friday, May 4, 2007

[AIMA JAVA Pre Final] Implemented Value Iteration

The code is now generic enough to handle any M by N environment with x ( >= 0 ) blocked squares. The MDP is derived out of the world structure. A further refinement might be to implement some kind of text file reader so that folks could design new worlds. I'll wait and see if there is enough demand for something like this.