Engineering AI Agents
BOOK
Foundations
Training Deep Networks
Perception
Kinematics
State Estimation
Large Language Models
Multimodal Reasoning
Task Planning
Global Planning
Local Planning
Markov Decision Processes
Reinforcement Learning
VLA Agents
COURSES
Introduction to AI
AI for Robotics
Deep Learning for Computer Vision
DATA MINING - BEING PORTED
MEDIA
AI for Robotics
ABOUT ME
Proximal Policy Optimization
Reinforcement Learning
Introduction to RL
Monte-Carlo Prediction
Temporal Difference (TD) Prediction
MC vs. TD(0)
Generalized Policy Iteration
\(\epsilon\)
-greedy Monte-Carlo (MC) Control
SARSA
SARSA Gridworld Example
Policy Gradient Algorithms - REINFORCE
Proximal Policy Optimization