Institution
Udemy
Build agentic AI, Generative AI, and Reinforcement Learning projects end-to-end. Learn Q-Learning, DQN, A3C, PPO, SAC, transformers, LoRA/QLoRA, and LLM fine-tuning.
This 2025 edition of Artificial Intelligence A-Z blends agentic AI, generative AI, and reinforcement learning into a single practical track. You start with hands-on prompting basics, how to shape factual answers, summaries, extractions, classifications, conversations, code, and reasoning, and learn to evaluate responses with objective and subjective metrics and parameter tuning. From there you build a suite of reinforcement learning systems: classic Q Learning, Deep Q Learning, and a convolutional DQN for games like Pac Man and control tasks like a lunar lander. You then move to policy gradient families including A3C, PPO, and SAC for more robust decision making, including self driving simulations. On the generative side, you work with transformers and large language models, applying LoRA and QLoRA to fine tune efficiently and using knowledge augmentation to specialize an LLM, for example a medical chatbot. The course code is built in Google Colab to avoid setup pain, with downloadable .py and .ipynb templates so you can remix quickly. Along the way, intuition first explanations keep the math approachable while still giving you a correct mental model of how each algorithm learns. Extras include DDPG, world models, and evolution strategies and genetic algorithms, plus a bonus mini course on generative AI and cloud. By the end you have multiple working AIs, agentic LLM apps and RL agents, plus a toolkit for evaluating prompts, tuning models, and turning ideas into runnable systems.
Pricing and availability can change. Always check the provider page.
Build practical AIs—agents, LLM apps, and reinforcement learners—by coding them end-to-end.— Course team on Udemy
Udemy
30-day money-back guarantee on Udemy. Course also available via Udemy Personal Plan subscription.
Last updated:
We only gather anonymous usage signals to keep Sup AI fast and helpful. Your choice stays in your browser for 30 days, then we will ask again.
What we collect: Anonymous interaction patterns (for example clicks or scroll depth) to improve the experience.
What we do not collect: Personal data, off-site browsing, or anything that can identify you.
How long we store it: Your decision lives in local storage for 30 days. After that, we will ask again.