What you'll learn
- What “reasoning models” really are (and aren’t)
- Compare reasoning styles across models
- The PRM800K dataset that powered this breakthrough
- Prompt and context engineering built for reasoning models
- The importance of the Generator–Verifier Gap (GVG)
- How Reinforcement Learning + Process Reward Models shape reasoning
- Dive deep into AI research papers
- Why reasoning models may be lying to you...
Why Learn About Reasoning Models?
Reasoning models are one of the biggest breakthroughs in AI as they give models the ability to think through things using a "scratchpad" of sorts, introducing something more akin to System 2 thinking.
Reasoning models feel like magic…until they don’t. This course helps you understand what’s happening under the hood, so you know how to use them, why they work, and where this technology is going.
You’ll start with the basics, then quickly move into real model behavior. You’ll practice your skills and explore reasoning models with real, hands-on exercises and practice.
Then you’ll connect the dots to how these systems are trained: reinforcement learning, RLHF, process reward models, the PRM800K dataset, and the impact of the latest scaling law on the future of AI: test-time compute.
You’ll also explore the tricky (but maybe more interesting)part: when reasoning models lie to you and the secrets they keep.
Wait...What's a Byte?
Bytes are shorter courses that allow you to upgrade your skills and knowledge in a single day!
Learning is hard. And sometimes you just need a quick learning fix, right? To learn something awesome, interesting, and relevant to your career goals.
That's why we've created Bytes.
What's The Bottom Line?
This course is not about superficial fluff or AI hype.
Instead, this course will take you from a complete beginner to understanding Reasoning Models on a deeper level than 99% of people out there and being able to start working with AI. 🚀
And... you have nothing to lose.
You can start learning right now and if this course isn't everything you expected, we'll refund you 100% within 30 days. No hassles and no questions asked.
I would recommend Andrei's ZTM courses to anyone who wants to learn web dev inside and out. His Junior to Senior course even helped me land my job at Tesla. They asked a lot of security questions in my interview and I answered them thanks to this course.
Who You Will Learn With
You're getting more than just a course
Our instructors, TAs, Mentors, Alumni, and fellow students go above and beyond to help guide you and ensure you're on the right path to achieve your goals. Our private ZTM Discord server is a key factor in taking your skills, confidence and career to the next level.