- Published on
Humans are capable of complex reasoning. When posed with a problem, they break it up into smaller steps, iteratively going through each step, learning and solving to reach the end goal. Most AI models up until now havent been capable of this complex reasoning tasks that require multi-step thinking and adaptive learning. Last month OpenAI released o1 which addresses these challenges by incorporating Chain-of-Thought and Reinforcement Learning to achieve near-human reasoning capabilities.