The company finally unveiled the new system in September, outing it as OpenAI’s first “reasoning” model and renaming it “o1.” Much like the two-stage release of GPT-2, where a stripped ...
The starting point of the project was Qwen2.5-32B-Instruct, an open-source LLM released by Alibaba Group Holding Ltd. last year. The researchers created s1-32B by customizing Qwen2.5-32B-Instruct ...