Resources 
Newsletters
Articles
Podcasts
Accreditations
Pro Community
Sponsor
Consulting

Advancements

Alibaba next to outperform OpenAI’s o1?

Chinese tech giant, Alibaba, has released a new AI model, capable of advanced reasoning, that outperforms OpenAI’s o1

November 28, 2024

The Chinese tech firm, Alibaba, has released a new AI model, called QwQ-32B-Preview (or QWQ for short), which has advanced reasoning capabilities, meaning it can ‘reason’ its way through tasks and answer complex multi-step questions on difficult subjects like logic, Math, and coding. This release puts QWQ in direct competition with OpenAI’s recently released AI model, o1-preview.

According to benchmark test results, QWQ scored higher than OpenAI’s o1 (and o1-mini) in Math benchmarks (which assess reasoning and mathematical capabilities), achieving an accuracy rate of 90.6% compared to o1’s 85.5%. It also scored higher in Aime tests (which use other AI models to test performance), scoring 50% vs o1’s 44.6% (for added context, OpenAI’s chatbot GPT-4 only scored 9.3% in these tests).

Although QWQ doesn’t accept more than 32,000-word prompts (o1 accepts up to 96,000), refuses to answer certain political questions (due to pressure from the Chinese government to build AI models that embody core socialist values”), and is prone to switch languages unexpectedly, get stuck in loops, and underperform on “common sense reasoning” tasks, it has been released as an open-source model. This means it can be downloaded (from the dev platform, Hugging Face) and used for commercial purposes. However, only certain parts of the model have been released, so users can’t replicate the model or see its inner workings.

Recommended

November 26, 2024

Anthropic revolutionizing AI with data tool?

Anthropic has released a new tool that connects AI models with multiple data sources to improve performance.

November 22, 2024

Apple’s new AI-powered Siri?

Apple is reportedly working on a new Siri that will be powered by LLMs

November 21, 2024

Has OpenAI finally met its match?

A Chinese AI lab has released a new AI model capable of reasoning that matches OpenAI’s o1

November 20, 2024

Google’s Gemini has a memory

Google’s Gemini can now remember what you tell it

November 19, 2024

Mistral’s ‘Le Chat’ better than ChatGPT?

French AI start-up, Mistral, has launched new features, as it battles its main rival: OpenAI

November 14, 2024

OpenAI’s secret AI agent revealed!

Insiders have revealed that OpenAI could release an AI agent as early as January

October 14, 2024

OpenAI’s game-changing benchmark test

OpenAI has released a new benchmark test designed to evaluate AI model's real-world ML capabilities

August 28, 2024

OpenAI’s Strawberry launching this fall?

Rumors are circulating that OpenAI will launch its highly-anticipated Project Strawberry this fall

OpenAI secret tool revealed

OpenAI has built a tool designed to detect text written by ChatGPT, but is refusing to launch it

Leaked: OpenAI’s secret project revealed!

OpenAI is working on a new secret project that is expected to produce a new model that has better-than-human intelligence

Learn AI in 5 minutes a day



Check your inbox to complete your sign-up!

Oops! Something went wrong.

Menu

Home
Newsletters
Stories
Podcasts
Privacy Policy
Terms of Services

Ecosystem

Accreditations
Pro Community
Consulting
Sponsor

Last newsletters

🧞 Alibaba next to outperform OpenAI’s o1?

😡 OpenAI’s Sora leaked by angry testers

💫 Anthropic revolutionizing AI with data tool?

Copyright © AI Tool Report