Coding Reasoning Maths

DeepSeek-V4 launch: New 1.6T parameter model challenges US AI supremacy in coding and math

The much-awaited update from DeepSeek comes more than a year after its R1 and V3 models went viral last year and broke all ...

Geeky Gadgets

OpenAI o3-Mini Review & Performance Tested : Coding, Math and Logical Reasoning

Whether it’s automating tedious coding tasks, solving complex logic puzzles, or even weighing in on ethical dilemmas, AI tools like OpenAI’s o3-Mini promise to make our lives easier. But let’s be ...

Hosted on MSN

GPT-5.5 scores 93/100 in ZDNET review on coding, reasoning

OpenAI’s GPT-5.5 achieved a 93/100 score in ZDNET’s 10-part evaluation, showing strong performance in coding, reasoning, and creative writing. The model excelled in tasks from algorithmic ...

VentureBeat

Microsoft’s GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Microsoft has unveiled a groundbreaking artificial intelligence model, ...

Geeky Gadgets

New ChatGPT-o1-mini excels at STEM, especially math and coding

OpenAI has also today released its the ChatGPT-o1-mini AI large language model, designed to be a cost-effective alternative to the o1-preview while maintaining strong performance in reasoning tasks.

1mon

Nvidia's Nemotron-Cascade 2 wins math and coding gold medals with 3B active parameters — and its post-training recipe is now open-source

Nvidia's Nemotron-Cascade 2 is a 30B MoE model that activates only 3B parameters at inference time, yet achieved gold medal-level performance at the 2025 IMO, IOI, and ICPC World Finals. Nvidia has ...

scmp.com

Show inaccessible results

DeepSeek-V4 launch: New 1.6T parameter model challenges US AI supremacy in coding and math

OpenAI o3-Mini Review & Performance Tested : Coding, Math and Logical Reasoning

GPT-5.5 scores 93/100 in ZDNET review on coding, reasoning

Microsoft’s GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks

New ChatGPT-o1-mini excels at STEM, especially math and coding

Nvidia's Nemotron-Cascade 2 wins math and coding gold medals with 3B active parameters — and its post-training recipe is now open-source

Alibaba upgrades flagship Qwen3 model to outperform OpenAI, DeepSeek in maths, coding

Google revamps Gemini 2.5 Pro again, claiming superiority in coding and math

Imandra’s new AI coding assistant CodeLogician uses ‘reasoning’ to guarantee the accuracy of its code

OpenAI debuts new ‘reasoning’ models and coding agent as it seeks to stay at the front of the AI pack