Programming Language Benchmarks

18h

OpenAI’s GPT-5.3-Codex thinks deeper and wider about coding work

The company says its latest model’s agentic skills also apply to a broader set of knowledge work such as presentations and ...

OpenAI introduces Frontier agent management platform and new GPT-5.3-Codex model

OpenAI Group PBC today introduced a platform called Frontier that companies can use to build and manage artificial ...

Qwen3-Coder-Next offers vibe coders a powerful open source, ultra-sparse model with 10x higher throughput for repo tasks

On SWE-Bench Verified, the model achieved a score of 70.6%. This performance is notably competitive when placed alongside ...

GPT-5.3-Codex: OpenAI introduces new coding model

Codex, a new coding model that, according to the development team, was significantly involved in its own development.

Bito's AI Architect Achieves Highest Success Rate of 60.8% on SWE-Bench Pro

Bito, the company building deep context graphs for coding agents, today announced evaluation results for its AI Architect context engine. A Claude Sonnet 4.5 agent augmented with Bito's AI Architect ...

Le Lézard

Alibaba Launches A Large Model Trained Inside a Coding Platform

SINGAPORE, SG / ACCESS Newswire / February 3, 2026 / Alibaba today announced the release of Qwen- Coder-Qoder, a large ...

Evolving Into An AI-Native Product Organization

AI lets product teams turn ideas into working prototypes in hours. When building is easier than it's ever been, the hard part ...

11h

VIDEO: Think Small to Win Big - How Helikai Is Proving That Micro AI Agents Beat the Billion-Dollar Brute-Force Approach

Every CEO in the Fortune 500 has issued some version of the same mandate: We need an AI strategy. Most of them have also ...

Le Lézard

Honda and Mythic Announce Joint Development of 100x Energy-Efficient Analog AI Chip for Next-Generation Vehicles

Honda Motor Co., Ltd. and Mythic announce a joint development agreement in which Honda R&D Co. Ltd., the R&D subsidiary of Honda, will license Mythic's Analog Processing Unit (APU) technology and the ...

India Today on MSN

After SaaS scare, Anthropic launches new Claude AI with agent teams that build C compilers on their own

Days after putting SaaS companies on alert with Claude Cowork, Anthropic has now revealed that its Claude Opus 4.6 model can ...

Briefly on MSN

“It’s nothing, bro”: Man reveals senior Java developer payslip, SA reacts to tech industry earnings

A man in SA shared a Senior Java Developer’s payslip, revealing salaries, deductions, experience, and sparking discussions on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results