Programming Language Benchmarks

Quesma Releases OTelBench: Independent Benchmark Reveals Frontier LLMs Struggle with Real-World SRE Tasks

New benchmark shows top LLMs achieve only 29% pass rate on OpenTelemetry instrumentation, exposing the gap between ...

Qwen3-Max Thinking

Discover Qwen3-Max Thinking, Alibaba's advanced AI model with extended reasoning capabilities. Learn about its features, ...

26d

Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment

B, an open-source AI coding model trained in four days on Nvidia B200 GPUs, publishing its full reinforcement-learning stack ...

The Official Microsoft Blog

Maia 200: The AI accelerator built for inference

Today, we’re proud to introduce Maia 200, a breakthrough inference accelerator engineered to dramatically improve the economics of AI token generation. Maia 200 is an AI inference powerhouse: an ...

Microsoft Extends Its Phi Models To Physical AI With Rho-Alpha

Physical AI marks a transition from robots as programmed tools to robots as adaptable collaborators. That transition will ...

Open Source Kimi K2.5 Resets the AI Pecking Order

Kimi K2.5 adds Agent Swarm with up to 100 parallel helpers and a 256k window, so teams solve complex work faster.

13d

VoidLink cloud malware shows clear signs of being AI-generated

The recently discovered cloud-focused VoidLink malware framework is believed to have been developed by a single person with the help of an artificial intelligence model.

Evolving Into An AI-Native Product Organization

AI lets product teams turn ideas into working prototypes in hours. When building is easier than it's ever been, the hard part ...

New York News on MSN

Producer Lavender Wang on innovation, pressure, and leading in sports broadcasting

We love live TV. In an era where content consumption shifts rapidly from mobile-first vertical screens to massive live ...

12d

Model Showcase 2: New Architectural Approaches in Language Models

This column focuses on open-weight models from China, Liquid Foundation Models, performant lean models, and a Titan from ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results