From rewriting Google’s search stack in the early 2000s to reviving sparse trillion-parameter models and co-designing TPUs with frontier ML research, Jeff Dean has quietly shaped nearly every layer of the modern AI stack.

Latent Space·3d ago·Google Open Source Benchmarks

MiniMax: MiniMax M2.5 (minimax/minimax-m2.5)

MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1 to extend into general office work, reaching fluency in generating and operating Word, Excel, and Powerpoint files, context switching between diverse software environments, and working across different agent and human teams. Scoring 80.2% on SWE-Bench Verified, 51.3% on Multi-SWE-Bench, and 76.3% on BrowseComp, M2.5 is also more token efficient than previous generations, having been trained to optimize its actions and output through planning.

OpenRouter·4d ago·Minimax Benchmarks Release

What’s next for Chinese open-source AI

The past year has marked a turning point for Chinese AI. Since DeepSeek released its R1 reasoning model in January 2025, Chinese companies have repeatedly delivered AI models that match the performance of leading Western models at a fraction of the cost. Just last week the Chinese firm Moonshot AI released its latest open-weight model,…

MIT Technology Review AI·4d ago·Z-ai Minimax Open Source Benchmarks

[AINews] Z.ai GLM-5: New SOTA Open Weights LLM

We have Opus 4.5 at home

Latent Space·4d ago·Z-ai OpenRouter Open Source Benchmarks

🔬Science at the speed of inference — Gabriele Corso & Jeremy Wohlwend, Boltz

Inside Boltz, AlphaFold’s Legacy, and the Tools Powering Next-Gen Molecular Discovery

Latent Space·4d ago·Open Source Benchmarks

Feb 11

NVIDIA Nemotron 3 Nano 30B MoE model is now available in Amazon SageMaker JumpStart

Today we’re excited to announce that the NVIDIA Nemotron 3 Nano 30B model with nbsp;3B active parameters is now generally available in the Amazon SageMaker JumpStart model catalog. You can accelerate innovation and deliver tangible business value with Nemotron 3 Nano on Amazon Web Services (AWS) without having to manage model deployment complexities. You can power your generative AI applications with Nemotron capabilities using the managed deployment capabilities offered by SageMaker JumpStart.

AWS Machine Learning·4d ago·Nvidia Amazon Open Source Benchmarks

Feb 10

Reasoning: A smarter way for AI to understand text and images

Engineers at the University of California San Diego have developed a new way to train artificial intelligence systems to solve complex problems more reliably, particularly those that require interpreting both text and images. In widely used tests to evaluate mathematical reasoning, AI models trained with this method outperformed others in solving math word problems containing visual elements like charts and diagrams.