DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...

OpenRouter·4/24/2026·Deepseek Release Coding

DeepSeek: DeepSeek V4 Flash (deepseek/deepseek-v4-flash)

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

OpenRouter·4/24/2026·Deepseek Release Speed

Apr 21

China’s open-source bet

Silicon Valley AI companies follow a familiar playbook: Keep the secret sauce behind an API, and charge for every drop. China’s leading AI labs are playing a different game: They ship models as downloadable “open-weight” packages. This lets developers adapt the models and run them on their own hardware to build products without negotiating a…

MIT Technology Review AI·4/21/2026·Z-ai Hugging Face Open Source Benchmarks

LLMs+

When ChatGPT launched as an experimental prototype in late 2022, OpenAI’s chatbot became an everyday everything app for hundreds of millions of people. LLMs like ChatGPT were the new future: The entire tech industry was consumed by the inferno, with companies racing to spin up rival products. The ashes of the old tech world still…

MIT Technology Review AI·4/21/2026·Deepseek OpenAI Speed Image

[AINews] Moonshot Kimi K2.6: the world's leading Open Model refreshes to catch up to Opus 4.6 (ahead of DeepSeek v4?)

Yay Kimi!!!

Latent Space·4/21/2026·OpenRouter Anthropic Open Source Benchmarks

Apr 14

[AINews] Top Local Models List - April 2026

a quiet day lets us check in on the local models scene

Latent Space·4/14/2026·Meta-llama Qwen Open Source Benchmarks

Apr 13

Want to understand the current state of AI? Check out these charts.

If you’re following AI news, you’re probably getting whiplash. AI is a gold rush. AI is a bubble. AI is taking your job. AI can’t even read a clock. The 2026 AI Index from Stanford University’s Institute for Human-Centered Artificial Intelligence, AI’s annual report card, comes out today and cuts through some of that noise. …

MIT Technology Review AI·4/13/2026·Anthropic Baidu Open Source Benchmarks

Apr 9

Waiting for DeepSeek: new model to test China's AI ambitions

For weeks now, the global tech industry has been waiting for a major artificial intelligence launch from DeepSeek, seen as a benchmark for China's progress in the fast-moving field.

TechXplore·4/9/2026·Deepseek Benchmarks

Apr 6

Accelerate agentic tool calling with serverless model customization in Amazon SageMaker AI

In this post, we walk through how we fine-tuned Qwen 2.5 7B Instruct for tool calling using RLVR. We cover dataset preparation across three distinct agent behaviors, reward function design with tiered scoring, training configuration and results interpretation, evaluation on held-out data with unseen tools, and deployment.

AWS Machine Learning·4/6/2026·Meta-llama Qwen Open Source Coding

Import AI 452: Scaling laws for cyberwar; rising tides of AI automation; and a puzzle over gDP forecasting

How much could AI revolutionize the economy?

Import AI·4/6/2026·Anthropic Google Open Source Benchmarks

Apr 3

Marc Andreessen introspects on The Death of the Browser, Pi + OpenClaw, and Why "This Time Is Different"

The legend needs no intro... if you pardon our pun

Latent Space·4/3/2026·Deepseek Nvidia Open Source Hardware

[AINews] Gemma 4: The best small Multimodal Open Models, dramatically better than Gemma 3 in every way

A welcome update from Google!

Latent Space·4/3/2026·Qwen Google Open Source Benchmarks

Mar 28

[AINews] H100 prices are melting UP

a quiet day lets us report an important GPU trend

Latent Space·3/28/2026·Z-ai Ibm Open Source Benchmarks

Mar 19

Multiverse Computing pushes its compressed AI models into the mainstream

After compressing models from major AI labs including OpenAI, Meta, DeepSeek and Mistral AI, Multiverse Computing has launched both an app that showcases the capabilities of its compressed models and an API that makes them more widely available.

TechCrunch AI·3/19/2026·Meta-llama Deepseek Open Source

Mar 16

ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text

Will AI cause a political interregnum

Import AI·3/16/2026·Anthropic Meta-llama Open Source Benchmarks

Mar 11

Introducing Fireworks AI on Microsoft Foundry: Bringing high performance, low latency open model inference to Azure

With Fireworks AI, we're extending the Microsoft Foundry platform with high performance inference for state-of-the-art open models on Azure. The post Introducing Fireworks AI on Microsoft Foundry: Bringing high performance, low latency open model inference to Azure appeared first on Microsoft Azure Blog .

Deepseek News

News Feed

[AINews] AI Engineer World's Fair — Autoresearch, Memory, World Models, Tokenmaxxing, Agentic Commerce, and Vertical AI Call for Speakers

[AINews] Agents for Everything Else: Codex for Knowledge Work, Claude for Creative Work

[AINews] not much happened today

[AINews] GPT 5.5 and OpenAI Codex Superapp

DeepSeek: DeepSeek V4 Pro (deepseek/deepseek-v4-pro)

DeepSeek: DeepSeek V4 Flash (deepseek/deepseek-v4-flash)

China’s open-source bet

LLMs+

[AINews] Moonshot Kimi K2.6: the world's leading Open Model refreshes to catch up to Opus 4.6 (ahead of DeepSeek v4?)

[AINews] Top Local Models List - April 2026

Want to understand the current state of AI? Check out these charts.

Waiting for DeepSeek: new model to test China's AI ambitions

Accelerate agentic tool calling with serverless model customization in Amazon SageMaker AI

Import AI 452: Scaling laws for cyberwar; rising tides of AI automation; and a puzzle over gDP forecasting

Marc Andreessen introspects on The Death of the Browser, Pi + OpenClaw, and Why "This Time Is Different"

[AINews] Gemma 4: The best small Multimodal Open Models, dramatically better than Gemma 3 in every way

[AINews] H100 prices are melting UP

Multiverse Computing pushes its compressed AI models into the mainstream

ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text

Introducing Fireworks AI on Microsoft Foundry: Bringing high performance, low latency open model inference to Azure

Tools

Directories

Models & Pricing

Endpoints

Rankings

News

Deepseek News

News Feed

[AINews] AI Engineer World's Fair — Autoresearch, Memory, World Models, Tokenmaxxing, Agentic Commerce, and Vertical AI Call for Speakers

[AINews] Agents for Everything Else: Codex for Knowledge Work, Claude for Creative Work

[AINews] not much happened today

[AINews] GPT 5.5 and OpenAI Codex Superapp

DeepSeek: DeepSeek V4 Pro (deepseek/deepseek-v4-pro)

DeepSeek: DeepSeek V4 Flash (deepseek/deepseek-v4-flash)

China’s open-source bet

LLMs+

[AINews] Moonshot Kimi K2.6: the world's leading Open Model refreshes to catch up to Opus 4.6 (ahead of DeepSeek v4?)

[AINews] Top Local Models List - April 2026

Want to understand the current state of AI? Check out these charts.

Waiting for DeepSeek: new model to test China's AI ambitions

Accelerate agentic tool calling with serverless model customization in Amazon SageMaker AI

Import AI 452: Scaling laws for cyberwar; rising tides of AI automation; and a puzzle over gDP forecasting

Marc Andreessen introspects on The Death of the Browser, Pi + OpenClaw, and Why "This Time Is Different"

[AINews] Gemma 4: The best small Multimodal Open Models, dramatically better than Gemma 3 in every way

[AINews] H100 prices are melting *UP*

Multiverse Computing pushes its compressed AI models into the mainstream

ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text

Introducing Fireworks AI on Microsoft Foundry: Bringing high performance, low latency open model inference to Azure

Tools

Directories

Models & Pricing

Endpoints

Rankings

News

[AINews] H100 prices are melting UP