Join the conversation on AI models, pricing, and tools. Price Per Token Community

|Follow:

Z-ai News

Latest Z-ai AI news and updates. Model releases, announcements, benchmarks, and developments. Updated daily.

All categories

Get our weekly newsletter on pricing changes, new releases, and tools.

Join the Price Per Token Community

News Feed

Apr 21

China’s open-source bet

Silicon Valley AI companies follow a familiar playbook: Keep the secret sauce behind an API, and charge for every drop. China’s leading AI labs are playing a different game: They ship models as downloadable “open-weight” packages. This lets developers adapt the models and run them on their own hardware to build products without negotiating a…

MIT Technology Review AI·4/21/2026·Z-ai Hugging Face Open Source Benchmarks

Apr 10

[AINews] AI Engineer Europe 2026

Two quiet days in a row let us reflect on the first AIE in London.

Latent Space·4/10/2026·Z-ai Anthropic Open Source Benchmarks

New GPT $100 plan and Anthropic's managed agents

Price Per Token·4/10/2026·Anthropic OpenRouter Benchmarks Release

Apr 7

Z.ai: GLM 5.1 (z-ai/glm-5.1)

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

OpenRouter·4/7/2026·Z-ai Release Coding

Apr 1

Z.ai: GLM 5V Turbo (z-ai/glm-5v-turbo)

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding, and task execution, and works seamlessly with agents to complete the full loop of “perceive → plan → execute“.

OpenRouter·4/1/2026·Z-ai Release Coding

Mar 28

[AINews] H100 prices are melting UP

a quiet day lets us report an important GPU trend

Latent Space·3/28/2026·Z-ai Ibm Open Source Benchmarks

Mar 19

[AINews] MiniMax 2.7: GLM-5 at 1/3 cost SOTA Open Model

congrats MiniMax!!

Latent Space·3/19/2026·Z-ai Xiaomi Open Source Benchmarks

Feb 24

[AINews] Anthropic accuses DeepSeek, Moonshot, and MiniMax of >16 million "industrial-scale distillation attacks"

the US-China cold war takes a big step up

Latent Space·2/24/2026·Z-ai Anthropic Open Source Benchmarks

Feb 17

[AINews] Qwen3.5-397B-A17B: the smallest Open-Opus class, very efficient model

Congrats Qwen team!

Latent Space·2/17/2026·Z-ai Deepseek Open Source Benchmarks

Feb 12

What’s next for Chinese open-source AI

The past year has marked a turning point for Chinese AI. Since DeepSeek released its R1 reasoning model in January 2025, Chinese companies have repeatedly delivered AI models that match the performance of leading Western models at a fraction of the cost. Just last week the Chinese firm Moonshot AI released its latest open-weight model,…

MIT Technology Review AI·2/12/2026·Z-ai Minimax Open Source Benchmarks

[AINews] Z.ai GLM-5: New SOTA Open Weights LLM

We have Opus 4.5 at home

Latent Space·2/12/2026·Z-ai OpenRouter Open Source Benchmarks

Feb 11

Z.ai: GLM 5 (z-ai/glm-5)

GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading closed-source models. With advanced agentic planning, deep backend reasoning, and iterative self-correction, GLM-5 moves beyond code generation to full-system construction and autonomous execution.

OpenRouter·2/11/2026·Z-ai Open Source Release

Jan 19

Z.AI: GLM 4.7 Flash (z-ai/glm-4.7-flash)

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

OpenRouter·1/19/2026·Z-ai Open Source Benchmarks

Jan 8

Startups go public in litmus test for Chinese AI

Leading Chinese artificial intelligence startup Zhipu AI soared as it went public in Hong Kong on Thursday, a day before rival MiniMax also makes its market debut in a litmus test for the country's rapidly developing sector.

TechXplore·1/8/2026·Z-ai Minimax Funding

Dec 22

Z.AI: GLM 4.7 (z-ai/glm-4.7)

GLM-4.7 is Z.AI’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

OpenRouter·12/22/2025·Z-ai Release Coding

Dec 8

Z.AI: GLM 4.6V (z-ai/glm-4.6v)

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts and charts directly as visual inputs, and integrates native multimodal function calling to connect perception with downstream tool execution. The model also enables interleaved image-text generation and UI reconstruction workflows, including screenshot-to-HTML synthesis and iterative visual editing.

OpenRouter·12/8/2025·Z-ai Release Coding

Sep 30

Z.AI: GLM 4.6 (z-ai/glm-4.6)

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.

OpenRouter·9/30/2025·Z-ai Anthropic Benchmarks Release

Aug 11

Z.AI: GLM 4.5V (z-ai/glm-4.5v)

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding, image Q&A, OCR, and document parsing, with strong gains in front-end web coding, grounding, and spatial reasoning. It offers a hybrid inference mode: a "thinking mode" for deep reasoning and a "non-thinking mode" for fast responses. Reasoning behavior can be toggled via the `reasoning` `enabled` boolean. Learn more in our docs

OpenRouter·8/11/2025·Z-ai Open Source Release

Jul 25

Z.AI: GLM 4.5 (z-ai/glm-4.5)

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment. It supports a hybrid inference mode with two options, a "thinking mode" designed for complex reasoning and tool use, and a "non-thinking mode" optimized for instant responses. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. Learn more in our docs

OpenRouter·7/25/2025·Z-ai Release Coding

Z.AI: GLM 4.5 Air (free) (z-ai/glm-4.5-air)

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. Learn more in our docs

Z-ai News

News Feed

China’s open-source bet

[AINews] AI Engineer Europe 2026

New GPT $100 plan and Anthropic's managed agents

Z.ai: GLM 5.1 (z-ai/glm-5.1)

Z.ai: GLM 5V Turbo (z-ai/glm-5v-turbo)

[AINews] H100 prices are melting UP

[AINews] MiniMax 2.7: GLM-5 at 1/3 cost SOTA Open Model

[AINews] Anthropic accuses DeepSeek, Moonshot, and MiniMax of >16 million "industrial-scale distillation attacks"

[AINews] Qwen3.5-397B-A17B: the smallest Open-Opus class, very efficient model

What’s next for Chinese open-source AI

[AINews] Z.ai GLM-5: New SOTA Open Weights LLM

Z.ai: GLM 5 (z-ai/glm-5)

Z.AI: GLM 4.7 Flash (z-ai/glm-4.7-flash)

Startups go public in litmus test for Chinese AI

Z.AI: GLM 4.7 (z-ai/glm-4.7)

Z.AI: GLM 4.6V (z-ai/glm-4.6v)

Z.AI: GLM 4.6 (z-ai/glm-4.6)

Z.AI: GLM 4.5V (z-ai/glm-4.5v)

Z.AI: GLM 4.5 (z-ai/glm-4.5)

Z.AI: GLM 4.5 Air (free) (z-ai/glm-4.5-air)

Tools

Directories

Models & Pricing

Endpoints

Rankings

News

Z-ai News

News Feed

China’s open-source bet

[AINews] AI Engineer Europe 2026

New GPT $100 plan and Anthropic's managed agents

Z.ai: GLM 5.1 (z-ai/glm-5.1)

Z.ai: GLM 5V Turbo (z-ai/glm-5v-turbo)

[AINews] H100 prices are melting *UP*

[AINews] MiniMax 2.7: GLM-5 at 1/3 cost SOTA Open Model

[AINews] Anthropic accuses DeepSeek, Moonshot, and MiniMax of >16 million "industrial-scale distillation attacks"

[AINews] Qwen3.5-397B-A17B: the smallest Open-Opus class, very efficient model

What’s next for Chinese open-source AI

[AINews] Z.ai GLM-5: New SOTA Open Weights LLM

Z.ai: GLM 5 (z-ai/glm-5)

Z.AI: GLM 4.7 Flash (z-ai/glm-4.7-flash)

Startups go public in litmus test for Chinese AI

Z.AI: GLM 4.7 (z-ai/glm-4.7)

Z.AI: GLM 4.6V (z-ai/glm-4.6v)

Z.AI: GLM 4.6 (z-ai/glm-4.6)

Z.AI: GLM 4.5V (z-ai/glm-4.5v)

Z.AI: GLM 4.5 (z-ai/glm-4.5)

Z.AI: GLM 4.5 Air (free) (z-ai/glm-4.5-air)

Tools

Directories

Models & Pricing

Endpoints

Rankings

News

[AINews] H100 prices are melting UP