AI News

GLM-5.2: Zhipu AI's 753 Billion Parameter Open-Weights Model Enters the Arena.

GLM-5.2 launched on 15 June 2026 with 753 billion parameters and an open-weights licence. We analyse how Zhipu AI's flagship competes in an increasingly crowded open-model landscape.

Daniel Fleuren2026-06-1811 min readFounders and operatorsUpdated 2026-06-19

Written by

Daniel Fleuren

Founder, AI Kick Start. 20+ years enterprise IT

Updated 2026-06-19

AI Kick Start editorial image for GLM-5.2: Zhipu AI's 753 Billion Parameter Open-Weights Model Enters the Arena.

Decision

Shortlist

Score tools by workflow fit, data handling, owner readiness, and cost at scale before buying seats.

Risk to watch

Shelfware

A capable tool still fails if nobody owns the workflow or checks whether it is used weekly.

Proof to collect

Pilot score

Run one real task through each shortlisted tool and record quality, time saved, and support burden.

TL;DR

TL;DR: GLM-5.2, released in mid-June 2026 by Chinese AI lab Zhipu AI (which goes by Z.ai internationally), is a 753-billion-parameter open-weights model. It is among the largest open-weights models released so far, and it lands as a clear sign that Chinese labs are real competitors in the open-model space. Reported figures for its pricing, benchmark scores and context window vary by source, so treat the specifics below with that in mind.

Key takeaways

GLM-5.2 is among the largest open-weights models released to date at 753 billion parameters, built as a Mixture-of-Experts design (Source: [Hugging Face, zai-org/GLM-5.2](https://huggingface.co/zai-org/GLM-5.2))
The weights are MIT-licensed with no regional restrictions, so self-hosting is allowed, if you can afford the hardware (Source: [Hugging Face, GLM-5.2 LICENSE](https://huggingface.co/zai-org/GLM-5.2/blob/main/LICENSE))
Verified API pricing is around $1.40/$4.40 per million tokens, placing it in the mid-tier of the current market (Source: [OpenRouter, GLM-5.2](https://openrouter.ai/z-ai/glm-5.2))
The release reportedly coincided with the US export-control suspension of Claude Fable 5, and open weights cannot be effectively restricted by export controls once released (Source: [Fortune, Anthropic disables Fable/Mythos](https://fortune.com/2026/06/13/anthropic-disables-fable-mythos-export-controls-national-security-threat/))

Analysis

Most big model launches arrive with a marketing machine behind them. When Zhipu AI put out GLM-5.2 in mid-June 2026, it landed quietly: a model card on Hugging Face, an API update, and not much fanfare compared to what you get from OpenAI, Google or Anthropic.

The timing was the loud part. The release reportedly coincided with the day the US government ordered Anthropic to suspend foreign access to its Fable 5 and Mythos 5 models on national-security grounds. A Chinese lab shipping a 753-billion-parameter model under an open licence, in the same week Washington was tightening the screws, is hard to read as a coincidence.

For Australian business teams, the headline is simpler than the geopolitics. There is now another large, capable, openly available model in the mix. You can call it through an API, or if you have the hardware and the appetite, run it yourself. That changes the maths on what serious AI work has to cost.

GLM-5.2 is the latest in Zhipu AI's General Language Model line, which traces back to 2021. Zhipu was spun out of Tsinghua University and has been building the GLM series steadily since then.

Architecture and Scale

At 753 billion parameters, GLM-5.2 sits among the largest open-weights models released to date (Source: Hugging Face, zai-org/GLM-5.2). Some sources round the count to 743-744B; 753B is the figure most often cited.

It is built as a Mixture-of-Experts (MoE) model rather than a dense one. Of the ~753B total parameters, only around 40B are active for any given token, using sparse attention and an optimisation Zhipu calls "IndexShare". That design keeps inference cost down relative to a dense model of the same headline size, which matters a lot once you start thinking about running it. (Note: earlier drafts of this story described GLM-5.2 as a dense, non-MoE design; the official model card confirms it is MoE.)

Its predecessor, GLM-5.1, was released in April 2026 and was itself a roughly 754B-parameter MoE model (256+1 experts, ~40B active), according to deployment write-ups. So GLM-5.2 is not a dramatic jump in raw size over 5.1, the two are in the same weight class.

On context length, reported figures differ. Zhipu's own materials and listings such as OpenRouter describe a 1,000,000-token (1M) context window, one of the model's headline features, with output up to roughly 131,000 tokens. Earlier coverage circulated a much smaller 128K figure, which appears to be wrong by about 8x.

Training-set size is less settled. Several secondary write-ups put the training corpus at around 28.5 trillion tokens, drawn from web pages, books, code repositories and academic papers, with heavy Chinese-language content. We have not been able to confirm a single authoritative number, so treat any precise token count as unverified. What is consistent across sources is the emphasis on Chinese-language data, which shows up in how the model performs on Chinese tasks.

Supporting AI Kick Start editorial image for glm-52-zhipu-ai-753b-parameter-open-model. — Generated AI Kick Start editorial visual used to explain the article's practical workflow and trade-offs.

Benchmark Performance

A word of caution before the numbers: the specific benchmark scores that circulated with early coverage of GLM-5.2, figures like MMLU-Pro 80.1%, HumanEval 85.3% and MATH 69.4% on English tasks, could not be confirmed in any source we checked, and should be treated as unverified. The benchmarks Zhipu actually emphasised at launch were coding and agentic ones, where independent coverage reports results such as SWE-bench Pro around 62, AIME 2026 around 99, and GPQA-Diamond around 91 (Hugging Face, GLM-5.2).

On the comparison points often quoted alongside it, GPT-5.5 at 86.4% MMLU-Pro, Claude Opus 4.8 at 87.6% SWE-bench, we could not verify those exact figures either, so read them as illustrative rather than precise. What is well established is that GLM-5.2 is routinely benchmarked against GPT-5.5 and Claude Opus 4.8, and that it still trails Opus 4.8 on coding.

Reported Chinese-language scores, for instance C-Eval around 88.7% and CMMLU around 86.2%, billed as best-in-class among open-weights models, likewise could not be confirmed in any source we found, so treat the Chinese benchmark table as unverified. The broader, sturdier claim is the one supported by the training mix: a model trained on this much Chinese content tends to do well on Chinese-language work, and for teams serving Chinese-speaking markets that is the real draw.

The same training mix points to decent cross-lingual ability across Chinese, English, Japanese and Korean. That is a plausible read given the data, but the head-to-head translation rankings quoted in early coverage are not independently verified.

Pricing and Deployment

Pricing has been reported inconsistently. Z.ai's own API pricing is listed at around $1.40 per million input tokens and $4.40 per million output tokens, with cached input near $0.26; OpenRouter shows roughly $1.20 input and $4.10 output (Source: OpenRouter, GLM-5.2 pricing). An earlier $0.80/$2.40 figure that did the rounds is not supported by any source we checked. Even at the higher, verified numbers, GLM-5.2 sits in the mid-tier: well above bargain-basement options and well below premium models like Claude Opus 4.8 at $5/$25.

The more interesting option is self-hosting. Because the weights are released under an MIT licence with no regional restrictions, you can run it on your own hardware (Source: Hugging Face, GLM-5.2 LICENSE). At 753B parameters that is not trivial. Coverage is clear that the model is hard to run locally, and the often-quoted requirements, roughly 16x H100 GPUs at full precision, around $400,000 in hardware, dropping to 4-8x H100s with quantisation, are plausible but unverified. Treat those as ballpark, not gospel. The canonical code lives in the zai-org/GLM-5 repository.

Geopolitical Implications

The release landed at a tense moment. The US export controls that suspended Claude Fable 5 were designed to slow Chinese AI development by cutting off access to advanced chips (Source: Fortune, Anthropic disables Fable/Mythos). Shipping a competitive model under those conditions is a pointed answer.

Open weights also scramble the regulatory picture. A closed API can be gated, throttled or cut off. Open weights cannot, once they are out, they are out. So you end up with a lopsided situation: export controls can slow the release of closed American models while doing little to stop the spread of open Chinese ones.

Worth noting alongside the technical story: Zhipu's listed shares reportedly surged around 33% on the news, and several outlets tied that market reaction directly to the release landing in the same window as the US Fable 5 and Mythos 5 ban.

Source trail

Primary references to keep this briefing grounded

AI and automation information changes quickly. Use these official or primary references to verify the claims, pricing, product behaviour, and compliance details before committing budget or production data.

Z.ai GLM documentation

What to do next

Write the job-to-be-done before looking at another product.
Score each shortlisted tool for workflow fit, data handling, cost, and owner readiness.
Run one small pilot and remove anything the team does not use weekly.

Want help applying this? Explore the AI tools directory.

AI Kick Start is an Illawarra-based AI studio in Figtree, helping businesses across Wollongong, Shellharbour and Kiama and right across Australia put AI to work.

Explore with AI

Use the article as a decision prompt

Summarise this AI Kick Start article for an Australian business owner. Focus on the useful decision, the risks, and the first practical next step: GLM-5.2: Zhipu AI's 753 Billion Parameter Open-Weights Model Enters the Arena

Read with ChatGPT Open Claude Search with AI Mode

Turn this into a practical roadmap.

Use the guide as a starting point, then map the first workflow worth building.

Book an AI strategy call