Analysis
Most big model launches arrive with a marketing machine behind them. When Zhipu AI put out GLM-5.2 in mid-June 2026, it landed quietly: a model card on Hugging Face, an API update, and not much fanfare compared to what you get from OpenAI, Google or Anthropic.
The timing was the loud part. The release reportedly coincided with the day the US government ordered Anthropic to suspend foreign access to its Fable 5 and Mythos 5 models on national-security grounds. A Chinese lab shipping a 753-billion-parameter model under an open licence, in the same week Washington was tightening the screws, is hard to read as a coincidence.
For Australian business teams, the headline is simpler than the geopolitics. There is now another large, capable, openly available model in the mix. You can call it through an API, or if you have the hardware and the appetite, run it yourself. That changes the maths on what serious AI work has to cost.
GLM-5.2 is the latest in Zhipu AI's General Language Model line, which traces back to 2021. Zhipu was spun out of Tsinghua University and has been building the GLM series steadily since then.
Architecture and Scale
At 753 billion parameters, GLM-5.2 sits among the largest open-weights models released to date (Source: Hugging Face, zai-org/GLM-5.2). Some sources round the count to 743-744B; 753B is the figure most often cited.
It is built as a Mixture-of-Experts (MoE) model rather than a dense one. Of the ~753B total parameters, only around 40B are active for any given token, using sparse attention and an optimisation Zhipu calls "IndexShare". That design keeps inference cost down relative to a dense model of the same headline size, which matters a lot once you start thinking about running it. (Note: earlier drafts of this story described GLM-5.2 as a dense, non-MoE design; the official model card confirms it is MoE.)
Its predecessor, GLM-5.1, was released in April 2026 and was itself a roughly 754B-parameter MoE model (256+1 experts, ~40B active), according to deployment write-ups. So GLM-5.2 is not a dramatic jump in raw size over 5.1, the two are in the same weight class.
On context length, reported figures differ. Zhipu's own materials and listings such as OpenRouter describe a 1,000,000-token (1M) context window, one of the model's headline features, with output up to roughly 131,000 tokens. Earlier coverage circulated a much smaller 128K figure, which appears to be wrong by about 8x.
Training-set size is less settled. Several secondary write-ups put the training corpus at around 28.5 trillion tokens, drawn from web pages, books, code repositories and academic papers, with heavy Chinese-language content. We have not been able to confirm a single authoritative number, so treat any precise token count as unverified. What is consistent across sources is the emphasis on Chinese-language data, which shows up in how the model performs on Chinese tasks.

Benchmark Performance
A word of caution before the numbers: the specific benchmark scores that circulated with early coverage of GLM-5.2, figures like MMLU-Pro 80.1%, HumanEval 85.3% and MATH 69.4% on English tasks, could not be confirmed in any source we checked, and should be treated as unverified. The benchmarks Zhipu actually emphasised at launch were coding and agentic ones, where independent coverage reports results such as SWE-bench Pro around 62, AIME 2026 around 99, and GPQA-Diamond around 91 (Hugging Face, GLM-5.2).
On the comparison points often quoted alongside it, GPT-5.5 at 86.4% MMLU-Pro, Claude Opus 4.8 at 87.6% SWE-bench, we could not verify those exact figures either, so read them as illustrative rather than precise. What is well established is that GLM-5.2 is routinely benchmarked against GPT-5.5 and Claude Opus 4.8, and that it still trails Opus 4.8 on coding.
Reported Chinese-language scores, for instance C-Eval around 88.7% and CMMLU around 86.2%, billed as best-in-class among open-weights models, likewise could not be confirmed in any source we found, so treat the Chinese benchmark table as unverified. The broader, sturdier claim is the one supported by the training mix: a model trained on this much Chinese content tends to do well on Chinese-language work, and for teams serving Chinese-speaking markets that is the real draw.
The same training mix points to decent cross-lingual ability across Chinese, English, Japanese and Korean. That is a plausible read given the data, but the head-to-head translation rankings quoted in early coverage are not independently verified.
Pricing and Deployment
Pricing has been reported inconsistently. Z.ai's own API pricing is listed at around $1.40 per million input tokens and $4.40 per million output tokens, with cached input near $0.26; OpenRouter shows roughly $1.20 input and $4.10 output (Source: OpenRouter, GLM-5.2 pricing). An earlier $0.80/$2.40 figure that did the rounds is not supported by any source we checked. Even at the higher, verified numbers, GLM-5.2 sits in the mid-tier: well above bargain-basement options and well below premium models like Claude Opus 4.8 at $5/$25.
The more interesting option is self-hosting. Because the weights are released under an MIT licence with no regional restrictions, you can run it on your own hardware (Source: Hugging Face, GLM-5.2 LICENSE). At 753B parameters that is not trivial. Coverage is clear that the model is hard to run locally, and the often-quoted requirements, roughly 16x H100 GPUs at full precision, around $400,000 in hardware, dropping to 4-8x H100s with quantisation, are plausible but unverified. Treat those as ballpark, not gospel. The canonical code lives in the zai-org/GLM-5 repository.
Geopolitical Implications
The release landed at a tense moment. The US export controls that suspended Claude Fable 5 were designed to slow Chinese AI development by cutting off access to advanced chips (Source: Fortune, Anthropic disables Fable/Mythos). Shipping a competitive model under those conditions is a pointed answer.
Open weights also scramble the regulatory picture. A closed API can be gated, throttled or cut off. Open weights cannot, once they are out, they are out. So you end up with a lopsided situation: export controls can slow the release of closed American models while doing little to stop the spread of open Chinese ones.
Worth noting alongside the technical story: Zhipu's listed shares reportedly surged around 33% on the news, and several outlets tied that market reaction directly to the release landing in the same window as the US Fable 5 and Mythos 5 ban.


