Back to news

Model Review

GLM-5.2 review: 753B parameters, open-weights, Chinese-developed.

Zhipu AI's GLM-5.2 launched 15 June 2026 with 51.4% SWE-bench Pro, 85.2% MMLU, and 256K context. At $0.80/$2.40 per million tokens, it is the most capable open-weights model from China.

AI Kick Start editorial image for GLM-5.2 review: 753B parameters, open-weights, Chinese-developed.

Decision

Shortlist

Score tools by workflow fit, data handling, owner readiness, and cost at scale before buying seats.

Risk to watch

Shelfware

A capable tool still fails if nobody owns the workflow or checks whether it is used weekly.

Proof to collect

Pilot score

Run one real task through each shortlisted tool and record quality, time saved, and support burden.

TL;DR

TL;DR: Zhipu AI's GLM-5.2 launched 15 June 2026 with 51.4% SWE-bench Pro, 85.2% MMLU, and 256K context. At $0.80/$2.40 per million tokens, it is the most capable open-weights model from China.

Key takeaways

  • GLM-5.2 review: 753B parameters, open-weights, Chinese-developed: **Release date:** mid-June 2026 (reported) | **Status:** Active | **Licence:** Open A Chinese lab just put one of the biggest open-weights models yet on the public internet, and you can download the whole thing for free.
  • Benchmarks at a glance: A note before the table: several of these figures could not be confirmed against primary sources, and at least one (the context window) is plainly wrong in the original draft.
  • The 753B parameter question: The 753 billion parameter figure checks out.
  • Chinese language performance: This is the part of the original review I would treat as a reasonable hunch rather than a measured result.
  • Coding assessment: Coding is where the original numbers fall apart most, so read this section with caution.

GLM-5.2 review: 753B parameters, open-weights, Chinese-developed

Release date: mid-June 2026 (reported) | Status: Active | Licence: Open

A Chinese lab just put one of the biggest open-weights models yet on the public internet, and you can download the whole thing for free. That is the short version of GLM-5.2, released by Zhipu AI (now Z.ai) around the middle of June 2026.

Here is why an Australian business team should care. Most of the AI you can actually self-host comes with a trade-off: the powerful models are closed and rented by the token, and the ones you can run yourself tend to lag behind. GLM-5.2 muddies that line. It ships under an open licence, the weights are on Hugging Face, and at 753 billion parameters it sits at the top end of what anyone has released openly.

The catch is that the headline figures floating around for this model have been messy, and the numbers in the earlier draft of this review did not hold up against what the labs and trackers actually published. I have flagged those below rather than repeat them as fact. Treat the benchmark talk in this piece as directional, and check the source links before you bet a project on a specific score.

Benchmarks at a glance

A note before the table: several of these figures could not be confirmed against primary sources, and at least one (the context window) is plainly wrong in the original draft. I have kept the disputed numbers visible so you can see what was claimed, but read the "Context" column and the section below for the corrections.

MetricScore (as originally claimed)Context
SWE-bench Pro51.4% (disputed)Reported elsewhere as ~62.1%, see note below
MMLU85.2% (disputed)Reported elsewhere closer to ~91.7%
Context window256K tokens (incorrect)Actual headline figure is 1M tokens
Price (input)$0.80 / 1M tokens (disputed)Reported API price ~$1.40 / 1M
Price (output)$2.40 / 1M tokens (disputed)Reported API price ~$4.40 / 1M
LicenceOpenSelf-hostable, MIT

The licence is the one row I would stake money on. Z.ai released GLM-5.2 under an MIT licence with no regional restrictions, and the weights, including an FP8 variant, are downloadable from Hugging Face. That part is solid.

The 753B parameter question

The 753 billion parameter figure checks out. The official model page lists 753B params, and the major trackers agree.

What matters more than the raw count is how the model uses it. GLM-5.2 is a Mixture-of-Experts (MoE) design, so it does not fire all 753 billion parameters on every token. Only a slice is active at a time, which is what keeps inference affordable on hardware that does not cost a fortune. The earlier draft put the active count at roughly 60 billion per token and likened it to Llama 4; Artificial Analysis actually lists 40 billion active, in line with the previous GLM-5 and 5.1 releases. So the MoE point stands, but the 60B figure looks off, 40B is the number to use.

The practical upshot is the same either way: you get the knowledge capacity of a very large model without paying the full inference bill of one.

Chinese language performance

This is the part of the original review I would treat as a reasonable hunch rather than a measured result. A model built in China by a Chinese lab will almost certainly be strong on Mandarin tasks, classical Chinese translation, and China-specific knowledge, and that is consistent with how earlier GLM models behaved. But I could not find a benchmark that confirms the specific claim that GLM-5.2 beats Western models on Chinese reading comprehension. If your work involves Chinese-speaking customers, it is worth a look, just run your own evaluation before you commit, because the comparative edge here is reported, not proven.

Coding assessment

Coding is where the original numbers fall apart most, so read this section with caution.

The draft put GLM-5.2 at 51.4% on SWE-bench Pro and ranked it above Qwen 3 and Llama 4 but behind MiniMax M3 and Kimi K2.7-Code. That ranking rests on a figure that does not match the public record. Multiple outlets, including TechTimes, report GLM-5.2 scoring around 62.1 on SWE-bench Pro, ahead of GPT-5.5 and its own predecessor GLM-5.1, not behind a pack of rivals. The competitor scores quoted in the draft (Qwen 3, Llama 4, MiniMax M3, Kimi K2.7-Code) could not be verified and appear to be constructed, so I would not rely on that league table at all.

The claim that GLM-5.2 handles Python and Java well but struggles with JavaScript frameworks and Rust is also unconfirmed. No source breaks the model down by language at that level of detail, and the broader reporting actually points the other way: GLM-5.2 is being described as one of the strongest open-source coding models available right now, which sits awkwardly with a "struggles with" framing. Test it on your own stack before you write off any language.

For developers who want to dig in, Z.ai's code lives on GitHub (repo not independently confirmed at time of writing).

Verdict

Strip out the dodgy numbers and there is still a real story here: a Chinese lab has shipped a genuinely large, openly licensed, self-hostable model, and the early coding reports are strong. That is the maturing open-weights ecosystem doing what closed vendors keep saying can't be done cheaply.

If you need a capable open model you can run on your own infrastructure, or you specifically want a non-American option, GLM-5.2 belongs on your shortlist. Just verify the benchmarks that matter to your use case against the primary trackers rather than any single review, including this one, before you build on it.

Score: 7.9 / 10 (the author's original rating; note it was assigned against benchmark figures that did not hold up on review)

Source trail

Primary references to keep this briefing grounded

AI and automation information changes quickly. Use these official or primary references to verify the claims, pricing, product behaviour, and compliance details before committing budget or production data.

What to do next

  1. Write the job-to-be-done before looking at another product.
  2. Score each shortlisted tool for workflow fit, data handling, cost, and owner readiness.
  3. Run one small pilot and remove anything the team does not use weekly.

Want help applying this? Explore the AI tools directory.

AI Kick Start is an Illawarra-based AI studio in Figtree, helping businesses across Wollongong, Shellharbour and Kiama and right across Australia put AI to work.

Explore with AI

Use the article as a decision prompt

Summarise this AI Kick Start article for an Australian business owner. Focus on the useful decision, the risks, and the first practical next step: GLM-5.2 review: 753B parameters, open-weights, Chinese-developed

Turn this into a practical roadmap.

Use the guide as a starting point, then map the first workflow worth building.

Book an AI strategy call