Third-party firm Artificial Analysis’s latest global large-model ranking gave
Qwen3.7-Max a score of 57, close to top GPT, Claude and Gemini models, and
ranked it first among Chinese models. The model handles programming tasks from
page prototyping to complex multi-file projects; in office workflows it can
integrate tools and coordinate multi-agent collaboration to automate multi-step
processes. In fully autonomous tests it remained coherent over runs exceeding 35
hours and more than 1,000 tool calls. It also demonstrates cross-agent-framework
generalization, performing well when connected to various agent frameworks and
tools.