Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

ServiceNow-AI released Apriel-1.6-15b-Thinker, a 15-billion parameter multimodal reasoning model that achieves state-of-the-art (SOTA) performance, rivaling models ten times its size. The model builds on Apriel-1.5-15b-Thinker, focusing on improved text and vision reasoning with better token efficiency and was trained on NVIDIA DGX™ Cloud with GB200 Grace™ Blackwell Superchips.

Current large language models (LLMs) often require significant computational resources, hindering accessibility and increasing deployment costs. Apriel-1.6 addresses this by demonstrating that high intelligence and reasoning capabilities can be achieved with a relatively smaller model size, making it more practical for enterprise applications and reducing the financial burden of AI implementation.

Key Insights

Artificial Analysis Index Score: Apriel-1.6 scores 57 on the Artificial Analysis Index, outperforming models like Gemini 2.5 Flash and Claude Haiku 4.5 (ServiceNow-AI, 2025).
Token Efficiency: The model reduces reasoning token usage by over 30% compared to its predecessor, Apriel-1.5-15b-Thinker (ServiceNow-AI, 2025).
Cost-Efficiency: Apriel-1.6 achieves performance comparable to Qwen3 235B A22B, but with significantly lower computational requirements (ServiceNow-AI, 2025).

Working Example

(No code provided in context)

Practical Applications

ServiceNow AI: Utilizing Apriel-1.6 to power intelligent automation and reasoning within the Now Platform, enabling more efficient and accurate service delivery.
Pitfall: Over-reliance on complex models when a smaller, more efficient model like Apriel-1.6 can achieve comparable performance, leading to unnecessary infrastructure costs and slower inference times.

References:

https://huggingface.co/blog/ServiceNow-AI/apriel-1p6-15b-thinker
Radhakrishna, S., Tiwari, A., Shukla, A., Hashemi, M., Maheshwary, R., Malay, S.K.R., Mehta, J., Pattnaik, P., Mittal, S., Slimi, K., Ogueji, K., Oladipo, A., Parikh, S., Bamgbose, O., Liang, T., Masry, A., Mahajan, K., Mudumba, S.R., Yadav, V., Madhusudhan, S.T., Scholak, T., Davasam, S., Sunkara, S. and Chapados, N., 2025. Apriel-1.5-15b-Thinker. arXiv preprint arXiv:2510.01141.
Zheng, C., Liu, S., Li, M., Chen, X.-H., Yu, B., Gao, C., Dang, K., Liu, Y., Men, R., Yang, A., Zhou, J. and Lin, J., 2025. Group Sequence Policy Optimization. arXiv preprint arXiv:2507.18071.

On This Page

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance