Skip to main content

On This Page

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

2 min read
Share

These articles are AI-generated summaries. Please check the original sources for full details.

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

ServiceNow-AI released Apriel-1.6-15b-Thinker, a 15-billion parameter multimodal reasoning model that achieves state-of-the-art (SOTA) performance, rivaling models ten times its size. The model builds on Apriel-1.5-15b-Thinker, focusing on improved text and vision reasoning with better token efficiency and was trained on NVIDIA DGX™ Cloud with GB200 Grace™ Blackwell Superchips.

Current large language models (LLMs) often require significant computational resources, hindering accessibility and increasing deployment costs. Apriel-1.6 addresses this by demonstrating that high intelligence and reasoning capabilities can be achieved with a relatively smaller model size, making it more practical for enterprise applications and reducing the financial burden of AI implementation.

Key Insights

  • Artificial Analysis Index Score: Apriel-1.6 scores 57 on the Artificial Analysis Index, outperforming models like Gemini 2.5 Flash and Claude Haiku 4.5 (ServiceNow-AI, 2025).
  • Token Efficiency: The model reduces reasoning token usage by over 30% compared to its predecessor, Apriel-1.5-15b-Thinker (ServiceNow-AI, 2025).
  • Cost-Efficiency: Apriel-1.6 achieves performance comparable to Qwen3 235B A22B, but with significantly lower computational requirements (ServiceNow-AI, 2025).

Working Example

(No code provided in context)

Practical Applications

  • ServiceNow AI: Utilizing Apriel-1.6 to power intelligent automation and reasoning within the Now Platform, enabling more efficient and accurate service delivery.
  • Pitfall: Over-reliance on complex models when a smaller, more efficient model like Apriel-1.6 can achieve comparable performance, leading to unnecessary infrastructure costs and slower inference times.

References:

  • https://huggingface.co/blog/ServiceNow-AI/apriel-1p6-15b-thinker
  • Radhakrishna, S., Tiwari, A., Shukla, A., Hashemi, M., Maheshwary, R., Malay, S.K.R., Mehta, J., Pattnaik, P., Mittal, S., Slimi, K., Ogueji, K., Oladipo, A., Parikh, S., Bamgbose, O., Liang, T., Masry, A., Mahajan, K., Mudumba, S.R., Yadav, V., Madhusudhan, S.T., Scholak, T., Davasam, S., Sunkara, S. and Chapados, N., 2025. Apriel-1.5-15b-Thinker. arXiv preprint arXiv:2510.01141.
  • Zheng, C., Liu, S., Li, M., Chen, X.-H., Yu, B., Gao, C., Dang, K., Liu, Y., Men, R., Yang, A., Zhou, J. and Lin, J., 2025. Group Sequence Policy Optimization. arXiv preprint arXiv:2507.18071.

Related Content