Large Language Model
54 articles in this category (Page 2 of 3)
Luma Labs Uni-1: Bridging the Intent Gap with Autoregressive Reasoning Transformers
Luma Labs Uni-1 utilizes a decoder-only autoregressive transformer to reason through spatial logic before generation, outperforming Flux Max on RISEBench at $0.10 per image.
Building Uncertainty-Aware LLM Systems with Confidence Estimation and Automated Web Research
A technical implementation of a three-stage LLM pipeline using Python to enable self-reported confidence scores, meta-cognitive self-evaluation, and automated web research for higher reliability.
Yuan 3.0 Ultra: Optimizing Trillion-Parameter MoE Efficiency via LAEP
YuanLab AI releases Yuan 3.0 Ultra, a 1T-parameter MoE model that achieves a 49% boost in pre-training efficiency. By utilizing Layer-Adaptive Expert Pruning and a Reflection Inhibition Reward Mechanism, it reduces total parameters by 33.3% while maintaining state-of-the-art performance in multimodal retrieval and enterprise benchmarks.
ByteDance AI Maps Molecular Bonds in Reasoning to Stabilize Long Chain-of-Thought Models
ByteDance researchers introduce MOLE-SYN, a framework that treats AI reasoning as molecular structures, stabilizing Long CoT performance across benchmarks like GSM8K and MATH-500.
Gemini 3.1 Pro: 1M Token Context and 77.1% ARC-AGI-2 Reasoning for AI Agents
Google releases Gemini 3.1 Pro with a 1M token context window and 77.1% ARC-AGI-2 reasoning score, targeting the high-performance autonomous AI agent market. This release focuses on reasoning stability, software engineering, and tool-use reliability for developers building next-generation autonomous agents and complex technical workflows.
Moonshot AI Releases Kimi K2.5: An Open Source Visual Agentic Intelligence Model with Native Swarm Execution
Moonshot AI launched Kimi K2.5, an open-source visual agentic intelligence model boasting a 1T parameter scale and achieving state-of-the-art results in agentic benchmarks.