New Releases
9 articles in this category
AntAngelMed: Optimizing 103B-Parameter Medical LLMs via 1/32 MoE Activation
AntAngelMed is a 103B-parameter open-source medical LLM utilizing a 1/32 MoE activation ratio to deliver 200+ tokens/s while outperforming proprietary models on OpenAI's HealthBench.
NVIDIA Dynamo v0.9.0 Overhauls Distributed Inference with FlashIndexer, Multi-Modal Support
NVIDIA's Dynamo v0.9.0 simplifies large-scale model deployment by removing NATS/ETCD, enhancing multi-modal support, and previewing FlashIndexer for reduced latency.
Prior Labs Launches TabPFN-2.5: Scaling Tabular Foundation Models for Enhanced Performance and Efficiency
Prior Labs introduces TabPFN-2.5, a major update to its tabular foundation model, enabling handling of 50,000 samples and 2,000 features with no training required, while outperforming traditional models on benchmarks.
Moonshot AI Introduces Kimi K2 Thinking: A Breakthrough in Long-Horizon Reasoning and Tool Use
Moonshot AI releases Kimi K2 Thinking, an open-source thinking model capable of executing 200–300 sequential tool calls without human intervention, optimized for long-horizon reasoning and agentic tasks.
OpenAI Releases gpt-oss-safeguard: Open-Weight Safety Reasoning Models for Custom Policy Enforcement
OpenAI introduces two open-weight safety reasoning models, gpt-oss-safeguard-120b and gpt-oss-safeguard-20b, enabling developers to apply custom safety policies at inference time without retraining. The models are available under Apache 2.0 and optimized for hardware deployment.
Liquid AI Releases LFM2-ColBERT-350M: A Compact Late Interaction Model for Multilingual Cross-Lingual Retrieval
Liquid AI introduces LFM2-ColBERT-350M, a 350M-parameter late interaction retriever optimized for multilingual and cross-lingual search, offering high accuracy and fast inference speeds.