New Releases

9 articles in this category

AI NewsNew ReleasesSoftware Engineering

AntAngelMed: Optimizing 103B-Parameter Medical LLMs via 1/32 MoE Activation

AntAngelMed is a 103B-parameter open-source medical LLM utilizing a 1/32 MoE activation ratio to deliver 200+ tokens/s while outperforming proprietary models on OpenAI's HealthBench.

May 12, 2026

AI NewsVoice AINew Releases

OpenAI Launches GPT-Realtime-2 and Specialized Audio Models in General Availability

OpenAI moves the Realtime API to general availability, introducing GPT-Realtime-2 with GPT-5-class reasoning and a 128K context window.

May 8, 2026

AI NewsAudio Language ModelNew Releases

Mistral AI Unveils Voxtral TTS: A 4B Parameter Open-Weight Model for 70ms Low-Latency Speech

Mistral AI releases Voxtral TTS, a 4B parameter open-weight model achieving 70ms latency and 9.7x real-time factor across 9 languages.

Mar 28, 2026

AI NewsAudio Language ModelNew Releases

IBM Granite 4.0 1B Speech: A High-Efficiency Multilingual Model for Edge AI

IBM's Granite 4.0 1B Speech model reduces parameter count by 50% while achieving a 5.52 Average WER, optimized for edge-style multilingual ASR and AST.

Mar 15, 2026

AI NewsAI InfrastructureNew Releases

NVIDIA Dynamo v0.9.0 Overhauls Distributed Inference with FlashIndexer, Multi-Modal Support

NVIDIA's Dynamo v0.9.0 simplifies large-scale model deployment by removing NATS/ETCD, enhancing multi-modal support, and previewing FlashIndexer for reduced latency.

Feb 19, 2026

AI NewsMachine LearningNew Releases

Prior Labs Launches TabPFN-2.5: Scaling Tabular Foundation Models for Enhanced Performance and Efficiency

Prior Labs introduces TabPFN-2.5, a major update to its tabular foundation model, enabling handling of 50,000 samples and 2,000 features with no training required, while outperforming traditional models on benchmarks.

Nov 8, 2025

AI NewsAgentic AIArtificial Intelligence

Moonshot AI Introduces Kimi K2 Thinking: A Breakthrough in Long-Horizon Reasoning and Tool Use

Moonshot AI releases Kimi K2 Thinking, an open-source thinking model capable of executing 200–300 sequential tool calls without human intervention, optimized for long-horizon reasoning and agentic tasks.

Nov 6, 2025

AI NewsSecurityArtificial Intelligence

OpenAI Releases gpt-oss-safeguard: Open-Weight Safety Reasoning Models for Custom Policy Enforcement

OpenAI introduces two open-weight safety reasoning models, gpt-oss-safeguard-120b and gpt-oss-safeguard-20b, enabling developers to apply custom safety policies at inference time without retraining. The models are available under Apache 2.0 and optimized for hardware deployment.

Oct 31, 2025

AI NewsAI ShortsApplications

Liquid AI Releases LFM2-ColBERT-350M: A Compact Late Interaction Model for Multilingual Cross-Lingual Retrieval

Liquid AI introduces LFM2-ColBERT-350M, a 350M-parameter late interaction retriever optimized for multilingual and cross-lingual search, offering high accuracy and fast inference speeds.

Oct 28, 2025