Natural Language Processing
12 articles in this category
Post-Transformer Frontier Models for Enhanced AI Attention Span
Pathway's Baby Dragon Hatchling model achieves a significant breakthrough in AI attention span, enabling continual learning and long-term reasoning with a 50% success rate in tasks lasting up to 2 hours and 70 minutes.
Conductor Quantum Introduces Coda, a Natural Language Interface for Quantum Computing
Conductor Quantum has announced Coda, a natural language interface for running quantum programs on real quantum hardware, reducing setup and low-level programming overhead.
Google Releases TranslateGemma Open Models for Efficient Multilingual Translation
Google's TranslateGemma models achieve high translation quality with 4B, 12B, and 27B parameter variants, supporting 55 languages and running on various platforms.
Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)
This article explains the shift from Recurrent Neural Networks (RNNs) to the Transformer architecture, detailing the vanishing gradient problem and the core concepts of self-attention.
StepFun AI Releases Step-Audio-EditX: A New Open-Source 3B LLM-Grade Audio Editing Model Excelling at Expressive and Iterative Audio Editing
StepFun AI introduces Step-Audio-EditX, a 3B parameter open-source model enabling precise, iterative audio editing akin to text manipulation.