ML & Data Engineering
20 articles in this category
OpenAI's Open Responses Specification Unifies Agentic LLM Workflows
OpenAI's Open Responses standardizes agentic AI workflows, reducing API fragmentation and enabling seamless transitions between proprietary and open-source models with a unified specification.
Google DeepMind Introduces ATLAS Scaling Laws for Multilingual Language Models
Google DeepMind researchers introduce ATLAS, a set of scaling laws for multilingual language models, revealing that doubling the number of languages requires a 1.18× increase in model size and 1.66× increase in total training data.
Anthropic Launches Claude Code on Web and Mobile
Anthropic expands the availability of Claude Code, its AI-powered development environment, to web and mobile platforms, enabling developers to write, edit, and execute code directly in a browser or on mobile devices.
AI Agents Evolve: From Assistance to Execution Engines in Enterprise Architecture
A significant shift is occurring in enterprise software architecture as AI agents transition from providing assistance to autonomously executing tasks. This article details the architectural changes, adoption rates, real-world examples, and key considerations for implementing agentic AI, including governance, transparency, and cost management.
NVIDIA Unveils OmniVinci: A Research-Focused Multimodal LLM
NVIDIA Research has released OmniVinci, a research-only large language model designed for cross-modal understanding of text, vision, audio, and robotics data. It demonstrates strong performance with a smaller training dataset compared to competitors, but its non-commercial license has sparked debate within the AI community.
DeepSeek AI Introduces DeepSeek-OCR: A Novel Approach to Context Compression for LLMs
DeepSeek AI has released DeepSeek-OCR, an open-source system leveraging optical 2D mapping for efficient compression of long text, potentially revolutionizing how large language models handle extensive inputs.
Google Launches LLM-Evalkit for Data-Driven Prompt Engineering
Google introduces LLM-Evalkit, an open-source framework on Vertex AI SDKs, to standardize and measure prompt engineering for large language models, promoting a data-driven workflow and collaboration.