Skip to main content
← All Tags

Generative AI

24 articles in this category

AI NewsBackend EngineeringGenerative AI

Why Backend Engineering is Fundamental to Generative AI Systems

Backend engineers are uniquely positioned to solve the systems engineering challenges inherent in scaling Generative AI beyond simple demos.

Read more
AI NewsVoice AIGenerative AI

Sakana AI Introduces KAME: Real-Time LLM Knowledge Injection for Near-Zero Latency Speech

Sakana AI's new KAME architecture boosts S2S model MT-Bench scores from 2.05 to 6.43 while maintaining near-zero latency by injecting back-end LLM knowledge in real-time.

Read more
AI NewsGenerative AIAI Infrastructure

Mastering OpenAI GPT-OSS: A Technical Guide to Open-Weight Inference Workflows

Deploy OpenAI's gpt-oss-20b using native MXFP4 quantization on hardware with 16GB VRAM for advanced structured generation and tool use.

Read more
AI NewsAgentic AIGenerative AI

How to Build a Secure Local-First Agent Runtime with OpenClaw

Learn to deploy a secure local-first agent runtime using OpenClaw Gateway and structured skills, enabling deterministic tool execution and model routing within a loopback-bound environment.

Read more
AI NewsGenerative AISoftware Engineering

Liquid AI LFM2-24B-A2B: Hybrid Architecture for Efficient Edge-Capable AI

Liquid AI's LFM2-24B-A2B model uses a 1:3 Attention-to-Base ratio and Sparse MoE to deliver 24B parameter intelligence with only 2.3B active parameters, fitting into 32GB of RAM for high-performance edge deployment.

Read more
AI NewsDeveloper ToolsGenerative AI

Gemini AI Accelerates Developer Portfolio Build

Irina Zaytseva's portfolio build saw significant acceleration via Gemini Code Assist and the Gemini CLI, reducing context switching.

Read more
AI NewsGenerative AIVideo

Waypoint-1: Real-time Interactive Video Diffusion

Overworld releases Waypoint-1, achieving 30,000 token-passes/sec on a 5090 GPU for real-time interactive video generation.

Read more
AI NewsGenerative AIVideo Production

Higgsfield Leverages OpenAI Models to Generate 4 Million Cinematic Social Videos Daily

Higgsfield utilizes GPT-4.1, GPT-5, and Sora 2 to produce 4 million short-form videos daily, increasing share velocity by 150%.

Read more
AI NewsRoboticsGenerative AI

NVIDIA brings agents to life with DGX Spark and Reachy Mini

NVIDIA unveiled a system combining the DGX Spark and Reachy Mini robot, requiring approximately 93GB of disk space for the reasoning and vision models.

Read more
AI NewsArtificial IntelligenceGenerative AI

Tencent Releases HY-Motion 1.0: A Billion-Parameter Text-to-Motion Model

Tencent’s HY-Motion 1.0 achieves a 78.6% SSAE score, representing a significant advance in text-to-3D human motion generation.

Read more
AI NewsGenerative AIAI Architecture

SIMA 2 Uses Gemini and Self-Improvement to Generalize Across Unseen 3D and Photorealistic Worlds

Google DeepMind’s SIMA 2 agent, built on the Gemini model, demonstrates robust generalization across multiple 3D game environments and novel photorealistic settings.

Read more
AI NewsGenerative AIRetail

Target Improves Add to Cart Interactions by 11 Percent with Generative AI Recommendations

Target’s GRAM system, leveraging large language models, increased add-to-cart interactions for Home accessories by 11%.

Read more
AI NewsGenerative AIFilmmaking

Higgsfield Cinema Studio: AI Filmmaking with Real Camera Controls

Higgsfield Cinema Studio offers filmmakers precise control over AI video generation, moving beyond lottery-style prompting to achieve cinematic intent.

Read more
AI NewsUI/UXGenerative AI

Promptions: Dynamic prompting UI that improves gen AI interaction

Promptions helps developers add dynamic controls to AI chat interfaces, improving user precision and reducing prompt engineering effort.

Read more
AI NewsGenerative AIFinancial Technology

Robinhood's LoRA Fine-Tuning Cuts AI Latency by 50% in Production

Robinhood's LoRA fine-tuning cuts AI latency by 50% in production, achieving 1-2 second response times.

Read more
AI NewsMicrosoftGenerative AI

Microsoft Copilot Fall Release Includes Collaboration and Personalization Features

Microsoft's Copilot Fall Release adds collaboration tools and health-focused AI, targeting 40% of users with weekly health queries.

Read more
AI NewsGenerative AICode Generation

Developing Claude Code at Anthropic at AI Speed

Anthropic's Claude Code generates 90% of its production code, redefining AI-driven software development at QConSF 2025.

Read more
AI NewsCloud ComputingGenerative AI

Announcing the updated AWS Well-Architected Generative AI Lens

AWS updated the Well-Architected Generative AI Lens with new best practices for responsible AI, data architecture, and agentic workflows.

Read more
AI NewsGenerative AIComputer Vision

Learn-to-Steer: NVIDIA’s 2025 Spatial Fix for Text-to-Image Diffusion

NVIDIA’s Learn-to-Steer framework improves spatial reasoning in text-to-image models, achieving gains on GenEval and T2I-CompBench.

Read more
AI NewsLarge Concept ModelsGenerative AI

Kimi’s K2 Opensource LLM Achieves 71.3% on SWE-Bench Verified

Kimi released K2, a 1.04 trillion parameter Mixture-of-Experts model, achieving 71.3% on the SWE-Bench Verified benchmark.

Read more
AI NewsGenerative AIArtificial Intelligence

MBZUAI Researchers Introduce PAN: A General World Model For Interactable Long Horizon Simulation

MBZUAI’s PAN world model achieves 70.3% agent simulation accuracy, enabling interactive long-horizon video generation.

Read more
AI NewsGenerative AIAI Architecture

Anthropic Launches Sandboxed Claude Code with Web Access for Enhanced AI Coding Security

Anthropic released sandboxing and a web version of Claude Code, mitigating security risks associated with AI code generation and reducing developer approval fatigue.

Read more
AI NewsAI ArchitectureGenerative AI

New Claude Haiku 4.5 Model Promises Faster Performance at One-Third the Cost

Anthropic’s Claude Haiku 4.5 delivers performance comparable to Sonnet 4 at one-third the cost and twice the speed.

Read more
AI NewsAI ArchitecturePlatform Engineering

Inside the Architectures Powering Modern AI Systems: QCon San Francisco 2025

QCon San Francisco 2025 focuses on real-world AI architecture challenges, featuring insights from Netflix, Meta, Intuit, and Anthropic on building scalable, reliable AI systems and infrastructure.

Read more