QConAI: Balancing Probabilistic and Deterministic Systems for Reliable Agentic AI
These articles are AI-generated summaries. Please check the original sources for full details.
Balancing Probabilistic and Deterministic Systems for Reliable Agentic AI
At QCon AI NYC 2025, Aaron Erickson positioned agentic AI as a core engineering problem, shifting the focus from prompt engineering to robust system design. He argued that reliable agentic AI requires a blend of probabilistic models for exploration and deterministic systems for execution and control.
Treating agentic AI as a layer over existing operational systems, rather than a replacement, is crucial for achieving reliability; however, relying solely on large language models for complex tasks like SQL query generation leads to rapidly diminishing accuracy as schema complexity increases. This accuracy degradation can translate to significant operational costs and potential failures in production systems.
Key Insights
- Schema Complexity & Accuracy: Accuracy of LLM-generated SQL queries falls sharply with complex schemas and numerous joins.
- Classification vs. Code Generation: Classification tasks exhibit higher reliability than arbitrary code generation.
- Tool Choice Paradox: LLMs struggle with excessive tool options, leading to suboptimal or unsafe selections.
Working Example
(No code included in the source context)
Practical Applications
- Fraud Detection: A system using an agent to classify transactions as potentially fraudulent, then routing them to a deterministic rule-based system for final adjudication.
- Pitfall: Over-reliance on LLMs for complex decision-making without deterministic safeguards can lead to unpredictable and potentially harmful outcomes.
References:
Continue reading
Next article
Why Modern Websites Fail Without a Strategic SEO and UX Foundation
Related Content
The 7 Statistical Concepts You Need to Succeed as a Machine Learning Engineer
Master seven foundational statistical concepts to build reliable machine learning systems, as outlined in a 2025 guide from MachineLearningMastery.com.
OpenAI’s Agent RFT: Reinforcement Fine-Tuning for Tool-Using Agents
OpenAI's Agent RFT, unveiled at QCon AI NYC 2025, uses reinforcement fine-tuning to improve tool-using agent performance by optimizing prompts and tasks before model adjustments.
DPO vs SimPO: Engineering Decisive Preference Optimization for LLMs
Analyze DPO and SimPO objectives to resolve training mismatches and evaluate lift, such as the 22.73% vs 18.18% improvement in SalesConversion-Bench.