Tag

MLOps

Insights and guides tagged MLOps from dypsis.ai.

InfrastructureFebruary 2026

Running GPUStack with NVIDIA MIG: A Deep Dive into Multi-Instance GPU Orchestration

Multi-Instance GPU (MIG) technology promises to maximize GPU utilization by partitioning a single GPU into isolated instances. But getting MIG to work with container orchestration tools like GPUStack requires navigating a maze of CDI configuration, device enumeration, and runtime patches. This technical deep-dive shares our battle-tested solutions.

Frederico Vicente

15 min read

Webinar

Artificial IntelligenceJanuary 2026

Small AI Models in Production

Discover how small, highly capable AI models are enabling faster, more cost-effective, and more controllable AI systems in real production environments. Learn why efficient models matter and when lighter architectures outperform larger ones.

Frederico Vicente

60 min

Architecting AI Agent Systems: A Strategic Framework for Production Deployment

Article

Artificial IntelligenceNovember 2025

Architecting AI Agent Systems: A Strategic Framework for Production Deployment

Choosing the right LLM framework is a strategic business decision that determines scalability, cost control, and system resilience. Learn how to navigate the trade-offs between speed, flexibility, and governance when building production-grade AI automation.

Frederico Vicente

12 min read

Rethinking AI Coding Agents: From Prompt Completion to Structured Engineering

Article

Software EngineeringNovember 2025

Rethinking AI Coding Agents: From Prompt Completion to Structured Engineering

The bottleneck in AI-assisted development isn't model capability - it's workflow design. Learn how to transform coding agents from autocompleters into systematic engineering partners through structured planning, context engineering, and disciplined process execution.

Frederico Vicente

14 min read

Enabling Private LLM Execution: Trusted Execution Environments and Encrypted Containers

Article

InfrastructureNovember 2025

Enabling Private LLM Execution: Trusted Execution Environments and Encrypted Containers

Running LLM inference and fine-tuning on private datasets requires bridging theoretical cryptography with practical high-throughput systems. Learn how TEEs and encrypted containers create compliance-ready, hardware-isolated execution environments for confidential AI workloads.

Frederico Vicente

16 min read

Model Context Protocol: Standardizing Context and Tool Integration for Agentic AI

Article

Artificial IntelligenceNovember 2025

Model Context Protocol: Standardizing Context and Tool Integration for Agentic AI

As LLMs evolve from stateless prompt responders to stateful, tool-using agents, fragile hand-wired orchestration is breaking down. MCP provides a vendor-neutral protocol for connecting models with structured context, tools, and external systems at runtime.

Frederico Vicente

15 min read

RAG vs Fine-Tuning: Why the Best AI Systems Combine Both

Article

Generative AISeptember 2025

RAG vs Fine-Tuning: Why the Best AI Systems Combine Both

Should you choose Retrieval-Augmented Generation (RAG) or fine-tuning to optimize your LLM? The answer is not either-or. Learn how combining RAG with fine-tuning delivers accuracy, adaptability, and cost efficiency in real-world AI systems.

Frederico Vicente

7 min read

← View all insights