Skip to main content

Tag

MLOps

Insights and guides tagged MLOps from dypsis.ai.

Running GPUStack with NVIDIA MIG: A Deep Dive into Multi-Instance GPU Orchestration
Article
InfrastructureFebruary 2026

Running GPUStack with NVIDIA MIG: A Deep Dive into Multi-Instance GPU Orchestration

Multi-Instance GPU (MIG) technology promises to maximize GPU utilization by partitioning a single GPU into isolated instances. But getting MIG to work with container orchestration tools like GPUStack requires navigating a maze of CDI configuration, device enumeration, and runtime patches. This technical deep-dive shares our battle-tested solutions.

Frederico VicenteFrederico Vicente
15 min read
Small AI Models in Production
Webinar
Artificial IntelligenceJanuary 2026

Small AI Models in Production

Discover how small, highly capable AI models are enabling faster, more cost-effective, and more controllable AI systems in real production environments. Learn why efficient models matter and when lighter architectures outperform larger ones.

Frederico VicenteFrederico Vicente
60 min
Architecting AI Agent Systems: A Strategic Framework for Production Deployment
Article
Artificial IntelligenceNovember 2025

Architecting AI Agent Systems: A Strategic Framework for Production Deployment

Choosing the right LLM framework is a strategic business decision that determines scalability, cost control, and system resilience. Learn how to navigate the trade-offs between speed, flexibility, and governance when building production-grade AI automation.

Frederico VicenteFrederico Vicente
12 min read
Rethinking AI Coding Agents: From Prompt Completion to Structured Engineering
Article
Software EngineeringNovember 2025

Rethinking AI Coding Agents: From Prompt Completion to Structured Engineering

The bottleneck in AI-assisted development isn't model capability - it's workflow design. Learn how to transform coding agents from autocompleters into systematic engineering partners through structured planning, context engineering, and disciplined process execution.

Frederico VicenteFrederico Vicente
14 min read
Enabling Private LLM Execution: Trusted Execution Environments and Encrypted Containers
Article
InfrastructureNovember 2025

Enabling Private LLM Execution: Trusted Execution Environments and Encrypted Containers

Running LLM inference and fine-tuning on private datasets requires bridging theoretical cryptography with practical high-throughput systems. Learn how TEEs and encrypted containers create compliance-ready, hardware-isolated execution environments for confidential AI workloads.

Frederico VicenteFrederico Vicente
16 min read
Model Context Protocol: Standardizing Context and Tool Integration for Agentic AI
Article
Artificial IntelligenceNovember 2025

Model Context Protocol: Standardizing Context and Tool Integration for Agentic AI

As LLMs evolve from stateless prompt responders to stateful, tool-using agents, fragile hand-wired orchestration is breaking down. MCP provides a vendor-neutral protocol for connecting models with structured context, tools, and external systems at runtime.

Frederico VicenteFrederico Vicente
15 min read
RAG vs Fine-Tuning: Why the Best AI Systems Combine Both
Article
Generative AISeptember 2025

RAG vs Fine-Tuning: Why the Best AI Systems Combine Both

Should you choose Retrieval-Augmented Generation (RAG) or fine-tuning to optimize your LLM? The answer is not either-or. Learn how combining RAG with fine-tuning delivers accuracy, adaptability, and cost efficiency in real-world AI systems.

Frederico VicenteFrederico Vicente
7 min read