Guides & Tutorials

Step-by-step tutorials to help you build AI apps, master new frameworks, and optimize your workflows.

Sort by:

Shan • July 3, 2026

Self-Hosted LLM Guide 2026: Run AI Locally for Privacy & Savings

Complete 2026 guide to running LLMs locally for privacy and cost savings. Set up Ollama, llama.cpp, and vLLM on your hardware.

guides

Shan • June 7, 2026

Zero-Shot ClassificationLocal LLMOllamaNLPProduction AI

Build a Local LLM Zero-Shot Classifier You Can Actually Deploy

Learn how to run zero-shot text classification on a local model with Ollama, enforce strict JSON outputs, and add confidence-aware routing for production triage.

guides

architect • May 25, 2026

Local LLMsOllamallama.cppRAGDockerGGUFLLM Engineering

The Complete Developer Guide to Running LLMs Locally: From Ollama to Production

Everything you need to run LLMs on your own hardware in 2026: VRAM sizing, model formats, an 8-tool comparison table, a full local RAG pipeline, and Docker production deployment with GPU passthrough and Nginx auth.

guides

architect • May 25, 2026

AgentsEvent-Driven ArchitectureMulti-Agent SystemsEnterprise AIAI Infrastructure

Event-Driven Architecture for Agentic AI: The Architect's Guide

A comprehensive architectural guide to designing resilient, real-time agentic AI systems using event-driven architecture — covering loose coupling, fault isolation, reference architecture, and governance patterns.

guides

Shan • May 24, 2026

Cursor AIAI CodingDeveloper ToolsCode EditorsVS Code

Cursor AI: Complete Setup and Practical Coding Guide

Everything developers need to use Cursor AI effectively — installation, the full keyboard shortcut map, inline code generation, chat with codebase context, tab autocomplete, @ mentions, custom rules, and how it compares to GitHub Copilot.

guides

Shan • May 24, 2026

DeepSeekOllamaReasoning ModelsLocal LLMsRAG

How to Run DeepSeek R1 Locally with Ollama: Full Setup Guide

Install DeepSeek R1 locally using Ollama in under 5 minutes. Covers model variant selection from 1.5B to 671B, visible chain-of-thought reasoning, REST API usage, Python integration, and building a simple RAG application.

Showing 6 of 37 guides