Guides & Tutorials

Step-by-step tutorials to help you build AI apps, master new frameworks, and optimize your workflows.

Sort by:
guides
architectMay 25, 2026
Local LLMsOllamallama.cppRAGDockerGGUFLLM Engineering

The Complete Developer Guide to Running LLMs Locally: From Ollama to Production

Everything you need to run LLMs on your own hardware in 2026: VRAM sizing, model formats, an 8-tool comparison table, a full local RAG pipeline, and Docker production deployment with GPU passthrough and Nginx auth.

Read more
guides
architectMay 25, 2026
AgentsEvent-Driven ArchitectureMulti-Agent SystemsEnterprise AIAI Infrastructure

Event-Driven Architecture for Agentic AI: The Architect's Guide

A comprehensive architectural guide to designing resilient, real-time agentic AI systems using event-driven architecture — covering loose coupling, fault isolation, reference architecture, and governance patterns.

Read more
guides
ShanMay 24, 2026
Cursor AIAI CodingDeveloper ToolsCode EditorsVS Code

Cursor AI: Complete Setup and Practical Coding Guide

Everything developers need to use Cursor AI effectively — installation, the full keyboard shortcut map, inline code generation, chat with codebase context, tab autocomplete, @ mentions, custom rules, and how it compares to GitHub Copilot.

Read more
guides
ShanMay 24, 2026
DeepSeekOllamaReasoning ModelsLocal LLMsRAG

How to Run DeepSeek R1 Locally with Ollama: Full Setup Guide

Install DeepSeek R1 locally using Ollama in under 5 minutes. Covers model variant selection from 1.5B to 671B, visible chain-of-thought reasoning, REST API usage, Python integration, and building a simple RAG application.

Read more
guides
ShanMay 24, 2026
DeepSeekLocal LLMsllama.cppGPUGGUF

Run DeepSeek V4 Flash Locally with llama.cpp on a Single GPU

Step-by-step guide to running the full DeepSeek V4 Flash GGUF locally on a single RTX Pro 6000 GPU using a modified llama.cpp build, llama-server, and the Hugging Face hf_transfer downloader.

Read more
guides
ShanMay 24, 2026
Qwen3OllamaLocal LLMsPythonReasoning Models

How to Run Qwen3 Locally with Ollama: Setup, API, and a Gradio App

Set up Qwen3 locally in minutes using Ollama. Covers every model variant, thinking mode control with /think and /no_think tags, CLI, REST API, Python SDK, and a practical Gradio reasoning app.

Read more

Showing 6 of 36 guides