DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Deploying Scalable LLM Tools via Remote MCP on Kubernetes

Deploying Scalable LLM Tools via Remote MCP on Kubernetes

Comments
10 min read
Build a Knowledge-Based Q&A Bot using Bedrock + S3 + DynamoDB/OpenSearch via AWS CDK

Build a Knowledge-Based Q&A Bot using Bedrock + S3 + DynamoDB/OpenSearch via AWS CDK

Comments
25 min read
VectorDatabase Showdown 2025: Pinecone vs Qdrant vs Weaviate con Benchmarks Reales

VectorDatabase Showdown 2025: Pinecone vs Qdrant vs Weaviate con Benchmarks Reales

Comments
3 min read
Retrieval-Augmented Generation (RAG) Agents: How to Build Grounded, Tool‑Using GenAI Systems

Retrieval-Augmented Generation (RAG) Agents: How to Build Grounded, Tool‑Using GenAI Systems

Comments
9 min read
Why RAG is the Future of Search (And How Elastic Search Makes it Possible )

Why RAG is the Future of Search (And How Elastic Search Makes it Possible )

1
Comments
4 min read
Graph RAG: Why Vector Search Alone Is Not Enough for Serious Backend Systems

Graph RAG: Why Vector Search Alone Is Not Enough for Serious Backend Systems

Comments
2 min read
Build a RAG Pipeline with n8n: Visual Workflows vs. Code-First

Build a RAG Pipeline with n8n: Visual Workflows vs. Code-First

Comments
6 min read
If your agent can delete user data, your prompt isn’t a prompt, it’s a contract

If your agent can delete user data, your prompt isn’t a prompt, it’s a contract

Comments 2
4 min read
RAG Without the Internet - Lessons From Building an Internal-Only AI Assistant on Markdown and Confluence

RAG Without the Internet - Lessons From Building an Internal-Only AI Assistant on Markdown and Confluence

Comments
6 min read
Building a RAG based agent using DronaHQ

Building a RAG based agent using DronaHQ

Comments
8 min read
AI Agents Feel Simple in Demos and Complicated in Production

AI Agents Feel Simple in Demos and Complicated in Production

Comments
2 min read
The Tradeoffs Behind AI Agents

The Tradeoffs Behind AI Agents

Comments
1 min read
Optimizing Milvus Standalone for Production: Achieving 72% Memory Reduction While Maintaining Performance

Optimizing Milvus Standalone for Production: Achieving 72% Memory Reduction While Maintaining Performance

Comments
3 min read
**Chunklet-py: One Library to Split Them All - Sentence, Code, Docs**

**Chunklet-py: One Library to Split Them All - Sentence, Code, Docs**

Comments
2 min read
The Missing Step in RAG: Why Your Vector DB is Bloated (and how to fix it locally)

The Missing Step in RAG: Why Your Vector DB is Bloated (and how to fix it locally)

1
Comments
3 min read
Building a Layered Memory Architecture for Long-Term AI Learning

Building a Layered Memory Architecture for Long-Term AI Learning

Comments
11 min read
Beyond Simple RAG: Building an Agentic Workflow with Next.js, Python, and Supabase

Beyond Simple RAG: Building an Agentic Workflow with Next.js, Python, and Supabase

Comments 1
2 min read
Dual-Source AI: Integrating RAG & Live Search for Real-Time Answers

Dual-Source AI: Integrating RAG & Live Search for Real-Time Answers

Comments
5 min read
RAG(Retrieval-Augmented Generation) Demystified: A Question-First Guide for Software Developers

RAG(Retrieval-Augmented Generation) Demystified: A Question-First Guide for Software Developers

Comments
7 min read
Data Governance in RAG Systems: Security, Privacy, and Compliance by Design

Data Governance in RAG Systems: Security, Privacy, and Compliance by Design

1
Comments
4 min read
The Orphan Axiom Problem in Ontology-Based RAG

The Orphan Axiom Problem in Ontology-Based RAG

Comments
6 min read
Building a Scalable RAG System for Repository Intelligence

Building a Scalable RAG System for Repository Intelligence

Comments 1
3 min read
Agentic RAG: Letting LLMs Choose What to Retrieve

Agentic RAG: Letting LLMs Choose What to Retrieve

Comments
11 min read
Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Optimal Chunking for Ontology RAG: Empirical Analysis & Orphan Axiom Problem

Comments
12 min read
Semantic Caching with Bifrost: Reduce LLM Costs and Latency by Up to 70%

Semantic Caching with Bifrost: Reduce LLM Costs and Latency by Up to 70%

Comments
7 min read
loading...