<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>向量检索 on 云原生技术栈</title><link>https://blog.ivanwz.com/tags/%E5%90%91%E9%87%8F%E6%A3%80%E7%B4%A2/</link><description>Recent content in 向量检索 on 云原生技术栈</description><generator>Hugo -- gohugo.io</generator><language>zh-cn</language><managingEditor>hello@example.com (Ivan)</managingEditor><webMaster>hello@example.com (Ivan)</webMaster><copyright>© 2026 Ivan</copyright><lastBuildDate>Fri, 19 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://blog.ivanwz.com/tags/%E5%90%91%E9%87%8F%E6%A3%80%E7%B4%A2/index.xml" rel="self" type="application/rss+xml"/><item><title>RAG 应用的基础架构拆解</title><link>https://blog.ivanwz.com/posts/posts/ai/llm-rag-starter/</link><pubDate>Wed, 17 Jun 2026 00:00:00 +0000</pubDate><author>hello@example.com (Ivan)</author><guid>https://blog.ivanwz.com/posts/posts/ai/llm-rag-starter/</guid><description>&lt;p&gt;RAG 是把外部知识接入大模型的常见方式。它并不神秘，本质上是一条从文档到答案的工程链路。&lt;/p&gt;

&lt;h2 class="relative group"&gt;文档切分
 &lt;div id="文档切分" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#%e6%96%87%e6%a1%a3%e5%88%87%e5%88%86" aria-label="锚点"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;文档需要被切成适合检索的片段。切分太大，召回不精准；切分太小，上下文容易丢失。常见做法是按标题、段落和固定 token 数组合切分。&lt;/p&gt;</description></item></channel></rss>