Skip to content

overview

Krixik Search Pipelines

🇨🇴 Versión en español de este documento

Search Pipeline Overview

Search pipelines are those that enable document search on textual documents. These documents may be the initial pipeline input or they may be generated mid-pipeline, as may for instance be the case in pipelines that begin with transcription (where audio is converted to text) or image captioning (image to text) modules.

Such search capabilities are often employed in RAG (Retrieval-Augmented Generation) systems today, but the pipelines described in this section can also have general application with recommendation systems, image and video retrieval based on content similarity, and personalized content delivery, to name a few possibilitiess.

Two types of document search can be enabled: semantic search and keyword search. Depending on which of these is sought, the final module of the pipeline must respectively be vector-db or keyword-db.

Search pipelines are more complex than other pipelines because they require an additional step.

Search Pipeline Examples