Technical Process
The technical process generally follows this sequence:
- Load the CSV file.
- Embed the selected data into a vector database.
- Create a
compression_retrieverthat I’ll introduce in details - Perform a
similarity_search. - Generate a prompt based on previous conversations.
- Define a chain with source data using
RetrievalQAWithSourcesChain - Send the output to the LLM for semantic optimization.
The following diagram illustrates the main work flow of the data processing.
file CSVLoader
file embedding
database vectorstore
file base_retriever
file compression_retriever
file llm
file result
CSVLoader -> embedding
embedding -> vectorstore
vectorstore --> base_retriever
compression_retriever <- base_retriever
compression_retriever --> llm: prompt
llm -> result