Building upon our previous work with AI Search, I implemented a Naive RAG (Retrieval-Augmented Generation) system in Copilot Studio.
Naive RAG
- Retrieve relevant information from external databases or documents based on user queries
- Generate responses using a language model (such as GPT) based on the retrieved information
While Advanced RAG techniques and Agentic Workflow/Agentic Design Patterns are becoming increasingly popular, I decided to start with the basic Naive RAG implementation as a learning exercise.
Tip: Official Recommendation for Using Knowledge Feature
If you need to create a custom Copilot / RAG (Retrieval Augmented Generation) application to chat with SharePoint data, the recommended approach is to use Microsoft Copilot Studio instead of this preview feature.
Excerpt from official documentation
While I’m building a custom implementation for this experiment, for production environments, a practical approach would be to first try the Knowledge feature, and only develop a custom RAG solution if the accuracy doesn’t meet your requirements.
data:image/s3,"s3://crabby-images/0651d/0651d3f6c2cdfec668dba45a3838655227c2982a" alt=""
Implementation
data:image/s3,"s3://crabby-images/65d1c/65d1c3fdb80c3d5ca5bcc25bb8ea03171e9a1795" alt=""
Note: We use recent information because using older data wouldn’t be interesting – the LLM could answer queries using its own knowledge base.
data:image/s3,"s3://crabby-images/fa442/fa44236c1ecef6d6bb58243af3a5f24d16f2aa63" alt="Screenshot showing SharePoint data structure and chunking"
For details on data preparation and AI Search integration, please refer to:
data:image/s3,"s3://crabby-images/924d1/924d16e93b35db6f0719df2071ac7ef69328b7ff" alt=""
- Create topics and generate search keywords
- Perform search (AI Search call)
- Generate responses and send messages
- Optional: Invoke from Conversational boosting
Step 1: Create Topics and Generate Search Keywords
data:image/s3,"s3://crabby-images/08ab0/08ab0c4ec7367e869b636d0ab860aa4c5984ca36" alt="Screenshot showing topic creation and prompt action setup"
Note: Since our main goal here is to implement a basic Naive RAG system, we’re not focusing on optimizing accuracy at this stage.
data:image/s3,"s3://crabby-images/83417/8341781d9347714f8a44a3ec6bb5bcb5fbda4e1c" alt="Screenshot showing prompt configuration with few-shot examples"
Here are examples of user questions and their corresponding search queries. Use these examples to guide your transformation of the input question. Example Input 1: "How to connect to a database using Python?" Example Output: "Python database connection method" Example Input 2: "What are the effects of climate change on ecosystems?" Example Output: "climate change ecosystem impact" Example Input 3: "What are the ethical challenges of AI?" Example Output: "AI ethics challenges" Now, based on these examples, transform the following user question into an effective search query. Input: {input}
Step 2: Search (AI Search Call)
data:image/s3,"s3://crabby-images/be427/be427ad7cab399e9cbd2140d5c5d3f9522b9c0b0" alt="Screenshot showing AI Search HTTP request configuration"
// POST to the following URL https://[AI-Search-Resource-Name].search.windows.net/indexes/[Index-Name]/docs/search?api-version=2024-11-01-preview
Note: This is a sample implementation, so we’re handling the API key in a simplified manner.
data:image/s3,"s3://crabby-images/bf32e/bf32e0c8c752dd6f00a702e5390bd8f12fca1be7" alt="Screenshot showing header configuration with API key"
Note: We’ll generate responses from the top 3 search results.
data:image/s3,"s3://crabby-images/e4782/e47821c7b61fb62edfc7b8f3f15fdec46e802072" alt="Screenshot showing JSON body configuration"
JSON({ search:Topic.queryOutput.structuredOutput.query, count:true, top:Topic.Top // Variable "Top" is preset to 3 })
Note: The schema definition varies depending on your AI Search index – it’s easiest to generate it from a sample response.
kind: Record properties: @odata.context: String @odata.count: Number value: type: kind: Table properties: @search.score: Number content: String id: String metadata_spo_item_content_type: String metadata_spo_item_last_modified: String metadata_spo_item_name: String metadata_spo_item_path: String metadata_spo_item_size: Number
Step 3: Generate Response and Send Message
data:image/s3,"s3://crabby-images/f8f92/f8f9266f38fc1883e0b49af4c03136cd69d6e6d3" alt="Screenshot showing prompt action configuration for response generation"
// Pass the "file name" and "file content" as search results JSON( ForAll(Topic.search_result.value, { fileName:metadata_spo_item_name,content:content } ))
data:image/s3,"s3://crabby-images/6fcec/6fcec347fedf7669902afdb84b45188a3e21d657" alt="Screenshot showing prompt configuration with JSON output format"
If the query cannot be answered based on the provided context, respond with 'Unable to answer this question with the provided information.' Otherwise, generate a complete and accurate answer, and as an annotation, output the name of the referenced files at the end.
Step 4 (Optional): Invoke from Conversational Boosting
Finally, since I plan to experiment with various RAG methods in the future, I’ll set up Conversational boosting to handle user messages.
data:image/s3,"s3://crabby-images/53db6/53db6cbad6ae2a0ed143ee1988967b4d94b57509" alt="Screenshot showing RAG method selection interface"
data:image/s3,"s3://crabby-images/bc6dc/bc6dc6f96544aaf614cb1aa2f7b82960688ea99a" alt="Screenshot showing redirection flow based on RAG method selection"
This completes the implementation.
Testing the Implementation
data:image/s3,"s3://crabby-images/2c225/2c22512e27e5824b127ee477546c1fa068b263b3" alt="Screenshot showing successful RAG response"
data:image/s3,"s3://crabby-images/5b7a2/5b7a2f575dff7f68e5afbcc7a02fe5dbfe3fb287" alt="Screenshot showing 4o's response with inaccuracies"
data:image/s3,"s3://crabby-images/3251a/3251a16af7141a0b538b297abcbf8cf3c439cd6b" alt="Screenshot showing system limitations with certain queries"
In future articles, I’ll explore various Advanced RAG implementations to improve the system’s performance.
コメント