Chatbot - Build Vector Store

Chatbot - Building Vector Store

Overview

The Vector Store is a semantic search index that enables RAG (Retrieval-Augmented Generation). Without it, your chatbot uses only general AI knowledge. With it, your chatbot can find and reference your specific documents to provide accurate, context-aware answers.

What is a Vector Store?

Vector Store = Semantic Understanding

  • Converts your documents into numerical representations (embeddings)
  • Understands meaning and context, not just keywords
  • Enables intelligent document retrieval
  • Powers RAG-enhanced responses

Analogy: Think of it like an intelligent librarian who has read all your documents and can instantly find relevant information when asked questions, even if you don't use exact words from the documents.


Prerequisites

Before building a vector store:

1. Documents Must Be Completed

  • All knowledge base items show status: "Completed"
  • No items should be "Processing" or "Failed"
  • Check the Knowledge Base Items table

2. Have Documents Uploaded

  • At least one completed knowledge base item
  • More documents = better knowledge base
  • Quality documents work better than many poor ones

3. Chatbot Must Be Configured

  • You've designed your chatbot (Step 1)
  • System is ready for vector store


Building the Vector Store

Step-by-Step Process

1. Verify Knowledge Base Status

  • Go to your Chatbot project page
  • Scroll to "Knowledge Base Items" section
  • Verify all items show "Completed"
  • If any show "Processing", wait for them to complete
  • If any show "Failed", delete and re-upload

2. Locate Vector Store Section

  • Find "Vector Store" section on project page
  • Shows current status
  • Typically shows "Unavailable" if not built yet

3. Click "Build Vector Store"

  • Button located in workflow or vector store section
  • Clicking starts the build process
  • Confirmation may appear

4. Wait for Processing

  • Status changes to "Processing"
  • Overlay may appear showing progress
  • Can take 5-30 minutes depending on:
  • Number of documents
  • Total document size
  • Server load

5. Monitor Progress

  • Status updates automatically
  • Page shows "Building..." or similar
  • Don't close browser (optional, but recommended)

6. Completion

  • Status changes to "Completed"
  • Vector store is ready
  • Can now use RAG features


What Happens During Building

1. Document Analysis

  • System reads all your knowledge base items
  • Extracts text content
  • Identifies structure and sections

2. Chunking

  • Documents split into smaller chunks
  • Each chunk is a manageable size
  • Overlapping chunks for better context

3. Embedding Generation

  • Each chunk converted to vector embedding
  • Numerical representation of meaning
  • Uses AI models for semantic understanding

4. Index Creation

  • Embeddings stored in vector database
  • Indexed for fast retrieval
  • Organized for efficient searching

5. Completion

  • Vector store ready for queries
  • RAG functionality enabled
  • Can retrieve relevant chunks


Understanding Status Indicators

Vector Store Statuses

Unavailable:

  • No vector store exists
  • Need to build first
  • RAG features not available

Processing:

  • Building in progress
  • Do not interrupt
  • Wait for completion

Updating:

  • Updating existing vector store
  • Usually after adding/removing documents
  • Faster than initial build

Completed:

  • Ready to use
  • RAG features enabled
  • Can query documents


When to Rebuild Vector Store

Automatic Updates

In some cases, vector store updates automatically:

  • After significant changes
  • When documents added/removed
  • System-triggered updates

Manual Rebuild

Rebuild When:

  • Added many new documents
  • Removed important documents
  • Want to refresh embeddings
  • Noticing poor RAG performance

How to Rebuild: 1. Click "Recreate Vector Store" or "Rebuild" 2. Confirm action 3. Wait for completion 4. Typically faster than initial build

Rebuild Process:

  • May show "Updating" instead of "Processing"
  • Preserves existing structure where possible
  • Updates with new content


Processing Time

Factors Affecting Build Time

Document Count:

  • Few documents (< 10): 5-10 minutes
  • Medium (10-50): 10-20 minutes
  • Many documents (50+): 20-30+ minutes

Document Size:

  • Small documents: Faster processing
  • Large documents: Slower processing
  • Total content volume matters most

Server Load:

  • Busy times: May take longer
  • Off-peak: Usually faster
  • Professional accounts may have priority

Estimated Times

| Documents | Estimated Time | |-----------|----------------| | 1-5 | 5-10 minutes | | 6-20 | 10-15 minutes | | 21-50 | 15-25 minutes | | 51-100 | 25-35 minutes | | 100+ | 30-60+ minutes |

Actual times may vary


Troubleshooting

Vector Store Won't Build

Problem: Documents Not Completed

  • Solution: Wait for all items to show "Completed"
  • Check for failed items
  • Delete and re-upload failed items

Problem: Build Stuck in Processing

  • Solution: Wait 30-45 minutes
  • Refresh the page
  • Contact support if stuck > 1 hour

Problem: Build Failed

  • Solution: Check error message
  • Verify documents are valid
  • Try rebuilding
  • Contact support if persists

Poor RAG Performance

Problem: Irrelevant Results

  • Solution: Rebuild vector store
  • Ensure documents are relevant
  • Add more quality documents
  • Check document quality

Problem: Missing Information

  • Solution: Add more documents
  • Rebuild vector store
  • Ensure documents contain needed info


Using Vector Store

Once Built

RAG Features Enabled:

  • Chatbot can query your documents
  • Retrieves relevant information
  • Combines with AI for answers
  • More accurate responses

Testing RAG:

  • Use "Compare AI vs RAG Chatbot" feature
  • Test queries about your documents
  • See difference in responses
  • Verify accuracy

Best Practices

✅ Keep Documents Updated

  • Refresh documents when content changes
  • Rebuild vector store after major updates

✅ Monitor Performance

  • Test chatbot responses
  • Verify accuracy
  • Rebuild if performance degrades

✅ Quality Matters

  • Better documents = better vector store
  • Well-structured content works best
  • Remove outdated or irrelevant content


Account Considerations

All Account Types

  • Can build vector stores
  • Same functionality
  • Processing time similar

Professional Accounts

  • May have priority processing
  • Can handle larger vector stores
  • More storage capacity

Technical Details

How It Works

Embedding Model:

  • Uses advanced AI models
  • Converts text to numerical vectors
  • Captures semantic meaning
  • Understands relationships

Storage:

  • Stored efficiently
  • Fast retrieval
  • Scalable to large datasets
  • Optimized for search

Retrieval:

  • Semantic similarity search
  • Finds relevant chunks
  • Ranks by relevance
  • Returns top matches


Next Steps

After vector store is built:

1. Compare AI vs RAG - See the difference 2. Deploy Your Chatbot - Go live with RAG 3. Test Queries - Try questions about your documents 4. Monitor Performance - Check response accuracy

Your vector store is the foundation of RAG - build it carefully and maintain it well!

Chatbot Icon
AIInSync Chatbot