Configuration
When creating an AutoRAG instance, you can customize how your RAG pipeline ingests, processes, and responds to data using a set of configuration options. Some settings can be updated after the instance is created, while others are fixed at creation time.
The table below lists all available configuration options:
| Configuration | Editable after creation | Description | 
|---|---|---|
| Data source | no | The source where your knowledge base is stored | 
| Chunk size | yes | Number of tokens per chunk | 
| Chunk overlap | yes | Number of overlapping tokens between chunks | 
| Embedding model | no | Model used to generate vector embeddings | 
| Query rewrite | yes | Enable or disable query rewriting before retrieval | 
| Query rewrite model | yes | Model used for query rewriting | 
| Query rewrite system prompt | yes | Custom system prompt to guide query rewriting behavior | 
| Match threshold | yes | Minimum similarity score required for a vector match | 
| Maximum number of results | yes | Maximum number of vector matches returned (top_k) | 
| Generation model | yes | Model used to generate the final response | 
| Generation system prompt | yes | Custom system prompt to guide response generation | 
| Similarity caching | yes | Enable or disable caching of responses for similar (not just exact) prompts | 
| Similarity caching threshold | yes | Controls how similar a new prompt must be to a previous one to reuse its cached response | 
| AI Gateway | yes | AI Gateway for monitoring and controlling model usage | 
| AutoRAG name | no | Name of your AutoRAG instance | 
| Service API token | yes | API token granted to AutoRAG to give it permission to configure resources on your account. | 
Was this helpful?
- Resources
 - API
 - New to Cloudflare?
 - Products
 - Sponsorships
 - Open Source
 
- Support
 - Help Center
 - System Status
 - Compliance
 - GDPR
 
- Company
 - cloudflare.com
 - Our team
 - Careers
 
- 2025 Cloudflare, Inc.
 - Privacy Policy
 - Terms of Use
 - Report Security Issues
 - Trademark