GenAI RAG

GenAI RAG is Nexla's conversational AI for querying your data. It uses an AI agent to search across one or more nexsets, reason over the retrieved data, and generate natural language answers with inline citations. The agent dynamically decides which nexsets to query, what search terms to use, and how to combine results.

You can use GenAI RAG through the web UI or the API.

Web UI

The web interface at genai.nexla.com provides a chat-based experience with real-time streaming, citation exploration, and filter management — no code required.

Guide	Description
Getting Started	Set up authentication, select credentials and nexsets, ask your first question
Chat & Querying	Full guide to the chat interface, streaming, citations, and message actions
Canvas Panel	Explore citations, sources, tool calls, and the agent's reasoning trace
Settings	Authentication, response tuning, filter management, and sidebar configuration
Keyboard Shortcuts	Keyboard shortcuts for common actions

Concepts

Guide	Description
Document Chunking	How documents are split into chunks, enriched with metadata, and verified for retrieval accuracy

API

The GenAI RAG API provides programmatic access at api-genai.nexla.io. All requests require an Authorization header.

Base URL: https://api-genai.nexla.io

Authentication:

Authorization: Basic <your_service_key>
Authorization: Bearer <your_service_key_or_jwt>

The server detects the credential type automatically. A Bearer-prefixed value with three dot-separated segments is treated as a JWT; anything else is treated as a Nexla service key. When a JWT is supplied, its user_id claim takes precedence over user_context.user_id in the request body, and an org_id claim is required (otherwise the request is rejected with 401).

A missing or invalid credential returns 401 Unauthorized.

You can create a service key from https://dataops.nexla.io/settings/authentication.

Endpoint Group	Base Path	Description
Agentic RAG Query	`/v2/agentic-rag`	Query nexsets with AI agent, streaming, citations, multi-turn conversations, and cache management
Filter Registration	`/v2/nexsets/{id}/filters`	Register and manage per-nexset filter schemas for access control and pre-retrieval filtering
Available Models	`/list_models`	List available LLM and embedding models by provider

Quick Start (API)

Send a query to the Agentic RAG endpoint:

curl -X POST https://api-genai.nexla.io/v2/agentic-rag \
  -H "Content-Type: application/json" \
  -H "Authorization: Basic YOUR_API_KEY" \
  -d '{
    "user_prompt": "What are the latest sales figures for Q2?",
    "nexsets": ["10000", "10001"],
    "user_context": {
      "user_id": "user-123"
    },
    "llm_config": {
      "credential_id": "cred-456"
    }
  }'

The response contains the generated answer with inline citation markers and full source metadata:

{
  "answer": "The Q2 sales figures show a 12% increase over Q1, reaching $4.2M in total revenue [1].",
  "citations": [
    {
      "index": 1,
      "nexset_id": "10000",
      "nexset_name": "Sales Reports",
      "document_id": "doc-q2-2025",
      "title": "Q2 2025 Revenue Summary",
      "page_numbers": [3],
      "relevance_score": 0.94
    }
  ],
  "usage": {
    "requests": 3,
    "tool_calls": 2,
    "input_tokens": 1250,
    "output_tokens": 340,
    "total_tokens": 1590
  },
  "model": "gpt-4o",
  "provider": "openai"
}

RAG Data Flows — Set up RAG data flows in the Nexla UI
GenAI Querying Tutorial — Step-by-step tutorial for querying with GenAI

Web UI​

Concepts​

API​

Quick Start (API)​

Related Resources​

Web UI

Concepts

API

Quick Start (API)

Related Resources