AI Studio is where you compose your model, knowledge, tools and guardrails into a production-grade assistant. Bring your own provider — OpenAI, Anthropic, Google, Mistral, or open-weight Llama on your GPUs — wire in retrieval over your docs, expose your existing APIs as tools, and deploy to WhatsApp, web chat, email or MCP clients in minutes.
There are two ways to ship AI in production today, and both are broken. Option one is a vendor chatbot that answers FAQs from a scraped sitemap — it cannot place an order, check a balance, or modify a subscription. It is a glorified search box. Option two is a six-month internal build where engineering wraps OpenAI, writes the embedding pipeline, builds a tool layer, adds rate limiting, builds an evaluation harness, and ships an MVP just as the model landscape shifts.
The gap is where most operators live. They need an agent that knows their catalog, their pricing rules, their refund policy, and can actually act on those — issue a refund, escalate a ticket, book a slot — not just chat about them. They need this in weeks, not quarters. They need it auditable, multi-tenant, swappable across providers, and cheap to evaluate.
AI Studio is built for that gap. Pick a model, point at your knowledge sources, declare your tools, set your guardrails, and you have a deployable agent. Swap the model next quarter without rewriting your tools. Switch providers without re-embedding your docs. Ship the same agent to WhatsApp, web chat and your internal Slack with one config.
AI Studio is a composition surface for production AI agents. The unit is an Assistant: a configured combination of model, system prompt, knowledge sources, tools, and guardrails. You can have many assistants per tenant — a sales agent, a support agent, an internal HR bot — each with its own scope and personality. Each assistant is versioned, evaluated, and deployable to any channel through SabFlow nodes or our API.
Retrieval is first-class. Connect a knowledge source (PDF folder, Notion workspace, Google Drive, public website, custom database query) and AI Studio handles chunking, embedding (using your provider or our local model), vector storage, and re-ranking. Every assistant response includes citation links back to the source chunks, and you can see retrieval quality in the eval harness. Updating a doc re-embeds only the changed chunks — incremental, fast, cheap.
Tool-calling turns the assistant from a talker into a doer. Declare a tool — `get_order_status`, `issue_refund`, `book_slot` — with a JSON schema and a backing handler (HTTP endpoint, SabFlow, or built-in CRM action). The model decides when to call the tool based on the conversation. You see every tool call in the trace, can require human approval for sensitive operations, and rate-limit per assistant or per contact. The MCP server exposes these same tools to external AI clients (Claude Desktop, Cursor, etc.) over the Model Context Protocol.
Safety and observability are non-negotiable. Every assistant ships with configurable guardrails: PII redaction on inputs (Aadhaar, PAN, credit cards, emails), output filters (no medical advice, no financial recommendations), refusal policies, and token budgets per conversation. The audit log captures every prompt, every tool call, every response — exportable for compliance review. For India deployments we honor DPDP requirements; for EU, GDPR; for healthcare, basic HIPAA-aligned redaction.
Capabilities
Switch between OpenAI (GPT-4, GPT-4o), Anthropic (Claude Sonnet, Opus, Haiku), Google (Gemini Pro, Flash), Mistral, Cohere, or self-hosted Llama / Qwen / DeepSeek behind a unified interface. Same prompt, same tools, different backend. Swap providers per assistant or A/B test two side by side.
Ingest PDFs, web pages, Notion, Google Drive, S3, GitHub wikis, or a SQL query result. We handle chunking (token-aware, semantic), embedding, vector storage and re-ranking. Every response cites the chunks it drew from. Re-index incrementally as sources change.
Declare tools with JSON schema. Backing handler can be an HTTP endpoint, a SabFlow, or a built-in CRM action (create_lead, move_stage, add_tag). The model decides when to call. Required-approval mode pauses execution for human sign-off on sensitive tools.
Every assistant is automatically exposed as an MCP server endpoint. Connect Claude Desktop, Cursor, Zed or any MCP client and the tools you defined for WhatsApp work in your IDE. One source of truth for AI actions across customer and team-facing surfaces.
Upload a CSV of (input, expected_output) pairs or curate from real conversations. Run an eval against any model and prompt combination. See win-rate, latency, cost and failure modes side-by-side. Block deploys that regress eval scores below the threshold.
Input filters strip Aadhaar, PAN, GST numbers, credit cards, IBAN, US SSN, emails and phone numbers before they reach the model. Output filters block policy violations. Token budgets cap runaway conversations. All configurable per assistant.
Every assistant invocation captures the full trace: input message, retrieved chunks, system prompt, model response, tool calls, final output, latency, cost. Export for compliance, replay for debugging, or pipe into your observability stack via webhook.
Use cases
Assistant indexes the product catalog, ingredient docs and review summaries. Tools include `find_products`, `check_stock`, `add_to_cart`. A customer asks "which face wash for oily skin under ₹500" and the agent searches, filters, and replies with three options and add-to-cart buttons. Conversion lifts 2-3× over static catalog browsing.
Indexed on policy docs, leave calendar API and payroll system. Tools include `request_leave`, `download_payslip`, `check_balance`. Deployed to Slack and the company intranet. Cuts HR ticket volume by 70% and gives the policy team a metric for where docs are unclear.
Agent ingests product factsheets and regulatory rules. Tools include `pull_credit_report`, `calculate_emi`, `start_application`. PII redaction strips Aadhaar before logging. Required-approval mode forces human sign-off on `start_application`. Trace export feeds RBI audit reports.
Assistant indexes course catalogs, fee structures, and admission timelines in five languages. Tools include `book_campus_visit`, `request_brochure`, `connect_counsellor`. Deployed on WhatsApp and the .edu site. Handles 12,000 monthly applicants with two human counsellors on standby.
Knowledge sources include the shipment SOP wiki and the tracking API. Tools include `track_shipment`, `raise_dispute`, `request_redelivery`. Deflects 80% of "where is my package" tickets while preserving the option for hand-off to a human for genuinely stuck shipments.
How it works
AI Studio is included on every SabNode workspace. No separate billing, no extra setup, flip it on from your workspace settings.
Choose your provider, paste an API key (or use SabNode-managed credits), set temperature, max tokens, and the system prompt. Start with a template — sales, support, internal — and customise.
Connect Drive, Notion, S3, a website, or upload PDFs. AI Studio chunks, embeds and stores vectors. Initial ingestion runs in the background and surfaces progress per source.
Add tools with JSON schema and backing handler. Toggle guardrails: PII redaction, output filters, token budgets, refusal patterns. Mark sensitive tools as approval-required.
Run the eval harness against a curated set. See win-rate, cost and latency. Iterate on prompt, model or tools until the assistant clears your threshold.
Drop an AI Generate node in a SabFlow, expose the MCP endpoint, or call the assistant API directly. Same assistant, multiple surfaces — WhatsApp, Web Chat, Slack, IDE.
Connect directly with your existing stack or leverage the Platform Core tools to extend capabilities natively.
Enhance this feature with deep integrations into our core infrastructure. Connect via API, utilize webhooks, or embed directly using our SDKs.
Manage all settings seamlessly within the core UI.
Extend functionality with custom automated workflows.
No credit card. No sales call required. Spin up a workspace, plug in a number, and your team is live in under an hour.