Xai · Llm model

Grok 4.3

Grok AI's flagship model is Grok 4

One API key. Every model.

per 1m tokens
Example
AI text
Commercial use included Verified May 26, 2026 Outputs are yours No training on your data
Endpoints

Start building with Grok 4.3.

One model. Three ways to call it. Same key, same bill.

IN

Llm

One call. Same key. Same bill.

$2.13 / 1m tokens

OUT

Llm

One call. Same key. Same bill.

$4.25 / 1m tokens

Capabilities

What it does best.

Grok AI capabilities

Context and I/O

  • 1,000,000-token context window per request[^1]
  • Text and image inputs. text output[^2]
  • Image inputs capped at 20 MiB each. JPG and PNG formats only[^3]

Reasoning

  • Configurable reasoning effort: none, low, medium, or high[^4]
  • Reasoning off (none) minimises latency for fast-response chat[^4]
  • High reasoning effort for complex multi-step tasks[^4]

Tooling

  • Function calling (tool use) for connecting to external systems[^5]
  • Structured outputs for organised, machine-readable responses[^5]
  • Server-side Web Search (billed separately by xAI)[^6]
  • Server-side X Search for real-time social signals (billed separately by xAI)[^6]
  • Sandboxed Python Code Execution (billed separately by xAI)[^6]
  • File Attachments support (billed separately by xAI)[^6]
  • Collections Search / RAG (billed separately by xAI)[^6]
  • Remote MCP Tools via token-based billing[^6]

Rate limits and regions

  • 1,800 requests per minute. 10,000,000 tokens per minute[^7]
  • Available in us-east-1 and eu-west-1[^7]

Model aliases

  • The aliases grok-4, grok-4-latest, grok-3, and grok-3-latest all resolve to Grok 4.3[^8]
  • Existing integrations built on those aliases require no code changes[^8]
API

Call Grok 4.3 in three lines.

One key. One base URL. Same SDK shape you already use.

# 1. set your key
export COINIS_API_KEY="sk_live_..."

# 2. call the model
curl https://api.app.coinis.com/v1/llm/generate \
  -H "Authorization: Bearer $COINIS_API_KEY" \
  -d '{"prompt":"neon city, rain, tracking shot"}'
import { Coinis } from "@coinis/sdk";
const coinis = new Coinis(process.env.COINIS_API_KEY);

const job = await coinis.llm.generate({
  model: "models/xai/grok-4",
  prompt: "neon city, rain, tracking shot",
});
from coinis import Coinis
coinis = Coinis(os.environ["COINIS_API_KEY"])

job = coinis.llm.generate(
    model="models/xai/grok-4",
    prompt="neon city, rain, tracking shot",
)
Response
{
  "id": "gen_8fa2c1",
  "status": "succeeded",
  "model": "models/xai/grok-4",
  "output": {
    "image_url": 
                "https://cdn.coinis.com/gen_8fa2c1.mp4"
              
              ,
    "format": "mp4"
  },
  "tokens_used": 10
}

Already on another provider's SDK? Change the host. Keep the call.

Pricing

Token pricing. No surprises.

One wallet across every model. No API accounts to juggle.

Grok 4.3 · IN
21.3 tokens
per 1m tokens · $2.13
Frontier LLM
$2.13 / 1m tokens
One key. Every model. One invoice. 1 token = $0.10
1 1m tokens ≈ 21 tokens ($2.13)
Start free. 15 tokens a week.

No credit card.

Why pay through Coinis
  • One wallet for every model. No API keys. No separate bills.
  • Generate ads. Launch to Meta. Track in one place.
  • On-brand output from your Brand Profile.

1 token = $0.10 pay-as-you-go. Less on a plan.

Standard vs Fast

Pick the run for the job.

IN

Final renders, studios
Resolution
Price $2.13 / 1m tokens

OUT

Rapid tests, high volume
Resolution
Price $4.25 / 1m tokens
Use cases

Two buyers. One model.

For builders

Resell every model. One key. One bill.

Unified API across video, image, audio, and LLM.

Generate 500 variants overnight.

Async queue plus webhooks. Batch at scale.

White-label the output.

Ship it under your brand. Outputs are yours.

For creatives

Ship a Reel before lunch.

Prompt to platform-native clip in minutes.

Same product. Ten formats.

One generation, every aspect ratio.

Commercial UGC without a creator.

Authentic selfie-style ads, on brand.

Autonomous agent workflows. Grok 4.3 leads xAI's model lineup on non-hallucination rate and agentic tool calling[^9]. Teams building multi-step agents get reliable function calls and fewer off-script responses.

Enterprise automation and data extraction. Structured output support means the model returns clean, schema-compliant JSON. Plug it directly into ETL pipelines, CRM enrichment flows, or document processing queues[^5].

Real-time research and monitoring. Activate the server-side Web Search or X Search tools to pull live information into any prompt[^6]. Monitor breaking news, track brand mentions, or surface competitor activity without a separate scraping layer.

Long-document analysis. A 1,000,000-token context window lets you load entire contracts, codebases, or research corpora in a single call[^1]. No chunking required for most real-world documents.

Latency-sensitive chat. Set reasoning effort to none and Grok 4.3 skips the thinking step entirely[^4]. Use it for customer-facing chatbots or any workflow where response speed matters more than deep reasoning.

Renders in seconds. Set a seed. Get the same frame back.

Outputs are yours. Sell them.

Safe for paid ads.

Your prompts are never used for training.

FAQ

Grok 4.3 FAQs

How does Grok AI pricing work on Coinis vs. the xAI API directly?

On the xAI API, Grok 4.3 is priced at $1.25 per 1M input tokens and $2.50 per 1M output tokens. On Coinis retail, the rates are $2.125 per 1M input tokens (variant grok-4-3-in) and $4.25 per 1M output tokens (variant grok-4-3-out). Coinis adds unified billing across all models in the catalog, so one account covers Grok 4.3 alongside OpenAI, Anthropic, and Google models with a single invoice.

Is there a Grok API I can call, and how does the Coinis endpoint work?

Yes. You call POST https://api.app.coinis.com/v1/llm/generate with model set to grok-4.3. The Coinis endpoint is OpenAI-compatible, so most existing SDKs work without modification. Full parameter reference is on the API sub-page.

What is the difference between Grok 4, Grok 4.3, and the older Grok 3 aliases?

Grok 4.3 is the current flagship. The aliases grok-4, grok-4-latest, grok-3, and grok-3-latest all resolve to Grok 4.3[^8]. If your integration already uses any of those aliases, it is already running Grok 4.3 with no code change required.

Does Grok 4.3 support real-time web and X search?

Yes, via server-side tools. Activate Web Search or X Search in your request and the model retrieves live data before generating its response[^6]. Without these tools, the model's knowledge is limited to its training data with a cut-off of November 2024[^3]. Server-side tool calls are billed separately by xAI on top of standard token charges. see the official docs at https://docs.x.ai/docs/models.

What is Grok 4.3's context window and knowledge cut-off?

The context window is 1,000,000 tokens per request[^1]. The training knowledge cut-off is November 2024[^3]. For information after that date, enable the Web Search or X Search server-side tools.

How do I reduce Grok 4.3 costs. do cached tokens and Batch API discounts apply on Coinis?

xAI offers cached input tokens at a significant discount versus standard input rates, and the Batch API can reduce token charges further for non-time-sensitive workloads[^10]. Contact Coinis support to confirm which xAI discount tiers are passed through on your plan. Coinis retail rates are grok-4-3-in at $2.125/1M and grok-4-3-out at $4.25/1M.

Which use cases favor Grok 4.3 over other large language models?

xAI positions Grok 4.3 as the best choice for agentic tool calling and low-hallucination autonomous workflows[^9]. The 1,000,000-token context window is well-suited to long-document tasks such as contract review or full-codebase analysis[^1]. Teams running output-heavy workloads will find the $4.25/1M output retail rate and the model's instruction-following precision a useful combination for high-volume generation jobs.

Start free

Your wallet. Every model. One call away.

Start free. 15 tokens a week. No card.

Generate on Coinis

No credit card.

Pricing and capabilities verified 2026-05-26. Read the docs .