IN
LlmOne call. Same key. Same bill.
Kimi AI is the flagship intelligence layer from Moonshot AI
One API key. Every model.
One model. Three ways to call it. Same key, same bill.
One call. Same key. Same bill.
One call. Same key. Same bill.
Capabilities
Context
Modalities
Reasoning modes
Tooling
Coding and instruction following
One key. One base URL. Same SDK shape you already use.
# 1. set your key
export COINIS_API_KEY="sk_live_..."
# 2. call the model
curl https://api.app.coinis.com/v1/llm/generate \
-H "Authorization: Bearer $COINIS_API_KEY" \
-d '{"prompt":"neon city, rain, tracking shot"}' import { Coinis } from "@coinis/sdk";
const coinis = new Coinis(process.env.COINIS_API_KEY);
const job = await coinis.llm.generate({
model: "models/moonshot/kimi",
prompt: "neon city, rain, tracking shot",
}); from coinis import Coinis
coinis = Coinis(os.environ["COINIS_API_KEY"])
job = coinis.llm.generate(
model="models/moonshot/kimi",
prompt="neon city, rain, tracking shot",
) {
"id": "gen_8fa2c1",
"status": "succeeded",
"model": "models/moonshot/kimi",
"output": {
"image_url":
"https://cdn.coinis.com/gen_8fa2c1.mp4"
,
"format": "mp4"
},
"tokens_used": 10
} Already on another provider's SDK? Change the host. Keep the call.
One wallet across every model. No API accounts to juggle.
No credit card.
1 token = $0.10 pay-as-you-go. Less on a plan.
Unified API across video, image, audio, and LLM.
Async queue plus webhooks. Batch at scale.
Ship it under your brand. Outputs are yours.
Prompt to platform-native clip in minutes.
One generation, every aspect ratio.
Authentic selfie-style ads, on brand.
Long-context agentic coding Feed an entire repository into a 256K context window and let Kimi K2.6 write, refactor, and debug across multiple files in a single session. Stable long-horizon code writing means output quality holds across extended completions[^1].
Autonomous tool-using agents ToolCalls and agent-mode execution are built into the model rather than wrapped on top[^1]. Connect external APIs, run multi-step workflows, and let the model self-correct when a tool call fails.
Multimodal document and screenshot analysis Pass in images alongside text prompts for document parsing, screenshot Q&A, and UI review tasks. Image input is natively supported. video input is documented by Moonshot and should be verified in the latest API docs before production use[^1].
Deep-reasoning research workflows Switch on thinking mode for complex analytical problems that require multi-step reasoning. The model works through intermediate steps before producing a final answer, reducing the need for manual chain-of-thought prompting[^1].
Web-grounded RAG and live-data chat Built-in internet search lets the model retrieve real-time information inside a conversation[^1]. Use it for competitive research, news summarisation, or any workflow where static training data is insufficient.
Renders in seconds. Set a seed. Get the same frame back.
Outputs are yours. Sell them.
Safe for paid ads.
Your prompts are never used for training.
Start free
Start free. 15 tokens a week. No card.
Generate on CoinisNo credit card.
Pricing and capabilities verified 2026-05-26. Read the docs .