Excloud India Private Limited ₹ INR PRICING
Excloud

Blackwell GPUs.Rupee rates.

up to 96 GiB VRAM · billed per hour

Dedicated GPU instances for LLM inference, ML training and professional visualization. The whole card is yours — billed per hour, on-demand, no upfront commitment.

on-demand · INR · per published docs

Three cards. Three rates.

Every instance gets a dedicated RTX Pro Blackwell card — no time-slicing, no fractional vGPU. Pick the VRAM you need, pay by the hour, terminate when you're done.

NVMe-backed storage, on the same platform — and the same invoice — as the rest of your compute. No upfront commitment.

GPU pricing in the docs
rtx pro blackwell 96 gib vram (nv1a.4xlarge) vram dedicated · no time-slice

Fig. G-1 · The whole card

Table G-1 · GPU instances, on-demand
Instance GPU VRAM vCPU RAM ₹ / hr
nv2a.xlarge RTX 4500 Pro Blackwell 32 GiB 4 16 GiB ₹44.554
nv3a.2xlarge RTX 5000 Pro Blackwell 48 GiB 8 32 GiB ₹63.849
nv1a.4xlarge RTX 6000 Pro Blackwell 96 GiB 16 64 GiB ₹126.784

Billed per hour · on-demand · no upfront commitment

two ways to run inference

Rent the whole card. Or just buy tokens.

EXC-GPU

The whole card

A dedicated GPU instance. Your weights, your runtime, your quantization — fine-tune, train, render, serve any model you like. Capacity is fixed and the meter is the clock.

dedicated card · your stack

from ₹44.554/hr

per instance · on-demand

EXC-LLM

Just the tokens

Token-priced LLM inference on a hosted Qwen model (Qwen3.6-27B). No instance to manage, no idle hours, no minimum commit — you pay only for what the model reads and writes.

hosted qwen · pay per token

₹20 · ₹60/1M tok

input · output · no minimum commit

LLM inference pricing → docs.excloud.in

Rule of thumb: steady high-volume or custom models → rent the card. Bursty traffic on a stock model → buy the tokens.

quota request · usually quick

Access by requisition.

GPU capacity is allocated by quota so every instance maps to a real card. Send a quota request, get approved, then provision from the console like any other instance — same hourly billing, same bill.

Request GPU quota
Requisition note Form EXC-GPU/Q
To
support@excloud.dev
Subject
GPU quota request
State
instance type(s) · quantity · workload
Then
provision at console.excloud.dev

Provision it now.

Console, CLI, API or Terraform — same prices everywhere.