How many AI models does VoltageGPU offer?

VoltageGPU offers access to 144+ AI models including LLMs like DeepSeek R1 and Llama 3, image generation models like FLUX and Stable Diffusion, video generation, and embedding models.

What is the best model for reasoning tasks?

DeepSeek R1 is the best model for reasoning tasks, offering advanced logical reasoning capabilities at $0.46 per million input tokens, which is 81% cheaper than comparable models.

Can I use VoltageGPU models via API?

Yes, all VoltageGPU models are accessible via our REST API. We provide OpenAI-compatible endpoints, making it easy to switch from other providers with minimal code changes.

What models are available for image generation?

VoltageGPU offers multiple image generation models including FLUX, Stable Diffusion XL, DALL-E compatible models, and specialized models for different art styles and resolutions.

AI Models Catalog - VoltageGPU

Access 144+ state-of-the-art AI models via API. Serverless inference with 85% cost savings vs OpenAI. OpenAI-compatible API for seamless integration.

🔥 Qwen/Qwen3-32B

High-performance 32B parameter LLM. Excellent for reasoning, coding, and multilingual tasks.

Input: $0.15/M tokensOutput: $0.44/M tokens

33.54M runs in 7 days

🧠 DeepSeek-V3-0324-TEE

Advanced reasoning model with Trusted Execution Environment for secure inference.

Input: $0.35/M tokensOutput: $1.61/M tokens

7.63M runs in 7 days

⚡ Mistral-Small-3.1-24B

Efficient 24B instruction-tuned model. Great balance of speed and quality.

Input: $0.06/M tokensOutput: $0.20/M tokens

3.5M runs in 7 days

🎨 FLUX Image Generation

State-of-the-art image generation. Create stunning visuals from text prompts.

$0.003 per image

High quality, fast generation

Model Categories

💬 LLM (Large Language Models)🎨 Image Generation🎬 Video Generation🔢 Embeddings🎤 Text to Speech🎵 Music Generation

Why Choose VoltageGPU for AI Inference?

✓85% cheaper than OpenAI - Competitive per-token pricing
✓OpenAI-compatible API - Drop-in replacement, no code changes
✓144+ models available - Latest open-source and proprietary models
✓Serverless - No infrastructure to manage, pay per use
✓TEE security - Trusted Execution Environment for sensitive data

Quick Start - API Integration

from openai import OpenAI

client = OpenAI(
    base_url="https://api.voltagegpu.com/v1",
    api_key="your-voltagegpu-api-key"
)

response = client.chat.completions.create(
    model="Qwen/Qwen3-32B",
    messages=[{"role": "user", "content": "Hello!"}]
)

print(response.choices[0].message.content)

Language ModelDeepSeekHotOpen SourceReasoning

DeepSeek R1 API

Advanced reasoning model with chain-of-thought capabilities, rivaling OpenAI o1 at a fraction of the cost.

Parameters

671B (MoE, 37B active)

Context

128,000 tokens

Organization

DeepSeek

Pricing

$0.14

per 1M input tokens

$0.28

per 1M output tokens

Try DeepSeek R1 for Free

Quick Start

Start using DeepSeek R1 in minutes. VoltageGPU provides an OpenAI-compatible API — just change the base_url.

Python (OpenAI SDK)

pip install openai

from openai import OpenAI

client = OpenAI(
    base_url="https://api.voltagegpu.com/v1",
    api_key="YOUR_VOLTAGE_API_KEY"
)

response = client.chat.completions.create(
    model="deepseek-ai/DeepSeek-R1",
    messages=[
        {"role": "user", "content": "Prove that there are infinitely many prime numbers."}
    ],
    max_tokens=4096,
    temperature=0.6
)

print(response.choices[0].message.content)

cURL

Terminal

curl -X POST https://api.voltagegpu.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_VOLTAGE_API_KEY" \
  -d '{
    "model": "deepseek-ai/DeepSeek-R1",
    "messages": [
      {"role": "user", "content": "Prove that there are infinitely many prime numbers."}
    ],
    "max_tokens": 4096,
    "temperature": 0.6
  }'

Pricing

Component	Price	Unit
Input tokens	$0.14	per 1M tokens
Output tokens	$0.28	per 1M tokens

New accounts receive $5 free credit. No credit card required to start.

Capabilities & Benchmarks

DeepSeek R1 achieves state-of-the-art results on reasoning benchmarks including AIME 2024 (79.8%), MATH-500 (97.3%), and Codeforces (2029 Elo rating). It supports chain-of-thought reasoning with transparent step-by-step problem solving. The model handles complex multi-step mathematical proofs, competitive programming challenges, logical deduction, and scientific reasoning. Its MoE architecture ensures efficient inference despite the massive 671B total parameter count.

About DeepSeek R1

DeepSeek R1 is a cutting-edge reasoning model developed by DeepSeek, featuring 671 billion parameters in a Mixture-of-Experts architecture with 37 billion active parameters per forward pass. It excels at complex reasoning tasks including mathematics, coding, logic puzzles, and scientific analysis. R1 uses chain-of-thought reasoning to break down complex problems step by step, producing transparent and verifiable reasoning chains. It matches or exceeds the performance of models like OpenAI o1 on many benchmarks while being fully open source and significantly more affordable.

Use Cases

🧮

Mathematical Reasoning

Solve complex math problems, proofs, and competition-level questions with step-by-step reasoning.

💻

Code Generation & Debugging

Write, review, and debug code across multiple programming languages with logical analysis.

🔬

Scientific Analysis

Analyze scientific papers, design experiments, and reason through complex hypotheses.

📊

Legal & Financial Analysis

Process complex documents requiring multi-step logical reasoning and evidence evaluation.

🎓

Educational Tutoring

Explain complex concepts step-by-step with transparent reasoning chains students can follow.

API Reference

Endpoint

POSThttps://api.voltagegpu.com/v1/chat/completions

Headers

Authorization	Bearer YOUR_VOLTAGE_API_KEY	Required
Content-Type	application/json	Required

Model ID

deepseek-ai/DeepSeek-R1

Use this value as the model parameter in your API requests.

Example Request

curl -X POST https://api.voltagegpu.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_VOLTAGE_API_KEY" \
  -d '{
    "model": "deepseek-ai/DeepSeek-R1",
    "messages": [
      {"role": "user", "content": "Prove that there are infinitely many prime numbers."}
    ],
    "max_tokens": 4096,
    "temperature": 0.6
  }'

Related Models

Language Model

DeepSeek V3

DeepSeek

$0.1/$0.2 per 1M tokens

Language Model

Llama 3.3 70B

Qwen 2.5 72B

Alibaba Cloud

$0.4/$0.4 per 1M tokens

Language Model

Mistral Large

Mistral AI

$2/$6 per 1M tokens

DeepSeek R1 — Related Resources

Run on A100 80GB

Recommended GPU for large models requiring high VRAM and memory bandwidth.

Run on H100 80GB

Best performance for large model inference with HBM3 memory.

AI Inference API

Access this model and 140+ others through our OpenAI-compatible API.

VoltageGPU vs RunPod

Compare GPU cloud pricing and model hosting features.

Pricing

View GPU compute and AI inference pricing with no hidden fees.

Browse GPU Pods

Deploy a GPU pod in under 60 seconds to run models locally.

Frequently Asked Questions

How does DeepSeek R1 compare to OpenAI o1?

DeepSeek R1 matches or exceeds OpenAI o1 on many reasoning benchmarks while being open source and up to 95% cheaper. On AIME 2024 math benchmarks, R1 scores 79.8% compared to o1's 83.3%, and on MATH-500 it achieves 97.3% vs o1's 96.4%. The main advantage is cost: at $0.14/M input tokens vs OpenAI o1's significantly higher pricing.

What is chain-of-thought reasoning?

Chain-of-thought (CoT) reasoning means the model explicitly shows its step-by-step thinking process before arriving at a final answer. This makes the reasoning transparent and verifiable, allowing you to see exactly how the model arrived at its conclusion and catch any errors in logic.

Is DeepSeek R1 open source?

Yes, DeepSeek R1 is fully open source under the MIT license. The model weights, training methodology, and technical report are all publicly available. You can run it locally or use it through VoltageGPU's API for managed inference.

What context window does DeepSeek R1 support?

DeepSeek R1 supports a context window of 128,000 tokens, allowing it to process long documents, codebases, and multi-turn conversations. This is equivalent to roughly 96,000 words or about 300 pages of text.

How do I use DeepSeek R1 via the VoltageGPU API?

You can use DeepSeek R1 through VoltageGPU's OpenAI-compatible API. Simply use the OpenAI Python SDK with base_url set to "https://api.voltagegpu.com/v1" and model set to "deepseek-ai/DeepSeek-R1". No code changes are needed if you're already using the OpenAI SDK.

Start using DeepSeek R1 today

Get $5 free credit when you sign up. No credit card required. Deploy in under 30 seconds with our OpenAI-compatible API.

Try DeepSeek R1 for Free Read Documentation