RightNow AI is a research lab and software company working on GPU programming tools, CUDA development workflows, model-hardware co-design, and inference infrastructure.

Which NVIDIA GPUs are supported by RightNow AI?

RightNow AI supports all NVIDIA GPUs with CUDA Toolkit 11.0-12.5, including GeForce RTX 40/30/20 series, GTX 16/10 series, Quadro RTX, Tesla, A100, and H100.

How much does RightNow AI cost?

RightNow AI is free to use with unlimited profiling and benchmarking. RightNow Pro costs $29 per month and adds GPU emulator access (50+ GPUs), multi-GPU comparison, and 1,000 AI credits per month.

What CUDA development workflow does RightNow AI support?

RightNow AI supports CUDA development workflows that combine editing, profiling, emulation, remote GPU execution, and benchmarked performance analysis.

Can I use RightNow AI on macOS?

Yes, RightNow AI is fully available on macOS (Apple Silicon and Intel). Mac users can use remote GPUs for free or our built-in GPU emulator for CUDA profiling.

╭────────────╮
│     AI     │
├────────────┤
│   ◦──◦     │
│    ╲  ╱    │
│     ◦      │
│    ╱  ╲    │
│   ◦──◦     │
╰────────────╯

AI Features

Pro

Two ways to use AI: chat interactively to write and debug CUDA code, or let the optimizer automatically improve your kernels with one click. Both know your GPU and support custom agents with skills and MCP integrations.

15+Providers

Local GPUModels

CustomAgents

Two Ways to Use AI

Chat Assistant

Pro

Interactive chat for writing kernels, debugging issues, and asking questions. GPU-aware responses tailored to your hardware.

• Write new kernels from descriptions
• Debug errors and race conditions
• Explain complex GPU code
• Ask about your hardware specs
• Create custom agents with skills
• Use MCP tools in workflows

Auto Optimizer

Pro

One-click optimization that reads profiling data, rewrites your code, verifies correctness, and shows the speedup.

• Analyzes NCU profiling data
• Applies targeted optimizations
• Auto-verifies correctness
• Instant rollback if needed

Choose Your Provider

Bring your own API key or run completely offline with local models. The model list shows cloud providers and local GPU-backed models in one place.

Cloud Providers

├─ OpenAI       GPT-4o, GPT-4 Turbo
├─ Anthropic    Claude 3.5 Sonnet
├─ Google       Gemini 1.5 Pro
├─ Deepseek     Deepseek V3
└─ OpenRouter   100+ models

Local / Private

├─ Ollama       Qwen2.5, Llama 3
├─ LM Studio    Any GGUF model
└─ vLLM         Self-hosted

GPU-Aware Context

Both the chat assistant and optimizer know your hardware. No generic suggestions.

// Detected hardware:
GPU: RTX 4090 (sm_89, 128 SMs, 24GB)
CUDA: 12.4
Features: Tensor cores, async copy, L2 persistence

// AI uses this for all suggestions and optimizations

Try AI Features

Chat assistant and auto optimizer included in Pro plan.

Download Setup Guide

Related Features

GPU Profiler

Performance analysis for optimization

Code Analysis

PTX and SASS inspection

GPU Emulator

Test without hardware

╭────────────╮ │ AI │ ├────────────┤ │ ◦──◦ │ │ ╲ ╱ │ │ ◦ │ │ ╱ ╲ │ │ ◦──◦ │ ╰────────────╯