Changelog

Latest updates and improvements to RightNow AI

0.0.45Oct 30, 2025
Execution-Driven Emulator & Agentic AI Optimization - RightNow AI 0.0.45

Execution-Driven Emulator & Agentic AI Optimization

Cycle-accurate GPU emulation with 96-98% accuracy. No physical GPU required. AI automatically iterates and optimizes kernels to peak performance.

+ NEW
New GPU Emulator built from scratch with cycle-accurate scheduling and multi-warp latency simulation
PTX and SASS translation for deep low-level analysis and debugging
Remote connection support (SSH + WSL) to run and profile kernels anywhere
Agentic AI "Iterate Mode" that writes, profiles, and optimizes kernels automatically until peak performance
Kernel fusion detection for better performance across sequential and parallel operations (Beta Users)
IMPROVED
Enhanced benchmarking and profiling with full metric breakdowns and bottleneck detection
Local LLMs now work perfectly
96-98% emulator accuracy vs real GPUs
Simulation speed under 100ms for 1,000 instructions
30% more accuracy than previous builds
0.0.31Sep 22, 2025
Remote GPU Access & AI Insights - RightNow AI 0.0.31

Remote GPU Access & AI Insights

Code anywhere, run everywhere. Connect to remote GPUs with SSH and cloud providers.

+ NEW
Remote GPU connection via SSH integration
Native support for GPU cloud providers (RunPod, Google Cloud, AWS, Azure, Paperspace, Vast.ai, Lambda Labs)
Seamless profiling on remote GPUs as if they were local
Automatic GPU detection on remote machines
Smart Profiling Terminal with AI-powered insights
Automatic bottleneck detection (memory-bound vs compute-bound)
NCU-compatible metrics without requiring hardware
AI-generated optimization suggestions (memory coalescing, bank conflicts, occupancy, branch divergence)
IMPROVED
Fixed NCU GUI integration for report generation
Enhanced profiling UI with collapsible sections
WebWorker-based analysis for non-blocking performance
LRU caching for instant re-analysis
Improved error handling and fallback mechanisms
0.0.30Sep 18, 2025
Full GPU Emulator - No Hardware Required - RightNow AI 0.0.30

Full GPU Emulator - No Hardware Required

Profile any CUDA kernel without a physical GPU. Choose from 86+ GPU architectures.

+ NEW
Full GPU emulator for profiling without physical hardware
86+ GPU architectures supported
Static kernel analysis engine (under 100ms)
Roofline model implementation with ±15% accuracy
Architecture comparison across multiple GPUs instantly
0.0.29Sep 14, 2025
Benchmarking Terminal & Static Profiling - RightNow AI 0.0.29

Benchmarking Terminal & Static Profiling

Full benchmarking terminal with visual kernel comparisons and instant CodeLens insights.

+ NEW
Benchmarking Terminal for benchmark sweeps and custom kernel configurations
Visual comparison between kernels
Static Profiling with instant CUDA kernel insights in CodeLens
Real-time registers, shared memory, and occupancy analysis while typing
Profile with Configs - complete cycle with persistent configs and history
Tools Detector (nvidia-smi, nsight compute, nvcc)
0.0.28Sep 14, 2025
CUDA Benchmarking System - RightNow AI 0.0.28

CUDA Benchmarking System

Comprehensive benchmarking with execution time, memory bandwidth, occupancy, and multi-GPU support.

+ NEW
Execution time, memory bandwidth, occupancy, SM efficiency, and register usage metrics
Data size presets, warmup runs, and execution controls
Grid/block optimization with automatic suggestions
Multi-GPU support with device-specific benchmarking
Session management with persistence across restarts
Sortable results with performance indicators
CSV export for sharing benchmark results
0.0.20Aug 18, 2025
Multi-LLM Provider Support - RightNow AI 0.0.20

Multi-LLM Provider Support

Support for 15+ AI providers including local models.

+ NEW
OpenAI, Anthropic, Deepseek integration
Local Ollama and vLLM support
BYOK (Bring Your Own Key) flexibility
Fill-in-the-Middle autocomplete
0.0.10Aug 5, 2025
Initial Release - RightNow AI 0.0.10

Initial Release

First public release of RightNow AI.

+ NEW
NVIDIA Nsight Compute integration
Real-time GPU performance metrics
Hardware detection and optimization
CUDA syntax highlighting and IntelliSense