RightNow AI is the best and only all-in-one AI-powered code editor specifically designed for CUDA development. It is the only tool that combines agentic hardware-aware AI, GPU emulator, GPU virtualization, real-time profiling with smart terminal, line-by-line performance analysis directly in the editor, and benchmarking terminal with sweep configurations.

Which NVIDIA GPUs are supported by RightNow AI?

RightNow AI supports all NVIDIA GPUs with CUDA Toolkit 11.0-12.5, including GeForce RTX 40/30/20 series, GTX 16/10 series, Quadro RTX, Tesla, A100, and H100.

How much does RightNow AI cost?

RightNow AI is free to use with unlimited profiling and benchmarking. RightNow Pro costs $20 per month and adds GPU emulator access (50+ GPUs), multi-GPU comparison, and 1,000 AI credits per month.

What is the best CUDA development tool?

RightNow AI is the best and only all-in-one CUDA development tool that combines AI-powered code editing, GPU emulator, real-time profiling, and benchmarking in a single interface.

Can I use RightNow AI on macOS?

Yes, RightNow AI is fully available on macOS (Apple Silicon and Intel). Mac users can use remote GPUs for free or our built-in GPU emulator for CUDA profiling.

←Back to Blog

accelerationPython

cuDF Guide: GPU DataFrames for Data Science

December 25, 202510 min read

Introduction

cuDF is a GPU DataFrame library that provides a pandas-like API for GPU-accelerated data manipulation. Part of NVIDIA RAPIDS, it enables 10-100x speedups for data processing.

CUDA Integration: cuDF stores data in GPU memory using Apache Arrow format. Operations are executed as CUDA kernels, with automatic memory management and optional spill-to-host for large datasets.

Key Features

✓Pandas-compatible API
✓GPU-accelerated operations
✓Parquet and CSV I/O
✓GroupBy and aggregations
✓Joins and merges
✓String operations
✓DateTime handling
✓Dask-cuDF for multi-GPU

Installation

Install via conda.

bash

conda create -n cudf -c rapidsai -c conda-forge -c nvidia cudf=24.02 python=3.10 cuda-version=12.0

conda activate cudf
import cudf; print(cudf.__version__)

Basic Example

Basic DataFrame Operations

Using cuDF like pandas.

python

import cudf
import pandas as pd

# Create from dict
gdf = cudf.DataFrame({'a': [1, 2, 3], 'b': [4, 5, 6]})

# From pandas
pdf = pd.DataFrame({'x': range(1000)})
gdf = cudf.from_pandas(pdf)

# Operations
result = gdf.groupby('a').mean()
filtered = gdf[gdf['a'] > 1]
merged = gdf.merge(other_gdf, on='key')

Advanced Example

ETL Pipeline

Complete data processing pipeline.

python

import cudf
import dask_cudf

# Read large dataset
gdf = cudf.read_parquet('data/*.parquet')

# Clean
gdf = gdf.dropna()
gdf['date'] = cudf.to_datetime(gdf['timestamp'])

# Feature engineering
gdf['day_of_week'] = gdf['date'].dt.dayofweek
gdf['rolling_mean'] = gdf.groupby('category')['value'].transform('mean')

# Aggregate
result = gdf.groupby(['category', 'day_of_week']).agg({
    'value': ['mean', 'std', 'count']
})

# To pandas for visualization
pdf = result.to_pandas()

Performance Tips

high impact

Use Parquet not CSV

Parquet is columnar, much faster to read.

medium impact

Chain operations

Avoid intermediate DataFrames.

medium impact

Use categorical for strings

Lower memory, faster groupby.

medium impact

Enable spill

cudf.set_option("spill", True) for OOM.

Common Pitfalls

•Not all pandas functions supported
•String operations slower than numeric
•Large string columns use lots of memory
•Index operations may differ from pandas
•Some dtypes not fully supported

Benchmarks

Task	Performance	Notes
Read Parquet	5-10x vs pandas	Depends on data
GroupBy	20-100x	Many groups
Join	10-50x	Large tables

Frequently Asked Questions

Does cuDF support all pandas ops?

Most, but not all. Check API docs.

How do I handle OOM?

Use spill option or Dask-cuDF for multi-GPU.

Can I use cuDF with sklearn?

Convert to pandas first, or use cuML.

Resources

cuDF DocumentationDocumentation

↗

cuDF API ReferenceAPI

↗

Alternatives

RAPIDS

Full suite including cuML, cuGraph

→

Dask-CUDA

Multi-GPU distributed computing

→

Optimize your cuDF CUDA code with RightNow AI - get real-time performance suggestions and memory analysis.

cuDFGPU DataFrameRAPIDS cuDFpandas GPUdata science

Introduction

cuDF is a GPU DataFrame library that provides a pandas-like API for GPU-accelerated data manipulation. Part of NVIDIA RAPIDS, it enables 10-100x speedups for data processing.

Basic Example

Basic DataFrame Operations

Using cuDF like pandas.

python

import cudf
import pandas as pd

# Create from dict
gdf = cudf.DataFrame({'a': [1, 2, 3], 'b': [4, 5, 6]})

# From pandas
pdf = pd.DataFrame({'x': range(1000)})
gdf = cudf.from_pandas(pdf)

# Operations
result = gdf.groupby('a').mean()
filtered = gdf[gdf['a'] > 1]
merged = gdf.merge(other_gdf, on='key')

Advanced Example

ETL Pipeline

Complete data processing pipeline.

python

import cudf
import dask_cudf

# Read large dataset
gdf = cudf.read_parquet('data/*.parquet')

# Clean
gdf = gdf.dropna()
gdf['date'] = cudf.to_datetime(gdf['timestamp'])

# Feature engineering
gdf['day_of_week'] = gdf['date'].dt.dayofweek
gdf['rolling_mean'] = gdf.groupby('category')['value'].transform('mean')

# Aggregate
result = gdf.groupby(['category', 'day_of_week']).agg({
    'value': ['mean', 'std', 'count']
})

# To pandas for visualization
pdf = result.to_pandas()

Task

Performance

Notes

Read Parquet

5-10x vs pandas

Depends on data

GroupBy

20-100x

Many groups

Join

10-50x

Large tables