Announced May 2025: Dataproc Serverless is now Google Cloud Serverless for Apache Spark

Google Cloud Serverless for Apache Spark

Focus on your code, not your infrastructure

Run your Apache Spark jobs easier on a customizable zero-ops platform, smarter with Gemini assistance, and faster with the performance of Lightning Engine.

Get $300 in credits

Contact sales

Apache Spark is a trademark of The Apache Software Foundation.

Product highlights

Features

Industry-leading performance

Supercharge your jobs with Lightning Engine, our next-generation vectorized engine. Get over 4.3x faster performance and lower TCO for your serverless Spark workloads, automatically.

Learn about Lightning Engine

Zero-Ops with intelligent autoscaling

Eliminate cluster management with intelligent autoscaling. Resources scale up, and down automatically to perfectly match your job's needs, ensuring maximum performance, and cost-efficiency without paying for idle time.

Learn about autoscaling

AI-powered development

Accelerate your entire workflow. Write and debug PySpark, Scala, and Java code with Gemini Code Assist in BigQuery Studio and launch GPU-accelerated environments with pre-configured ML Runtimes.

Explore Gemini assistance

Unified Spark and SQL experience

Eliminate context switching. Develop and run your workloads in a single environment like BigQuery Studio, seamlessly blending powerful SQL with the flexibility of PySpark in the same notebook.

Read about PySpark in BigQuery Studio

Two tiers of performance

Two tiers of performance	Tiers to match your specific needs, from standard batch processing to the most demanding, performance-critical jobs.
Tier	Best for
Standard	Ideal for cost-effective batch processing, data transformations, and general-purpose Spark jobs. General purpose Spark ETL Scheduled data pipelines Cost-sensitive batch jobs
Premium	For the most demanding workloads, offering maximum performance with Lightning Engine, AI/ML acceleration, and interactive capabilities. Performance-critical jobs powered by Lightning Engine for 4.3x boost Interactive data science and analysis GPU-accelerated AI and ML Complex, large-scale data processing

Compare tiers of Serverless Spark on Google in more detail.

Two tiers of performance

Tiers to match your specific needs, from standard batch processing to the most demanding, performance-critical jobs.

Standard

Best for

Ideal for cost-effective batch processing, data transformations, and general-purpose Spark jobs.

General purpose Spark ETL
Scheduled data pipelines
Cost-sensitive batch jobs

Premium

Best for

For the most demanding workloads, offering maximum performance with Lightning Engine, AI/ML acceleration, and interactive capabilities.

Performance-critical jobs powered by Lightning Engine for 4.3x boost
Interactive data science and analysis
GPU-accelerated AI and ML
Complex, large-scale data processing

Compare tiers of Serverless Spark on Google in more detail.

How It Works

Develop your Apache Spark application in your favorite tools, including BigQuery Studio notebooks. Submit your serverless Spark job with a single command, and let Google handle the rest—no clusters to create, configure, or manage.

View documentation

Common Uses

Interactive Data Science

Empower data scientists to explore data and rapidly iterate on Spark ML models. Unify SQL and Spark in a single BigQuery Studio notebook, moving seamlessly from data exploration with SQL to model building with PySpark without ever managing infrastructure.

Learn how to run PySpark code in BigQuery Studio notebooks

Tutorials, quickstarts, & labs

Interactive Data Science

Learn how to run PySpark code in BigQuery Studio notebooks

Automated ETL Pipelines

Build robust, event-driven Spark ETL pipelines that automatically scale on demand. Pay only for what you use, making it perfect for spiky or unpredictable workloads.

Learn how to apply data lineage

Tutorials, quickstarts, & labs

Automated ETL Pipelines

Build robust, event-driven Spark ETL pipelines that automatically scale on demand. Pay only for what you use, making it perfect for spiky or unpredictable workloads.

Learn how to apply data lineage

AI/ML at scale

Accelerate large-scale model training and batch inference with serverless Spark. Attach NVIDIA GPUs with pre-configured libraries with a single command.

View GPU documentation

Learning resources

AI/ML at scale

Accelerate large-scale model training and batch inference with serverless Spark. Attach NVIDIA GPUs with pre-configured libraries with a single command.

View GPU documentation

Pricing

Transparent, value-driven pricing	Serverless for Apache Spark pricing is based on per-second usage of compute (DCUs), GPUs, and shuffle storage.
Services and usage	Subscription type	Price (USD)
Data Compute Unit (DCU)	Standard	Starting at $0.06 per hour
Premium	Starting at $0.089 per hour
Shuffle storage	Standard	Starting at $0.04 per GB/month
Premium	Starting at $0.1 per GB/month
Accelerator pricing	a100 40 GB	Starting at $3.52069 per hour
a100 80 GB	Starting at $4.713696 per hour
L4	Starting at $0.672048 per hour

View pricing details for Google Cloud Serverless for Apache Spark.

Transparent, value-driven pricing

Serverless for Apache Spark pricing is based on per-second usage of compute (DCUs), GPUs, and shuffle storage.

Data Compute Unit (DCU)

Subscription type

Standard

Price (USD)

Starting at

$0.06

per hour

Premium

Subscription type

Starting at

$0.089

per hour

Shuffle storage

Subscription type

Standard

Price (USD)

Starting at

$0.04

per GB/month

Premium

Subscription type

Starting at

$0.1

per GB/month

Accelerator pricing

Subscription type

a100 40 GB

Price (USD)

Starting at

$3.52069

per hour

a100 80 GB

Subscription type

Starting at

$4.713696

per hour

Subscription type

Starting at

$0.672048

per hour

View pricing details for Google Cloud Serverless for Apache Spark.

Pricing calculator

Calculate your monthly costs by region.

Estimate your costs

Custom quote

Connect with our sales team to get a custom quote for your organization.

Request a quote

Get started today

Tutorial for getting started

Get $300 in credits

Have a large project?

Contact sales

Product documnetation

Read here

Use BigQuery connector with Serverless for Apache Spark

Read guide

Use GPUs with Serverless for Apache Spark

Read guide

Business Case

Build your business case for Google Cloud Serverless for Apache Spark

The economic benefits of Google Cloud Dataproc and Serverless Spark versus alternative solutions

See how Serverless for Apache Spark delivers significant TCO savings and business value compared to on-prem and other cloud solutions.

Download the report

Google Cloud Serverless for Apache Spark

Focus on your code, not your infrastructure

Product highlights

Industry-leading performance

Zero-Ops with intelligent autoscaling

AI-powered development

Unified Spark and SQL experience

Develop your Apache Spark application in your favorite tools, including BigQuery Studio notebooks. Submit your serverless Spark job with a single command, and let Google handle the rest—no clusters to create, configure, or manage.

Interactive Data Science

Tutorials, quickstarts, & labs

Interactive Data Science

Automated ETL Pipelines

Tutorials, quickstarts, & labs

Automated ETL Pipelines

AI/ML at scale

Learning resources

AI/ML at scale

Pricing calculator

Custom quote

Get started today

Tutorial for getting started

Have a large project?

Product documnetation

Use BigQuery connector with Serverless for Apache Spark

Use GPUs with Serverless for Apache Spark

Related Content

When should I choose Serverless for Apache Spark versus Dataproc?

Do I need to install my own libraries like PyTorch or XGBoost?

How do I get the best performance and how does pricing work?