Overview
Google BigQuery is a fully managed, serverless data warehouse that enables scalable analysis over large datasets. Datazone connects to BigQuery using a Google Cloud service account, allowing you to extract tables and views from any dataset within your GCP project.Connection Parameters
| Parameter | Required | Description |
|---|---|---|
| Name | Yes | A unique identifier for your BigQuery source |
| Project ID | Yes | Your Google Cloud project ID (e.g. my-gcp-project) |
| Credentials JSON | Yes | The full contents of your service account key file in JSON format (stored encrypted) |
Setting Up a Service Account
- Go to IAM & Admin → Service Accounts in the Google Cloud Console
- Create a new service account (e.g.
datazone-reader) - Grant the following roles:
BigQuery Data Viewer— read access to datasets and tablesBigQuery Job User— permission to run query jobs
- Create a JSON key for the service account
- Copy the full contents of the downloaded JSON key file into the Credentials JSON field
Required Permissions
The service account needs the following IAM roles:roles/bigquery.dataViewer— for reading table dataroles/bigquery.jobUser— for executing queries
BigQuery Data Viewer on the specific datasets in the source project.
Limitations
- BigQuery extracts use the BigQuery Storage API for efficient large-scale reads
- Partitioned and clustered tables are supported
- Supported BigQuery regions: all standard GCP regions
Next Steps
After configuring your BigQuery source:- Create extracts to specify which tables or views to ingest
- Configure scheduling for recurring extracts
- Integrate the source into your data pipelines