Examples
Pipeline Examples
Learn how to create and manage data pipelines in Datazone through practical examples
Pipeline Examples
Basic Data Pipeline
Example 1: CSV to Processed Dataset
This example demonstrates uploading a CSV file to Datazone and then performing basic transformations, and saving the results.
Data Quality Pipeline
Example 2: Data Validation and Reporting
Multi-Source Pipeline
Example 4: Combining Data from Multiple Sources
Logging Pipeline
Example 6: Pipeline with Comprehensive Logging
Different logging levels are available:
logger.debug()
: Detailed information for debugginglogger.info()
: General information about pipeline progresslogger.warning()
: Warning messages for potential issueslogger.error()
: Error messages for caught exceptionslogger.critical()
: Critical failures that require immediate attention
Usage Instructions
- Save these transforms in your project’s transform directory
- Configure the dataset IDs to match your environment
- Create a pipeline including the transforms in the desired order
- Set up appropriate scheduling and monitoring