How to Run the End-to-End Pipeline
This guide explains how to orchestrate the full GeoLift workflow using the
installed geolift pipeline command.
Prerequisites
- A prepared dataset (CSV format)
- A configuration YAML file
- GeoLift installed
Running the Pipeline
Packaged-install example:
Source-checkout example with the shipped demo configs:
Pipeline Config Behaviour
--configmust point to one canonical stage YAML file- in a source checkout, GeoLift resolves the sibling canonical YAMLs from the same directory
- GeoLift does not use a separate
pipeline_config.yaml - built artifacts do not include the repo’s
data-config/; packaged users should supply their own YAML path instead
Stage Selection
You can run partial pipelines by skipping specific stages:
--skip-power: Skips the power analysis stage--skip-donor: Skips the donor evaluation stage--only-inference: Runs only the final inference stage
Report Generation
Control report generation with:
--report(enabled by default)--no-report
Output Directory
Specify where results should be saved:
This will generate the following structure:
results/campaign_a/multicell_power_analysis/results/campaign_a/multicell_donor_eval/results/campaign_a/multicell_geolift_analysis/results/campaign_a/geolift_pipeline_report.mdresults/campaign_a/geolift_pipeline_report.html
For a source-tree walkthrough using the shipped demo configs, start with Quick Start.