mirror of
https://github.com/dataforcanada/d4c-datapkg-orthoimagery.git
synced 2026-06-13 14:10:53 +02:00
Vancouver 2022 Orthoimagery — Download Script
This directory contains the automation script for acquiring the City of Vancouver 2022 Orthophoto Imagery dataset (7.5 cm resolution) from Vancouver Open Data.
What the Script Does
download.sh performs four sequential steps:
- Download Index — Uses
aria2cto fetch the dataset catalogue as a Parquet file from the Vancouver Open Data API. - Extract URLs — Queries the Parquet file with
duckdbto extract all MrSID image URLs into a plain-text file suitable for batch downloading. - Create Output Directory — Ensures the data input directory exists at
../../data/input/ca-bc_vancouver-2022A00055915022_d4c-datapkg-orthoimagery_2022_075mm/(relative to this script). - Download Images — Uses
aria2cto download all images in parallel (12 concurrent connections, 4 connections per server) into the data input directory.
Dependencies
The following command-line tools must be installed and available on your PATH:
| Tool | Purpose | Install |
|---|---|---|
| aria2c | High-speed parallel downloads | sudo apt install aria2 |
| duckdb | Query Parquet files from the CLI | Install guide |
Usage
cd scripts/ca-bc_vancouver-2022A00055915022_d4c-datapkg-orthoimagery_2022_075mm
bash download.sh
The script will print progress for each step. Once complete, the downloaded MrSID image files will be located in:
data/input/ca-bc_vancouver-2022A00055915022_d4c-datapkg-orthoimagery_2022_075mm/