6.3 KiB
title, toc
| title | toc |
|---|---|
| Welcome to Data for Canada | false |
Mission
Data for Canada exists to bridge the gap between open data availability and data usability. We curate, clean, and re-engineer high-value Canadian datasets into high-performance, analysis-ready formats for researchers, developers, and systems.
The Problem
Canada creates incredible amounts of open data, from foundational road networks, federal census statistics, orthoimagery, and other. However, these datasets are often locked in legacy formats, fragmented portals, or structures that require significant engineering effort to normalize before they can be used. For a researcher or a system developer, the "time-to-insight" is often bottlenecked by data preparation.
The Solution
We act as the transformation layer. We aggregate datasets with permissive licenses and process them into "digestible" standards optimized for modern downstream applications.
- For Researchers and Developers: Skip the cleaning phase. Access normalized, documented data ready for analysis.
- For Systems: Standardized data structures designed to feed directly into pipelines, data warehouses, and downstream services.
High-Level Overview
flowchart TD
subgraph "Data Sources"
StatProducts@{ shape: lean-l, label: "Statistical Products"}
Orthoimagery@{ shape: lean-l}
FieldImagery@{ shape: lean-l, label: "Field Imagery"}
Elevation@{ shape: lean-l}
EnvironmentClimate@{ shape: lean-l, label: "Environmental & Climate"}
end
subgraph "Processing Pipeline"
Raw@{ shape: rect, label: "Raw Data Ingestion"}
Transform@{ shape: rect, label: "Transform"}
Opt@{ shape: rect, label: "Optimization"}
end
subgraph "Dissemination Formats"
Parquet@{ shape: lean-l}
FlatGeoBuf@{ shape: lean-l}
PMTiles@{ shape: lean-l}
COG@{ shape: lean-l}
Zarr@{ shape: lean-l}
JPEGXL@{ shape: lean-l, label: "JPEG XL"}
AV1@{ shape: lean-l, label: "AV1"}
end
subgraph "Distribution Infrastructure"
ObjectStorage@{ shape: bow-rect, label: "Object Storage"}
Metadata@{ shape: rect}
HTTP@{ shape: rect, label: "Static Files & API"}
DecentralizedDistribution@{ shape: rect, label: "Decentralized Distribution"}
end
subgraph ei [Experimental Infrastructure]
Panoramax@{ shape: rect}
Martin@{ shape: rect}
GeoServer@{ shape: rect}
ZOOProject@{ shape: rect, label: "ZOO-Project"}
BBOXServer@{ shape: rect, label: "BBOX Server"}
end
subgraph "Consumption"
DataSci@{ shape: rect, label: "Researchers & Developers"}
Systems@{ shape: rect, label: "Systems"}
end
%% Relationships
Elevation a3@--> Raw
a3@{animate: true, animation: slow}
EnvironmentClimate a31@--> Raw
a31@{animate: true, animation: slow}
FieldImagery a32@--> Raw
a32@{animate:true, animation: fast}
Orthoimagery a2@--> Raw
a2@{animate: true, animation: slow}
StatProducts a1@--> Raw
a1@{animate: true, animation: slow}
Raw a4@--> Transform
a4@{animate: true, animation: slow}
Transform a5@--> Opt
a5@{animate: true, animation: slow}
Opt a6@--> Parquet
a6@{animate: true, animation: slow}
Parquet a7@--> FlatGeoBuf
a7@{animate: true, animation: slow}
Parquet a8@--> PMTiles
a8@{animate: true, animation: slow}
Opt a9@--> COG
a9@{animate: true, animation: slow}
Opt a29@--> Zarr
a29@{animate: true, animation: slow}
Opt a33@--> JPEGXL
a33@{animate: true, animation: slow}
JPEGXL a34@ --> ObjectStorage
a34@{animate: true, animation: fast}
Opt a35@--> AV1
a35@{animate: true, animation: slow}
AV1 a36@ --> ObjectStorage
a36@{animate: true, animation: slow}
Zarr a30@--> ObjectStorage
a30@{animate: true, animation: slow}
Parquet a10@--> ObjectStorage
a10@{animate: true, animation: slow}
FlatGeoBuf a11@--> ObjectStorage
a11@{animate: true, animation: slow}
PMTiles a12@--> ObjectStorage
a12@{animate: true, animation: slow}
COG a13@--> ObjectStorage
a13@{animate: true, animation: slow}
COG a28@--> PMTiles
a28@{animate: true, animation: slow}
ObjectStorage a14@--> Metadata
a14@{animate: true, animation: slow}
Metadata a15@--> HTTP
a15@{animate: true, animation: slow}
HTTP a16@--> ei
a16@{animate: true, animation: slow}
HTTP a17@--> DecentralizedDistribution
a17@{animate: true, animation: slow}
HTTP a18@--> DataSci
a18@{animate: true, animation: slow}
DecentralizedDistribution a22@--> Systems
a22@{animate: true, animation: fast}
DecentralizedDistribution a23@--> DataSci
a23@{animate: true, animation: fast}
Systems a38@ --> DataSci
a38@{animate: true, animation: fast}
Martin a24@--> DataSci
a24@{animate: true, animation: slow}
GeoServer a25@--> DataSci
a25@{animate: true, animation: slow}
ZOOProject a26@--> DataSci
a26@{animate: true, animation: slow}
BBOXServer a27@--> DataSci
a27@{animate: true, animation: slow}
Panoramax a37@--> DataSci
a37@{animate: true, animation: slow}
click Parquet "https://github.com/apache/parquet-format/" _blank
click FlatGeoBuf "https://flatgeobuf.org/" _blank
click PMTiles "https://github.com/protomaps/PMTiles/blob/main/spec/v3/spec.md" _blank
click Zarr "https://github.com/zarr-developers/geozarr-spec/" _blank
click StatProducts "/docs/processes/statistical_products/" _blank
click Orthoimagery "/docs/processes/orthoimagery/" _blank
%%click FieldImagery "/docs/processes/field_imagery/" _blank
%%click EnvironmentClimate "/docs/processes/environmental_climate/" _blank
%%click Elevation "/docs/processes/elevation/" _blank
click JPEGXL "https://jpeg.org/jpegxl/" _blank
click AV1 "https://aomedia.org/specifications/av1/" _blank
click DecentralizedDistribution "/docs/dissemination/" _blank
click Metadata "https://stac-utils.github.io/stac-geoparquet/latest/spec/stac-geoparquet-spec/" _blank
click Martin "https://martin.maplibre.org/" _blank
click GeoServer "https://geoserver.org/" _blank
click Panoramax "https://gitlab.com/panoramax" _blank
click COG "https://cogeo.org/" _blank
click ZOOProject "https://zoo-project.org/" _blank
click BBOXServer "https://www.bbox.earth/" _blank