diff --git a/content/_index.md b/content/_index.md
index 2c1d892..fd87f4a 100644
--- a/content/_index.md
+++ b/content/_index.md
@@ -4,12 +4,15 @@ toc: false
---
## Mission
+
Data for Canada exists to bridge the gap between open data availability and data usability. We curate, clean, and re-engineer high-value Canadian datasets into high-performance, analysis-ready formats for researchers, developers, and systems.
## The Problem
+
Canada creates incredible amounts of open data, from foundational road networks to federal census statistics. However, these datasets are often locked in legacy formats, fragmented portals, or structures that require significant engineering effort to normalize before they can be used. For a researcher or a system developer, the "time-to-insight" is often bottlenecked by data preparation.
## The Solution
+
We act as the transformation layer. We aggregate datasets with permissive licenses and process them into "digestible" standards optimized for modern downstream applications.
* **For Researchers:** Skip the cleaning phase. Access normalized, documented data ready for analysis.
@@ -26,13 +29,13 @@ flowchart TD
classDef consumer fill:#f3e5f5,stroke:#8e24aa,stroke-width:2px
subgraph "Data Sources"
- StatCan[("Statistical Products")]:::source
+ StatProducts[("Statistical Products")]:::source
Orthoimagery[("Orthoimagery")]:::source
end
subgraph "Processing Pipeline"
Raw[Raw Data Ingestion
CSVs, Shapefiles, ECW]:::process
- Transform[Transformation Engine
Open & Closed Source]:::process
+ Transform[Transformation Engine]:::process
Opt[Optimization]:::process
end
@@ -43,35 +46,41 @@ flowchart TD
end
subgraph "Distribution Infrastructure"
- S3_COMPLIANT_STORAGE[S3 Compliant Storage]:::storage
- Decentralized_Distribution[Decentralized Distribution]:::storage
- Serverless[Cloudflare Worker
API & Serving]:::storage
+ ObjectStorage[Object Storage]:::storage
+ DecentralizedDistribution[Decentralized Distribution]:::storage
+ Serverless[Serverless Worker
API & Serving]:::storage
end
subgraph "Consumption / End Users"
DataSci[DuckDB, Python, QGIS, Jupyter]:::consumer
WebApps[Web Applications]:::consumer
- DataSci[Python]:::consumer
+ DataSci[Python, R, Julia]:::consumer
Systems[Systems]:::consumer
end
%% Relationships
- StatCan --> Raw
+ StatProducts --> Raw
Raw --> Transform
Transform --> Opt
Opt --> Parquet
Opt --> PMTiles
Opt --> FlatGeoBuf
- Parquet --> S3_COMPLIANT_STORAGE
- PMTiles --> S3_COMPLIANT_STORAGE
- FlatGeoBuf --> S3_COMPLIANT_STORAGE
- S3_COMPLIANT_STORAGE --> Decentralized_Distribution
- S3_COMPLIANT_STORAGE --> Serverless
- Decentralized_Distribution --> Systems
+ Parquet --> ObjectStorage
+ PMTiles --> ObjectStorage
+ FlatGeoBuf --> ObjectStorage
+ ObjectStorage --> DecentralizedDistribution
+ ObjectStorage --> Serverless
+ DecentralizedDistribution --> Systems
Serverless --> WebApps
Serverless --> DataSci
%% Concept Annotations
Transform -.->|"Join Spatial & Tabular"| Parquet
- PMTiles -.->|"Stream tiles"| WebApps
+ PMTiles -.->|"Stream"| WebApps
+ FlatGeoBuf -.->|"Stream"|DataSci
+ FlatGeoBuf -.->|"Stream"|WebApps
+
+ click StatProducts "https://www.dataforcanada.org/docs/processes/statistical_products/" _blank
+ click Orthoimagery "https://www.dataforcanada.org/docs/processes/statistical_products/" _blank
+ click DecentralizedDistribution "https://www.dataforcanada.org/docs/dissemination/" _blank
```