mirror of
https://github.com/dataforcanada/d4c-datapkg-statistical.git
synced 2026-06-13 14:10:55 +02:00
be1396e652bea3ff373e0fbd5674be1779b0fa2b
Remove vector tiles HTML experiment. All the work is available at https://www.diegoripley.ca/files/census_of_population_2021_vector_tiles_august_12_2025/. I have the code in another repo, but I need to clean it up before publishing
Table of Contents
About
process-statcan-data-dev is a set of scripts that helps you load and prepare Statistics Canada data for analysis. It performs schema normalization, field name standardization, and adds derived fields (such as DGUIDs) to improve consistency, interoperability, and usability across datasets.
All output datasets are written in GeoParquet format to support modern geospatial workflows and ensure broad compatibility across platforms.
This project processes the following datasets:
- Geographic Boundaries (2001–2021)
- Census of Population (2001–2021)
- National Household Survey (2011–2016)
- Census of Agriculture (2001–2021)
- National Address Register (2022–2024)
- Road Network Files (2001–2021)
How to Run
This project uses a Dev Container environment for setup and execution. If you are using VS Code all you need is the Dev Containers extension installed and Docker installed on your system.
# Clone the repository
git clone https://github.com/dataforcanada/process-statcan-data-dev.git
# Navigate to the project directory
cd process-statcan-data-dev
# In Dev Container
./main.sh
License
This product is distributed under an MIT license.
Languages
Jupyter Notebook
94.1%
Shell
5.4%
Python
0.4%