Update README

This commit is contained in:
Diego Ripley
2025-06-02 14:13:58 +00:00
parent 7e0b001075
commit 5a188a469c
+16 -12
View File
@@ -1,23 +1,27 @@
## Table of Contents ## Table of Contents
- [About](#-about) - [About](#about)
- [How to Run](#-how-to-run) - [How to Run](#how-to-run)
- [License](#-license) - [License](#license)
## About ## About
**process-statcan-data** is a set of scripts that helps you load all Statistics Canada data required for data analysis. Specifically this loads: **process-statcan-data** is a set of scripts that helps you load and prepare Statistics Canada data for analysis. It performs schema normalization, field name standardization, and adds derived fields (such as DGUIDs) to improve consistency, interoperability, and usability across datasets.
- **Boundaries**: Geographic boundaries from 2001 to 2021. All output datasets are written in GeoParquet format to support modern geospatial workflows and ensure broad compatibility across platforms.
- **Road Network Files**: Road network files from 2001 to 2021.
- **Health Regions**: Health Regions from 2003 to 2023. This project processes the following datasets:
- **National Address Register**: National Address Register files from 2022 to 2024.
- **Census of Population**: Census of Population data from 2001 to 2021. - **Geographic Boundaries** (20012021)
- **Census of Agriculture**: Census of Agriculture from 2001 to 2021. - **Road Network Files** (20012021)
- **National Household Survey**: National Household Survey from 2011 to 2016. - **Health Regions** (20032023)
- **National Address Register** (20222024)
- **Census of Population** (20012021)
- **Census of Agriculture** (20012021)
- **National Household Survey** (20112016)
## How to Run ## How to Run
We make use of Dev Container to run the code: This project uses a **Dev Container** environment for setup and execution:
```shell ```shell
# Clone the repository # Clone the repository