Commit Graph

42 Commits

Author SHA1 Message Date
Diego Ripley 79a37a3afc Some fixes for ubuntu user 2025-06-27 08:47:11 -04:00
Diego Ripley 2f346bfa4c Fix #4. Needed to mount ~/.ssh and make sure that it has the same UID/GID as the host user 2025-06-26 15:14:33 +00:00
Diego Ripley a55e1d325d Made changes 2025-06-26 13:35:37 +00:00
Diego Ripley 4ed5fb4bbb Add DuckDB example for duplicate column name 2025-06-25 15:38:12 +00:00
Diego Ripley b71a7b326e DuckDB issue with duplicate column names (ex. 'Value' and 'VALUE' are treated the same) 2025-06-25 15:30:36 +00:00
Diego Ripley e929850d4a Finish comment on issue with Value and VALUE columns being treated the same by DuckDB 2025-06-21 18:03:29 +00:00
Diego Ripley 8875722d10 Made changes to processing of data tables 2025-06-21 18:01:16 +00:00
Diego Ripley 7c8211cb5f Found some issues with the output parquet files 2025-06-21 05:26:50 +00:00
Diego Ripley 887291d2f7 Read all DGUIDs from subset parquet output (100,000 records each) 2025-06-21 00:54:26 -04:00
Diego Ripley 72ca6c87e1 Made changes 2025-06-20 17:32:01 -04:00
Diego Ripley 5a95616b3c Calculate CSV file size by viewing inside of zip file 2025-06-20 16:01:20 -04:00
Diego Ripley e836363cd1 Had to optimize the code. Leaving it outside of function for now in case I need to continue working on it 2025-06-20 16:00:51 -04:00
Diego Ripley f6d88c5fd0 Continue work on processing data tables 2025-06-19 15:58:30 -04:00
Diego Ripley ab8f40c708 Keeping track of processed files in case processing crashes and I have to restart again 2025-06-19 11:46:31 -04:00
Diego Ripley faa63451ab Experiment with Jupyter notebook on downloading and processing statcan cubes 2025-06-18 21:26:51 +00:00
Diego Ripley c0899080f4 Remove scratch files after processing. Was running out of space 2025-06-18 09:26:18 -04:00
Diego Ripley f85ce79ff2 Add polars to Dockerfile 2025-06-17 21:19:42 +00:00
Diego Ripley ea603f2914 Convert statcan CSV into parquet 2025-06-17 20:46:24 +00:00
Diego Ripley 4d16ef8232 Process 2024-06 national address register 2025-06-07 15:39:28 +00:00
Diego Ripley b3c5f8767f Move export of 2021 geographic boundaries 2025-06-06 22:51:49 +00:00
Diego Ripley 5e26ec282e Process 2024-12 national address register. Still need to make some improvements 2025-06-06 22:49:47 +00:00
Diego Ripley 2350d6d8d7 Update README files 2025-06-06 11:20:41 +00:00
Diego Ripley ce999a3a4d Remove unused code 2025-06-04 17:02:21 +00:00
Diego Ripley 3565b7c5a4 Update DuckDB lonboard example 2025-06-04 17:01:04 +00:00
Diego Ripley fad884efc8 Create 2021 cartographic boundary files 2025-06-04 16:36:45 +00:00
Diego Ripley 13a86dc3dc Add 2006, 2011, 2016 cartographic boundary files 2025-06-03 14:34:53 +00:00
Diego Ripley 0203e04b45 Add 2021 cartographic boundary files 2025-06-03 13:33:03 +00:00
Diego Ripley 23e3133118 Add 2021 cartographic boundary files 2025-06-02 22:03:49 +00:00
Diego Ripley 5a188a469c Update README 2025-06-02 14:13:58 +00:00
Diego Ripley 7e0b001075 Enable downloading of 2021 hydro 2025-06-02 13:56:50 +00:00
Diego Ripley 4eae3622d2 Update DuckDB lonboard example 2025-05-30 22:30:35 +00:00
Diego Ripley 34d2c50046 Update DuckDB longboard example 2025-05-30 17:29:58 +00:00
Diego Ripley bd7e8ee9f4 Should have used count_total_4 in calculation instead of count_total_1 2025-05-30 16:15:27 +00:00
Diego Ripley daf5a2d154 Fix #6 2025-05-30 16:11:45 +00:00
Diego Ripley 6aedfe7c46 Update DuckDB example so it calculates the area of a polygon properly 2025-05-30 15:26:22 +00:00
Diego Ripley a8b05a96f9 Add jenkspy package to Dockerfile 2025-05-27 15:23:59 +00:00
Diego Ripley b7f8506708 Add DuckDB and lonboard example. Still need lots of work to get it right 2025-05-27 15:22:10 +00:00
Diego Ripley d4a25af7b1 Format DuckDB SQL code 2025-05-25 20:05:15 +00:00
Diego Ripley b4b7e20219 Add buckaroo and remove rclone from the Dockerfile 2025-05-25 19:39:19 +00:00
Diego Ripley b260d259c8 Add DuckDB example for selecting a Census of Population variable at the Dissemination Area level, calculating a field, and exporting the dataset into multiple formats 2025-05-25 19:35:30 +00:00
Diego Ripley 8dab61a685 Update URL for FED 2021, 2013 representation order 2025-05-24 19:37:12 +00:00
Diego Ripley f93e4d0cec Initial commit 2025-05-24 13:37:31 -04:00