Running the pipeline¶
The following section describes how to run the 3dbag-pipeline to produce the core of 3DBAG, excluding the extension packages (floors_estimation and party_walls).
Sourcing the inputs and generating the 3DBAG data requires a lot of storage. As an indication, running the whole pipeline for a single AHN tile requires about 70GB space and 1-2 hours processing, depending on the machine.
The core package consists of several jobs that need to be executed in a certain order:
bgt— Load the latest BGT Pand layer.ahn_tile_index— Download the AHN tile index (bladwijzer) and checksum files, and create the metadata tables for AHN 3, 4 and 5. Must be run before the AHN download jobs.ahn3/ahn4/ahn5— Download the LAZ files and record metadata for each AHN version. These jobs are partitioned by tile and can be run independently. Theahn_checksum_sensorautomates triggering these jobs when PDOK updates the upstream checksum files.ahn_metadata_index— Create indices on the AHN metadata tables. Run after all AHN tile downloads are complete.source_input— Download and stage BAG and TOP10NL source data, then prepare the input for reconstruction (tiling, intermediary processing).nl_reconstruct— Run the crop and reconstruct steps for the Netherlands (partitioned by tile).nl_export— Run the tyler export and 3D Tiles steps. Usenl_export_after_floorsinstead if the floors_estimation package has been run first — it includes the GeoPackage and 3D Tiles outputs.nl_deploy— Deploy the Netherlands data to the production servers.nl_release— Perform the final steps for the 3DBAG release (publish data and webservices).
For debugging reconstruction, nl_reconstruct_debug runs the same steps as nl_reconstruct with additional debug output.