Checking Best Practices¶
The check commands validate GeoParquet files against best practices.
Run All Checks¶
gpio check all myfile.parquet
Runs all validation checks:
- Spatial ordering
- Compression settings
- Bbox structure and metadata
- Row group optimization
Individual Checks¶
Spatial Ordering¶
gpio check spatial myfile.parquet
Checks if data is spatially ordered using random sampling. Spatially ordered data improves:
- Query performance
- Compression ratios
- Cloud access patterns
Compression¶
gpio check compression myfile.parquet
Validates geometry column compression settings.
Bbox Structure¶
gpio check bbox myfile.parquet
Verifies:
- Bbox column structure
- GeoParquet metadata version
- Bbox covering metadata
Row Groups¶
gpio check row-group myfile.parquet
Checks row group size optimization for cloud-native access.
Options¶
# Verbose output with details
gpio check all myfile.parquet --verbose
# Custom sampling for spatial check
gpio check spatial myfile.parquet --random-sample-size 200 --limit-rows 1000000