Remote Files¶
Read GeoParquet files from cloud storage and HTTPS URLs.
Supported Protocols¶
- HTTPS: Public and private URLs
- S3:
s3://bucket/path/file.parquet - Azure:
az://container/file.parquetorhttps://account.blob.core.windows.net/... - GCS:
gs://bucket/path/file.parquet
Basic Usage¶
# Inspect remote file
gpio inspect https://data.source.coop/path/file.parquet
# Check remote file
gpio check all s3://bucket/data.parquet
# Convert remote to local
gpio convert https://example.com/data.geojson local.parquet
# Process remote, upload result
gpio sort hilbert s3://bucket/input.parquet local-sorted.parquet
gpio upload local-sorted.parquet s3://bucket/output/sorted.parquet
Authentication¶
AWS S3¶
# Environment variables
export AWS_ACCESS_KEY_ID=your_key
export AWS_SECRET_ACCESS_KEY=your_secret
# Or use AWS CLI
aws configure
# Or use profile
export AWS_PROFILE=your-profile
Azure¶
export AZURE_STORAGE_ACCOUNT_NAME=account
export AZURE_STORAGE_ACCOUNT_KEY=key
# Or SAS token
export AZURE_STORAGE_SAS_TOKEN=token
Google Cloud¶
export GOOGLE_APPLICATION_CREDENTIALS=/path/to/service-account.json
Limitations¶
- Outputs write to local filesystem only
- Use
gpio uploadto transfer results to cloud storage - STAC generation requires local files
- HTTPS wildcards (
*.parquet) not supported