Data Source
Blockchair API provides blockchain data for 6 UTXO cryptocurrencies
Inputs & Outputs
- API Key
- Date Range
- Coin Selection
- Blocks
- Transactions
- Inputs
- Outputs
- Addresses
MugenLink's Phase-Centric ETL Pipeline processes blockchain data from 6 cryptocurrencies through a robust, orchestrated workflow.
6
Cryptocurrencies
6
Pipeline Phases
15+
DBT Models
Daily
Refresh Rate
Blockchair API provides blockchain data for 6 UTXO cryptocurrencies
Python orchestrator downloads TSV dumps with retry logic and bandwidth limiting
Files uploaded to AWS S3 with encryption and integrity validation
COPY INTO loads data from S3 external stages with error tolerance
Wallet-centric transformation layer with incremental materialization
Web app queries transformed data for real-time dashboards and wallet tracing
Blockchair API provides daily TSV dumps for 6 UTXO cryptocurrencies with blocks, transactions, inputs, outputs, and addresses.
Apache Airflow orchestrates the ETL pipeline with Python scripts for downloading, uploading, and loading data with retry logic.
dbt transforms raw data into wallet-centric analytics with incremental materialization and clustering for optimal query performance.
Orchestration
Data Warehouse
Object Storage
Transformation