Challenge

A global enterprise compensation management platform relied on fragmented, manually-managed data flows across multiple HR and payroll systems.

Each source had its own authentication model, data format, latency profile, and failure mode. Without a unified orchestration layer, data engineers spent significant time manually managing runs, investigating silent failures, and reconciling data across compensation cycles. The core requirement was to build a fully orchestrated, fault-tolerant pipeline system that ingests data from heterogeneous sources on configurable schedules, applies business logic at ingestion time, and makes clean data available downstream — with comprehensive logging, error handling, and a structured deployment path from dev through production.

Approach

Rather than building isolated pipelines per source, we designed a feed-driven orchestration model.

A central configuration table in the database drives what runs, when, and how — making it straightforward to onboard new feeds without modifying pipeline code. Six key engineering patterns were applied throughout: (1) Configuration-driven scheduling — all schedule logic lives in SQL, not ADF, allowing new feeds via a database row insert; (2) Idempotent copy design — every Copy activity is preceded by targeted DELETEs via preCopyScript; (3) Fault isolation inside ForEach loops — using SetVariable on failure patterns to continue processing; (4) ForEach/IfCondition nesting workaround — delegating complex sub-flows to child pipelines; (5) Checksum gating for SFTP ingestion — three-gate conditional handling for late, duplicate, or missing files; (6) Zero-secret pipeline architecture — all credentials in Azure Key Vault, referenced via Linked Services at runtime. Pre- and post-processing stored procedures bracket every feed run, with structured logging capturing outputs and errors regardless of outcome. The pipeline stays thin — routing and orchestration only — while all business logic lives in SQL where it can be tested independently.

Outcome

The pipeline architecture has been running in production through multiple annual compensation cycles, daily HR data refreshes, and on-demand payroll ingestion runs.

Fully automated ingestion across Workday, ADP (three company codes), SFTP, and market data APIs — no manual intervention during normal operations. Feed-driven configuration allows new data sources to be onboarded by inserting a database row. Idempotent copy patterns ensure any pipeline can be safely retried without downstream data reconciliation. A four-stage DevOps release pipeline with approval gates provides a safe, auditable promotion path for all pipeline and schema changes.

Key Integrations

Workday RaaS

Employee master data and FX rates pulled via authenticated HTTP. XML responses flattened into SQL staging tables using ADF's TabularTranslator with complex-to-string mapping.

ADP Payroll

Three distinct company codes, each requiring OAuth 2.0 via client certificate. Two-phase lookup-then-copy pattern per payroll group for schedules and detailed payment records.

SFTP / Blob Storage

Compensation statement files ingested from SFTP, validated via existence check and checksum, delivered downstream, then archived. Skip logic handles missing or duplicate files.

Market Data API

Real-time share price data fetched using runtime-resolved API keys. Stored procedure with table-valued parameter handles upsert to target table.

Azure Key Vault

All credentials — ADP client certificates, passwords, API keys — stored in Key Vault and referenced via Linked Service at runtime. Zero secrets in pipeline JSON.

SQL Server / Stored Procs

Every integration bracketed by pre- and post-processing stored procedures. Business logic in SQL keeps ADF pipelines thin and the data layer independently testable.

Gallery

Let’s Engineer Something Remarkable

Whether you’re modernizing legacy systems, building a new product, or need senior engineering muscle — we’re ready.

Schedule a Discovery Call

View Our Work

hello@periscopelabs.io · +880 9611 656808 · +880 1626 934015 · +1 646 267 9820

Enterprise Data Pipeline Architecture — Azure Data Factory

Key Integrations

Workday RaaS

ADP Payroll

SFTP / Blob Storage

Market Data API

Azure Key Vault

SQL Server / Stored Procs

Gallery

Let’s Engineer Something Remarkable

Enterprise Data Pipeline Architecture — Azure Data Factory

Key Integrations

Workday RaaS

ADP Payroll

SFTP / Blob Storage

Market Data API

Azure Key Vault

SQL Server / Stored Procs

Gallery

Let’s Engineer Something Remarkable