Apply Now

Please mention DailyRemote when applying

AI Summary

The Senior Data Engineer is responsible for designing and implementing scalable data pipelines and optimizing large-scale Spark jobs. They also establish best practices for code quality and automate deployments across various environments.

Position Summary
The Senior Data Engineer drives the design, development, and operational excellence of our data platform built on Databricks and the Delta Lakehouse Architecture. This role requires deep expertise in scalable ETL/ELT, Spark optimization, and modern data governance using Unity Catalog. The Senior Data Engineer serves as a technical leader, establishing best practices for code quality, performance, and automated deployments across all environments.


Responsibilities

  • Design and implement scalable data pipelines using Delta Lake and manage enterprise-wide data access, security, and lineage using Unity Catalog
  • Optimize large-scale Spark jobs (PySpark/SQL) and cluster configurations (Photon) to meet stringent SLA and cost performance targets across all workflows
  • Build resilient data scheduling via Databricks Workflows (Jobs) and establish automated CI/CD pipelines for reliable code promotion across Dev, Staging, and Prod workspaces
  • Migrate data and models from relational databases to Databricks
  • Ensure best practices for development using industry standard development patterns
  • Monitor data pipelines performance
  • Partner with Data Management and Full Stack development engineers to operationalize models with current applications and processes
  • Support existing data pipelines to ensure business continuity
  • Stay updated with the latest trends and technologies in data engineering and cloud computing
  • Perform other related duties as assigned

Requirements

  • Bachelor’s degree is required
  • 5+ years of experience in Data Engineering, with a significant focus on data warehousing, ETL/ELT development, and distributed systems
  • 3+ years of hands-on experience developing enterprise solutions on the Databricks platform
  • Expertise in PySpark and high-performance SQL
  • Deep understanding practical knowledge of Delta Lake architecture and optimal maintenance best practices
  • Experience with cloud platforms (AWS preferred) and integrating Databricks with native cloud services (S3, Secret Manager, IAM)
  • Solid experience implementing CI/CD for Databricks notebooks and associated libraries
  • Healthcare experience is preferred

Skills

  • Strong development experience with: Python, Spark, or similar technologies
  • Excellent verbal, communication, negotiation, and presentation skills
  • Strong analytical and problem-solving skills
  • Ability to work autonomously while collaborating across teams to deliver timely projects
  • Ability to explain complex concepts in simple terms
  • Dedicated, hardworking employee who achieves maximum efficiency and productivity
  • Strong knowledge of domain-based design, data modeling and data structures
  • Strong knowledge of best practice in data management

No Travel

#LI-Remote

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Data Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified