Data Engineer

 Posted 2 hours ago
     
⭐ 2-5 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Design and optimize data infrastructure to replicate operational data from MongoDB to Google BigQuery for analytics. Manage BI enablement via Looker Studio and implement security safeguards to isolate analytical workloads from production databases.

Company Description

Dashlabs.ai (YC W21) is a healthcare operations platform that integrates Electronic Medical Records (EMR), Laboratory Information Systems (LIS), and Radiology Information Systems (RIS) into a unified ecosystem. Built for diagnostic laboratories, imaging centers, clinics, and hospitals, Dashlabs.ai serves as the operational backbone for diagnostic providers across the Philippines and is expanding its footprint into Indonesia and Malaysia. We build robust, scalable infrastructure to streamline healthcare workflows, optimize diagnostics, and improve patient outcomes.

Job Description

Role Overview
As a Data Engineer, you will design, build, and optimize the data infrastructure that powers our reporting, analytics, and business intelligence capabilities. You will manage data synchronization across disparate database paradigms, enable stakeholders through modern BI tools, and play a critical role in safeguarding our core production databases from intensive analytical workloads and unauthorized data extraction.
This role requires a strong engineering background with a deep understanding of the MERN stack, particularly TypeScript, alongside robust database administration, data warehousing, and ETL/ELT pipeline expertise.

Key Responsibilities

1. Data Pipeline Engineering & ELT

  • Design, implement, and maintain scalable data pipelines to replicate and transform operational data from a production MongoDB environment into Google BigQuery for analytics and data warehousing.
  • Ensure data consistency, integrity, and low-latency synchronization between NoSQL documents and BigQuery's columnar storage structures.
  • Optimize BigQuery storage, partitioning, clustering, and query performance to minimize operational costs and speed up retrieval times.

2. Analytics & Business Intelligence Infrastructure

  • Own the onboarding, access management, and data enablement pipeline for users on Looker Studio, ensuring seamless integration with BigQuery datasets.
  • Construct and maintain semantic layers, optimized BigQuery views, and data sources that empower internal teams and external clients to build self-service reports.

3. Production Database Optimization & Security

  • Implement architectural safeguards (e.g., read replicas, change data capture, data lakes) to isolate analytics workloads and protect the primary production MongoDB database from resource-intensive data extraction queries.
  • Collaborate with the security and infrastructure teams to enforce strict data governance, access controls, and masking protocols for sensitive healthcare data before it reaches the data warehouse.

4. Full-Stack Data Integration (MERN Stack)

  • Write clean, maintainable, and typed code within our existing software ecosystem.
  • Develop backend services, scripts, and internal tools leveraging TypeScript and the MERN stack to facilitate automated data workflows, API integrations, and ETL orchestration.

Qualifications

Technical Stack Proficiencies:

  • Languages: JavaScript, TypeScript, and Advanced SQL (Required).
  • Frameworks: Node.js, Express.js (MERN stack paradigm).
  • Databases & Warehousing: MongoDB (NoSQL) and Google BigQuery.
  • BI Tools: Looker Studio.

Experience & Competencies:

  • Proven experience building production-grade ETL/ELT pipelines converting nested NoSQL data structures (JSON/BSON) into optimized BigQuery schemas.
  • Deep understanding of database indexing, BigQuery billing/slot optimization, and architectural patterns used to separate transactional (OLTP) and analytical (OLAP) workloads.
  • Experience managing user roles, credentials, and data source permissions within cloud-based BI and Google Cloud environments (IAM).
  • Strong commitment to data privacy, security best practices, and handling sensitive healthcare data especially the Data Privacy Act of 2012.
  • Ability to write self-documenting code, design clean APIs, and work effectively in a fast-paced, high-growth startup environment.
  • Excellent communication skills.

Additional Information

This position is remote-first. Arrangement can be flexible (full-time or part-time, etc.).

Dashlabs.ai offers outstanding career opportunities, empowerment in the workplace, and a diverse, friendly team underpinned by competitive compensation packages. Offers for a paid position may be extended post-internship. Salary and level will be commensurate to the candidate's experience, qualifications, and applicable skillsets.

Similar Jobs

See all Remote Software Development jobs β†’

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Data Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified