Design, develop, and maintain scalable data pipelines and ETL/ELT processes using Python and PySpark on AWS. Optimize data models and queries while collaborating with multidisciplinary teams to ensure data quality and governance.
Bluetab, an IBM Company
5 Remote Job Openings at Bluetab, an IBM Company
The role involves conducting process surveys to identify pain points and root causes while generating autonomous process documentation. It also requires translating business requirements into functional solutions and creating medium-complexity diagrams and prototypes.
Design, develop, and support cloud-based data solutions with a focus on automating ETL processes using AWS Glue and Kiro. The role involves optimizing data pipelines and evolving the data platform toward decentralized architectures.
Data Governance Specialist
Bluetab, an IBM Company
·
Full Time
·
2 months ago
Bluetab, an IBM Company
Design and implement data governance models and lead transversal initiatives with multiple stakeholders. The role involves managing data quality, metadata, and lineage within a large-scale Cloud Ecosystem project.
You will be responsible for building and optimizing data pipelines using ETL/ELT processes within an AWS environment. Additionally, you will manage data modeling, document end-to-end data lineage, and ensure the integrity of data structures.