Data Engineer, Web Scraping

 Posted 3 months ago
     
 $105K - $125K per year
  
2-5 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

The role involves designing, implementing, and optimizing end-to-end data pipelines for scraping and processing structured and unstructured data using GCP, alongside conducting ad hoc data collection for intelligence initiatives. Responsibilities also include preparing data via cleaning and transformation, contributing to API development, and collaborating with engineering teams to deliver insights and tools.

About 10a Labs: 10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. Our adversarial red teaming, model evaluations, and intelligence collection enable engineering, safety, and security teams to stay ahead of evolving threats and deploy AI systems safely.

 

About 10a Labs: 

10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. Our adversarial red teaming, model evaluations, and intelligence collection enable engineering, safety, and security teams to stay ahead of evolving threats and deploy AI systems safely.

 

In this role, you will:

  • Design, implement, and optimize end-to-end data pipelines for scraping and processing structured and unstructured data using Google Cloud Platform (or similar) and best practices; 
  • Conduct ad hoc web scraping and data collection to support research and intelligence initiatives;
  • Prepare data for further analysis, including data cleaning, transformation, anonymization, and masking;
  • Contribute to the development of internal and external APIs, following best practices; 
  • Collaborate with ML engineers, other data engineers, and software developers to deliver actionable insights and functional tools, including internal and external dashboards, APIs, and data dumps; and 
  • Drive other critical initiatives. 

Requirements:

  • Degree (or equivalent work experience) in Computer Science, Engineering, Information Science, Data Science or a related field (graduate degree preferred)
  • 2+ years of professional experience in data engineering or a closely related field
  • Ability to communicate complex technical ideas clearly to non-technical audiences
  • Proficiency in Python, SQL
  • Experience with web scraping/crawling (e.g., Beautiful Soup, Selenium, Scrapy)
  • Experience with Google Cloud Platform (or similar), including storage and database services (e.g., Cloud Storage, CloudSQL, Cloud Spanner) and workflow orchestration (e.g., Cloud Composer/Airflow, Cloud Run, Pub/Sub)
  • Experience building and managing data pipelines, especially for text data
  • Comfort working in fast-moving, high-impact environments, such as startups, AI research labs, or security-focused teams

Compensation & Benefits:

  • Salary Range: $105K–$125K, depending on experience and location
  • Bonus: Performance-based annual bonus
  • Professional Development: Support for conferences, continuing education, or leadership training
  • Work Environment: Fully remote, U.S.-based
  • Health Benefits: Comprehensive health, dental, and vision coverage
  • Time Off: Generous PTO and paid holiday schedule

Retirement: 401(k) plan

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Data Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified