Standard Template (New Job)

 Posted 2 days ago
     
 $115K - $125K per year
  
2-5 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Design and maintain scalable RAG/CAG pipelines and document ingestion workflows to power AI-driven applications. Collaborate with cross-functional teams to optimize retrieval performance and ensure data security within federal cloud environments.

Standard Template (New Job)

Department: Data

Employment Type: Full Time

Location: Remote

Compensation: $115,000 - $125,000 / year



Description

At Nüvitek, customer success is our Ethos; together, we drive transformational outcomes. We only succeed when our customers succeed. We partner with our customers to achieve business objectives by using our proven customer-centric, value-driven business practices and service delivery methodologies.

Nüvitek is seeking a highly skilled Data Engineer to support the design, development, and optimization of advanced AI and data processing solutions. This role will focus on building scalable data pipelines that power large language model (LLM) applications, including retrieval systems, document ingestion workflows, and intelligent search capabilities.

The ideal candidate has hands-on experience with retrieval-augmented generation (RAG), contextual augmentation generation (CAG), OCR processing, vector databases, and modern AI data architectures. This role requires strong technical expertise, problem-solving skills, and the ability to work collaboratively within agile pod-based teams.




In assuming this position, you will be a critical contributor to meeting Nuvitek's mission: To deliver innovative, cost-effective solutions and services that enable our customers to rapidly adapt to dynamic environments.


What You Will Do

  • Design, develop, and maintain scalable RAG/CAG pipelines for AI-powered applications
  • Build and optimize document ingestion workflows for structured and unstructured data sources
  • Manage and maintain vector stores to support semantic search and retrieval capabilities
  • Develop OCR processing pipelines for historical and modern document collections spanning 1781–2025
  • Optimize retrieval performance, relevance tuning, and ranking strategies for LLM-based systems
  • Build reliable data pipelines that support integrations with large language models and AI services
  • Collaborate with engineers, UX teams, product owners, and stakeholders to deliver scalable AI solutions
  • Ensure data quality, integrity, security, and performance across ingestion and retrieval systems
  • Implement monitoring, logging, and troubleshooting for AI and data processing workflows
  • Contribute to architecture decisions, technical documentation, and engineering best practices
  • Participate in agile pod-based development teams and continuous improvement initiatives



What You Will Bring

  • 4+ years of experience in data engineering, data platform development, or AI/ML infrastructure
  • Strong experience building RAG and/or CAG pipelines
  • Hands-on experience with vector databases and semantic retrieval systems
  • Experience developing document ingestion and OCR processing workflows
  • Strong understanding of LLM integrations and AI data pipeline architectures
  • Experience working with structured, semi-structured, and unstructured datasets
  • Proficiency with Python and modern data engineering frameworks
  • Familiarity with APIs, ETL/ELT pipelines, and distributed processing systems
  • Experience building and operating data pipelines in secure federal cloud environments, including FedRAMP Moderate and Zero Trust architectures, with appropriate handling of sensitive data and Controlled Unclassified Information (CUI)
  • Ability to obtain and maintain a federal Public Trust (or higher) clearance
  • Strong analytical, troubleshooting, and performance optimization skills
  • Ability to work effectively in agile or pod-based delivery environments
  • Excellent communication and collaboration skills
  • Experience working with historical archives or large-scale document digitization efforts
  • Familiarity with cloud-native data platforms and AI infrastructure
  • Experience with search relevance tuning and ranking optimization
  • Knowledge of embedding models, chunking strategies, and retrieval optimization techniques
  • Experience with containerization and orchestration technologies such as Docker and Kubernetes
  • Familiarity with accessibility, governance, and secure data handling practices
  • Passion for building scalable AI-driven solutions that improve user experiences and operational efficiency




Benefits

Nuvitek is proud to offer a comprehensive benefits package:
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance
  • Disability and Life Insurance
  • Parental Leave
  • 401K
  • Paid Time Off
Equal Opportunity Employer Statement
Nuvitek is an equal-opportunity employer as to all protected groups, including protected veterans and individuals with disabilities.

Similar Jobs

See all Remote Others jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Others

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified