Software Engineer Java + Data (PySpark)

 Posted a month ago
     
5-10 years experience
Apply Now

Please mention DailyRemote when applying

AI Summary

Design and maintain scalable backend services and high-performance data processing pipelines using Java and Python. Collaborate with cross-functional teams to implement ML-driven features and optimize system reliability.

About Lineate

Lineate is a US-based international software development company with over two decades of experience.

From Intelligent Document Processing(IDP) and Agentic RAG systems to scalable cloud architectures, we turn complex ideas into real, measurable results.

We deliver AI-driven custom solutions for FinTech, HealthTech, AdTech, and beyond, empowering businesses to grow smarter, faster, and more efficiently.

Our expertise falls into three main categories:

  • Building Custom AI Solutions: Deploying high-impact, AI-enabled technology utilizing IDP, Agentic RAG.
  • Cloud and Data Infrastructure: Optimizing business operations with our data management and cloud computing solutions.
  • Team Augmentation: Providing specialized experts in FinTech, AdTech, and HealthTech to integrate seamlessly and accelerate project timelines.
  • Our goal is not just to build technology, but to build the future operating model for our clients.

 

Responsibilities

  • Design, develop, and maintain scalable backend services using Java and Python
  • Build and optimize data processing pipelines and APIs for high-performance applications
  • Collaborate with cross-functional teams to deliver reliable and efficient solutions
  • Improve system performance, scalability, and reliability
  • Work with large datasets to support search, recommendation, or ML-driven features
  • Contribute to architecture decisions and technical design
  • Write clean, maintainable, and well-documented code

 

Requirements (Must-have)

  • 6+ years of commercial software development experience
  • Strong hands-on experience with both Java and Python (primarily PySpark code)
  • Experience in designing, developing, and optimizing scalable data processing pipelines and backend APIs for high-performance applications
  • Solid understanding of backend development principles and system design
  • Experience working with APIs, microservices, and distributed systems

Nice-to-have

  • Databricks OR AWS EMR OR Hadoop
  • Search technologies experience, such as:

Lexical search (e.g., Solr, Elasticsearch)

Semantic search, vector search, or RAG-based systems

Search relevance tuning and optimization

  • Machine Learning experience, especially in:

Recommendation systems

User behavior prediction (e.g., click-through rate prediction, relevance estimation)

Practical ML application in production systems

We offer:

  • B2B contract with our US office
  • NY working hours (at least 6 hours overlap)

Similar Jobs

See all Remote Software Development jobs →

Personalize your Remote Job Search in 3 Easy Steps!

Discover remote opportunities in Software Engineer

Answer easy questions

Answer easy questions

200,000+ jobs across 15+ categories

Get your best job matches

Get your best job matches

Only hand-screened, legit jobs

Find a remote job faster

Find a remote job faster

No ads, scams, or junk

I was the first applicant for a remote marketing position that got listed on the company website the same day I applied. Had an interview within 48 hours!

Sarah J. — Sarah J. · Marketing Manager ★★★★★ Verified