Senior Data Engineer (PySpark, NoSQL)
We are seeking a Senior Data Engineer with strong expertise in Azure and PySpark, skilled in designing, implementing, and maintaining robust data processing solutions. This role focuses on building scalable, production-grade data systems, ensuring reliability, and optimizing performance in distributed environments.
Responsibilities
Design and optimize large-scale data pipelines using PySpark
Build and maintain scalable ETL/ELT workflows in Azure
Troubleshoot production issues related to performance, latency, and availability
Work with distributed NoSQL technologies (e.g., Cosmos DB, Cassandra, DynamoDB, MongoDB, or similar)
Optimize Spark jobs (partitioning, execution plans, resource usage)
Implement best practices for scalability, security, and reliability
Collaborate with cross-functional teams on data-driven solutions
Contribute to automation, CI/CD, and operational improvements
Requirements
5+ years of experience as a Data Engineer or similar role
Strong hands-on experience with PySpark in production
Proven experience in data modeling, partitioning, indexing, and performance tuning in NoSQL systems
Strong programming skills in Python
Experience building and operating production-grade pipelines in cloud (Azure)
Experience with distributed NoSQL databases (e.g., Cosmos DB, Cassandra, DynamoDB, MongoDB)
Strong understanding of distributed systems and performance optimization
Experience with CI/CD, monitoring, troubleshooting, and production support
Strong analytical and communication skills (English B2+)
Nice to have
Experience with real-time / streaming data
Exposure to Data Science workflows
Knowledge of Big Data ecosystems
Experience with financial data
Familiarity with AI-assisted development or LLM tools
We offer/Benefits
We gather like-minded people:
Engineering community of industry professionals
Friendly team and enjoyable working environment
Flexible schedule and opportunity to work remotely within Poland
Chance to work abroad for up to 60 days annually
Business-driven relocation opportunities
We provide growth opportunities:
Outstanding career roadmap
Leadership development, career advising, soft skills, and well-being programs
Certification (GCP, Azure, AWS)
Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
English classes
We cover it all:
Stable income (Employment Contract or B2B)
Participation in the Employee Stock Purchase Plan
Benefits package (health insurance, multisport, shopping vouchers)
Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
Referral bonuses
Corporate, social and well-being events
Please, note:
The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview.
We will reach out to selected candidates exclusively.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Senior Data Engineer (PySpark, NoSQL)
Senior Data Engineer (PySpark, NoSQL)