Data Engineer with GCP

Data

Data Engineer with GCP

Data
Centrum, Warsaw

emagine Polska

Full-time
Any
Mid
Remote

Job description

Language: English (fluent), German (nice to have)

Start date for assignment: ASAP

Duration: 6 months + extensions

Workload: Full-time

Location: 100% remote

Summary

The role focuses on enhancing data-driven personalization for strategic digital products. The primary objective is to design and implement scalable data architectures on Google Cloud to improve user engagement, retention, and monetization through relevant content delivery.

Main Responsibilities

  • Design and implement scalable data pipelines using Google Cloud technologies (e.g., BigQuery, Dataflow, Pub/Sub).

  • Develop and optimize both batch and streaming data pipelines for personalization use cases.

  • Transform and provision user and content-related data for machine learning applications.

  • Implement data quality checks and establish logging and monitoring mechanisms.

  • Optimize existing data processing workflows for improved performance.

  • Integrate machine learning models into production workflows.

  • Document data structures, transformation logic, and relevant interfaces.

  • Provide technical consulting on event structures, tracking, and data architecture within the personalization framework.

Key Requirements

  • Experience with Google Cloud Platform (GCP).

  • Proficient in Python and FastAPI.

  • Strong skills in dbt, SQL, and BigQuery.

  • Familiarity with data pipeline frameworks like Airflow.

  • Understanding of Kubernetes and Docker for containerization.

Nice to Have

  • Experience with Vertex AI for machine learning initiatives.

  • Knowledge of Terraform for infrastructure as code.

  • Experience with CI/CD practices, particularly in GitLab.

  • Familiarity with project tracking tools like JIRA and Confluence.

Other Details

  • Team: Personalization Team in a digital product context.

  • Environment: Data-driven and cloud-centric.

  • Collaboration Tools: Microsoft Teams and Projektron Time Recording.

Tech stack

    English

    B1

    Documentation

    advanced

    Atlassian JIRA

    advanced

    training

    advanced

    Use Cases

    advanced

    Artificial Intelligence (AI)

    advanced

    Atlassian Confluence

    advanced

    SQL

    advanced

    Python

    advanced

    Project Management

    advanced

    CI/CD

    advanced

Office location

Data Engineer with GCP

Summary of the offer

Data Engineer with GCP

Centrum, Warsaw
emagine Polska
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest emagine z siedzibą w Warszawie, ul.Domaniewskiej 39A (dalej jako "administrator"). Masz pr... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.