Data Engineer (Platform Intelligence)

Data

Data Engineer (Platform Intelligence)

Data
-, Lublin +4 Locations

Billennium

Full-time
B2B
Senior
Remote

Job description

Billennium is a global technology company with over 20 years of experience, committed to innovation and empowering businesses. As an employer, we offer a supportive, growth-focused environment where collaboration and creativity thrive. Join us to shape the future of technology together!

We are seeking a computer scientist with a strong engineering background to build the data foundation of our software portfolio efficiency Initiative. This role focuses on extracting, structuring, and analyzing quantitative data from our code repositories (Gitlab, Github), artifact management systems, and observability platforms (Grafana, Data Dog). You will implement the logic to programmatically determine relationships and enforce metadata standards across 90+ products and 50+ platform services.

Key Responsibilities

  • Data Mining & Extraction: Mine code repositories to construct data-driven software metrics. Utilize Abstract Syntax Trees (AST) for representation of structure of source code. Conduct analysis of the code structure, to identify deep dependencies and usage patterns of platform libraries and web services.

  • Metadata Standardization: Implement validation logic within DevHub (Backstage) to design and mandate automation-critical metadata and enforce Canonical Internal Automation IDs across the ecosystem.

  • Graph Database Implementation: Architect and implement graph database solutions (using Gen AI/RAG techniques where applicable) to map "Known Unknowns" and complex dependency chains between Edge, AWS Cloud, and Foundation services.

  • Telemetry Integration: Map quantitative metrics to confirmed systems to visualize real-time adoption and performance of reusable assets.

  • Algorithm Development: Develop algorithms to calculate metrics based on defined acceptance criteria.

Required Skills & Qualifications

  • Core Technical Stack: Advanced proficiency in Python, knowledge of JS, and extensive experience with Version Control Systems and CI/CD pipelines.

  • Computer Science Fundamentals: Good understanding of algorithms, data structures, and graph theory in the context of reverse engineering and static analysis. Strong grasp of algorithms, data structures, and graph theory (e.g. Abstract Syntax Trees) is mandatory.

  • Data Engineering: Experience mining large codebases running on premise and AWS cloud and integrating with Artifactory, Prometheus, and Splunk.

  • AI/ML: Familiarity with Gen AI core concepts, Graph RAG (Retrieval-Augmented Generation), and vector databases.

  • Documentation & Handoff: Proven ability to create comprehensive documentation (e.g., system architecture, data flow diagrams, operational runbooks) for seamless maintenance handoff to permanent teams.

  • Mindset: Exploratory mindset capable of defining technical solutions for data retrieval without explicit step-by-step instructions.

Perks and benefits (our offer)

  • Comprehensive benefits - enjoy Udemy for Business, private medical care, Multisport card, veterinary package, language lessons, and shopping vouchers.

  • Flexibility - adaptable working hours and remote/hybrid work options to suit your lifestyle & location.

  • Career growth - access opportunities for professional development and learning, including perks related to our official partnerships with global IT giants: Microsoft, AWS, Snowflake, Salesforce & more.

  • Global collaboration - work with a diverse, international team.

  • Innovative environment - be part of a forward-thinking and growth-oriented workplace.

  • Engaging community - Work with passionate professionals and participate in team-building events, hackathons, and CSR initiatives to make an impact beyond work.

  • Team-building events including our company tradition (annual company event in Mazury).

  • A pleasant surprise to start your journey with us in the form of a welcome pack.

Tech stack

    English

    B2

    Polish

    C2

    Code analysis

    master

    Python

    master

    Algorithms & Data Structures

    master

    Graph databases

    advanced

    GIT (Github / Gitlab)

    advanced

    CI/CD (np. GitLab CI, Jenkins)

    advanced

    Data Engineering

    advanced

    AWS

    regular

    Observability tools (Prometheus, Grafana, Datadog)

    regular

    RAG / GenAI

    regular

Office location

Data Engineer (Platform Intelligence)

Summary of the offer

Data Engineer (Platform Intelligence)

-, Lublin
Billennium
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest Billennium S.A. z siedzibą w Warszawie, ul. Koszykowa 61 (dalej jako "administrator"). Mas... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.