Data Engineer (Code Mining & Telemetry)
About the Project
We are looking for a computer scientist with a strong engineering background to build the data foundation of our Client's (pharma industry) software portfolio efficiency initiative. This role focuses on extracting, structuring, and analyzing quantitative data from Client's code repositories (Gitlab, Github), artifact management systems and observability platforms (Grafana, Data Dog). You will implement the logic to programmatically determine relationships and enforce metadata standards across 90+ products and 50+ platform services.
Work Mode: remote
Start of the project: ASAP
Duration of the project: 1 year (with possibility of extension)
Job Responsibilities
Data Mining & Extraction: Mine code repositories to construct data-driven software metrics. Utilize Abstract Syntax Trees (AST) for representation of structure of source code. Conduct analysis of the code structure, to identify deep dependencies and usage patterns of platform libraries and web services.
Metadata Standardization: Implement validation logic within DevHub (Backstage) to design and mandate automation-critical metadata and enforce Canonical Internal Automation IDs across the ecosystem.
Graph Database Implementation: Architect and implement graph database solutions (using Gen AI/RAG techniques where applicable) to map "Known Unknowns" and complex dependency chains between Edge, AWS Cloud, and Foundation services.
Telemetry Integration: Map quantitative metrics to confirmed systems to visualize real-time adoption and performance of reusable assets.
Algorithm Development: Develop algorithms to calculate metrics based on defined acceptance criteria.
Expectations
Core Technical Stack: Advanced proficiency in Python, knowledge of JS, and extensive experience with Version Control Systems and CI/CD pipelines.
Computer Science Fundamentals: Good understanding of algorithms, data structures, and graph theory in the context of reverse engineering and static analysis. Strong grasp of algorithms, data structures, and graph theory (e.g. Abstract Syntax Trees) is mandatory.
Data Engineering: Experience mining large codebases running on premise and AWS cloud and integrating with Artifactory, Prometheus, and Splunk.
AI/ML: Familiarity with Gen AI core concepts, Graph RAG (Retrieval-Augmented Generation), and vector databases.
Documentation & Handoff: Proven ability to create comprehensive documentation (e.g., system architecture, data flow diagrams, operational runbooks) for seamless maintenance handoff to permanent teams.
Mindset: Exploratory mindset capable of defining technical solutions for data retrieval without explicit step-by-step instructions.
We offer
Ongoing support from a dedicated agent, taking care of your project continuity, client contact, necessary formalities, work comfort and development
Consultant Development Program – advice on growth planning based on the latest trends and market needs in IT, including consultations with agents and growth mentors
Access to 7N Learning & Development – a development and educational platform with webinars, a library of articles and industry reports, and regular invitations to one-time and recurring development events – technical, business, and lifestyle
Spectacular integration events, both for you (e.g., annual Kick-Off trip, Christmas parties, or Summer Olympics sports events) and for your loved ones (e.g., family picnics, movie premieres)
Professional development not only during the project – you can get involved in knowledge transfer to others within the 7N Services offering directed at 7N clients
Relationships and access to the knowledge of the most experienced IT experts in the market – the average professional tenure of our consultants in Poland is over 10 years
A complete benefits package, including funding for medical care, life insurance, sports cards for you and your loved ones, as well as discounts in stores in Poland and abroad
About 7N
Constantly searching for projects, difficult rate negotiations, lack of development support – sounds familiar? At 7N, you gain not only stability of contracts but also the personal involvement of a dedicated agent who ensures your professional comfort and continuous access to development initiatives.
Our mission is to provide stable and rewarding collaborations that drive your success as an IT expert and the success of our clients. We build long-lasting relationships based on Scandinavian values and 30 years of experience creating IT solutions for over 200 organizations.

7N
Ciągłe szukanie nowych projektów IT, negocjacje stawek, brak realnego wsparcia w rozwoju – brzmi znajomo? Znamy tę perspektywę, dlatego w 7N stworzyliśmy model współpracy, który zapewni Ci stabilność kontraktów, indywidu...
Data Engineer (Code Mining & Telemetry)
Data Engineer (Code Mining & Telemetry)