AI Engineer (RAG & On Prem LLMs)

4 575 - 6 189 USDGross per month - Permanent
AI/ML

AI Engineer (RAG & On Prem LLMs)

AI/ML
., Warszawa

DCG

B2B Contract
Permanent
Mid
Hybrid
4 575 - 6 189 USDGross per month - Permanent

Job description

As a recruitment company, DCG understands that every business is powered by experienced professionals. Our management style and partnership approach enable us to meet your needs and provide continuous support. Due to our ongoing growth and the large number of recruitment projects we undertake for our partners, we are currently looking for:

AI Engineer (RAG & On Prem LLMs)

Responsibilities:

  • Architect, implement, and optimize end-to-end Retrieval Augmented Generation (RAG) pipelines for enterprise use cases in on-premises environments

  • Design and integrate retrieval mechanisms (e.g. vector databases such as Neo4j) with generative models (e.g. LLAMA 3.2, Mistral)

  • Fine-tune and optimize retrieval and generation components to achieve high accuracy and low latency

  • Implement and customize inference servers using vLLM and LiteLLM for efficient and scalable LLM serving

  • Integrate open-source large language models with proprietary data sources and enterprise APIs

  • Design GPU-optimized, scalable on-prem infrastructure for model training and inference, ensuring security and data governance compliance

  • Collaborate with DevOps teams to containerize workflows using Docker and Kubernetes and automate MLOps pipelines

  • Apply performance optimization techniques such as quantization, pruning, and dynamic batching

  • Monitor system performance, troubleshoot bottlenecks, and ensure high availability

  • Work closely with data engineers and business stakeholders to translate business requirements into technical AI solutions in telco environments

Requirements:

  • At least 3 years of professional experience in ML/NLP roles, including 2+ years working with RAG systems

  • Proven experience deploying and operating LLM‑based solutions in on‑prem or hybrid environments

  • Hands‑on experience with vLLM, LiteLLM, and open‑source LLMs such as LLAMA 3.2, DeepSeek, or Mistral

  • Strong Python skills and experience with frameworks such as PyTorch, Hugging Face Transformers, and LangChain

  • Experience with vector databases (e.g. Neo4j)

  • Familiarity with Linux‑based systems and Red Hat OpenShift

  • Strong problem‑solving and analytical skills

  • Ability to clearly communicate complex AI concepts to non‑technical stakeholders

  • Bachelor's, Master's, or PhD degree in Computer Science, Artificial Intelligence, or a related field

  • Knowledge of English (B2+/C1)

Offer:

  • Private medical care co-financing

  • Sports card

  • Training & learning opportunities

  • Life insurance co-financing

Tech stack

    English

    C1

    Machine Learning

    regular

    LLM

    regular

    PyTorch

    regular

    AI

    regular

    RAG

    regular

    Langchain

    regular

    Red Hat

    regular

    Deep Learning

    regular

    Python

    regular

    Hugging Face

    regular

Office location

AI Engineer (RAG & On Prem LLMs)

4 575 - 6 189 USDGross per month - Permanent
Summary of the offer

AI Engineer (RAG & On Prem LLMs)

., Warszawa
DCG
4 575 - 6 189 USDGross per month - Permanent
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Informujemy, że administratorem danych jest DCG Sp. z o.o., ul. Towarowa 28, 00-839 Warszawa (dalej jako "administrator"). Masz prawo ... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Check similar offers
7N

7N

Warszawa

Hybrid

Hybrid

6 344 - 7 477USD/month
GenAI
Machine Learning
LLM
SQL
Python
Data Science
MidMidB2B, PermanentB2B, Permanent
New
ADVERTISEMENT: Recommended by Just Join IT
Salary
4 575 - 6 189 USD
Gross per month - Permanent
Applied -
12 day left (until 03.07.2026)
Applied -
Check similar offers
7N

7N

Warszawa

Hybrid

Hybrid

6 344 - 7 477USD/month
GenAI
Machine Learning
LLM
SQL
Python
Data Science
MidMidB2B, PermanentB2B, Permanent
New
Addepto

Addepto

Remote

Remote

3 436 - 5 268USD/month
Machine Learning
LLM
Cloud
Backend
AI
Software Development
Docker
Python
Generative AI
Data Science
MidMidB2BB2B
New
Devapo

Devapo

Warszawa

Remote

Remote

Undisclosed Salary
LLM
AI
MidMidB2BB2B
New
ITLT

ITLT

Poland (Remote)

Remote

Remote

38 - 52USD/h
LLM
PyTorch
RAG
Langchain
Python
MidMidB2BB2B
New
PAYBACK

PAYBACK

Warszawa

Hybrid

Hybrid

Undisclosed Salary
CI/CD
AIOps
AI
Vertex AI
GCP
Langchain
Langgraph
Python
MidMidPermanentPermanent
New
ADVERTISEMENT: Recommended by Just Join IT