Data Scientist, NLP & GenAI – Model Development

Data

Data Scientist, NLP & GenAI – Model Development

Data
Remote, New York

Kratos Growth

Full-time
B2B
Senior
Remote

Job description

Our client is hiring Data Scientists, NLP & GenAI – Model Development


Join an AI-Powered Consumer Intelligence Platform Delivering Insights for the World’s Biggest Brands


Company Background


We're an AI-powered consumer intelligence platform that transforms billions of data points -- Google searches, social conversations, product reviews, and videos -- into strategic insights for Fortune 500 clients in CPG, beverages, and personal care. We deliver in days what traditional research takes months to produce.


The Role


As a Data Scientist, you'll build NLP models and GenAI solutions that extract meaning from massive consumer datasets.Your work will power two core products in brand measurement and product innovation insights. You'll develop sentiment analysis, emotion detection, and LLM-based applications that directly inform multi-million dollar marketing and product decisions for global brands. This is a remote position reporting to our CTO.


What You'll Do


• Develop and deploy NLP models for sentiment analysis, emotion detection, and text classification processing 1M+ documents daily

• Design and implement LLM-based solutions using prompt engineering, RAG architectures, and fine-tuning approaches

• Build evaluation frameworks to measure model accuracy, establishing baselines and tracking improvements

• Collaborate with data engineers and AI engineers to productionize models via API endpoints and batch pipelines

• Work with multi-language text data (English primary, with global market expansion)

• Translate business requirements into technical model specifications

• Document model architectures, performance metrics, and usage guidelines for cross-functional teams

• Iterate on models based on client feedback and production performance data


• Core Stack: Python, SQL

• AI/ML: Sentiment analysis, emotion detection, LLMs, GenAI, prompt engineering

• NLP Libraries: Modern frameworks (Hugging Face, transformers, Spacy, NLTK, LangChain, LLM APIs)

• Cloud & Data: Azure ecosystem, Databricks, large-scale datasets

• Products: brand measurement and product development innovation products


What We're Looking For


Education

• Bachelor's degree in Computer Science, Statistics, Mathematics, or related quantitative field


Experience

• 4+ years in data science/machine learning roles with production model development

• 2+ years focused primarily on NLP tasks (sentiment analysis, classification, topic modeling, or similar)

• 1+ years working with LLMs (prompt engineering, fine-tuning, or RAG implementations)

• Minimum 1 model deployed to production or demonstrably used to inform business decisions


Technical Skills

• Python: 3+ years in production ML environments

• ML/NLP libraries: Proficiency with at least 2 of (scikit-learn, Hugging Face Transformers, SpaCy, NLTK)

• SQL: Intermediate level (JOINs, aggregations, window functions for datasets >1M rows)

• Version control: Git-based workflows including code reviews and collaborative development


Working Style

• Experience handing off models to engineering teams for deployment

• Track record of self-directed work with minimal daily supervision

• Clear written communication for technical documentation


Preferred (Nice to Have)

• Master's or PhD in relevant quantitative field

• Experience with Azure cloud platform and/or Databricks

• LangChain or similar LLM orchestration framework experience

• PySpark or distributed computing experience

• Multi-language NLP experience (processing text beyond English)

• Consumer insights, CPG, or marketing analytics industry background

• Emotion detection model development specifically

• MLOps experience (model monitoring, A/B testing, automated retraining)


What We Offer


• Competitive base salary (with performance bonus + equity participation opportunities)

• Fully remote (4 hours overlap with US Eastern time zone desired)

• Environment: Greenfield development—modern tech stack, no legacy code constraints

• Impact: Your models directly power products used by Fortune 500 clients

• Growth: Work with cutting-edge GenAI and LLM technologies

• Team: Collaborate with senior data scientists, data engineers, and AI engineers in a flat structure





Tech stack

    Gen AI

    advanced

    Azure

    advanced

    Databricks

    advanced

    SQL

    advanced

    Python

    advanced

    Hugging Face

    regular

Office location

Published: 21.01.2026

Data Scientist, NLP & GenAI – Model Development

Summary of the offer

Data Scientist, NLP & GenAI – Model Development

Remote, New York
Kratos Growth
By applying, I consent to the processing of my personal data for the purpose of conducting the recruitment process. Please be informed that the data controller is Kratos Growth (hereinafter "controller"). You have the right to request access to your ... MoreThis site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.