We are seeking a Senior Data Scientist with expertise in machine learning and LLM evaluation to design and execute research on how humans assess LLM features. The ideal candidate will have a strong background in statistical programming, human data collection, and research methodologies, as well as the ability to work autonomously on impactful research problems.
Responsibilities
- Work with technical and non-technical stakeholders to design and conduct research on LLM evaluation methods.
- Develop and implement human evaluation studies and automated benchmark testing for LLM performance.
- Collect and analyze data from human participants (e.g., surveys, experiments), ensuring high data quality and validity.
- Own and drive a research agenda, selecting impactful problems and autonomously executing projects.
Requirements
- Strong understanding of machine learning principles, especially in the context of LLMs.
- Knowledgeable about LLM evaluation techniques, such as human evaluation and automated benchmarks.
- Can own and pursue a research agenda, including choosing impactful research problems and autonomously carrying out projects.
- Fluent in at least one statistical programming language such as Python (preferred) or R.
- Demonstrated background in collecting data from human participants (e.g., surveys, experiments) with knowledge about data quality, data validity, etc.
- Strong verbal and written communication skills with the ability to work effectively across internal and external organizations and virtual teams.
- PhD or advanced degree in computer science, machine learning, cognitive science, psychology, economics, or similar (preferred).
We offer
- Opportunity to work on bleeding-edge projects
- Work with a highly motivated and dedicated team
- Competitive salary
- Flexible schedule
- Benefits package - medical insurance, sports
- Corporate social events
- Professional development opportunities
- Well-equipped office
About us
Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.