Linguist – Data Specialist - Portugese Speaking - Warsaw - Hybrid
We are seeking a detail-oriented Linguistic Data Specialist to support the development and evaluation of language technologies and internationalisation (i18n) solutions. In this role, you will work with linguistic datasets, perform data annotation and validation, and contribute to the continuous improvement of language models and related tools.
Key Responsibilities
Collect, prepare, and annotate linguistic data used for training, testing, and evaluating language models and other language technology applications.
Review and validate annotated datasets to ensure high levels of accuracy, consistency, and quality.
Maintain comprehensive documentation, including annotation guidelines, data sources, workflows, and project progress.
Collaborate with cross-functional teams to support linguistic analysis and data management initiatives.
Continuously expand knowledge of annotation methodologies, linguistic frameworks, data management practices, and emerging language technology tools.
Identify and escalate data quality issues, contributing to process improvements and best practices.
Qualifications
Required
Strong understanding of core linguistic disciplines, including syntax, semantics, pragmatics, sociolinguistics, corpus linguistics, and related areas.
Experience with linguistic data annotation, corpus management, or language data analysis.
Excellent attention to detail and strong organizational skills.
Familiarity with at least one data annotation platform or annotation tool.
Experience working with data analysis and database query tools, such as SQL, spreadsheets, or similar technologies.
Proficiency in Python for data processing, analysis, or automation tasks.
Preferred
Experience supporting machine learning, natural language processing (NLP), or AI-related projects.
Familiarity with multilingual datasets and internationalization workflows.
Experience working in a research, technology, or language services environment.
What You'll Bring
A passion for language, data quality, and emerging language technologies.
Strong analytical and problem-solving skills.
The ability to work independently while collaborating effectively within a team environment.
A commitment to accuracy, consistency, and continuous learning.
Linguist – Data Specialist - Portugese Speaking - Warsaw - Hybrid
Linguist – Data Specialist - Portugese Speaking - Warsaw - Hybrid