Principal Data Scientist with strong Software Engineering expertise
Job overview
dotData is hiring a high-caliber expert who is excited to democratize data science with automation. You will work on dotData’s proprietary core engine.
Our advanced AutoML component automatically explores and evaluates state-of-the-art ML models with automated hyper-parameter optimization and model selection. Logical plan is divided into computationally-intensive jobs that are executed in parallel with strict resiliency requirements. We take advantage of industry-standard machine learning libraries, like sklearn, XGBoost, LightGBM, TensorFlow, and Pytorch.
Things you will do:
Work closely with DS/ML researchers, propose new algorithms and optimizations to existing ones for both Machine Learning and Feature Engineering engines
Prototype, test and implement chosen solutions
Support other developers in tasks requiring DS/ML expertise
Challenge engineers by looking at their deliverables from the product perspective
Job requirements
Non-technical:
Startup experience is nearly required. Knowing what that means and seeking it out is required. This includes being able to seek out answers and fill in the blanks, proposing new ways of approaching problems, and being able to handle all sorts of different projects.
You take ownership, end-to-end, of the features that are your responsibility.
You work collaboratively, and can do so in a global multi-cultural environment.
You are willing to learn, and will not be afraid to jump into something with which you have no prior experience.
Technical:
You are expert in Data Science and Machine Learning
You were a key implementor coding, testing, and shipping multiple Scala or Java-based enterprise-grade products.
You write clean, maintainable code using the best software engineering practices.
You do not compromise on quality and you write tests to guarantee it.
You are proficient in Scala and its ecosystem.
A very strong Java programmer in the other technologies might work out, too.
Strong CS skills including such things as time / space complexity, data structures, functional programming, understanding of operating systems...
CS Master’s or equivalent.
Preferred:
Working knowledge of DevOps platforms such as Jenkins, Github, JIRA, etc.
Foundation in distributed computing.
Expertise in Python is a plus.
What we’re offering:
Direct impact on core engine of our key product (both feature and model engineering automation)
Possibility to work with key people in our company (DS research team, key engineers, customer facing team)
Becoming a co-owner of the company your are about to build
Powerful hardware, convenient setup with many monitors
Friendly atmosphere, transparent communication
Flexible working hours: we understand if you have to leave at 3pm or prefer to work in the evenings
Convenient office at the very heart of Warsaw with access to both metro lines :)
Remote only option:
2-4 week onboarding in Warsaw
then monthly visits for 2-4 days in Warsaw
after 6 months of cooperation, it will be possible to work 100% remotely
About dotData
dotData is a Silicon Valley based startup focused on full-cycle Machine Learning and Data Science automation. Our platform automates the entire process of building predictive models starting from raw business data through data and feature engineering to machine learning all the way to production. We have offices in the USA, Japan, and Poland. Fortune 500 organizations around the world use dotData to accelerate their ML and AI projects.
Unique to the dotData Platform is its AI-powered feature engineering, which eliminates the most time-consuming and labor- and skill-intensive aspects of the full data science process by discovering and evaluating millions of features derived from relational, transactional, temporal, geo-locational, or text data. This technology enabled us to become one of three leaders in the AutoML area as assessed by Forrester - an independent research & consulting company.
dotData stemmed from Dr. Ryohei Fujimaki’s experience in leading more than 100 data analysis projects at NEC, across a variety of industries. Prior to founding dotData, he was the youngest research fellow ever appointed in the 119-year history of NEC, an honor given to only six individuals worldwide among NEC’s 1000+ researchers.