4+ years’ experience in data-based decision-making or quantitative analysis
• Extracting and aggregating data from large data sets using SQL/Hive or Spark
• Analyzing large data sets using programming languages such as Python, R, SQL and/or Spark
• Generating and visualizing data-based insights in software such as Tableau or Power BI
• Proficiency in data analytics, data mining, machine learning, and related fields.
• Knowledge of Text mining, Natural Language Processing, Pattern recognition, Deep Learning, Graph algorithms required
• Proficiency in Python, Jupiter notebooks required
• Proficiency in IBM Watson APIs and Opensource ML/AI frameworks like Tensorflow, Pytorch etc nice to have
• Experience in building inference model APIs and scaling APIs preferred
• Knowledge of Hadoop and components like Hive, Oozie, Sqoop, Airflow, Nifi, Spark
• Knowledge of Kafka, Airflow, Apache Flint, Spark Streaming, Microservices, Building data products nice to have
• Knowledge of Data engineering (ETL and ELT) and Data warehousing concepts nice to have
• Knowledge of relational and NoSQL, Graph database concepts, systems architecture, and data structures
• Proficiency in Linux, Windows, Databases (Oracle, Sql server), SQL, Graph database(Neo4j)
• Excellent communication skills
Check similar offers