1 month ago

Blog Details

  • Home
  • Sr. Data Scientist

Sr. Data Scientist

1 month ago

We are looking for a Sr. Data Scientist whose primary responsibilities are listed below:

Job Responsibilities:

  • Assembling large, complex sets of data that meet non-functional and functional business requirements
  • Identifying, designing, and implementing internal process improvements including re-designing infrastructure for greater scalability, optimizing data delivery, and automating manual processes
  • Building required infrastructure for optimal extraction, transformation and loading of data from various data sources using AWS and SQL technologies
  • Building analytical tools to utilize the data pipeline, providing actionable insight into key business performance metrics including operational efficiency and customer acquisition
  • Working with stakeholders including data, design, product and executive teams and assisting them with data-related technical issues
  • Working with stakeholders including the Executive, Product, Data and Design teams to support their data infrastructure needs while assisting with data-related technical issues
  • Track record of leading and collaborating on strategic initiatives
  • Strong team and business communication skills – can walk through approach with technical clients or explain tricky concepts to non-technical people
  • Experience with big data tools: Hadoop, Spark, Scala, Kafka, etc.
  • Experience with data pipeline: Airflow, etc.
  • Ability to collaborate with people at all levels and with multi-office/region teams
  • Must thrive in a fast paced yet sometimes ambiguous environment and be able to work independently

Job Requirements:

  • Around 9+ years of Experience in application development using Python, JSON, Json Schema, GitHub, JIRA, Jenkins, MongoDB, PostgreSQL, MySQL, and Redis.
  • Extensively used open-source tools for analysis: PyCharm (Professional), JupyterLab/Notebook
  • Familiar with JSON based REST Web services and Amazon Web Services (AWS).
  • Experience in Python Development and Scientific Programming and using NumPy and Pandas in Python for Data Manipulation.
  • Proficient database development and management experiences. Write an aggregate query in MySQL, PostgreSQL databases, Cloud based AWS (EC2, S3, Lambda, DynamoDB, Cloud watch, CloudFormation, ECS Fargate, CloudTrail)
  • Ability to build and optimize data sets, ‘big data’ data pipelines and architectures
  • Ability to perform root cause analysis on external and internal processes and data to identify opportunities for improvement and answer questions
  • Excellent analytic skills associated with working on unstructured datasets
  • Ability to build processes that support data transformation, workload management, data structures, dependency, and metadata