SHARE THIS PROJECT

Go Back

Project Details

Consultant - Python Data Engineer

A global management consulting firm

 Mumbai / Navi Mumbai, Delhi / NCR, Kolkata, Chennai, Bangalore, Pune, Hyderabad

Posted on:  31/01/2022

Cinque Terre

Starts on:  14/02/2022

BROAD CATEGORY

Information Technology

SUB CATEGORY

Business Analysis, Software Development - Python, Data Science,

INDUSTRY

Software

Company Details

A global management consulting firm

Assignment Details

Our client is a top consulting firm and is looking for a Python expert.
The consultant will be responsible for building, optimizing, maintaining data pipelines into and within RL-based personalization engine environment (e.g. creating API connection between RL-based personalization engine and other systems)

More details will be provided by the client during discussions.

Skills Required

Must haves:
• Strong code development practices in Python >=3.7 with high amount of rigor and high code standards
• Experience in quality assuring data engineering code, e.g., by reviewing pull requests
• Strong capabilities in data management using – Relational methods/systems (SQL),
– Object storage/big data approaches (AWS S3/HDFS/Azure Data Lake),
– Distributed computing frameworks (such as Apache Spark)
• Strong capabilities in data storage layer design, in the physical (data asset organization, data type choice, data compression, data formats) and logical sense (data cardinality and normal forms, primary/foreign key relationships, integrity constraints)
• Strong capabilities in PySpark, covering data management and performance tuning, at data scale >1TB
• Experienced in code versioning and release management through git, e.g., following the gitFlow approach
• Experienced in unit testing, static code analysis/code linting, using e.g., pytest, flake8, black, isort
• Hands-on experience with a workflow orchestrator, preferrably Apache Airflow
• Basic knowledge of DevOps/cloud native approaches–
minimally Docker, ideally Kubernetes, Terraform

Nice-to-have:
• Experienced in Python based machine learning, using sci-kit learn, preferably also Spark ML

Assignment Duration

6 month(s)

Capacity Required

Full Time

No. of Positions

1

Nature of Work

Remote

Profile Requirements

Experience: 10+ years

Qualification : Graduate

fleXpertise required

Data SciencecodingSQLPythonPySparkRelational methods

ESTIMATED BUDGET (Total Budget)

-

 

info@flexingit.com | Terms of use | Privacy policy | Contact us
©2018 Flexing It® Services Private Limited. All Rights Reserved.

This website uses cookies to ensure you get the best experience on our website. By continuing to use this site, you agree to our cookie policy. Accept