SHARE THIS PROJECT

Go Back

Project Details

Consultant - Python Data Engineer

A global management consulting firm

 Mumbai / Navi Mumbai, Delhi / NCR, Kolkata, Chennai, Bangalore, Pune, Hyderabad

Posted on:  31/01/2022

Cinque Terre

Starts on:  14/02/2022

BROAD CATEGORY

Information Technology

SUB CATEGORY

Business Analysis, Software Development - Python, Data Science,

INDUSTRY

Software

Company Details

A global management consulting firm

Assignment Details

Our client is a top consulting firm and is looking for a Python expert.
The consultant will be responsible for building, optimizing, maintaining data pipelines into and within RL-based personalization engine environment (e.g. creating API connection between RL-based personalization engine and other systems)

More details will be provided by the client during discussions.

Skills Required

Must haves:
• Strong code development practices in Python >=3.7 with high amount of rigor and high code standards
• Experience in quality assuring data engineering code, e.g., by reviewing pull requests
• Strong capabilities in data management using – Relational methods/systems (SQL),
– Object storage/big data approaches (AWS S3/HDFS/Azure Data Lake),
– Distributed computing frameworks (such as Apache Spark)
• Strong capabilities in data storage layer design, in the physical (data asset organization, data type choice, data compression, data formats) and logical sense (data cardinality and normal forms, primary/foreign key relationships, integrity constraints)
• Strong capabilities in PySpark, covering data management and performance tuning, at data scale >1TB
• Experienced in code versioning and release management through git, e.g., following the gitFlow approach
• Experienced in unit testing, static code analysis/code linting, using e.g., pytest, flake8, black, isort
• Hands-on experience with a workflow orchestrator, preferrably Apache Airflow
• Basic knowledge of DevOps/cloud native approaches–
minimally Docker, ideally Kubernetes, Terraform

Nice-to-have:
• Experienced in Python based machine learning, using sci-kit learn, preferably also Spark ML

Assignment Duration

6 month(s)

Capacity Required

Full Time

No. of Positions

1

Nature of Work

Remote

Profile Requirements

Experience: 10+ years

Qualification : Graduate

fleXpertise required

SQLPythonData ScienceRelational methodsPySparkcoding

ESTIMATED BUDGET (Total Budget)

-

 

info@flexingit.com | Terms of use | Privacy policy | Contact us
©2018 Flexing It® Services Private Limited. All Rights Reserved.

× We use cookies to ensure that we give you the best experience on our website. However, if you would like to change your cookie settings, please use your browser settings.