RANDSTAD INDIA PVT LTD | Jobs | BIG DATA/ DATA Engineer – 30 days To Immediate Joiners (Only) | BigDataKB.com | 03-02-22

โ€”

by

Job Location: Bangalore/Bengaluru( Electronics City Phase 1 )

Opening with one of the Top Pharma companies for an IT division for a permanent role with them.

Job Title: Data Engineer – Enterprise Big Data Platform !!!

In this role, you will be with a growing, team of data engineers, who collaborate in DevOps mode, to enable Life Science business with brand-new technology to bring to bear data as an asset and to take better informed decisions.

The Life Science Data Engineering Team is responsible designing, developing, testing, and supporting automated end-to-end data pipelines and applications on Life Sciences data management and analytics platform (Palantir Foundry, Hadoop, and other components).

The Foundry platform comprises multiple different technology stacks, which are hosted on Amazon Web Services (AWS) infrastructure or on-premises own data centers. Developing pipelines and applications on Foundry requires:

Roles & Responsibilities:

  • Develop data pipelines by ingesting various data sources structured and un-structured into Palantir Foundry. Participate in end-to-end project lifecycle, from requirements analysis to and operations of an application
  • Acts as business analyst for developing requirements for Foundry pipelines. Review code developed by other data engineers and check against platform-specific standards, cross-cutting concerns, coding and configuration standards and functional specification of the pipeline. Document technical work in a professional and transparent way. Create high quality user documentation. Work out the best possible balance between technical feasibility and business requirements (the latter can be quite strict)
  • Deploy applications on Foundry platform infrastructure with clearly defined checks. Implementation of changes and bug fixes framework and according to system engineering practices (additional training will be provided). Besides working on projects, act as third level support for critical applications; analyze and resolve sophisticated incidents/problems. Debug problems across a full stack of Foundry and code based on Python, Pyspark, and Java

Professional Experience

  • We need someone with 5+ years of experience in system engineering or software development
  • We need someone with 3+ years of experience in engineering with ETL type work with databases and Hadoop platforms.

Skills

Hadoop General

Deep knowledge of distributed file system concepts, map-reduce principles, and distributed computing. Knowledge of Spark and differences between Spark and Map-Reduce. Familiarity of encryption and security in a Hadoop cluster.

Data management / data structures

Must be proficient in technical data management tasks, i.e., writing code to read, transform and store data

Spark

Experience in launching spark jobs in client mode and cluster mode. Familiarity with the property settings of spark jobs and their implications to performance.

Must have experience in using REST APIs

SQL

Must be an expert in manipulating database data using SQL. Familiarity with views, functions, stored procedures and exception handling.

Aws

General knowledge of AWS Stack (EC2, S3, EBS, )

Apply Here

Submit CV To All Data Science Job Consultants Across India For Free

๐Ÿ” Explore All Related ITSM Jobs Below! ๐Ÿš€ โœ… Select your preferred “Job Category” in the Job Category Filter ๐ŸŽฏ ๐Ÿ”Ž Hit “Search” to find matching jobs ๐Ÿ”ฅ โž• Click the “+” icon that appears just before the company name to see the Job Detail & Apply Link ๐Ÿ“๐Ÿ’ผ

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *