Lead/principal Data Engineer - Big Data Tools

Job Title : Lead / Principal Data Engineer Experience : 7-16 years Work Location : Vizag (Visakhapatnam). Required Skill Sets : Big data (Hadoop, Spark, Scala, MapReduce, Hive, Flume, Pig, Java/ Python) Job Summary : - Come work as a Senior - Data Engineer at a growing company that offers great benefits with opportunities to advance and learn alongside accomplished leaders, For Leading Global investment firm in collaboration with technology services company Innova Solutions. - Innova Solutions is a global information technology company combining a global reach with a local touch. Headquartered in Santa Clara, California, Innova employs more than 1,800 technology professionals worldwide, with field offices in New York, Chennai, Bangalore, Hyderabad, Pune, and Taipei. - From Cloud Transformation to Data Services to Managed IT Operations, Innova provides a broad array of proven, tested, cost-effective and enterprise-scale technologies and services that leverage the latest technology and delivery models to deliver high value in the cloud, in the data center, and across complex interconnected environments.Position Overview : - Data engineers are mainly tasked with transforming data into a format that can be easily analyzed. They do this by developing, maintaining, and testing infrastructures for data generation. - Data engineers work closely with our data scientists and are largely in charge of architecting solutions for data scientists that enable them to do their jobs. What are we looking for- We are looking for an accountable, multi-talented Data Engineer to facilitate the operations of our Data Scientists. The Data Engineer will be responsible for employing machine learning techniques to create and sustain structures that allow for the analysis of data, while remaining familiar with dominant programming and deployment strategies in the field. - During various aspects of this process, you should collaborate with coworkers to ensure that your approach meets the needs of each project. - To ensure success as a Data Engineer, you should demonstrate flexibility, creativity, and the capacity to receive and utilize constructive criticism. A formidable Data Engineer will demonstrate unsatiated curiosity and outstanding interpersonal skills. Responsibilities and Duties : - Create and maintain optimal data pipeline architecture. - Assemble large, complex data sets that meet functional/non-functional business requirements. - Identify, design, and implement internal process improvements : automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc. - Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS 'big data' technologies. - Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency and other key business performance metrics. - Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. - Keep our data separated and secure across national boundaries through multiple data centers and AWS regions. - Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader. - Work with data and analytics experts to strive for greater functionality in our data systems. Required Candidate profileRequired Qualifications for Data Engineer : - Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases. - Experience building and optimizing 'big data' data pipelines, architectures and data sets. - Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement. - Strong analytic skills related to working with unstructured datasets. - Build processes supporting data transformation, data structures, metadata, dependency and workload management. - A successful history of manipulating, processing and extracting value from large disconnected datasets. - Working knowledge of message queuing, stream processing, and highly scalable 'big data' datastore. - Experience supporting and working with cross-functional teams in a dynamic environment. Requirements : We are looking for a candidate with 4+ years of relevant experience in a Data Engineer Position with the following technology stack : 1) Experience with big data tools : Hadoop, Spark, Kafka, etc. 2) Experience with relational SQL and NoSQL databases, including Postgres and Cassandra. 3) Experience with data pipeline and workflow management tools : Azkaban, Luigi, Airflow, etc. 4) Experience with AWS cloud services : EC2, EMR, RDS, Redshift 5) Experience with stream-processing systems : Storm, Spark-Streaming, etc. 6) Experience with object-oriented/object function scripting languages : Python, Java, C++, Scala, etc.


Key Skills
Java, Hive, Data Pipeline, RDBMS, Hadoop, Big Data, Spark, AWS, Machine Learning, Python, SQL

Job Summary

  • Published on: 2020-05-22
  • Salary: NA
  • Location: Visakhapatnam, Vishakhapatnam