i
Mastek
7 Mastek Jobs
Cloudera Data Engineer
Mastek
posted 6hr ago
We are seeking a highly skilled and motivated Senior Cloudera Data Engineer with 8 to 9 years of experience to design, develop, and maintain data pipelines and infrastructure on the Cloudera platform. The ideal candidate will have a strong background in data engineering, hands-on experience with Cloudera tools, and a deep understanding of big data architectures alongside expertise on informatica toolset such as Informatica DEI, Informatica Powercentre.
Design, develop, and maintain scalable and robust data pipelines and ETL processes using Cloudera tools and frameworks (e.g., HDFS, Hive, Impala, HBase, Spark, Flink). Design, development, and implementation of complex ETL processes using Informatica PowerCenter. Utilize Informatica DEI for integrating, transforming, and managing large volumes of data across various data platforms, including Hadoop, Spark, and other big data technologies. Collaborate with data architects, data scientists, and business stakeholders to understand requirements and deliver solutions that meet their needs. Optimize and tune performance of data processing jobs, ensuring efficient resource usage and fast execution times. Implement and enforce best practices for data integration, data quality, data security, and governance. Monitor, troubleshoot, and resolve data pipeline and infrastructure issues promptly, ensuring high availability and reliability. Develop and maintain documentation related to data pipelines, processes, and infrastructure. Stay up to date with the latest industry trends, technologies, and best practices in big data and cloud computing. Provide technical guidance and mentorship to junior data engineers within the team.
Bachelors or master’s degree in computer science, Information Technology, Data Science, or a related field. 8-9 years of professional experience in data engineering, with a significant focus on Cloudera platforms. Strong experience of working on support engagements for Cloudera data platform. Strong hands-on experience with Cloudera tools and technologies such as Hadoop, Hive, Impala, HBase, Spark, Flink, and Kafka. Experience in design, development, and implementation of complex ETL processes using Informatica PowerCenter. Experience in utilizing Informatica DEI for integrating, transforming, and managing large volumes of data across various data platforms, including Hadoop, Spark, and other big data technologies. Proficiency in programming languages such as Python, Java, or Scala. Solid understanding of data modelling, data warehousing concepts, and big data architectures. Experience with cloud platforms (e.g., AWS, Azure, GCP) and containerization (e.g., Docker, Kubernetes) is a plus. Proven track record of successfully designing and implementing large-scale data processing systems. Excellent problem-solving, analytical, and critical thinking skills. Strong communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams.
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Mastek Data Engineer roles with real interview advice
8-12 Yrs
₹ 15 - 30L/yr
Pune, Gurgaon / Gurugram, Mumbai
6-10 Yrs
Noida, Gurgaon / Gurugram, Chennai