We are looking for a Data Engineer with 3 years of expreience that will work on the collecting, storing, processing, and analyzing of huge sets of data. The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them. You will also be responsible for integrating them with the architecture used across the company.
Responsibilities
- Selecting and integrating any Big Data tools and frameworks required to provide requested capabilities - Implementing ETL process like importing data etc. - Implementing Data API transformations using Spark and Hadoop cluster
Skills and Qualifications
- Proficient in Python programming - Working knowledge with LLM's should be good to have Skill set - Proficient understanding of distributed computing principles - Experience with Spark SQL, Python, Spark Data frames. - Experience with integration of data from multiple data sources - Experience with SQL and NoSQL databases such as MySQL, Cassandra, MongoDB - Good understanding of Lambda Architecture/Functional programming, along with its advantages and drawbacks