Work with large datasets, performing data cleaning, feature engineering, and data exploration.
Utilize tools such as Pandas, NumPy, and PySpark to process and analyze structured and unstructured data.
Utilize Docker, Kubernetes, and cloud platforms.°Proficiency in Python, R, or Java. Experience with machine learning libraries.
Strong understanding of supervised and unsupervised learning, reinforcement learning.
Experience with large datasets°