Hands on experience in Python, Spark, SQL and AWS.
Experience on managing, maintenance, troubleshooting issues in Databricks clusterGood experience working on AWS cloud environment Add / Remove EC2 nodes as needed, and configure services to the cluster
Configuration Performance tuning of Databricks cluster / nodes
Experience setting up optimizing cluster configurations for MapReduce, Spark, Hive, Zeppelin, etc.
Experience with Backup DR procedures
User management provide access via LDAP / Ranger
Debugging knowledge of YARN. Hands-on with analyzing various Hadoop log files, compression, encoding, file formats, etc.
Monitoring / Alert services such as AWS SES, etc.
Expert in Linux shell scripting. Python scripting experience preferred