Support the development efforts of the engineering and product development teams which results in a stable environment with speed to market.
Provide technical support and system engineering to infrastructure servers that support development, test, and production environments.
Meet regularly with engineering and Ops, provide status reports and notify company management of new tasks, accomplishments, risks, deadlines and changes in customer priorities.
Automate tasks for the infrastructure by developing custom scripts.
Maintain documentation of all scripts and system tailoring that is deployed
Provide proactive support that anticipates system operation and availability.
Provide Tier-1 technical support including triaging issues, implement corrective procedures and escalate problems as required.
Perform in a Cloud/System administrator and system engineer role as required for production support and new infrastructure initiatives.
Provide recommendations to engineering team related to ongoing operational support of environments at all phases of the engineering process.
Maintain disciplined change management procedures for all environments and supporting tools.
Qualifying Requirements:
Bachelors Degree in Computer Science or an Information Technology related field, and/or related experience.
At least 6 years of experience in Linux server administration
Proficiency in Linux/Unix system administration, with certifications or relevant experience (Ubuntu Linux specifically)
Experience working with public clouds like AWS, Azure, or Google
Experience using Puppet, Jenkins, and cobbler
Experiencing using virtualization technologies suc h as LXC/LXD, VMWare
Experience working with containerization/micro services like Docker and Kubernetes
Experience working with Prometheus and Alert manager
Knowledge of databases such as Elasticsearch and Postgresql
Knowledge of Shell Scripting, Python,
Ability to meet deadlines and adjust to changing priorities to meet business goals.
Ability to work in a multi-tasking / matrixed environment
Capable of working with and without direct supervision.
Excellent troubleshooting skills.
Strong oral and written communications skills.
Effective interpersonal skills.
Strong customer service orientation.
Participate in on-call rotation
Preferred Experience:
Experience in start-up company environment a plus.
Knowledge of big data technology such as Hadoop, Spark etc
Cyber Intelligence Security or networking experience strongly desired.
Familiarity with Prometheus, Alertmanager, Ansible is desired.