79 Phenom Jobs
Service Reliability Engineer I
Phenom
posted 3d ago
Flexible timing
Key skills for the job
Phenom Intro:
Our purpose is to help a billion people find the right work! Phenom is an AI-Powered talent experience platform that is redefining the HR tech space. We have grown into a global organization with offices in 6 countries and over 1,500 employees. As an HR tech unicorn organization, innovation and creativity is within our DNA. Come help us make every talent moment Phenomenal!
Job Summary:
As the Site Reliability Engineer I at Phenom People, you will be responsible for driving the strategy, development, and launch of our product. You will work closely with cross-functional teams including engineering, design, sales, and marketing to deliver a best-in-class candidate experience that aligns with our companys vision and goals.
What Youll Do:
Monitoring Team with L1 Resources for All domains to cover 24x7 IT Environment (Server, Network, Application, Storage and Database)
Management of alerts raised by infrastructure elements
Management of alerts raised by Application Services
Perform daily health checks (Network, Servers Datacenter)
Knowledge of Windows, Linux Network Infrastructure
Perform operations based on the documented procedures
Assist in analysis of the reporting and alerts raised by various infrastructure devices
Fine tuning of configuration to maintain performance and functionality of the monitoring solutions in place.
Roles Responsibilities:
24x7 proactive monitoring of server, storage, backup and network environment alerts via monitoring tool and Email
Escalations and Follow up with the IT System Admin team as well as specific application team on pending high priority trouble tickets
Prepare and maintain Documentation, Reports, and provide follow up status on identified tasks
On time Escalation and Reporting of alerts according to Incident Management process
Daily / Weekly Report preparation based on the specified already agreed format and sending the same to pre-assigned set of recipients
Sending the reports on specified time and day and informing the concerned recipients in terms of delay due to any dependencies
Escalate the incidents based on the standard procedure and run-down follow-up reporting per team and area, Escalate incidents till closure
Maintain, update and implement the standard escalation procedures complete with notification matrix and escalation standards
What Youve Done:
Good Communication Skills
Strong Linux administration skills in various flavors (CentOS, Ubuntu and Red hat).
Troubleshooting skills in Booting Problems
Good skills in incidents tracking from Logs.
Good Skills in Shell Scripting
Networking Skills
Knowledge on Web servers (Apache, Nginx ...etc.)
Files servers like FTP, NFS and SAMBA
Additional Advantage: AWS Azure Cloud knowledge, Dockers, Jenkins
Qualification:
Education: B.Tech or 16+ years of full time education.
Work Experience: 2+ years of experience
Benefits:
We want you to be your best self and to pursue your passions!
Health and wellness benefits/programs to support holistic employee health
Flexible hours and working schedules, as well as parental leave for new parents
Growing organization with career pathing and development opportunities
Tons of perks and extras in every location for all Phenoms!
Diversity, Equity, Inclusion:
Our commitment to diversity runs deep! Diversity is essential to building phenomenal teams, products, and customer experiences. Phenom is proud to be an equal opportunity employer taking collective action to build a more inclusive environment where every candidate and employee feels welcomed. We recognize there is more to be done.
#LI-JG1
Employment Type: Full Time, Permanent
Read full job descriptionPrepare for Phenom Service Reliability Engineer roles with real interview advice
Overally everything at Phenom is excellent and love working with the work culture here and enjoying the work
They are providing free meals but the taste isn't that good as per standards
3-6 Yrs