i
8 Media.net Software Services (India) Jobs
Media.net - Senior Site Reliability Engineer (4-6 yrs)
Media.net Software Services (India)
posted 2mon ago
Flexible timing
Key skills for the job
Role : Site Reliability Engineer (SRE)
About Media.net :
Founded in 2011, Media.net is a leading global advertising technology company that develops innovative products for both publishers and advertisers. Since inception, Media.net has made substantial investments in its business and built one of the most comprehensive portfolios of advertising technology in the industry across search, mobile, display, native, local, products and video globally. Our platform and products are licensed & used by some of the largest publishers, ad networks and other ad tech companies worldwide.
Media.net has 1400+ employees in key operation centers across - New York, Los Angeles, Dubai, Mumbai and Bangalore. Media.net's U.S. headquarters are based in New York and Global headquarters are in Dubai.
What does the SRE team do :
SRE is a key part of a closely coupled autonomous engineering team that work together to solve the business problems.There are multiple SRE teams that catering to our business in the Ad-tech space. A few of our Ad-tech products include domain monetization, contextual monetization, Ad quality , Ad exchange, Programmatic Advertising, etc.
One of the teams is responsible for maintaining the brains behind our contextual advertising systems. Similarly, another team works on building a real time Ad Quality system which provides various real time bidding metrics and real-time ad-display decisions. Each SRE team gets to work on breadth of applications, from low latency high throughput web serving to large scale data systems
The SRE team has carefully crafted an internal platform that allows controlled, but speedy innovation while crunching a quarter billion messages a day, and leveraging multiple state-of-the-art algorithms, to serve the best relevant ads on some of the leading news and information websites on the planet.
To support this platform, our infra strives to be an industry standard Like Maintaining high availability andreliability, while allowing our distributed applications to serve tens of terabytes of information every hour. We make use of open-source technologies on commodity hardware and scale them beyond the scope and scale of enterprise solutions, hosted in public cloud and co-located datacentres.
Role & Responsibilities :
Your team is focused on improving and promoting the availability, stability and performance of our infrastructure, systems and applications.
A SRE'sresponsibilities will include :
- Shaping the scope and expertise for SRE practices across the team.
- Building reliability and resiliency into our infrastructure, tools, services and processes working with our development team, plus establishing practices forsupporting, and running them that allow us to keep services highly available to our clients, easily supportable by our developers, and operable for the company.
- Driving design, implementation, and support of large-scale infrastructure. You and your team will participate in the design and implementation phases for new and existing products
- Developing policies and procedures that improve overall platform stabilityand participate in shared on-callschedule
Who should apply for this role :
- B.Tech/M.Tech or Equivalent in Computer Science, Information Technology, or a related field
- 4-6 years of experience in handling services in a largescale distributed system.
- Deep understanding of network stack (e.g., TCP/IP, routing, network topologies and hardware, SDN, etc.)
- Deep understanding of modern software architectures, including load-balancing, queueing, caching, distributed systems failure modes generally, microservices and big data technologies.
- Excellent programming (Python, Go, Ruby or preferred scripting languages) and automation skills
- Ability to work independently and own problem statements end-to-end.
- Great communication, interpersonal andteamwork skills.
- Adaptable to work in a fast-paced environment and alter priorities as per business needs
You have expertise in one or more of the below tools/skills :
- Container orchestration technologies like Kubernetes and Mesos
- Virtualization platforms, either on-prem or cloudbased (We use Openstack and AWS)
- Understands Infrastructure as a code (we use Puppet, Ansible and Terraform) and containerization tool sets (we use Docker).
- Data intensive applications and platforms like Kafka, Hadoop, Spark, Zookeeper, Cassandra, PostgreSQL OLAP, Druid
- Relational databases like MySQL,Oracle, PostgreSQL etc
- NoSQL databases like Redis, MongoDB, Cassandra, CouchDB etc
- One or more CI tools like Jenkins, Teamcity
- Centralized logging systems, metrics, and tooling frameworks such as ELK, Prometheus, and Grafana.
- Web and Application servers like Apache, Nginx, Tomcat
- Versioning tools such as git
Functional Areas: Software/Testing/Networking
Read full job descriptionPrepare for Senior Site Reliability Engineer roles with real interview advice
4-6 Yrs
Bangalore / Bengaluru
4-8 Yrs
Mumbai, Bangalore / Bengaluru
3-8 Yrs
Mumbai, Bangalore / Bengaluru