Lead Engineer - Site Reliability, Chennai (REQ-1453)

Chennai Asia PacificIndiaChennai
HID Global
Job Type: 
Functional Area: 
R&D Engineering
Job Level: 

About HID Global

HID Global powers the trusted identities of the world’s people, places and things. We make it possible for people to transact safely, work productively and travel freely. Our trusted identity solutions give people secure and convenient access to physical and digital places and connect things that can be accurately identified, verified and tracked digitally. Millions of people around the world use HID products and services to navigate their everyday lives, and over 2 billion things are connected through HID technology. We work with governments, educational institutions, hospitals, financial institutions, industrial businesses and some of the most innovative companies on the planet. Headquartered in Austin, Texas, HID Global has over 3,000 employees worldwide and operates international offices that support more than 100 countries. HID Global® is an ASSA ABLOY Group brand. For more information, visit www.hidglobal.com

HID Global has is the trusted source for secure identity solutions for millions of customers and users around the world. In India, we have two Engineering Centre (Bangalore and Chennai) over 200+ Engineering Staff. Global Engineering Team is based in Chennai and one of the Business Unit Engineering team is based in Bangalore.

Position Summary

A rewarding career at HID Global beckons you! We are looking for a Site Reliability Engineer, who is responsible for reliability, scalability, and automation while keeping an eye on latency, performance, and capacity for some of our Product Lines. You are accountable for delivering good technical architecture for a growing number of distributed systems, incorporating third-­party open-source tools when available. We are a leading company in the trusted source for innovative products, solutions and services that help millions of customers around the globe create, manage and use secure identities.


To perform this job successfully, an individual must be able to perform each essential duty satisfactorily. The requirements listed below are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.


Site Reliability Engineer with experience in  change management, monitoring, emergency response, and capacity planning. Candidates should be familiar with “Cloud Native Applications”. Candidates may be expected to work both software as well as the system side of the product.


  • Design, write, and maintain software to improve the availability, scalability, latency, and efficiency of  product services.
  • Create new designs for a growing number of distributed systems.
  • Design and implement the tools and processes used for deployment and change management.
  • Plan and execute configuration management.
  • Own, maintain, and continuously improve all systems provided as a service, such as monitoring and datastores.
  • Engage in service capacity planning and demand forecasting, anticipating performance bottlenecks.
  • Automate resource provisioning and allocation process.
  • Run software performance analysis and system tuning
  • Plan and execute disaster recovery drills
  • Participate in rotating on-call duties
  • Reduce organizational silos
  • Define prescriptive ways to measure values related to performance and operation of the system
  • Create a bridge between development and operations by applying a software engineering mindset to system administration topics


  • Experience with at least one Cloud platform:  AWS, Azure, Google, CloudFoundry, OpenStack, etc.
  • Familiarity with algorithms, data structures, and complexity analysis
  • In-depth knowledge of operating systems (processes, threads, IPC, concurrency, locks, mutexes, semaphores, etc.)
  • Experience working with Unix/Linux systems from kernel to shell and beyond, with experience working with system libraries, file systems, and client-server protocols
  • Experience with network protocols and theory (TCP/IP, UDP, ICMP, MAC addresses, IP packets, DNS, OSI layers, and load balancing, etc.)
  • Experience with Puppet, or some other configuration management tool
  • Systematic problem solving approach
  • Strong sense of ownership and drive

Preferred Qualities:

  • Undergraduate degree in Information Technology, Computer Science, Engineering, or a related field required, with graduate degree preferred
  • 6 to 9 years of overall experience in Application/Site Reliability domain
  • Experience building Cloud Native Applications
  • Expert hands-on proficiency in Developing Applications using one or more technology stacks (C, Java, Python, Go, Javascript, etc)
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Strong hands-on understanding of scalability, security, high availability and operational requirements
  • Experience with Amazon Web Services
  • Experience with Database tuning and performance
  • Experience with full product lifecycle
  • Experience with Atlassian suite: Jira, Confluence, etc
  • Excellent verbal and written communication skills

Language Skills  

  • Ability to effectively communicate in the English language, verbally and in writing.
  • Ability to read and interpret technical journals, specifications, international technical standards, etc. 

Work Environment

The work environment characteristics described here are representative of those an employee encounters while performing the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

  • Employee works primarily in an office environment, with in a well ventilated area, and is exposed to moderate noise levels.

Work Requirements

  • Travel and fieldwork, including international travel, may be required. Therefore, employee should have a valid passport.