Apply now »

Senior Site Reliability Engineer

Points of Acceptance
Description: 

CORE PROFILE

The Site Reliability Engineer will ensure and drive resilience, reliability and efficiency, operability of cloud services of Maya Business. This includes building deployment pipelines and developing cloud infrastructure for that support our Enterprise applications and solutions.

 

NATURE OF WORK

  • Develop and deliver software solutions by integrating them into available infrastructure using various technologies including but not limited to IaaC (Terraform, Cloud Formation, Helm), CI/CD and automation.
  • Develop and implement solutions that monitor performance, cost, uptime, and availability metrics.
  • Develop solutions that optimize developer experience and software operation and administration.
  • Collaborate as a cross-functional member with team and other stakeholders to develop and deliver infrastructure and integration solutions that fulfill various business, technical and operational requirements.
  • Actively participate in team ceremonies and contribute accordingly as part of a cross-functional team
  • Actively participate in architecture designs and implementation planning
  • Evaluate, explore and integrate new technologies into existing stacks, services and way of working
  • Promote best practices and continuous improvement within the team
  • Provide 2nd level support in troubleshooting issues

 


DISPLAYED SKILL MASTERY

  • Strong problem solving and troubleshooting skills, including application and network-level troubleshooting ability
  • Communication, collaboration and team work
  • Shows strong expertise in AWS cloud, management and monitoring containerized and serverless applications
  • Adaptable to new technologies and ways of working
  • Shows strong accountability and ownership
  • Strong focus on maintaining a high quality of service
  • Work in a fast-paced, multi-tasking environment

 


REQUIRED QUALIFICATIONS

  • Highly technical and analytical, with 5+ years of demonstrated IT experience.
  • Experience working and implementing AWS services or other similar cloud providers.
  • Strong scripting experience with languages like Python, Shell, Perl etc.
  • Hands-on experience in developing CICD pipelines and tooling (Gitlab CI, Jenkins, CodePipeline etc ).
  • Experience with infrastructure as a code (IaaC) like Terraform and CloudFormation.
  • Experience with configuration management tools such as Puppet, Ansible.
  • Experience working with container management and container orchestration technologies like Docker, Kubernetes and similar.
  • Extensive system/network administration background in a large scale Linux environment.
  • Experience with monitoring and APM tools such as Splunk, Dynatrace, New Relic or any other monitoring tools/processes.
  • Ability to handle multiple competing priorities in a fast-paced environment.
  • Strong Automation and Problem solving skills.
  • Experience with system hardening and implementing security controls. PCIDSS experience is a plus.

Apply now »