Job Detail

DevOps Engineer
Experience: 2-4years Gurgaon


  • Ready to work under 24x7 flexible shifts.

  • Respond to and triage alerts and SOP tasks on Linux and Windows VMs.

  • Experience with intrusion detection, log analysis, secure monitoring and event tracking systems.

  • Monitor Production Server Health of different parameters (CPU Load, Physical Memory, Swap Memory, and Setup Monitoring tool to Monitor Production Servers Health, Nagios

  • Created Alerts and configured monitoring of specified metrics to manage their cloud infrastructure efficiently. - Setup/Managing VPC, Subnets; make the connection between different zones; blocking suspicious IP/subnet via ACL.

  • Creating/Managing AMI/Snapshots/Volumes, Upgrade/downgrade AWS resources (CPU, Memory, EBS)

Requirements :

  • Design, implement and maintain all AWS infrastructure and services (AWS Cloud Formation, AWS EC2, S3, VPC, etc.)

  • Design, deploy and maintain enterprise-class security, network, availability, scalability and systems management applications within an AWS environment.

  • Continual re-evaluation of existing stack and infrastructure to maintain optimal performance, availability, security and cost optimization.

  • Implement process and quality improvements through task automation. Implement infrastructure as code, security automation and automation or routine maintenance tasks.

  • Strong knowledge of Amazon SNS, AWS Lambda, Amazon Simple Queue Service (Amazon SQS),

  • Strong scripting skills Bash/ Python/ Powershell/ Perl etc will be an advantage

  • Experience with automation/configuration management using Puppet/Chef/Ansible or similar.

  • Experience building a sophisticated and highly automated infrastructure.

  • Knowledge of IP networking, VPN's, DNS, load balancing and firewalling.

  • Strong knowledge with Web Services, API Gateways and application integration development and design.

  • Good knowledge Apache/Nginx/tomcat.

  • Experience writing and implementing shell scripts for various monitoring and automation tasks over Nagios.

  • Significant BS degree in Computer Science or Engineering.

  • Ability to prioritize tasks and work independently.

  • Excellent communication skills to collaborate with teams globally.

  • RHCSA / RHCE / RHSA Certified 

  • Knowledge of relational and non-relational databases.

  • Strong practical Linux-based systems administration skills in a Cloud or Virtualized environment.

  •  Management of continuous integration servers like Jenkins or Bamboo.

  • Knowledge of containerization (Docker, AWS EKS, ECS)

  • Excellent written and oral communication skills; Ability to communicate effectively with technical and non-technical staff.

Additional Skills:

  • Knowledge of other cloud infrastructure like GCP a plus.

  • knowledge of Jira tracking tool.

  • Knowledge of Splunk and/or Grafana to analyze and create dashboards for incident validation and triage.