Lab45- Senior Engineer
India || 55 Days Ago
Category :Vacant
Country :India
Bengaluru
publish date :2024-03-26
Description
Role and Responsibilities Responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. Define, Analyse and review technical architecture on required platform and coming up with architecture options and recommendations Define, detail, and scope the technical requirements into solutions Implement monitoring and alerting systems to ensure high availability and performance of the platform, focusing on SLA and availability. Work across key activities in configuration, systems management tools and backup and recovery Support Technical Consultants and lead in building solutions and providing technical mentoring and guidance Required Technical and Professional Expertise Must have working experience in Installation and Upgrades Must have working experience in Application deployments & Configuration Must have working experience in Security vulnerability Fixes, SSL Certificate Installation/Renewal. Good knowledge on Shell scripting. Must be able to configure various technological components such as Apache Tomcat, Apache ActiveMQ, JBoss Infini span, etc to complete end-to-end integration Must be knowing the Java application set up related JVM parameters, GC parameters, network integration and performance tuning Must be able to setup Kubernetes and Helm and enabling the development team to deploy service-based architecture using the container and pods technology Must be able to automate the build and deployment processes for Java application Excellent communication skills to effectively collaborate with cross-functional teams. Experiece in on any infra monitoring APM tools Experience in log/event aggregation and monitoring systems such as Splunk, Elasticsearch (ELK), Prometheus, Grafana. Should have sound experience in a distributed environment (preferably kubernetes) to troubleshoot performance issue related to PODs, Network, Application Servers, Load Balancers, etc. Strong expertise in managing production incidents, with experience driving for resolution and stakeholder communication during incidents Additional Skills: Proficiency on one or more scripting languages for automating systems, eg. Bash, Python, Ansible, Puppet would be asset. Must have skills in investigating and troubleshooting complicated systems/platforms, and identifying key points of failure Knowledge of Distributed Systems fundamental principles (architectures, micro-services, high-availability, elections) will be an added advantage
LI-CB1
The ad has expired. You can see similar ads below
2021-09-26
£45,000 - £50,000
2023-11-11
£15,000 - £20,000
2021-09-26
£45,000 - £50,000