-
loading
Solo con imagen

Monitor specific metrics availability


Listado top ventas monitor specific metrics availability

Nueve de Julio-Buenos Aires (Buenos Aires)
Sysadmin Linux Remote SSR (Night Shift, Bs As) ID100/327 Our Client is hiring a Sysadmin / Site Reliability Operator (SRO) to join our Site Operations team in Buenos aires. You will be responsible for helping the team in keeping our customers applications running at peak performance. Not only will you be the first point of contact to external worldwide customers but you will be helping to identify, analyze and resolve first-tier technical issues on large scale productive platforms. You will be helping modify and improve our monitoring infrastructure which has multiple metrics and graphs which are generated every minute from a very diverse environment. Required QualificationsAdvanced Linux skills and troubleshooting experience in a production environment.Experience with monitoring graphing metrics and alerting services.Experience in tracking problems with ticketing systems.Strong communication and teamwork skills.Strong communication skills in English.Willingness to learn from others and share knowledge within teammates. The ability to rapidly self-educate on new concepts and tools as also being actively searching for increased self-knowledge.Preferred QualificationsGood experience in on-premise infrastructure management and cloud-based infrastructure, in particular AWS.Good understanding in scripting on Bash, Python or similar.Experience managing web servers.Good understanding on following tools; Nagios, Grafana, Zabbix, and JIRA.An understanding of networking concepts of DNS, routing, load balancers, and firewalls.Job Duties And ResponsibilitiesBeing able to follow incident management procedures in production environments.Understanding Root Cause Analysis determination and timeline creation.Create and maintain documentation on installations, incidents, and procedures.Analyzing and troubleshooting large-scale distributed systems.Monitor specific metrics for availability, latency and overall system health.Development and implementation of new IT infrastructure monitoring.BenefitsCareer Path:Developing your monitoring skills by using complex systems such as Sensu or Zenoss.Interacting with Cloud Services from AWS and receiving continuous formation and courses from our AWS Specialists / Online.Using and deploying different applications with Containerization Software such as Docker Engine.Learning to automate daily tasks using Orchestration Software such as Puppet, Ansible or Salt.What We OfferOn boarding in San Francisco for 3 weeks approximate.Direct contact with clients and the opportunity to share ideas.Flexible retribution plan: you can adjust your compensation composition according to your needs.Training and certifications.Professional growth.Flexible Home-office.Trips to eventsb'/xe2/x80/xa6'and more!Location: PALERMO Bs As, Argentina 100% RemoteShiftNight: 23:00 a 7:00 AM (Monday to Friday)
Ver aviso
Nueve de Julio-Buenos Aires (Buenos Aires)
Sysadmin Linux Remote JRb'/xc2/xa0' (Afternoon Shift, Bs As) ID100/215 Our Client is hiring a Sysadmin / Site Reliability Operator (SRO) to join our Site Operations team in Buenos aires. You will be responsible for helping the team in keeping our customers applications running at peak performance. Not only will you be the first point of contact to external worldwide customers but you will be helping to identify, analyze and resolve first-tier technical issues on large scale productive platforms. You will be helping modify and improve our monitoring infrastructure which has multiple metrics and graphs which are generated every minute from a very diverse environment. Required Qualifications. (Linux, AWS, monitoreo y alarmas, manejo de tickets). Preferred QualificationsBasic experience in on-premise infrastructure management and cloud-based infrastructure, in particular AWS.Basic understanding in scripting on Bash, Python or similar.Experience managing web servers.An understanding of networking concepts of DNS, routing, load balancers, and firewalls.Job Duties And ResponsibilitiesBeing able to follow incident management procedures in production environments.Understanding Root Cause Analysis determination and timeline creation.Create and maintain documentation on installations, incidents, and procedures.Analyzing and troubleshooting large-scale distributed systems.Monitor specific metrics for availability, latency and overall system health.Development and implementation of new IT infrastructure monitoring.BenefitsCareer Path:Developing your monitoring skills by using complex systems such as Sensu or Zenoss.Interacting with Cloud Services from AWS and receiving continuous formation and courses from our AWS Specialists / Online.Using and deploying different applications with Containerization Software such as Docker Engine.Learning to automate daily tasks using Orchestration Software such as Puppet, Ansible or Salt.What We OfferOn boarding in San Francisco for 3 weeks approximate.Direct contact with clients and the opportunity to share ideas.Flexible retribution plan: you can adjust your compensation composition according to your needs.Training and certifications.Professional growth.Flexible Home-office.Trips to eventsb'/xe2/x80/xa6'and more!Location: PALERMO Bs As, Argentinab'/xc2/xa0' or 100% Remote b'/xc2/xa0'
Ver aviso
Nueve de Julio-Buenos Aires (Buenos Aires)
Summary/Mission: Perform tasks in all phases of the development cycle with little or none technical supervision. Appropriately assess problematic situations to gain adequate understanding of problems involved and assume the responsibility of delivering complex tasks on time and in scope within the teamb'/xe2/x80/x99's plan. Responsibilities: Work with the team to design for the performance, capacity and high availability of infrastructure and services Participate in problem resolution activities; Troubleshoot issues across the entire stack - software, database and infrastructure.Diagnose and troubleshoot complex distributed systems handling large volumes of data and develop solutions that have a significant impact at scale.Participate in building advanced tooling for testing, monitoring, administration and operations of multiple clusters across multiple geographically distributed data centersDevelop innovative ways to smartly measure, monitor b'&' report application and infrastructure healthExperience improving the performance of micro-services and solve scaling/performance issuesDefine and Monitor SLI/SLO Error BudgetsDrive efficiencies in systems and processes: capacity planning, configuration management, performance tuning, monitoring and root cause analysis. RequisitosRequirements / Experience: Creative when solving problems and continuously seeking improvements for processes and solutions facilitate knowledge sharing by creating and maintaining comprehensive documentation b'&' diagramsWrite high quality code to deliver automated solutions across the entire stack.Translate a passion for improvement into design b'&' roadmap contributions, despite existing technical challengesPartner with the Engineering community to establish metrics, review b'&' sign off on changes and introduce new services and schema changesStrong team player with a high degree of self-motivationAbility to learn new systems b'&' manage additional technical resources to meet the project requirementsCollaborate with development teams on best practices and infrastructure planning activities with a focus on reliability, performance and security3+ years of hands-on experience with cloud computing - including infrastructure, storage, platforms and data management, preferably in AWS.Experience with container orchestration technologies, like Docker b'&' KubernetesHands-on experience on AWS Elastic Kubernetes Service.Hands-on experience with Github Actions.Preferred Qualifications BS degree in computer science or proven software engineering capabilityExperience with traditional enterprise data-center technologies, including compute, storage appliances, virtual machines, and networkingExperience managing Databases: MySQL, MariaDB, SQL Server, or PostgreSQLExperience working with scalable networking technologies such as Load Balancers/Firewalls and web standards (REST APIs,, web security mechanisms, OWASP top 10).Broader Integration and management experience of DevOps ecosystems and relatedDeployment/orchestration tools such as Helm, Terraform, Gitlab CI/CD, Jenkins, Artifactory3+ years of experience in Linux Systems and general programming/scripting (Python, Shell, Java, Golang) and automation frameworks.Able to identify the root cause and resolve critical issues by looking across multiple layers (storage, OS, network, and application / DB stack)Play a part in incident management and emergency response b'/xc2/xa0' Location: LATAM USD Pay
Ver aviso

Avisos gratis para comprar y vender en Argentina | CLASF - copyright ©2025 www.clasf.com.ar.