Observability Operations Automation Engineer

Key Responsibilities

  • Design, develop, and maintain automation solutions to support observability and IT operations, focusing on improving system monitoring, alerting, and reporting capabilities.  This includes but is not limited to agent deployment and management, synthetic transaction monitoring, and incident remediation and validation workflows.
  • Manage source code and version control using GitHub, ensuring best practices in version control, code sharing, branching, and collaboration.
  • Develop automation scripts and tools using Ansible for configuration management, PowerShell for task automation, Terraform for infrastructure as code (IaC) implementations, and Jenkins for continuous integration/continuous deployment (CI/CD) pipelines.
  • Employ Python scripting to enhance automation efforts, contributing to more sophisticated data analysis and operational workflows.
  • Collaborate with cross-functional teams to identify automation opportunities that can streamline processes, reduce manual interventions, and improve system reliability and performance.