Role: SRE (Observability) Engineer Start Date: December 16, 2024 Note: Before taking interview candidate need to write coding test, Immediate closure opportunity. This position is remote. Candidates must pass a HackerEarth Assessment to qualify skills in Automation (Chef, Ansible, Terraform), Python, and general SRE. Please stay on top of your submitted candidates, as we will interview those that qualify next week. Description We are seeking a highly skilled SRE ( Observability) Engineer with a deep understanding of modern observability practices and tools. The ideal candidate will have hands-on experience with provisioning, configuring, and developing infrastructure solutions, along with a strong focus on automation, scalability, and reliability. This role involves a mix of development, system architecture, and troubleshooting responsibilities, providing opportunities to influence the evolution of our infrastructure. Responsibilities Design, implement, and manage observability solutions using tools like Dynatrace , Prometheus, Thanos, or Grafana. Develop metrics, alerts, and silences for comprehensive system monitoring. Automate infrastructure tasks using Chef (recipes, cookbooks), Ansible (tasks, playbooks), or Terraform with a strong focus on syntax and GitLab CI/CD configuration. Script solutions using Python , PowerShell , or Bash to enable automation across the infrastructure. Propose and implement innovative ideas to reduce manual workload and improve operational efficiency through automation. Provision and configure cloud resources via CLI or APIs on Azure , GCP , or AWS . Troubleshoot and resolve system issues with an SRE (Site Reliability Engineering) mindset , focusing on root cause analysis and corrective actions. Develop and enhance documentation, including application guides, runbooks, and system configurations, ensuring clarity in the "why" and "how" of operations. Plan, design, and execute scalable and redundant system architecture to meet organizational goals. Required Skills Observability Tools : Hands-on experience with Dynatrace, Prometheus, Thanos, or Grafana. Infrastructure Automation : Proficiency in Chef , Ansible , Terraform , and GitLab CI/CD. Scripting Languages : Advanced skills in Python , PowerShell , or Bash . Cloud Platforms : Proficient in provisioning and configuring resources on Azure and GCP (AWS experience acceptable). SRE Practices : Familiarity with troubleshooting using SRE principles, root cause analysis, and corrective action planning. Documentation : Strong ability to write clear, concise, and detailed technical documentation and runbooks. System Architecture : Solid understanding of scalability and redundancy principles. Preferred Skills Kubernetes : Basic understanding of container orchestration and CLI. Linux Administration : Configuration, package management, and troubleshooting expertise. Networking : Knowledge of VPCs, proxies, CDNs, and their integration into scalable systems. Storage Systems : Familiarity with block and object storage configuration. Resource Informatics Group
...Medical Solutions Allied is seeking a travel Pediatric Respiratory Therapist for a travel job in... ...Therapist ~ Discipline: Allied Health Professional ~ Duration: 13 weeks ~36... ...find a great place to work and a career home. Weve received Best Places to Work awards...
Job Description: Company Description:McDonalds growth strategy, Accelerating the Arches, encompasses all aspects of our business as... ...Nothing in this job posting or description should be construed as an offer or guarantee of employment.#J-18808-Ljbffr McDonald's Corporation
THERAPY 2000, a home-based provider of pediatric Speech, Occupational, and Physical Therapy services, has an exciting opportunity for a Pediatric insert... ...and federal, state and local laws applicable to home-health pediatric speech language pathology. What youll need:...
...Receptionist Corporate Headquarters 12575 Uline Drive,Pleasant Prairie, WI 53158 Office orchestrator wanted. Are you a meticulous multitasker? Then you belong at Uline! As a Receptionist, youll support office operations at our Corporate Headquarters as we continue...
...LanceSoft is seeking a travel Pediatric Respiratory Therapist for a travel job in Albuquerque... ...Therapist ~ Discipline: Allied Health Professional ~ Start Date: 06/16/2025... ...Health Centers, Drug & Alcohol Facilities, Home Health & Community Health, Urgent Care Clinics...