.

.

Senior DevOps Engineer

Location: Redwood City, CA, USA

Notice

This position is no longer open.

Job Number: 29806

Position Title: Sr DevOps Engineer

External Description:

Description 

Informatica is currently looking for a Senior Site Reliability Engineer with experience in Elastic Stack, Kubernetes to join our team.  

Job Summary  

Senior Site Reliability Engineer in Cloud Trust Team to support Centralized Elastic Stack for Observability and Security. 

As a Senior Site Reliability Engineer, you will be responsible for supporting Centralized Elastic Stack for Informatica R&D Engineers.  Also, you must be able to work and adapt in a fluid, fast-paced environment, must have strong technical, communication, collaboration, and leadership skills.

Technology You’ll Use:

Elastic Stack (ECK), Kubernetes, GitOps, Java/Go Lang, Python, AWS, Azure, GCP, Docker, Kafka, Kinesis, SQS, Grafana, Prometheus, Jaeger/Zipkin, Ansible/Chef  

What You’ll Do  

  • Build and Operate large scale ELK clusters deployed in multiple Cloud Providers.
  • Driving initiatives to evolve our current platform to increase efficiency and keep it in line with current standards and best practices. 
  • Manage and Onboard users to Elastic APM, Open Telemetry and Open Tracing.
  • Rollout GitOps to manage many Elasticsearch Clusters to ensure zero downtime, highly available during upgrades and maintenance activities.
  • Perform incident/alert troubleshooting, problem analysis and provide high quality solutions to technical issues 
  • On-call support in cases of issues on production environment  
  • Work with geographically dispersed team 
  • Manage RCA, Incident Process, and Risk Analysis of the cloud services
  • Driving successful POCs, involving latest cloud-based technologies.
  • Mentoring junior engineers on technical, architectural, design and related issues. 

Key Essentials  

  • Have experience with logging and telemetry services, specifically ELK/Elastic Stack, Jaeger, Zipkin, Grafana, Prometheus 
  • Have experience in either of Cloud Providers AWS, Azure, GCP 
  • Able to code to a good standard with any programming language, such as Python, Go, Ruby.
  • Experience writing infrastructure as code using tools such as Terraform, CloudFormation 
  • A solid understanding of configuration management principles and tools such as Chef. 
  • Comfortable with supporting and operating high availability Cloud services and provide on-call support for Sev1 incidents on production and critical development/QA cloud environments. 
  • Understanding of GitOps, CI/CD principles, Linux fundamentals, networking concepts and IP protocols 
  • You are self-motivated with strong problem solving and troubleshooting skills. 
  • Experience in building and operating Kubernetes clusters  

What Does Success Look Like? 

  • Enable Product teams with self-service capabilities and ensure zero downtime, adhering to defined SLAs for higher uptime of the system.  

Must Have  

  • 5+ years of Systems administrations or enterprise software development or operations 
  • 5+ years of experience in managing Cloud operation environments at scale in Production 
  • 3+ years of experience in full implementation of building and managing Elasticsearch clusters using Logstash, Beats
  • 2+ years of experience working in real-time data streaming tools such as Kafka, Kinesis, etc 
  • Experience in building and operating Kubernetes clusters
  • Experience in working with container runtime such as Docker, Containerd, CRI-O 
  • Experience in building dashboards in Kibana, Grafana 
  • Experience in instrumenting Distributed Tracing with Jaeger, Zipkin or Elastic APM 
  • Experience with Production level monitoring and alerting with tools like Prometheus, Grafana 
  • Experience with cloud automated deployment tools such as Chef, Ansible or Puppet is a plus 
  • Prior experience in performing or participating in compliance and security audits is a plus 
  • Prior experience in defining, configuring, and implementing disaster recovery process 
  • Ability to perform data related benchmarking, performance analysis and tuning

 

City: Redwood City

State: California

Seniority Level: Mid-Senior Level

Alternative Location(s) :

Community / Marketing Title: Senior DevOps Engineer

Remote LinkedIn Hashtag:

LinkedIN Hashtag: LI-SD1

Company Profile:

EEO Employer Verbiage:

At Informatica we know diversity drives innovation. We are proud to be an Equal Opportunity Employer dedicated to maintaining a work environment free from discrimination, one where all employees are treated with dignity and respect. All qualified applicants will receive consideration for employment without regard to race, color, gender, sex, sexual orientation, marital status, religion, age, disability, gender identity, veteran status or any other characteristic protected by applicable law and Informatica policy.

Travel Requirement: Limited

Location_formattedLocationLong: Redwood City, California US

Contact Us     Trademarks     Labor Condition Applications     Terms of Use     Privacy Policy

Facebook LinkedIn YouTube Instagram

© 2024 Informatica Inc.