What Temporal is looking for in applicants
We are expanding our team! You can be anywhere in the United States for all of our positions, and other various locations outside the U.S. for other roles to join us.
At Temporal, we are on a mission to remove the complexity in developing reliable software for the cloud. If you want to solve hard distributed system problems, have a passion for open source software and building a strong developer community, then come join us in our mission. Temporal enables developers to focus on writing important business logic, and not on managing state or worrying about the underlying infrastructure. The Temporal platform is being trusted by top-tier companies as a core technology in their mission critical systems. Our active open source community of developers, who are also our users, provide us with real-time feedback and contributions. We're backed by top VC firms, have closed Series B and have a team of professionals from start-ups and larger companies like Microsoft, Google, Amazon, Meta, Uber, Apple, Cisco and more.
Infrastructure Engineer - Observability & Monitoring
We have openings for Senior to Principal bands on our Infrastructure team. You will be building the Platform for Observability & Monitoring, which will support the Temporal Cloud and other engineering teams. This Monitoring Platform will enable self-healing by exposing a programmatic interface to our observability systems. The platform will provide insights, and improve observability services with a focus on performance and reliability.
What you’ll do
* Create interfaces or tooling for self-service Developer Customers and Internal Engineers
*Build a monitoring system using Prometheus, OpenTracing, or another widely-used monitoring/visibility platform
*Work with large scale, high volume distributed systems, distributed databases, and data pipelines
*Build, maintain, and ensure timely delivery of our high-volume event log pipelines
*Create tools, and automation to help ensure data gets to the right place
*Prototype tooling interfaces and building new features for engineering use cases
*Participate in the Monitoring
*Participate in on-call rotation
What You bring to us
* At least 10+ years experience in setting up observability infrastructure using Prometheus, Grafana, Loki, Thanos or any other widely used monitoring system
*Experience using deployment automation/configuration management, especially Terraform or Chef
*Experience with AWS and other virtualized environments
*Experience building CI/CD and release pipelines using BuildKite, Travis, Jenkins, etc.
*Experience with deployment as code systems like Terraform, Chef, Puppet and Ansible
*Experience with container and management tool chains like Docker, Kubernetes, etc.
*Experience automating operations using Kubernetes Operators
*Experience with message queue services, such as Kafka
*Scripting experience using bash, zsh, etc.
*Experience coding in Go, C, or Java