Site Reliability Engineer (SRE) - SparkPost

Save to Kiter
What Messagebird is looking for in applicants

Transform the communications world! 

We’re proud (and excited!) to be transforming the global communications landscape through our Omnichannel Platform-as-a-Service (OPaaS). 

Our Amazing #TeamSparkpost

SparkPost is the industry’s most trusted email optimization platform. SparkPost helps senders reliably reach the inbox with powerful solutions to help them plan, execute and optimize their email programs. The SparkPost platform is powered by the industry’s largest data network, a team of email experts to help brands elevate every aspect of their email program, and a security and compliance posture to support even the most regulated industries. SparkPost is the world’s largest sender, delivering 40% of all commercial email - 4-5 trillion sent annually - and also boasts the world’s largest data footprint to help enterprise-level brands make data-driven decisions to improve their email performance. The world’s most sophisticated senders, including The New York Times, Zillow, Adobe and trust SparkPost to elevate their email.

What You’ll Do in This Role to Add Value

At Sparkpost, we view Site Reliability Engineering methodologies and concepts as force multipliers. SREs play a critical role at SparkPost and are responsible for designing, building, and maintaining solutions supporting our infrastructure and applications, both internal and customer-facing, with emphasis on scalability, fault tolerance, performance, and monitoring.


  • Collaborate with your team and others within the organization to empower other engineering teams to efficiently do their best work.
  • Design and develop automation workflows to reduce operational toil.
  • Be proactive: gather supporting data, propose improvements, and work to implement them.
  • Be thorough: consider dependencies, failure modes and responses, and blast radius. Validate assumptions and outcomes.
  • Be collaborative: Participate in design discussions and reviews, help define technical solution criteria, and work effectively with team members in a remote work environment.
  • Document your work so it’s easy for the next person to use.
  • Participate in rotating 24x7 on-call shift, triaging production issues and either fixing issues or escalating to appropriate teams.

Required Skills/Qualifications

  • Experience building and managing resources using infrastructure-as-code tooling (e.g. Terraform, Chef, Puppet, Ansible, etc.).
  • Relevant experience or certifications with a cloud provider (AWS preferred).
  • Experience implementing effective automated monitoring for mission critical applications (e.g. Datadog, New Relic, Circonus, Nagios).
  • Comfortable with system architecture models and designs.
  • Experience with software version control (git preferred).
  • Understanding of modern virtualized server hardware, ability to identify resource issues related to utilization of CPU’s, memory, disk IO, and network IO.

Desired Skills/Experience

  • Experience guaranteeing high availability of high volume internet services, especially email.
  • Experience with establishing, measuring, and maintaining service level objectives and error budgets.
  • Experience with managing auto-scaling container-based workloads (Docker, ECS preferred).
  • Experience with relational and/or column-oriented databases (PostgreSQL, Dynamo).
  • Experience developing production applications using modern programming languages (e.g. Ruby, Python, Go, Kotlin, etc.) in a cloud environment.
  • Experience with risk management, incident response, and postmortem/root cause analysis.
  • Experience with performance testing and capacity management.

#LI-NR1 #LI-Remote

What You’ll Gain

  • Work from anywhere 
  • Generous stock options for all Birds
  • WFH set-up budget
  • State-of-the-art work gear
  • Learn from hundreds of the best minds in the business
  • Collaborate with diverse colleagues from over 55 countries (and counting)


Life at MessageBird:

We call ourselves Birds!

We work fast, grow fast, build fast and focus on impact. We’re go-getters, industry leaders and roll-up-your-sleeves-and-make-it-happen kind of people. 

Ready To Fly?

Our cloud communications solutions make it possible for over 25,000 businesses to instantly connect with billions of devices worldwide, allowing them to speak with their customers in the same ways they talk to their friends.

Headquartered in Amsterdam, we’re proud to be a “Work Anywhere” company. Our unique and united culture is rooted in our team: a diverse flock of over 750 Birds who represent 55 nationalities and counting. We’re smart, fast, and hungry. Our potential for growth is limitless. 

We understand that “life happens” and give you the freedom to choose the best environment for you to “get sh*t done”. Our Birds choose where they work from in the region or country we’re hiring in, so long as it’s within the job’s complementary timezone as indicated in the Job descriptions — this could be from one of our MessageBird hubs (Amsterdam, Singapore or Bogota) or remotely.... Want to work from a rural retreat? Sure, no problem! How about a bustling city getaway for a few weeks? Go ahead!

MessageBird is committed to fostering a fair and equal environment based on trust and mutual respect. We believe that a diverse and inclusive workplace is paramount to our success and we are committed to building a team that represents a wide variety of backgrounds, perspectives, and skills. 

Recruitment Privacy Statement:

Want some tips on how to get an interview at Messagebird?

What is Messagebird looking for?
If this role looks interesting to you, a great first step is to understand what excites you about the team, product or mission. Take your time thinking about this and then tell the team! Get in touch and communicate that passion.
What are interviews for Site Reliability Engineer (SRE) - SparkPost like?
Interview processes vary by company, role and team. The best plan is to see what others have experienced and then plan accordingly.
How to land an interview at Site Reliability Engineer (SRE) - SparkPost?
A great first step is organizing your path to an offer. Check out Kiter for tools to get started!