Site Reliability Engineer
$0 - $0 • 0.00% - 0.00%
At imgix, we are building the "graphics card" for the Internet. We provide image and visual media processing on-demand via a URL-based API. We connect to a customer's existing images (e.g. S3 bucket) and then provide a domain they can use to access those images dynamically, creating any number of sizes, crops, adjustments, and other variants of those images. Think of it like Photoshop's "Save for Web" feature but for a million images at a time.
Skills: Python, Google Cloud, Linux, Amazon Web Services (AWS)
imgix is looking for a Site Reliability Engineer to join our Infrastructure Engineering team. In this role, you will work closely with other members of the team to build and operate the highly available and performant infrastructure that powers image delivery for some of the largest sites on the Internet.
The Infrastructure team is involved in building and operating the core rendering service and encouraging operational best practices throughout the entire engineering organization.
We interact with a variety of tools and services on a day-to-day basis including and are looking for people who are excited about learning about operating these things at scale.
- High-performance caching services built on top of Varnish
- Monitoring and data visualization tools such as Prometheus, Grafana, BigQuery.
- Automation tooling such as Ansible, Terraform, and shell scripts.
- HAProxy, nginx, and a variety of Google Cloud Platform (GCP) Load Balancers.
- Google Kubernetes Engine along with a wide variety of other GCP services.
- In-house tools developed in Python, Golang, and LUA.
We use these tools and services to
- Build and operate highly reliable and performant distributed systems at scale.
- Observe and measure the imgix infrastructure for performance and reliability.
- Contribute expertise towards architectural planning and ensure the company builds sustainable services that meet our customer expectations while leveraging appropriate tools and frameworks.
- Foster and promote operational awareness throughout the larger engineering organization.
- Ongoing participation in the review and testing process for our production services.
imgix is building the future of visual media on the Internet. imgix operates the premier solution to deliver impactful, engaging, highly dynamic and real time imagery to eyeballs around the world. Our service consists of a top tier image delivery platform tightly coupled with imgix's proprietary, on-demand image processing pipeline. It provides customers with great design flexibility while reducing the engineering investment required to serve state-of-the-art visual media. imgix enables our customers to greatly increase the value of their imagery and get back to building awesome things.
For the time being, imgix is currently operating 100% remotely given concerns with COVID-19. This role will be remote, with the possibility of being based in our San Francisco office when it is appropriate and our office can safely reopen. Employee benefits are comprehensive (401k, medical, dental and vision), perks are generous, vacation time is flexible, and salaries are in line with industry averages.
We maintain two separate technology stacks, one designed to perform all of the image processing we do and one for configuring those image processing pipelines for customers. Our image processing infrastructure is built using technologies such as haproxy, varnish, Lua, Go, Objective-C and C++. It handles >100K qps of requests and >10K qps of image renders. Our configuration and customer management stack is built on Python, Mongo, mySQL, nodejs, and Angular.