Location This is a Remote position. Open to candidates in or willing to relocate to OR, WA, CA, ID, UT, AZ, TX USA.
Who We’re Looking For The Senior Site Reliability Engineer has wide latitude to automate and improve service reliability. The role is also responsible for diagnosing, investigating, and resolving service issues. The role will help teams build and adopt Service Level Objectives (SLOs) in order to broadly improve service reliability at the company. They will also have a hand in supporting and improving both our technical infrastructure and the platform services built on it.
What You’ll Accomplish
Maintain organization cloud presence in AWS
Automate and deploy infrastructure configurations using Infrastructure as Code (IAC)
Implement monitoring, as well as metric and log collection
Assist other teams with capacity planning and infrastructure budgeting
Participate in evening/night and weekend on-call rotation
Continuously improve infrastructure by automating away repetitive tasks and toil
Build out functionality of the clients Platform used by other engineering teams
Debug issues in application code and related services
Demonstrate a consistent commitment to clients core values and operating principles.
Work with your team on planning and completing department goals that align with overall business objectives.
Be an effective communicator by listening carefully, asking questions, and being transparent, timely, and diplomatic across all levels of the organization.
Stay informed on what is happening within the business and help others understand business decisions and the company direction by positively representing the company view.
Provide technical knowledge, coach and mentor others in the department and company.
Stay informed on new technologies or processes within your specialization and implement them when necessary.
Use analytic skills to communicate and drive decisions for your team based on available data.
In partnership with your manager, start to plan, evaluate, and improve the efficiency of your department to enhance speed, quality, efficiency, and output.
What You’ll Contribute
5+ years software development experience
3 years of experience with AWS
3 years of experience using Terraform to manage AWS resources
2 years of experience with Kubernetes (EKS preferred)
2 years of experience with a scripting language (e.g., Python, Bash)
1 year of experience with cloud network configuration
1 year of experience configuring CI/CD pipelines
1 year of experience supporting 24/7 web applications
Experience configuring monitoring and alerting
Experience with Kubernetes configuration tools like Helm and Kustomize
Nice to Have
Experience with systems programming languages like Rust or Go
Experience implementing observability through code instrumentation
Experience with GitOps (e.g., Argo CD, Flux)
Experience migrating applications from an on-prem environment to the cloud
Experience with Change Data Capture and Kafka
Bachelor's degree in computer science or related field
What We Provide
Competitive compensation: salary, plus performance-related bonus program
401(k) with employer match
100% company-paid medical and dental insurance benefits for you and your dependents
4 weeks paid vacation, increasing based on tenure
18 weeks paid leave for birth moms
8 weeks paid parental leave, including for adoption
Monthly wellness allowance
Annual professional and personal development allowance
Work from home office set-up and expense allowances
Flexible work location opportunities
Employer matching toward charitable contributions
What We Believe In Client’s mission is to serve the music fan in everyone. We represent a diverse and inclusive community, and we are committed to serving our community with innovative and creative solutions. We know that innovation happens best when varying perspectives are embraced and integrated. Our global team reflects our global community.