Site Reliability Engineer Job at Dstillery, Remote

d0M1eDJHTGVkSW50NUdEeUx3Kyt3eW8vMWc9PQ==
  • Dstillery
  • Remote

Job Description

Dstillery is the leading AI ad targeting company. We empower brands and agencies to target their best prospects for high-performing programmatic advertising campaigns. Backed by our award-winning Data Science, Dstillery has earned 24 patents (and counting) for the AI technology that powers our precise, scalable audiences. Our newest technology, ID-free ® , is patented, privacy-safe behavioral targeting that reaches 100% of ad impressions and can be used with any Dstillery product. Our premier user segment product, Custom AI Audiences, is a just-for-your-brand targeting solution that refreshes hundreds of millions of users every 24 hours to deliver the best performance.

We currently handle billions of events per day, we're growing strong and need someone who can help us scale our systems to handle lots of data. We're still pretty small on the engineering team, and everyone gets their hands dirty and makes a real impact.

Our engineering culture is focused on shipping scalable, practical systems. We use small, agile teams that can touch any part of the system. We place a high value on maintaining a good work-life balance, avoiding grinds and focusing on getting things done rather than putting in long hours.

We are looking to hire a Site Reliability Engineer to join our team in supporting both our on-premises and cloud infrastructure. In this role, you will be working both on standalone tasks as well as collaborating with other SRE members on larger, more complex projects.

Responsibilities

  • Contribute to initiatives aligned with the systems roadmap in a collaborative team environment.
  • Work cross-functionally with software engineers, ML engineers, and data scientists to build and support reliable systems.
  • Build and refine monitoring and alerting systems to ensure high availability and performance..
  • Lead incident response, conduct root cause analysis, and drive remediation to prevent recurrence.
  • Participate in design sessions, code reviews, and knowledge sharing.
  • Advocate for SRE principles and best practices, including infrastructure as code and automation.
  • Contribute to and enhance our evolving systems documentation.
  • Participate in a scheduled rotation to support production systems during office hours.

Qualifications

We are looking for a candidate who has:

  • Familiarity with security best practices and experience implementing security measures across infrastructure.
  • Experience in performance tuning and optimizing systems for scalability and efficiency.
  • Experience in designing and implementing disaster recovery and business continuity plans.
  • Strong communication skills to effectively collaborate with cross-functional teams.
  • Strong analytical and problem-solving skills to troubleshoot complex issues.
  • Ability to mentor junior team members and share knowledge to foster a collaborative learning environment.

and also has experience in a significant subset of the following tools, and interest in learning the rest:

  • Linux system administration on RHEL derivatives
  • Deployment and monitoring across bare metal, cloud VMs, cloud-native platforms, and Kubernetes.
  • Configuration management tools such as: Salt, Ansible
  • Infrastructure as code, eg: Terraform
  • Linux installation tools, eg: Cobbler
  • VM image building tools, eg: Packer
  • Open source networking, eg: quagga/frr, keepalived, iptables
  • Cloud networking on AWS and GCP
  • Automation using Python
  • Source control management using Git

Job Tags

Remote job, Full time, Work at office,

Similar Jobs

The Cigna Group

Provider Contracting Representative - GEORGIA (TH8908JUL4111) Job at The Cigna Group

 ...Notes: Hybrid position: Remote working role with occasional travel to...  ...position includes working at home when not in the field. Team...  ...consistently during working hours of 8:00am - 4:30pm CST during...  ...Diploma or equivalent Hourly Pay Rate Range (dependent on... 

Indygene

Marketing Internship - Remote Job at Indygene

 ...Position Overview: Indygene is seeking a motivated and creative Remote Marketing Intern to join our team. This role is designed to provide you with hands-on experience in digital marketing, content creation, and campaign execution, with the opportunity to directly... 

Reyes Holdings

Box Truck Delivery Driver Job at Reyes Holdings

 ...the Truckee and Lake Tahoe area. + Certified Great Place to Work 2025 - 5th Consecutive Year! Position responsibilities: + The Driver efficiently delivers the right products to customer accounts, in a professional, safe and timely manner+ You will operate all equipment... 

PwC

State and Local Tax - Income Franchise - Senior Associate Job at PwC

**Specialty/Competency:** State & Local Tax (SALT)**Industry/Sector:** Not Applicable...  ...Requirements:** Up to 20%A career in our Income Franchise practice, within State and...  ...and meetings- Managing engagements and preparing concise, accurate documents- Innovating... 

Ace Hardware Home Services

Parrish Services - Warehouse Associate Job at Ace Hardware Home Services

Compensation Details:$16.00 - $20.00 Per HourJob Description:Who we are:Parrish Services is now an Ace Hardware Company. At Ace Hardware Home Services, we are backed by a brand that customers have trusted for over 100 years. You can trust that you can build a career you...