You're using an older version of Internet Explorer that is no longer supported. Please update your browser.
AppNeta

Senior Site Reliability Engineer

Location
Vancouver, BC
Details
Full Time
4 days ago

AppNeta is a fast-growing global technology company that is taking advantage of the massive performance monitoring marketplace sized by Gartner as $2.2B in 2019. AppNeta has been named five times to the Inc. 5000 Fastest-growing Private Companies list, and has won numerous awards for company culture, including Inc. Magazine and BBJ’s Best Places to Work and BostInno’s Coolest Companies.

Overview:

As AppNeta continues to refine and evolve its well established DevOps practices, we have decided that it’s time to grow our dedicated Site Reliability team. Our primary goal is to streamline the 'as-a-Service' part of 'SaaS' by improving reliability for customers, minimizing operational overhead, and maximizing cost-effectiveness of our infrastructure.

This is an exciting time to be a Site Reliability Engineer at AppNeta as we are in the process of migrating our robust cloud-based platform to Microservices based architecture. As a consequence, this team acts as an advocate for automation, scalability, security, and fault-tolerance, working with other engineering teams at AppNeta to achieve continuous improvements in each of these areas. We are well on our way to implement on-demand deployments and achieve COMMIT == DEPLOY model.

We are expanding our cohesive team to include additional diverse experience and perspectives. We value transparent, respectful, and light-hearted communications, and people who are well-organized, collaborative, pragmatic, and empathetic. We want to work with team-players, not rock stars.

The ideal candidate does not tolerate toil, derives deep satisfaction from bringing order to chaos, dares to think big, has the technical skill to blaze new trails, demonstrates the emotional intelligence to effectively influence direction, and can balance tactical requirements against strategic advancement.

What you’ll do:

  • Lead the design and implementation of a new platform, addressing concerns such as builds, continuous integration & delivery, service discovery, network security, secrets management, monitoring & alerting, access control, and vulnerability management
  • Work closely with other engineering teams to evolve product/service architecture, and migrate services into our freshly-minted platform, and ensure that new services are designed with operability in mind
  • Automate all of the things, freeing yourself and others from the tyranny of manual tasks
  • Participate in an on-call rotation, along with other technical team members
  • Practice sustainable incident response and coordinate blameless postmortems
  • Mentor SRE team members, helping them reach their full potential
  • Assist in the definition, prioritization, and planning of work in a fast-paced environment

Highly desirable traits:

We’re open to people with a range of experience, but as a senior you should tick more than half of the following:

  • Strong interpersonal communication skills (listening, speaking, and writing)
  • Solid grasp of Linux systems, scripting, package management, and networking concepts
  • Strong familiarity with Amazon Web Services (especially computer, networking, storage), and interacting with them via API/CLI
  • Expertise with configuration management tools (e.g. Chef, Ansible, Puppet)
  • Proficiency in at least one programming language (Python, Java, Ruby, or Go preferred)
  • Experience implementing or maintaining CI/CD pipelines, either self-hosted (e.g. Jenkins, TeamCity), or managed (e.g. AWS CodeBuild, Codeship)
  • Experience developing effective operational monitoring and alerting (with tools such as Prometheus, CloudWatch, and Splunk)
  • Operational knowledge of virtualized and containerized environments (e.g. vSphere, KVM, Docker)
  • Demonstrable aptitude to learn new technologies, and apply that knowledge to solve real problems

Bonus points for:

  • Experience (or interest) in team lead/scrum master/project management responsibilities
  • Experience operating large-scale, distributed systems on top of cloud infrastructure such as Amazon Web Services, Google Compute Platform, or Microsoft Azure
  • Experience with infrastructure management/orchestration tooling (e.g. CloudFormation, Terraform, Rundeck)
  • Expertise with container deployment/orchestration technologies, especially Kubernetes
  • Experience with service discovery (e.g. Consul, SmartStack), and secrets management (e.g. Vault)
  • Professional Java coding experience, or operating Java-based applications
  • Skill in identifying performance bottlenecks, identifying anomalous system behavior, and determining the root cause of incidents

About AppNeta:

AppNeta is the leader in proactive end-user performance monitoring solutions built for the distributed enterprise. With AppNeta, IT and Network Ops teams can assure continual and exceptional delivery of business-critical applications. AppNeta’s SaaS-based solutions give IT teams essential application and network performance data, allowing them to continuously monitor user experience across any application, network, data center or cloud.

At AppNeta, we take application and network performance seriously without taking ourselves too seriously. We are big believers in a work hard, play hard culture. We offer everything from catered lunch and free snacks to commuter benefits and Maternity/Paternity leave, and pride ourselves on providing a challenging yet fun and collaborative environment in both our offices. For more on our company culture, perks, and benefits, check out our website: https://www.appneta.com/about/company-culture/.

About AppNeta’s office locations:

  • Boston:  Located just steps from South Station in the heart of the Innovation District, AppNeta’s Boston office is home to our Sales & Sales Engineering, Marketing, Customer Success, Product and G&A teams.
  • Vancouver: One block from Waterfront Station in the heart of historic Gastown, AppNeta’s Vancouver office is home to our Product, Engineering and Customer Success teams.

 

Category
Software and Programming Information Technology