You're using an older version of Internet Explorer that is no longer supported. Please update your browser.

Systems Engineer - AWS Messaging Services

Vancouver, BC
Full Time
2 days ago
Amazon Web Services (AWS) is the world leader in providing a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers tens of thousands of businesses around the world! The messaging team owns and operates Simple Queue Service (SQS), which provides AWS customers with the cloud infrastructure for building highly scalable, asynchronous and fault tolerant cloud applications. It's a core architectural component of the critical for Amazon as well as many leading global enterprises running on AWS.

The messaging service and the team is growing fast, and is innovating in big and brand new feature areas. We are looking for a Engineer who is obsessed with operational excellence, automation and availability. How do you know if you are a good fit for us? You want to automate common and complex tasks in fault-tolerant that operate at scale. You love dive deep into to identify latency and availability root causes. You find center build-outs, performance engineering, and other scaling activities to be a joy. Finally, you insist upon giving customers what they want: quality, highly usable, always-on services.

In this position you'll get to:
• Work with developers to , build, and manage massively scaled systems
• Automate all aspects of systems management
• Build in new centers and regions, and add/manage capacity in existing regions as our usage grows
• Optimize the performance of our by analyzing and deploying new hardware configurations
• Track the health of our services, identify problems, drive to root cause, and fix
• Collaborate with some of the leading minds in systems


Bachelors or Masters Degree in Computer Science or related field
• A minimum of 3 years building and running for Internet-facing services
• A minimum of 3 years experience in scripting (Perl/ or Shell) and automation
• Excellent written and verbal communication skills, sense of ownership, urgency and drive


• Experience with TCP/IP network troubleshooting and administration
• Experience in a 24x7 production environment, esp. one based on Linux
• Excellent troubleshooting skills at all levels, from application to network to host
• Experience with management and monitoring software (home-grown or commercially available)
• Experience with performance testing and tuning
• Automation or monitoring framework experience, deployment or development
• Experience with very large such as multi-terabyte storage farms, and/or horizontally scaled request processing fleets
• Experience with SQL scripts and database administration preferred
• Advanced degree in computer science, mathematics, or a related field
Information Technology