Senior Site Reliability Engineer
Who are you?
You love servers, systems and data bordering on the brink of obsession! But you also understand code and you leverage it when you need to automate away a repetitive task. You care deeply about building and running reliable systems and you want to share that passion with others. You enjoy a variety of work, mixing planning and operations of cloud infrastructure with building PC's. You excel in a collaborative and agile environment where we deploy continuously, measure everything and take responsibility to diagnose and fix what breaks.
Does this sound like you? Then read on.
Who are we?
Raygun’s cloud-based platform and tools enable software development organisations and teams to deliver exceptional value to businesses, and exceptional experiences to customers. Raygun helps developers stay in front of application development and performance issues before they impact on end users.
This could be you...
You will be involved in the day-to-day operation of the Raygun product suite, working directly with the CTO to support new initiatives, maintain and improve existing infrastructure and provide operation support for an established high volume system hosted in AWS which supports development teams around the world.
Our team is data driven, collaborative and very supportive. You’ll be given opportunities to learn and excel, but also be prepared to be challenged by people who care very deeply about the Raygun products and the value we are unlocking for our users.
- You will be responsible for the Raygun Amazon Web Services account management day-to-day
- Work with the team to assist in capacity planning for infrastructure, including managing instance reservations and other tasks for ensuring tight control on our cost-to-serve.
- Work with the team to provide operational support and quality focused improvements for internal and production systems
- Work with the CTO to plan, implement and review security of our internal and cloud infrastructure
- Work with the CTO to plan, implement and review automation of operations tasks
- Provide internal IT support for Raygun staff
A good candidate for this position:
Has strong commercial experience in cloud infrastructure (with a preference for AWS).
Has a solid background and commercial experience in operating and managing both Windows and Linux hosts.
Has experience in managing and operating in high uptime environments.
Has good communication skills and is self-motivated.
Can support the internal IT requirements of the business.
Has a strong security mindset and has relevant experience in securing cloud infrastructure.
Has proficiency in scripting languages, and a back end programming language
A great candidate for this position:
Has existing experience in an Site Reliability role.
Can pick up new tasks, technologies and systems quickly.
Can troubleshoot problems beyond the documentation.
Keeps up with new technologies and infrastructure trends.
Can work independently or in a small team to deliver features on time and to spec.
Helps mentor junior developers and teach good coding practices.
Has proficiency in scripting languages, and C#
You’ll work closely with a team of genuinely nice people, who will support you.
You’ll get to up-skill and learn new things from other team members, and share your knowledge.
Ability to influence growth and clearly see your impact.
Become part of a growing company where you can shape the future of our work processes.
Preference for working in the office, but flexibility about working from home as needed.
Your choice of set up. Mac or PC, the decision is yours.
Up-skill and learn new things from other team members and share your own knowledge.
The opportunity to join a fast-growing, fast-moving company where you have a direct impact is here – are you up for the challenge?
Applicants for this position must have NZ residency or a valid NZ work visa.
If you really want to know what tools and technologies we use, then here’s a great, big list for you: