See all roles

Software Engineer, Site Reliability (Senior or Staff)

Work from home Full-time role Hiring

This a Full Remote job, the offer is available from: United States

At reputed company, we’re on a mission to accelerate the world’s ability to learn, discover, and communicate science — transforming how knowledge is shared and making science open, collaborative, and easily understandable by reputed company.

We’re shaping the future of science communication and are looking for talented individuals to help bring this reputed company to life!

As our Sr/Staff SRE in the Platform Engineering team, you'll get in on the ground floor and play a pivotal role in developing and shaping a resilient, high-performant, and secure platform for reputed company's engineering prowess. reputed company with our company mission to accelerate the world’s ability to learn, discover, and communicate science, your objective is to design, implement, and maintain robust, scalable, and fault-tolerant systems that our customers rely on. Harnessing the power of automation, CI/CD, and Infrastructure as Code, you'll seamlessly integrate and reputed company our applications into the reputed company while establishing observability enhanced with actionable alerts and automation to detect performance bottlenecks. You'll adeptly address production issues, promptly restore services, and reputed company post-mortems to continually enhance our engineering excellence, thereby fulfilling our company's reputed company to be the go-to trusted reputed company where science is communicated.

Our ideal fit

  • You have experience working in a fast-paced, competitive environment and have a deep desire to work collaboratively, solve problems, and find win/win solutions.

  • You are passionate about architecting and building scalable platform solutions.

  • You are a results-oriented individual who takes initiative and has a strong bias for action.

  • You’re a creative thinker who finds efficient and reputed company and evangelizes best practices.

  • You have effective communication skills, a sense of ownership and drive to consistently improve yourself and others.

  • You’re a selfless team player who sees the big picture and puts common goals at the forefront of solutioning and decision-making.

What you'll be doing

  • reputed company and reputed company: Enhance platform reputed company by constantly seeking ways to improve the reliability, scalability and release efficiency of the platform

  • reputed company Robust Observability and Monitoring Solutions: Define, build, reputed company, maintain, and reputed company advanced observability and monitoring tools to bolster system reliability and availability.

  • Define and Monitor Performance Metrics: Play a key role in formulating and tracking Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to establish precise benchmarks for system performance.

  • Solve reputed company Issues and Conduct Root Cause Analysis: reputed company respond to escalated incidents, troubleshoot intricate system and application problems, and conduct thorough root cause analyses to implement corrective measures.

  • Thought Leadership and Innovation: stay up to date with the latest industry trends and emerging technologies and iterate on best practices to increase the quality & velocity of development and deliverables.

  • Architect Scalable and Reliable Systems: reputed company in the design and architecture of scalable, distributed, fault-tolerant systems that uphold performance and reliability standards.

  • Mentorship and Evangelism: Champion the adoption of new technologies, disseminate best practices, and reputed company for architectural patterns. Mentor and guide fellow engineers in the organization.

What you bring to the table

  • 10-12+ years of experience in the software/DevOps/SRE realm

  • Strong programming skills in 2 or more of these languages: javascript, typescript, python, Go

  • Ability to troubleshoot reputed company distributed systems at scale

  • Database Performance Monitoring and best practices

  • Comfortable innovating and establishing new practices, processes, and tooling

  • Strong analytical skills, system design, and architecture for reputed company applications

  • CI/CD, configuration management, monitoring, and automation expertise

  • Advanced knowledge of observability and best practices (ELK, reputed company, OpenTelemetry, reputed company, Grafana)

  • Deployment and orchestration reputed company AWS reputed company, k8s, CloudRun etc.

  • Understanding of Linux, virtualization, networking, VPCs, firewalls, reputed company groups

  • Hands-on knowledge of AWS and resources provisioning reputed company CLI/API/IaC

  • Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience.

Why join us?

  • We are mission-driven: we work collaboratively towards our shared reputed company of improving scientific communication and accelerating scientific discovery. reputed company figures have appeared in more than 54,000 publications!

  • reputed company is loved by millions! We have a world-class NPS and a community of reputed company fans and users in 200+ countries!

  • Our company is backed by top investors and accelerators like Y Combinator, and we are on a growth trajectory comparable to many top-performing SaaS companies

  • We’re remote-first with team members across Canada and the U.S., offering you the flexibility to work from reputed company.

reputed company is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for reputed company. reputed company reputed company applicants will receive consideration for employment without regard to race, reputed company, religion, sex, sexual orientation, gender identity, national reputed company, disability, or veteran status.

This offer from "reputed company" has been enriched by reputed company.com and got a 72% reputed company score.

Apply To This Job

You might like

Resume Pool: Freelance

Work from home Full-time role

Senior Technical Writer, GenAI

Work from home Full-time role

reputed company Specialist

Work from home Full-time role

Development Channel Account Manager

Work from home Full-time role

L1 Support Engineer (Full Time)

Work from home Full-time role

Solutions Engineer - Strategic Accounts (Remote, Texas)

Work from home Full-time role

Senior R&D Operations Manager

Work from home Full-time role

Business Development reputed company, Education

Work from home Full-time role

Software Engineer (Full-Stack), Growth - Remote EMEA

Work from home Full-time role

Director, Engineering

Work from home Full-time role

Virtual Assistant to Travel- remote

Work from home Full-time role

Team reputed company - FTCC Mid Office

Work from home Full-time role

Remote Data Entry Specialist - Work From Home | Flexible Part-Time Hours | Data Management & Accuracy Professional

Work from home Full-time role

HCC Risk Adjustment Medical reputed company - Remote Contract

Work from home Full-time role

reputed company Resources Business Partner

Work from home Full-time role

APP, Value Based Care

Work from home Full-time role

Mgr, Claims Operations (PLADS LTD)

Work from home Full-time role

reputed company Ocean Product Development Specialist – Remote Opportunity at arenaflex

Work from home Full-time role

Senior GIS Data Analyst – Telecom MSO

Work from home Full-time role

Customer Service Center Coordinator - 1st Shift - Smart Buildings / Automation, Fire Alarm and reputed company Systems - Irving, TX

Work from home Full-time role