Senior Site Reliability Engineer
Astranis, San Francisco, CA, United States
As a team, we’ve launched five satellites into orbit, signed ten commercial deals worth over $1 billion in revenue, raised over $750 million from top global investors, and recruited a team of over 400 world-class engineers. We all work out of our (legendary) San Francisco office, which was once used to build ships during the World Wars.
Our satellites, which operate from geostationary orbit (GEO), weigh only 400 kg and utilize a proprietary software-defined radio payload. Each satellite can connect over two million people, and we’re very excited for the impact we’ll soon have in the Philippines, Peru, Mexico, and more!
Backed by substantial funding and a passionate, collaborative team, we offer a rewarding work environment where you'll learn and make a significant impact, no matter where you are in your career.
Senior Site Reliability Engineer - Ground Software
As a Senior Site Reliability Engineer, you will work with our Flight and Ground software teams to operate, scale, and automate operations for key software and mission control systems. These include command & control systems, telemetry databases, continuous integration systems and other software systems critical to our mission. You will be one of the first hires for this team, autonomously and cross-functionally leading our DevOps efforts as we continue to expand.
You will improve the state of operations for key software systems at Astranis, with respect to reliability, maintainability as we expand to a fleet of satellites and their supporting services.
This role will contribute to both commercial and US Government programs.
Role
- Own and maintain multiple Kubernetes clusters
- Ensure the reliability and availability of services
- Implement monitoring and alerting systems
- Establish robust deployment practices and infrastructure
- Set up and manage sandbox/staging environments
- Manage enterprise vendor services within the corporate cluster (Github, Artifactory, etc)
Requirements
- Bachelor of Science in a related discipline (e.g. Information Technology, Computer Science)
- 7+ years of as a Site Reliability Engineer, DevOps or DevSecOps experience
- 7+ years of experience on Linux
- Experience with Kubernetes in a production environment
- Experience with shell programming (e.g. Bash)
- Strong written and oral communication skills
- Highly motivated, self-starting, and able to perform duties autonomously without much supervision
Bonus
- Experience with Go or Rust
- Experience with Terraform
- Experience with Github Enterprise
- Experience with Ansible
- Experience with Bazel