Magic Mondayz
Founding Platform Engineer
Magic Mondayz, San Francisco, California, United States, 94199
At Recode HR, we are collaborating with a cutting-edge, YC-backed voice AI startup to find a Founding Senior Platform Engineer (Backend). This role is perfect for engineers who have a robust background in infrastructure or DevOps and are passionate about building scalable distributed systems. As an early and foundational team member, you will play a key role in constructing and expanding the company's real-time voice AI infrastructure.
About the Company:
This innovative startup is building the "Retool for Voice AI," enabling developers to embed voice technology across various industries. As major platforms integrate human-like voice assistants for billions of users, this startup's platform bridges the gap between raw AI models and ready-to-use voice applications. With a focus on sectors like SaaS, logistics, and telehealth, they are preparing businesses for a voice-driven future. Since launching in March, they have rapidly scaled revenue and secured Series A funding from top-tier investors. Joining now as one of the first 10 team members means you will directly impact the product and infrastructure's trajectory.
Role Overview and Responsibilities:
As the Founding Senior Platform Engineer, you will take ownership of real-time conversational infrastructure, ensuring it can handle millions of concurrent calls with 99.9% reliability and sub-second response times. You will lead infrastructure scalability projects and design resilient systems that are built to last.
Key Responsibilities:
Lead end-to-end projects focused on scaling infrastructure to support millions of users while ensuring high availability.
Build and deploy comprehensive monitoring systems for real-time performance and reliability insights (e.g., Prometheus).
Develop and implement anti-fragility measures for resilient infrastructure that can adapt and recover from unexpected events (e.g., multi-cluster rollovers).
Collaborate closely with the founding team to refine and enhance infrastructure for improved reliability and scalability.
Core Requirements:
5+ years of software engineering experience with a focus on infrastructure or DevOps.
2+ years of experience working with distributed systems and scaling infrastructure.
At least 1 year of experience at a startup with fewer than 100 employees, ideally in fast-scaling and high-ownership roles.
Proficiency with Kubernetes and Pulumi for managing infrastructure; experience with Terraform is a plus.
Demonstrated success in building scalable systems from the ground up and leading projects to ensure high availability.
Hands-on coding experience with infrastructure and container management systems.
Strong understanding of networking concepts, multi-cluster environments, and observability.
Ability to thrive in a fast-paced startup setting, driving features from concept to deployment and continuously iterating for improvements.
Nice to Haves:
Prior experience as a founder or early-stage team member in infrastructure/DevOps.
Work history with top infrastructure companies (e.g., Mux, Render, Supabase, Datadog, Snowflake).
Familiarity with Rust or Go and a passion for using modern tech tools.
A problem-solving, innovative mindset aimed at enhancing infrastructure efficiency and reliability.
Proven ability to simplify and refactor legacy systems for better performance and maintainability.
Expertise in network security, certificates, and multi-cluster configurations.
A strong engineering portfolio with contributions to open-source projects or an active GitHub profile.
What We Offer:
Equity: 0.10% - 0.60%.
A full-time, in-office position in San Francisco.
The chance to work closely with the founding team and have a significant impact on shaping the company’s infrastructure and growth.
A high-impact role with opportunities to lead major infrastructure decisions and development.
Key Milestones:
First 7 days: Complete a pre-scoped project, such as setting up high-availability Redis.
First 14 days: Deliver on set monitoring goals, such as implementing Prometheus rules.
First 30 days: Independently complete a project (e.g., developing a Custom Resource Definition (CRD) for managing worker pools) and enhance real-time alerting for latency spikes.
Implement proactive infrastructure enhancements to bolster system resilience (e.g., automated multi-cluster rollovers).
Candidate Process:
20-minute Zoom interview with the Chief of Staff for an initial chat and a brief technical discussion.
30-minute Technical Interview with the CTO focusing on architecture and system design.
In-office lunch with the Founders to discuss company vision and culture.
A paid 3- to 7-day work trial to collaborate with the team and assess mutual fit.
If you are a platform engineer ready for a high-impact role at a rapidly growing startup, this opportunity offers the chance to influence the future of real-time voice AI infrastructure. Apply now to join a team dedicated to advancing the next generation of developer tools.
#J-18808-Ljbffr
#J-18808-Ljbffr