Logo
Paradigm

Principal Infrastructure Engineer

Paradigm, Emory, Texas, United States, 75440


The role As a core member of our infrastructure team, you will build and maintain major features, through inception, design, implementation and launch, working closely with product and engineering disciplines across the company. You will spend the majority of your time on cross-functional self-contained feature teams focused on delivering value to the customer, while other projects will be more internally focused on integrations, scalability, and performance. Responsibilities Own the site reliability process and systems from design and implementation to deployment and maintenance Educate the platform software engineering team on reliability best practices and collaborate to evolve the software engineering process to accommodate reliability principles Provide service outage escalation response alongside software engineers Manage multiple Kubernetes clusters across multiple environments and regions Manage and build core services and infrastructure across the entire engineering organization Help build an adaptable, high-velocity team Things that we believe are critical Expertise in site reliability engineering in a multi-datacenter production cloud environment with demanding up-time, real-time performance, and security requirements Experience adopting and employing open-source, home-grown, and commercial technology products as appropriate in support of the Infra Engineering mission Strong familiarity with AWS and Kubernetes Background in Software Engineering Experience with leading teams and projects Comfort working with senior management to allocate and prioritize engineering energy in support of the Infra Engineering mission in a real-world resource-constrained environment Extra Credit Experience with cloud infrastructure and networking in a production context Experience building and/or using low-latency cross-region databases or high-volume trading applications Experience with HashiCorp tools (Vault, and Terraform) Experience with Kafka, Redis, and Postgres Experience with cloud providers beyond AWS (Azure, GCP, etc.) Expertise in cloud network security #J-18808-Ljbffr