Logo
BlueSkyClarity

Principal SRE and DevOps Engineer, Boston, 145k - 185k

BlueSkyClarity, Boston, MA


Principal Site Reliability and DevOps Engineer, Boston & Suburbs, 145k -185k base, small bonusCompensation Commensurate with experience, bonus, equity, benefits additional, EOECandidates must be a U.S. citizen or national, refugee, asylum, or lawful permanent resident.Seasoned H1b visa candidates will be considered, especially those with i140 status, Green Card sponsorship available. Our client, an iconic direct to consumer brand is seeking a Principal Site Reliability and DevOps Engineer, preferably one steeped in Kubernetes or other open-source container orchestration systems to continue to expand cloud platform capabilities that enable our product development teams to deploy, configure, manage, maintain and support production applications.ResponsibilitiesServe as a primary point person for the overall health, performance, and capacity of our Kubernetes-based cloud platform, evangelize, design, implement and automate security controls, governance processes, and compliance validation, work closely with our DevOps team and other consumers of the platform to develop SLAs and KPI targetsBuild an understanding of all aspects of the cloud platform including network ingress, monitoring, alerting, RBAC, and multi-region deployment, participate in solution design for new cloud platform features, open source technologies, and tool evaluation and selection, create and maintain runbooks and operational procedures to ensure that service availability requirements are achievedWork closely with engineering teams to conduct root cause analysis for production incidents, provide on-call support to triage and resolve issues in production platformsContinually improve processes, automation, documentation, monitoring and security, keep up-to-date on Kubernetes updates and tools, and plan and execute cloud platform updatesCreate contiguous improvements, drive "chaos testing" and stress tests to understand and improve overall resiliency to failures and loadDirectly impact our web to consumer customer experience by creating clean, maintainable, intuitive build and installation solutions using Kubernetes.Work alongside product managers, designers, and engineers.Automate as much work that makes sense so that we can ensure we deliver products and releases on time and within scope.Work directly with the engineering teams to assist with technical support within our environment and to develop, write, and ensure that all aspects of the code are tested in an efficient manner.QualificationsSite Reliability and DevOps Engineer with the following key technologies --> kubernetes | python | mysql | jenkins | puppet | amazon-web-services is the best match, NOT all technologies requiredExperience in software development or IT organizations, with direct experience with AWS or similar cloud infrastructure provider and production experience with Kubernetes or other open-source container orchestration systemsSoftware process automation with popular scripting languages (Python, Node.js, and/or Ruby) and knowledge of best practices and IT operations in an always-up, always-available mission critical serviceBS in Computer Science or equivalent technical domain, MS preferredBlueSkyClarity:BlueSkyClarity (a Delaware LLC) is a search firm focused on retaining smart, passionate and talented people within the marketing, creative, analytic, product, sales, software engineering and information technology domain disciplines for web-to-consumer, digital agency, consulting, start-up, or iconic brand clients in all industries.Posted by:Dom Costagliola, Principal, m 1-617-899-5094http://www.blueskyclarity.comType: direct hire