Cloud Infrastructure / Site Reliability Engineer
Signiminds Technologies Inc, San Francisco, CA, United States
Note: Position requires having Security Clearance, candidates with clearance are encouraged to apply.
Job Description:
As the Senior Software Engineer -Cloud Infrastructure you will collaborate with development and quality engineering to build and maintain our continuous integration pipeline from development to production. You’ll bring a strong systems background and an eye toward automated software engineering and continuous delivery. Your deep understanding of SaaS and cloud technologies, combined with your leadership skills, will be vital in shaping the future of Client's Open NDR SaaS Platform.
Responsibilities:
Design, deploy, and maintain cloud infrastructure solutions on platforms such as AWS, Azure, or Google Cloud Platform (GCP).
Develop automation scripts and tools to streamline provisioning, configuration, and management of cloud resources.
Collaborate with software development teams to integrate cloud services into applications and workflows.
Implement monitoring and alerting systems to ensure the performance, availability, and security of cloud environments.
Optimize resource utilization and cost efficiency through continuous monitoring, analysis, and optimization of cloud infrastructure.
Stay current with emerging technologies and best practices in cloud computing, DevOps, and infrastructure automation.
Participate in the resolution of production incidents and contribute to post-mortem analysis and improvement efforts.
Minimum Qualifications
8+ years of professional experience in cloud infrastructure engineering or related roles.
Strong programming skills in languages such as Bash, Python, Go.
Experience with infrastructure-as-code (IaC) tools such as Terraform, CloudFormation.
Proficiency in scripting/programming languages such as Python, Bash, or PowerShell
Experience with automation tools like Jenkins, GitLab, and Ansible/Chef
Understanding of networking concepts, security best practices, and cloud-native architectures.
Experience with cloud platforms like AWS, Azure, or Google Cloud
Strong communication and collaboration skills
Experience with Observability tools such as Prometheus, Grafana, ELK stack, or similar
Hands-on experience with Docker, Kubernetes, or similar technologies
Knowledge of security practices and standards in cloud environments
Experience with SLI, SLO, SLA, and Error Budget concepts
Strong problem-solving skills and ability to troubleshoot complex issues under pressure
Familiarity with Agile methodologies and DevOps/SRE practices
Excellent documentation skills
Excellent problem-solving skills and the ability to work effectively in a fast-paced, collaborative environment.
Preferred Skills
Certification in cloud computing (e.g., AWS Certified Solutions Architect, Azure Solutions Architect).
Experience with serverless computing platforms (e.g., AWS Lambda, Azure Functions).
Knowledge of infrastructure monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack).
Familiarity with configuration management tools (e.g., Ansible, Puppet, Chef).