Logo
Varonis

Site Reliability Engineering Team Leader

Varonis, Morrisville, North Carolina, United States, 27560


Site Reliability Engineering Team Leader

The Company:

Varonis (Nasdaq: VRNS) is a leader in data security, fighting a different battle than conventional cybersecurity companies. Our cloud-native Data Security Platform continuously discovers and classifies critical data, removes exposures, and detects advanced threats with AI-powered automation.

Thousands of organizations worldwide trust Varonis to defend their data wherever it lives — across SaaS, IaaS, and hybrid cloud environments. Customers use Varonis to automate a wide range of security outcomes, including data security posture management (DSPM), data classification, data access governance (DAG), data detection and response (DDR), data loss prevention (DLP), and insider risk management.

Varonis protects data first, not last. Learn more at www.varonis.com.

The Role:

We are seeking a driven and development-focused Site Reliability Engineer Team Leader to join our SRE department. This group ensures that our software applications and infrastructure are reliable, scalable, and performant. The Team Leader will have both managerial and technical responsibilities.

The Location:

We are considering candidates who are able to work by hybrid model, reporting twice weekly to our Morrisville, NC office.

The Requirements:

At least a bachelor's degree (computer science or related fields) or equivalent experience in building scalable solutions to improve high-availability Production service reliability and/or increase productivity and efficiency.

At least 2 years of experience in managing SRE / Production Team.

Experience in developing C#, Python, or Java applications.

Strong organizational and analytical skills.

Substantial experience in operating a high-availability cloud infrastructure.

Quick technology adaptation.

Good interpersonal skills.

Experience with Microsoft Azure or other cloud platforms (GCP, AWS).

Advantages:

In-depth understanding of the entire web development process (design, development, and deployment).

Experience with Agile development, including CI/CD, and coding for automated testing.

The Responsibilities:

Managing SRE team with Production service reliability and increasing productivity and efficiency.

Monitor, manage and operate our cloud services including incident management.

Scale our service with required monitoring and alerting capabilities.

Develop tools and automations based on C# .Net and Python to support our operation and growth.

Work closely with R&D to ensure new features are reliable, easily deployable, and support the requirements of the service in terms of scale and security.

Establish a regular operational feedback cycle into our engineering teams.

Manage the Service Operations team to operate with a culture of business and customer-centricity by maintaining Varonis SLA for each service, including incident response, problem management, and service upgrades.

Develop and drive, as the primary owner, the communication strategy for internal and external stakeholders (including customers) to convey service health, tracking against SLAs, current and historical incidents, upcoming events, or upgrades.

Ensure all technical procedures are documented, reviewed, and updated, and actively contribute to the maintenance of operational standards & policies.

Collaborate with the Varonis Support team to understand and improve user experience, performance, incident response, and the serviceability of our offerings.

Collaborate with the internal R&D team to automate infrastructure services and system administration tasks wherever possible and implement a monitoring strategy to provide rapid feedback and diagnostics in the event of a service disruption.

Create relationships with other departments, including Marketing, Product Management, Engineering, and Customer Success, to ensure we provide services with high availability and superior performance for all our customers.

#J-18808-Ljbffr