Databricks Inc.
Sr. Software Engineer - Performance Mountain View, California
Databricks Inc., Mountain View, California, us, 94039
P-97
At Databricks, we are passionate about enabling data teams to solve the world's toughest problems. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business. We constantly push the boundaries of data and AI technology, while simultaneously operating with the resilience, security and scale that is critical to making customers successful on our platform. Databricks develops and operates one of the largest scale software platforms; the fleet consists of millions of virtual machines, generating terabytes of logs and processing exabytes of data per day. At our scale, we regularly observe cloud hardware, network, and operating system faults, and our software must gracefully shield our customers from any of the above.
As a performance engineer, you will work closely with multiple teams across the company to evaluate the performance of products and features, identify performance bottlenecks, and partner with engineers to solve performance and scalability issues. This implies, among other teams, setting performance targets for various software releases, guiding teams to develop performance benchmarks, running competitive benchmark analysis for different Databricks products, doing deep dive analysis to identify performance issues and fix them.
The impact you will have:
Identify performance limitations of the entire stack based on telemetry, customer signals, PoCs, and competitive benchmarks, that will result in the best performing system across the industry, when resolved. Dimensions include latency, data and compute scalability, concurrency, cost, and price to performance ratio. Impact spans all cloud providers and all major areas.
Set the performance expectations for all cross-cutting efforts early on through specialized benchmarks capturing the intended customer user journeys, and make sure they are met before deployed to customers.
Understand the performance characteristics of the compute instance types, storage layers, and all cloud services Databricks depends on and deploy optimal solutions to meet the customer demand.
Work with customers to root cause and mitigate performance problems during production, previews, and POCs.
What We Look For:
BS (or higher degree) in Computer Science, or a related field
Experience in the performance analysis discipline. Ability to identify performance issues, root cause problems, and be able to come up with potential solutions.
Experience in software development, preferably in large scale distributed systems
Ability to measure and document the impact of performance features to existing customers, such as possible regressions for certain workloads, their extent, and which customers will be affected.
Ability to build strong working relationships with developers and field engineers to facilitate triaging and mitigation of performance problems.
Benefits
Comprehensive health coverage including medical, dental, and vision
401(k) Plan
Equity awards
Flexible time off
Paid parental leave
Family Planning
Gym reimbursement
Annual personal development fund
Work headphones reimbursement
Employee Assistance Program (EAP)
Business travel accident insurance
#J-18808-Ljbffr
At Databricks, we are passionate about enabling data teams to solve the world's toughest problems. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business. We constantly push the boundaries of data and AI technology, while simultaneously operating with the resilience, security and scale that is critical to making customers successful on our platform. Databricks develops and operates one of the largest scale software platforms; the fleet consists of millions of virtual machines, generating terabytes of logs and processing exabytes of data per day. At our scale, we regularly observe cloud hardware, network, and operating system faults, and our software must gracefully shield our customers from any of the above.
As a performance engineer, you will work closely with multiple teams across the company to evaluate the performance of products and features, identify performance bottlenecks, and partner with engineers to solve performance and scalability issues. This implies, among other teams, setting performance targets for various software releases, guiding teams to develop performance benchmarks, running competitive benchmark analysis for different Databricks products, doing deep dive analysis to identify performance issues and fix them.
The impact you will have:
Identify performance limitations of the entire stack based on telemetry, customer signals, PoCs, and competitive benchmarks, that will result in the best performing system across the industry, when resolved. Dimensions include latency, data and compute scalability, concurrency, cost, and price to performance ratio. Impact spans all cloud providers and all major areas.
Set the performance expectations for all cross-cutting efforts early on through specialized benchmarks capturing the intended customer user journeys, and make sure they are met before deployed to customers.
Understand the performance characteristics of the compute instance types, storage layers, and all cloud services Databricks depends on and deploy optimal solutions to meet the customer demand.
Work with customers to root cause and mitigate performance problems during production, previews, and POCs.
What We Look For:
BS (or higher degree) in Computer Science, or a related field
Experience in the performance analysis discipline. Ability to identify performance issues, root cause problems, and be able to come up with potential solutions.
Experience in software development, preferably in large scale distributed systems
Ability to measure and document the impact of performance features to existing customers, such as possible regressions for certain workloads, their extent, and which customers will be affected.
Ability to build strong working relationships with developers and field engineers to facilitate triaging and mitigation of performance problems.
Benefits
Comprehensive health coverage including medical, dental, and vision
401(k) Plan
Equity awards
Flexible time off
Paid parental leave
Family Planning
Gym reimbursement
Annual personal development fund
Work headphones reimbursement
Employee Assistance Program (EAP)
Business travel accident insurance
#J-18808-Ljbffr