Databricks
Database Engine Internals - Staff Software Engineer
Databricks, Bellevue, Washington, us, 98009
P-955
Our mission at Databricks is to radically simplify the whole data lifecycle from ingestion to ETL, BI, and all the way up to ML/AI with a unified platform. To achieve this goal, we believe the data warehouse architecture as we know it today will be replaced by a new architectural pattern, Lakehouse ( CIDR 2021 paper ), open platforms that unify data warehousing and advanced analytics. The new architecture will help address several major challenges, including data staleness, reliability, total cost of ownership, data lock-in, and limited use-case support.
A critical part of realizing this vision is the next generation (decoupled) query engine and structured storage system that can outperform specialized data warehouses in relational query performance, yet retain the expressiveness and of general purpose systems such as Apache Spark to support diverse workloads ranging from ETL to data science.
As part of this team, you will be working in one or more of the following areas to design and implement these next gen systems that leapfrog state-of-the-art:
Query compilation and optimization
Distributed query execution and scheduling
Vectorized execution engine
Data security
Resource management
Transaction coordination
Efficient storage structures (encodings, indexes)
Automatic physical data optimization
What we look for:
A passion for database systems, storage systems, distributed systems, language design, or performance optimization
Experience working towards a multi-year vision with incremental deliverables
Motivated by delivering customer value and impact
8+ years of experience working in a related system (preferred)
Optional: PhD in databases or distributed systems
Benefits
Comprehensive health coverage including medical, dental, and vision
401(k) Plan
Equity awards
Flexible time off
Paid parental leave
Family Planning
Gym reimbursement
Annual personal development fund
Work headphones reimbursement
Employee Assistance Program (EAP)
Business travel accident insurance
Pay Range Transparency
Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents base salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks utilizes the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above.
Local Pay Range:
$182,400
—
$247,000 USD#J-18808-Ljbffr
Our mission at Databricks is to radically simplify the whole data lifecycle from ingestion to ETL, BI, and all the way up to ML/AI with a unified platform. To achieve this goal, we believe the data warehouse architecture as we know it today will be replaced by a new architectural pattern, Lakehouse ( CIDR 2021 paper ), open platforms that unify data warehousing and advanced analytics. The new architecture will help address several major challenges, including data staleness, reliability, total cost of ownership, data lock-in, and limited use-case support.
A critical part of realizing this vision is the next generation (decoupled) query engine and structured storage system that can outperform specialized data warehouses in relational query performance, yet retain the expressiveness and of general purpose systems such as Apache Spark to support diverse workloads ranging from ETL to data science.
As part of this team, you will be working in one or more of the following areas to design and implement these next gen systems that leapfrog state-of-the-art:
Query compilation and optimization
Distributed query execution and scheduling
Vectorized execution engine
Data security
Resource management
Transaction coordination
Efficient storage structures (encodings, indexes)
Automatic physical data optimization
What we look for:
A passion for database systems, storage systems, distributed systems, language design, or performance optimization
Experience working towards a multi-year vision with incremental deliverables
Motivated by delivering customer value and impact
8+ years of experience working in a related system (preferred)
Optional: PhD in databases or distributed systems
Benefits
Comprehensive health coverage including medical, dental, and vision
401(k) Plan
Equity awards
Flexible time off
Paid parental leave
Family Planning
Gym reimbursement
Annual personal development fund
Work headphones reimbursement
Employee Assistance Program (EAP)
Business travel accident insurance
Pay Range Transparency
Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents base salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks utilizes the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above.
Local Pay Range:
$182,400
—
$247,000 USD#J-18808-Ljbffr