Logo
Inizio Partners

Databricks Data Engineer

Inizio Partners, New York, New York, us, 10261


About the job Databricks Data Engineer

OUR CLIENT

Our client provides data-driven, action-oriented solutions to business problems through statistical data mining, cutting-edge analytics techniques, and a consultative approach. Leveraging proprietary methodology and best-of-breed technology, our client's analytics team takes an industry-specific approach to transform decision-making and embed analytics more deeply into their business processes. They have a global footprint of 2,000+ data scientists and analysts who assist client organizations with complex risk minimization methods, advanced marketing, pricing and CRM strategies, internal cost analysis, and cost and resource optimization within the organization. They serve the insurance, healthcare, banking, capital markets, utilities, retail and e-commerce, travel, transportation and logistics industries.

ROLE

We are seeking a Databricks Data Engineer.

RESPONSIBILITIES:

Lead and/or assist in designing and developing data systems, tailoring solutions to meet client-specific requirementsDesign and implement databricks-based solutions with a focus on distributed data processing, data partitioning and optimization for parallelismEngage with client to evaluate their current and future needs, crafting bespoke solution architectures and providing strategic recommendationsDevelop comprehensive architecture solution roadmaps integrating client business processes and technologiesDefine and enforce coding standards for ETL processes, ensuring maintainability, reusability, and adherence to best practicesArchitect and implement CI/CD pipelines for Databricks notebooks and jobs, ensuring testing, versioning, and deploymentDisaster recovery strategies for Databricks environments, ensuring data resilience and minimal downtime in case of failureInnovate and expand solution offerings to address data challengesAdvise stakeholders on data cloud platform architecture optimization, focusing on performanceExperienced with Scrum and Agile Methodologies to coordinate global delivery teams, run scrum ceremonies, manage backlog items, and handle escalationsIntegrate data across different systems and platformsStrong verbal and written communication skills to manage client discussionsCANDIDATE PROFILE

5+ years experience in architecture, design, and implementation using DatabricksExperience in designing and implementing scalable, fault-tolerant systemsDeep understanding of one or more of the big data computing technologies such as Databricks, snowflakeDemonstrated experience with the deployment of Databricks on cloud platforms, including advanced configurationsIn-depth knowledge of spark internals, catalyst optimization, and Databricks runtime environmentMust have experience in implementing solutions using DatabricksExperience in Insurance (P&C) is good to haveProgramming Languages SQL, PythonTechnologies Databricks, Delta Lake storage, Spark (PySpark, Spark SQL).

Good to have - Airflow, Splunk, Kubernetes, Power BI, Git, Azure DevOps

Project Management using Agile, ScrumB.S. Degree in a data-centric field (Mathematics, Economics, Computer Science, Engineering, or other science field), Information Systems, Information Processing, or engineering.Excellent communication & leadership skills, with the ability to lead and motivate team membersAbility to work independently with some level of ambiguity and juggle multiple demands