NYC Health Hospitals
Data Warehouse Developer
NYC Health Hospitals, New York, New York, 10261
MetroPlus Health provides the highest quality healthcare services to residents of Bronx, Brooklyn, Manhattan, Queens and Staten Island through a comprehensive list of products, including, but not limited to, New York State Medicaid Managed Care, Medicare, Child Health Plus, Exchange, Partnership in Care, MetroPlus Gold, Essential Plan, etc. As a wholly-owned subsidiary of NYC Health Hospitals, the largest public health system in the United States, MetroPlus Health network includes over 27,000 primary care providers, specialists and participating clinics. For more than 30 years, MetroPlus Health has been committed to building strong relationships with its members and providers to enable New Yorkers to live their healthiest life. Position Overview The Data Warehouse Developer will be accountable for successful design, development, and delivery of Azure Databricks Lakehouse solutions within a SQL Server Data Warehouse environment. An ideal candidate will be someone who leads, develops, maintains, and integrates processes for a data warehouse that will extract data from standardized or varied data sources and transforms data for storing in proper formats and structures for querying and analysis using SSIS. This would include loading data into an operational data store (ODS), and a dimensional model (EDM). Job Description Design and develop data warehouses and data marts to support data analysis and reporting. Interact with end users and business analysts to understand requirements. Develop SQL Server objects including, but not limited, to stored procedures, functions, and views. Design and implement technology best practices, guidelines, and repeatable processes. Create data validation rules on source data to confirm the data has correct and/or expected values. Perform data profiling of source data to identify data quality issues and anomalies, business knowledge embedded in data, gathering of natural keys, and metadata information. Leverage Databricks to optimize and accelerate processing and analytics tasks. Collaborate with Databricks Administrator to ensure optimal cluster configuration. Implement data quality and data governance policies to ensure data accuracy and consistency Participate in the development/implementation of master data management (MDM) strategies and solutions within complex system landscapes. Develop Azure Databricks data engineering, AI, and machine learning tasks Minimum Qualifications Bachelor's degree in computer science or related field. 10 years of relevant professional experience. 5 years' experience with Databricks Lakehouse Platform Strong SQL skills for data manipulation and query optimization Knowledge of Python is required. Proficiency in cloud platforms like AWS, Azure, or Google Cloud. Hands on experience on unified data analytics platform with Azure Databricks, Databricks Workspace user interface, managing Notebooks, Delta Lake with Python. Strong experience using SSIS as the ETL tool in a data warehouse environment. Experience with Power BI or other BI reporting tools is a plus. Working experience with Salesforce and MuleSoft is a plus. Knowledge of Visual Studio .Net development is a plus. Experience in a health care related business is a plus Professional Competencies Integrity and Trust Customer Focus Functional/Technical skills Written/Oral Communication Ability to adapt to different projects and assignments quickly. Team player with strong communication skills Excellent analytical and problem-solving skills LI-Remote