CODAMETRIX
Senior Data Engineer
CODAMETRIX, Boston, Massachusetts, us, 02298
CodaMetrix is revolutionizing Revenue Cycle Management with its AI-powered autonomous coding solution, a multi-specialty AI-platform that translates clinical information into accurate sets of medical codes. CodaMetrix’s autonomous coding drives efficiency under fee-for-service and value-based care models and supports improved patient care. We are passionate about getting physicians and healthcare providers away from the keyboard and back to clinical care.
Overview
The Senior Data Engineer is a member of the Data & Analytics team, reporting to the VP, Data & Analytics. The Data & Analytics team is responsible for designing a data strategy and maintaining robust data architectures on the Databricks platform, which can efficiently handle large-scale, real-time data processing. They are also responsible for managing data pipelines to ensure seamless data flow from various source systems. The goal of the team i s to integrate data into a unified Data Lake, facilitating more insightful decision-making and analytical insights. This integration will enhance our data management capabilities, optimize our data pipeline, improve data quality, and boost operational efficiency.
The Data Engineer is responsible for the analytics data ecosystem, creating and maintaining performant data pipelines and repositories, providing the infrastructure to discover and consume data while continually evolving our data storage and analytic capabilities.
The Data Engineer supports our analytics and customer onboarding teams, data scientists and software engineers on data initiatives and will ensure optimal data access across the organization. As a team member, they will populate and maintain our data and data pipeline architecture, as well as optimizing data flow and collection for cross functional teams. They are an experienced data pipeline author and data wrangler who enjoys optimizing data systems and evolving them - and have a customer-centric approach towards the various teams who provide and consume data.
They are self-directed and comfortable supporting the data engineering and analytics needs of multiple stakeholders and systems and are relentless about data security. The right candidate will be excited by the prospect of optimizing our company’s data platform architecture to support deep-dive analytics to power our next generation of AI-driven products and solutions.
Responsibilities
Create, maintain, populate and optimize the CodaMetrix data platform and analytics architecture
Design, build and maintain robust and scalable infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources such as AWS RDS PostgreSQL DB, Salesforce..etc
Assemble large, complex data sets that meet functional / non-functional business requirements
Identify, design, and implement internal process improvements such as automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
Collaborate with the Analytics, Machine Learning, and Product teams to address data-related technical issues and support their data infrastructure needs to enhance data availability and usability
Optimize existing data systems and pipelines for performance and scalability
Provide accurate and relevant data to standard and ad hoc data requests to incorporate into new and existing product dashboards and reports, proactive understanding of what needs to be communicated and when
Leverage advanced technical skills to support development of the CodaMetrix data lake, warehouse and business intelligence solutions
Foster a culture of continuous improvement and learning within the team
Evaluate and recommend new data technologies and methodologies to enhance data capabilities
Create and maintain comprehensive documentation of data processes, systems, and architectures
Provide regular updates to management on data engineering initiatives and project statuses
Participate and lead portions of team Agile-based ceremonial activities, including stand-ups, stakeholder demos, and reviews of other engineers designs and code
Provide technical consulting to users of the various data warehouse and dashboarding tools and advises users on optimizations, conflicts and appropriate and inappropriate data usage
Requirements
Required
Bachelor's or Master's degree in Computer Science, Data Science, Information Technology or a related field
5+ years of experience with Big Data technologies for data processing using Apache Spark, Kafka, Cassandra
5+ years of experience in AWS Cloud infrastructure & Databases services such as RDS, PostgreSQL, Aurora or Redshift
5+ years of programming experience in ingesting, processing and reading large volumes of data using pyspark or Scala
5+ years of experience in writing SQL and performance tuning
Posses an advanced understanding of various structured data in a healthcare setting and organize for visualization and consumption
Experience in building Data lake/Data Warehousing with both structured and unstructured dataset
Experience with ETL processes to implement data pipelines and workflows
Experience with Databricks platform using Unity catalog and DLT pipeline
Advanced Knowledge of data modeling, data architecture and data integration
Ability to manage multiple tasks or projects simultaneously
Proven analytical and problem-solving skills
Effective verbal and written communication skills with both management and peers
Preferred
Common BI Tools; Tableau is a huge plus
Knowledge of HIPAA compliance requirements as well as other security/compliance practices such as PII and SOC2 a big plus
Experience with Streaming workloads and integrating Spark with Apache Kafka
Experience with consuming or authoring REST and/or SOAP web service APIs
You understand what IaC means and have experience with common tools to implement it
What CodaMetrix can offer you:
Learn more about our full-time employee benefits and how we take care of our team.
Health Insurance: We cover 80% of the cost of medical and dental insurance and offer vision insurance
Retirement: We offer a 401(k) plan that eligible employees can contribute to one month after their first day
Flexibility: We have a generous Paid Time Off policy, which is managed but not limited, so you can take the time you need to relax and rejuvenate
Learning: All new hires complete our 7-week Onboarding Program where they learn about our company and each of our departments through live sessions hosted by a variety of our leaders
Development: We provide annual performance evaluations and prioritize working with employees on what their individual growth looks like
Recognition: We recognize the outstanding achievements of our team through annual company awards where employees have the opportunity to nominate their peers
Office Location: A modern open plan workspace located in the bustling Back Bay neighborhood of Boston
Additional Employer Paid Benefits: We offer employer-paid life insurance and short-term and long-term disability insurance
Background Check Notice
All candidates will be required to complete a background check upon acceptance of a job offer.
Equal Employment Opportunity
Our company, as well as our products, are made better because we embrace diverse skills, perspectives, and ideas. CodaMetrix is an Equal Employment Opportunity Employer and all qualified applicants will receive consideration for employment.
Don’t meet every requirement? We invite you to apply anyway. Studies have shown that women, communities of color and historically underrepresented talent are less likely to apply to jobs unless they meet every single qualification. At CodaMetrix we are committed to building a diverse, inclusive and authentic workplace and encourage you to consider joining us.
Powered by JazzHR
Overview
The Senior Data Engineer is a member of the Data & Analytics team, reporting to the VP, Data & Analytics. The Data & Analytics team is responsible for designing a data strategy and maintaining robust data architectures on the Databricks platform, which can efficiently handle large-scale, real-time data processing. They are also responsible for managing data pipelines to ensure seamless data flow from various source systems. The goal of the team i s to integrate data into a unified Data Lake, facilitating more insightful decision-making and analytical insights. This integration will enhance our data management capabilities, optimize our data pipeline, improve data quality, and boost operational efficiency.
The Data Engineer is responsible for the analytics data ecosystem, creating and maintaining performant data pipelines and repositories, providing the infrastructure to discover and consume data while continually evolving our data storage and analytic capabilities.
The Data Engineer supports our analytics and customer onboarding teams, data scientists and software engineers on data initiatives and will ensure optimal data access across the organization. As a team member, they will populate and maintain our data and data pipeline architecture, as well as optimizing data flow and collection for cross functional teams. They are an experienced data pipeline author and data wrangler who enjoys optimizing data systems and evolving them - and have a customer-centric approach towards the various teams who provide and consume data.
They are self-directed and comfortable supporting the data engineering and analytics needs of multiple stakeholders and systems and are relentless about data security. The right candidate will be excited by the prospect of optimizing our company’s data platform architecture to support deep-dive analytics to power our next generation of AI-driven products and solutions.
Responsibilities
Create, maintain, populate and optimize the CodaMetrix data platform and analytics architecture
Design, build and maintain robust and scalable infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources such as AWS RDS PostgreSQL DB, Salesforce..etc
Assemble large, complex data sets that meet functional / non-functional business requirements
Identify, design, and implement internal process improvements such as automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
Collaborate with the Analytics, Machine Learning, and Product teams to address data-related technical issues and support their data infrastructure needs to enhance data availability and usability
Optimize existing data systems and pipelines for performance and scalability
Provide accurate and relevant data to standard and ad hoc data requests to incorporate into new and existing product dashboards and reports, proactive understanding of what needs to be communicated and when
Leverage advanced technical skills to support development of the CodaMetrix data lake, warehouse and business intelligence solutions
Foster a culture of continuous improvement and learning within the team
Evaluate and recommend new data technologies and methodologies to enhance data capabilities
Create and maintain comprehensive documentation of data processes, systems, and architectures
Provide regular updates to management on data engineering initiatives and project statuses
Participate and lead portions of team Agile-based ceremonial activities, including stand-ups, stakeholder demos, and reviews of other engineers designs and code
Provide technical consulting to users of the various data warehouse and dashboarding tools and advises users on optimizations, conflicts and appropriate and inappropriate data usage
Requirements
Required
Bachelor's or Master's degree in Computer Science, Data Science, Information Technology or a related field
5+ years of experience with Big Data technologies for data processing using Apache Spark, Kafka, Cassandra
5+ years of experience in AWS Cloud infrastructure & Databases services such as RDS, PostgreSQL, Aurora or Redshift
5+ years of programming experience in ingesting, processing and reading large volumes of data using pyspark or Scala
5+ years of experience in writing SQL and performance tuning
Posses an advanced understanding of various structured data in a healthcare setting and organize for visualization and consumption
Experience in building Data lake/Data Warehousing with both structured and unstructured dataset
Experience with ETL processes to implement data pipelines and workflows
Experience with Databricks platform using Unity catalog and DLT pipeline
Advanced Knowledge of data modeling, data architecture and data integration
Ability to manage multiple tasks or projects simultaneously
Proven analytical and problem-solving skills
Effective verbal and written communication skills with both management and peers
Preferred
Common BI Tools; Tableau is a huge plus
Knowledge of HIPAA compliance requirements as well as other security/compliance practices such as PII and SOC2 a big plus
Experience with Streaming workloads and integrating Spark with Apache Kafka
Experience with consuming or authoring REST and/or SOAP web service APIs
You understand what IaC means and have experience with common tools to implement it
What CodaMetrix can offer you:
Learn more about our full-time employee benefits and how we take care of our team.
Health Insurance: We cover 80% of the cost of medical and dental insurance and offer vision insurance
Retirement: We offer a 401(k) plan that eligible employees can contribute to one month after their first day
Flexibility: We have a generous Paid Time Off policy, which is managed but not limited, so you can take the time you need to relax and rejuvenate
Learning: All new hires complete our 7-week Onboarding Program where they learn about our company and each of our departments through live sessions hosted by a variety of our leaders
Development: We provide annual performance evaluations and prioritize working with employees on what their individual growth looks like
Recognition: We recognize the outstanding achievements of our team through annual company awards where employees have the opportunity to nominate their peers
Office Location: A modern open plan workspace located in the bustling Back Bay neighborhood of Boston
Additional Employer Paid Benefits: We offer employer-paid life insurance and short-term and long-term disability insurance
Background Check Notice
All candidates will be required to complete a background check upon acceptance of a job offer.
Equal Employment Opportunity
Our company, as well as our products, are made better because we embrace diverse skills, perspectives, and ideas. CodaMetrix is an Equal Employment Opportunity Employer and all qualified applicants will receive consideration for employment.
Don’t meet every requirement? We invite you to apply anyway. Studies have shown that women, communities of color and historically underrepresented talent are less likely to apply to jobs unless they meet every single qualification. At CodaMetrix we are committed to building a diverse, inclusive and authentic workplace and encourage you to consider joining us.
Powered by JazzHR