Healthix, Inc. is hiring: Data Analyst in New York
Healthix, Inc., New York, NY, US
Job Description
JOB TITLE: Data Analyst
REPORTS TO: Senior Manager of Data Strategy
DEPARTMENT: IT
HOURS: 40 Hours
JOB TYPE: Temp (6 months), Hybrid
RATE: $35 to $50 per hour
POSITION SUMMARY:
The Data Analyst provides the foundation for analytics, namely the collection, quality assurance and availability of data. This IT role requires a significant set of technical skills, including a deep knowledge of SQL, SQL database design and multiple programming languages as well as communication skills to understand what data and analysis the healthcare analysts want to gain from Healthix data stores. They are responsible for the maintenance, improvement, cleansing and manipulation of data in the organization’s operational and analytics databases. The Data Analyst works with team members to define and build data pipelines to orchestrate the movement, transformation, validation, and loading of data from the source to the final destination – data stores defined and implemented based on system requirements and data consumer requirements.
Typical tasks performed include working with the data architect on data warehouse design, ensuring data loads quickly, is easily accessible and rapidly comprehensible for analysts/users; extract, transform, and load (ETL) processing; performing thorough testing and validation in order to support the accuracy of data transformations and data verification to ensure correctness and that data is generally reliable for downstream consumption; data bounds checking and database tuning.
The Data Analyst strives to ensure proper data governance and quality for the Data Strategy team and the entire organization. They analyze complex data elements and systems, data flow, dependencies, and relationships to contribute to conceptual physical and logical data models. They work collaboratively within the Data Strategy team, providing support for their data centric needs. The candidate has strong working and conceptual knowledge of database queries multidimensional query and index tuning, monitoring, disaster recovery, backup, automated testing, automated schema migration, and continuous deployment.
Essential Duties and Responsibilities:
- Expand and optimize data and data pipeline architecture, enhancing data flow and collection.
- Support software developers, database architects, and data analysts on data initiatives, ensuring consistent and optimal data delivery across projects.
- Self-directed in meeting the data needs of multiple teams, systems, and products.
- Optimize or redesign Healthix’s data architecture to support next-generation products and data initiatives.
- Develop and maintain scalable data pipelines and build new API integrations to handle increasing data volume and complexity.
- Collaborate with analytics and business teams to improve data models for business intelligence tools, enhancing data accessibility.
- Implement processes and systems to monitor data quality, ensuring production data accuracy and availability for key stakeholders and business processes.
- Write unit/integration tests, contribute to Confluence, and document work.
- Perform data analysis to troubleshoot and resolve data-related issues.
- Work closely with frontend and backend engineers, product managers, and analysts.
- Contribute to the definition and population of company data assets (data models).
- Design data integrations and data quality frameworks.
- Evaluate open-source and vendor tools for data lineage.
- Collaborate with business units and engineering teams to develop long-term data platform architecture strategies.
- Contribute to the development of an integrated data warehouse to provide a comprehensive view of the healthcare landscape.
Supervisory Responsibilities:
- This job has no supervisory responsibilities.
Qualifications:
- 5+ years of experience in a Data Analyst role, with a graduate degree in Computer Science, Statistics, Informatics, Information Systems, or a related field. IBM Certified Data Engineer or Google Certified Professional is preferred.
- Experience with healthcare data, including HL7, CCDAs, Intersystems HealthShare, HealthShare Health Insight, and Cache databases.
- Advanced SQL knowledge and experience with relational databases, query authoring, and familiarity with various databases.
- Proficient in database design principles, including relationships, normalization, structures, indexes, views, and analyzing database requirements.
- Experience in building and optimizing big data pipelines, architectures, and data sets.
- Strong analytical skills and experience in performing root cause analysis on data and processes to answer business questions and identify improvement opportunities.
- Experience in developing processes for data transformation, structures, metadata, dependency, and workload management.
- Working knowledge of message queuing, stream processing, and scalable big data stores.
- Proficient in big data tools such as Hadoop, Spark, and Kafka.
- Experience with relational SQL and NoSQL databases.
- Strong programming skills with languages like Python, Java, C++, and Scala.
- Ability to architect end-to-end ETL pipelines.
- Experience with AWS services and BI tools like Tableau.
- Strong quantitative, analytical, process development, facilitation, and organizational skills.
- Excellent documentation and verbal communication skills, with the ability to clearly communicate technical concepts to both technical and non-technical stakeholders.