Chemical Abstracts Service
BN30P3 Data Engineer
Chemical Abstracts Service, Columbus, Ohio, United States, 43224
Description
CAS uses intuitive technology, unparalleled scientific content and unmatched human expertise to help companies create groundbreaking innovations that benefit the world. As the scientific information solutions division of the American Chemical Society, CAS manages the largest curated reservoir of scientific knowledge, and for 115 years, has helped innovators mine, assess and apply that information to keep businesses thriving. The CAS team is global, diverse, endlessly curious and strives to make scientific insights accessible to innovators worldwide. Position Summary: You will be responsible for designing, developing, and maintaining our data infrastructure. You will work closely with data scientists, analysts, and other stakeholders to ensure data availability and quality. Your expertise in AWS, Python, and Spark will be crucial in building scalable data pipelines and processing large datasets efficiently. Experience with NoSQL databases, particularly MarkLogic, and full stack development is a plus. CAS is currently seeking a Data Engineer. This position will be located in our headquarters in Columbus, Ohio. Job Accountabilities: Design, develop, and maintain scalable data pipelines using AWS services. Implement data processing workflows using Python and Spark. Work with NoSQL databases, with a preference for experience in MarkLogic. Collaborate with data scientists and analysts to understand data requirements and deliver solutions. Ensure data quality and integrity through effective data governance practices. Optimize and troubleshoot data workflows for performance and reliability. Contribute to full stack development projects as needed. Stay updated with the latest industry trends and technologies to continuously improve our data infrastructure.
Qualifications: Bachelor's degree in Computer Science, Engineering, or a related field. 3-5 years of experience in data engineering or a related role. Proficiency in AWS services (e.g., S3, ECS, Lambda, EMR). Strong programming skills in Python. Experience with Apache Spark for data processing. Familiarity with NoSQL databases; experience with MarkLogic is a plus. Knowledge of full stack development is a plus. Strong problem-solving skills and attention to detail. Excellent communication and teamwork abilities.
Preferred Qualifications: Experience with content management and data warehousing solutions. Knowledge of data governance and best practices. Familiarity with other programming languages (e.g., Java, Scala).
CAS offers a competitive salary and comprehensive benefits package, including a generous vacation plan, medical, dental, vision insurance plans, and employee savings and retirement plans. Candidates for this position must be authorized to work in the United States and not require work authorization sponsorship by our company for this position now or in the future. EEO/Minority/Female/Disabled/Veteran Equal Opportunity Employer/Protected Veterans/Individuals with Disabilities
The contractor will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by the employer, or (c) consistent with the contractor's legal duty to furnish information. 41 CFR 60-1.35(c)
CAS uses intuitive technology, unparalleled scientific content and unmatched human expertise to help companies create groundbreaking innovations that benefit the world. As the scientific information solutions division of the American Chemical Society, CAS manages the largest curated reservoir of scientific knowledge, and for 115 years, has helped innovators mine, assess and apply that information to keep businesses thriving. The CAS team is global, diverse, endlessly curious and strives to make scientific insights accessible to innovators worldwide. Position Summary: You will be responsible for designing, developing, and maintaining our data infrastructure. You will work closely with data scientists, analysts, and other stakeholders to ensure data availability and quality. Your expertise in AWS, Python, and Spark will be crucial in building scalable data pipelines and processing large datasets efficiently. Experience with NoSQL databases, particularly MarkLogic, and full stack development is a plus. CAS is currently seeking a Data Engineer. This position will be located in our headquarters in Columbus, Ohio. Job Accountabilities: Design, develop, and maintain scalable data pipelines using AWS services. Implement data processing workflows using Python and Spark. Work with NoSQL databases, with a preference for experience in MarkLogic. Collaborate with data scientists and analysts to understand data requirements and deliver solutions. Ensure data quality and integrity through effective data governance practices. Optimize and troubleshoot data workflows for performance and reliability. Contribute to full stack development projects as needed. Stay updated with the latest industry trends and technologies to continuously improve our data infrastructure.
Qualifications: Bachelor's degree in Computer Science, Engineering, or a related field. 3-5 years of experience in data engineering or a related role. Proficiency in AWS services (e.g., S3, ECS, Lambda, EMR). Strong programming skills in Python. Experience with Apache Spark for data processing. Familiarity with NoSQL databases; experience with MarkLogic is a plus. Knowledge of full stack development is a plus. Strong problem-solving skills and attention to detail. Excellent communication and teamwork abilities.
Preferred Qualifications: Experience with content management and data warehousing solutions. Knowledge of data governance and best practices. Familiarity with other programming languages (e.g., Java, Scala).
CAS offers a competitive salary and comprehensive benefits package, including a generous vacation plan, medical, dental, vision insurance plans, and employee savings and retirement plans. Candidates for this position must be authorized to work in the United States and not require work authorization sponsorship by our company for this position now or in the future. EEO/Minority/Female/Disabled/Veteran Equal Opportunity Employer/Protected Veterans/Individuals with Disabilities
The contractor will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by the employer, or (c) consistent with the contractor's legal duty to furnish information. 41 CFR 60-1.35(c)