Logo
Cleo

Senior Data Engineer

Cleo, Trenton, New Jersey, us, 08628


Senior Data Engineer

Remote - US

Cleo is a cloud integration technology company focused on business outcomes. Every day, we ensure that each one of our 4,000+ customers' potential is realized by delivering solutions that make it easy to discover and create value through the connections and integration of enterprise applications supporting critical workflows. By providing the industry’s most complete and flexible integration offerings, we are helping our clients build trusted relationships across their partner ecosystems today, while providing all the control and visibility they need to advance their business tomorrow. In a nutshell, Cleo is a rapidly growing category leader in ecosystem integration software and we have experienced tremendous growth over recent years.

The

Senior Data Engineer

is a hands-on leader responsible for designing, developing, and maintaining data pipelines and infrastructure at Cleo. This role involves setting the strategy for data systems, collaborating closely with cross-functional teams, and ensuring the scalability, reliability, and efficiency of data solutions. The Senior Data Engineer will focus on data infrastructure needs for

AI/ML models

, overseeing the creation of a

data warehouse

and associated systems from scratch, and ensuring data is properly transformed and optimized for machine learning and artificial intelligence applications. This role is integral to building the processes that support

data transformation

,

data structures

,

metadata management

,

data quality controls

, and

workload management

.

What You Will Be Doing

Lead the Design and Build of Data Pipelines

: Develop and maintain scalable, reliable, and efficient data pipelines that collect, process, and store large datasets. Ensure these systems are optimized for

AI/ML

model training and inference.

Set Data Infrastructure Strategy

: Define and execute the strategy for building and maintaining

data warehouses

,

data lakes

, and other data storage systems that support both operational and analytical needs. Ensure systems are optimized for AI/ML model workflows.

Hands-On Data Transformation for AI/ML Models

: Design and implement data transformation processes, including

feature engineering

,

data preprocessing

, and

data augmentation

, to ensure data is in the right format for machine learning models.

Build Data Structures and Metadata Management

: Build and manage the data structures, metadata repositories, and related systems that support data transformation, ensuring the organization has well-documented, accurate, and accessible data for AI/ML workflows.

Data Quality Controls and Risk Management

: Establish and implement data quality controls to ensure that data used in machine learning models is clean, consistent, and accurate. Identify and raise risks at all stages of the data engineering process, including data ingestion, transformation, and storage.

Collaborate with Cross-Functional Teams

: Work closely with

data scientists

,

ML engineers

,

product managers

, and business leaders to understand data requirements and ensure data systems meet the needs of AI/ML initiatives. Provide hands-on leadership to guide teams in transforming data for model training.

ETL Development and Optimization for AI/ML

: Lead the development and optimization of ETL (Extract, Transform, Load) processes for ML/AI data. Focus on efficiently moving and transforming large datasets, ensuring data quality and readiness for model training and deployment.

Optimize Data for Model Training and Inference

: Ensure data pipelines are designed to support the needs of AI/ML model training, such as handling large volumes of data, managing data quality, and enabling fast model iteration.

Data Governance for AI/ML

: Define and implement data governance practices to ensure the secure, compliant, and ethical use of data in AI/ML workflows.

Stay Current

: Keep up with emerging trends in data engineering, particularly those related to

AI/ML

model development, and implement best practices to optimize data systems for machine learning.

Your Qualifications

Experience

: 5-7+ years of experience in data engineering, with a focus on

transforming data for AI/ML models

and optimizing data systems to support machine learning and artificial intelligence workflows.

Hands-On Expertise

: Proven experience in hands-on

data transformation

and building

data pipelines

for AI/ML, including data preprocessing, feature engineering, and model-specific data handling.

Leadership

: Experience leading or mentoring data engineering teams, providing hands-on guidance for AI/ML-related projects and collaborating with cross-functional teams.

Cloud and Big Data

: Strong experience with

cloud platforms

and

big data technologies

, particularly in the context of

AI/ML model development

.

A few things we have to offer:

Competitive compensation

Great Healthcare + Dental + Vision

Flexible PTO

Culture of support, encouraging Life-Work balance

401k match

FSA and HSA options

Employee Assistance Program

Paid Parental Leave

Representing a company with 4,000+ clients and a 99% retention rate

Accelerated title and salary growth potential

A fun and energetic work environment that makes you excited to go to work every day

Cleo Communications, LLC is an equal opportunity/affirmative action employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability status, protected veteran status or any other characteristic protected by law.