Aquent Talent

Multimodal AI Engineer

Aquent Talent, Boston, MA, United States

We are seeking a highly skilled Multimodal AI Engineer to join our team at a global creative, design, and technical staffing company. In this role, you will be responsible for developing and deploying advanced AI-driven solutions focused on image processing and multi-modal foundation models. The platform will enable our internal teams to source candidates based on job descriptions, portfolios, and other relevant data. This is an exploratory role requiring creativity and innovation, as you will design cutting-edge solutions to address complex, real-world business challenges.

Responsibilities:

Design, prototype, develop, and deploy AI solutions for image processing, including the development of backend infrastructure, data pipelines, and ML Ops systems.
Leverage multi-modal foundation models to combine image and text data for various tasks such as image generation, image understanding, and image enhancement.
Develop, train, and optimize image processing algorithms to achieve high-quality results and improve system efficiency.
Collaborate closely with cross-functional teams (including product managers, engineers, and data scientists) to ensure seamless integration of AI solutions into the production environment.
Implement and manage data pipelines for efficient and scalable AI model training and inference.
Stay up-to-date with the latest research and advancements in generative AI, multi-modal learning, and image processing to continuously improve system capabilities and innovation.
Work closely with stakeholders to define project objectives and deliver AI solutions that align with business needs.
Document findings, experiments, and solutions for internal stakeholders, fostering knowledge sharing and collaboration.

Qualifications:

Degree in Computer Science, Software Engineering, Electrical Engineering, Machine Learning, or a related field.
Strong understanding of machine learning algorithms, deep learning architectures, and generative models (GANs, VAEs, etc.).
Demonstrated experience with multi-modal foundation models and image processing techniques.
Proficiency in Python and experience with machine learning libraries such as PyTorch, TensorFlow, or similar.
Familiarity with cloud platforms (AWS, GCP, Azure) for building and deploying AI models at scale.
Strong problem-solving and analytical skills, with the ability to tackle complex data challenges.
Proven ability to work both independently and collaboratively within cross-functional teams.
Excellent communication skills and the ability to present technical concepts clearly to non-technical stakeholders.

Preferred Skills & Experience:

Experience with image generation models, such as StyleGAN, DALL*E, or similar.
Familiarity with ML Ops practices and tools for automating model deployment and monitoring.
Knowledge of data pipelines, distributed computing, and cloud-native architecture.
Experience working with databases (SQL/NoSQL) and managing large datasets.
Ability to quickly prototype, experiment, and iterate on new AI-driven solutions.
Previous experience in the creative, design, or technical staffing industries is a plus.

What We Offer:

Opportunity to work with cutting-edge technologies in Generative AI and multi-modal learning.
A creative and innovative environment where your contributions will shape the future of the platform.
The ability to work on real-world business problems and make a tangible impact on global staffing processes.
Competitive compensation, benefits, and career development opportunities.

If you are passionate about AI, image processing, and building innovative solutions, we encourage you to apply and join our team in this exciting new venture!

PDN-9da41a3c-4cee-4338-bd90-d5ac17c35c4c