JPMorganChase

Senior Lead Software Engineering - AI/ML Platform

JPMorganChase, Jersey City, New Jersey, United States, 07390

JOB DESCRIPTION

Be an integral part of an agile team that's constantly pushing the envelope to enhance, build, and deliver ML technology products.

As a Senior Lead Software Engineer at JPMorgan Chase within the Corporate Sector, AI/ML Technology, you are an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure, stable, and scalable way. Drive significant business impact through your capabilities and contributions, and apply deep technical expertise and problem-solving methodologies to tackle a diverse array of challenges that span multiple technologies and applications.

Job Responsibilities

Architects and implements distributed ML infrastructure, including inference, training, scheduling, orchestration, and storage.

Develops advanced monitoring and management tools for high reliability and scalability.

Optimizes system performance by identifying and resolving inefficiencies and bottlenecks.

Collaborates with product teams to deliver tailored, technology-driven solutions.

Drives decisions that influence the product design, application functionality, and technical operations and processes.

Integrates Generative AI within the ML Platform using state-of-the-art techniques.

Adds to the team culture of diversity, equity, inclusion, and respect.

Hands-on experience with the ability to analyze, write, develop, test, and release products using Python on AWS.

Adheres to changing organization policies for in-office presence 3 days a week as this is a Hybrid role.

Required Qualifications, Capabilities, and Skills

Formal training or certification on software engineering concepts and 5+ years applied experience.

Deep expertise in AWS / Azure and Kubernetes ecosystem, including EKS, Helm, Custom Operators and Terraform.

Advanced in Python programming language, Java is a plus.

Background in High Performance Computing, ML Hardware Acceleration (e.g., GPU, TPU, RDMA), or ML for Systems.

Strong coding skills and experience in developing large-scale ML systems.

Extensive hands-on experience with ML frameworks (TensorFlow, PyTorch, JAX, scikit-learn).

Proven track record in contributing to and optimizing open-source ML frameworks.

Strategic thinker with the ability to craft and drive a technical vision for maximum business impact.

Demonstrated leadership in working effectively with engineers, data scientists, and ML practitioners.

Proven ability to identify trade-offs, clarify project ambiguities, and drive decision-making.

Ability to tackle design and functionality problems independently with little to no oversight.

Preferred Qualifications, Capabilities, and Skills

Excellent problem-solving and analytical skills.

Ability to work independently and in a team.

Passion for innovations and continuous learning.

#J-18808-Ljbffr