Logo
Datadog

Manager I, Engineering - ML Observability

Datadog, Boston, Massachusetts, us, 02298


The ML Observability team is committed to empowering our customers with an advanced observability platform, specifically designed for applications that increasingly integrate machine learning components such as large language models and generative AI. We provide comprehensive monitoring and diagnostics for ML-based components, tracking model performance, drift, fairness, and system stability. Our platform also offers model prediction explainability and root-cause analysis, enhancing organizations' confidence in the reliability of their deployments.As the Engineering Manager, you will lead a team focused on enhancing and expanding Datadog's ML Observability product. Positioned at the forefront of R&D, you will emphasize rigor and experimentation to design, refine, and implement advanced techniques for evaluating and monitoring AI components - LLMs in particular - in our customers' applications. Your leadership and expertise in both engineering and applied science will be pivotal in shaping the direction of our product, ensuring Datadog remains a key player in this rapidly evolving field.What You'll Do:Manage and mentor a team of engineers, fostering a collaborative and innovative work environmentLeverage your technical expertise in software engineering and applied science to guide the team in building robust and scalable solutionsApply your experience with LLMs to enhance the product's capabilities in evaluating and monitoring LLM-based applicationsExplore and implement new techniques and tools to provide deeper insights into model behavior, drift, fairness, and interpretabilityEngage with senior management and executives, articulating complex technical concepts clearly and preciselyStay current with industry trends and advancements in machine learning and observability, driving innovation within the teamWho You Are:Proven experience in software engineering and applied science, with a focus on engineering LLM-based systems in productionDemonstrated experience managing small teams of software engineers and/or applied scientists, with a track record of delivering high-quality productsStrong software development skills and proficiency in Python and GoStrong understanding of machine learning theory, statistics, and fundamentalsExcellent communication abilities to convey complex technical concepts clearlyA collaborative mindset and proven experience in working in cross-functional teamsA proactive approach with a passion for continuous learning and innovationBenefits & Growth:New hire stock equity (RSUs) and employee stock purchase plan (ESPP)Continuous professional development, product training, and career pathingAn inclusive company culture, ability to join our Community Guilds (Datadog employee resource groups)Access to Inclusion Talks, our internal panel discussionsFree, global mental health benefits for employees and dependents age 6+Competitive global benefitsAbout Datadog:Datadog (NASDAQ: DDOG) is a global SaaS business, delivering a rare combination of growth and profitability. We are on a mission to break down silos and solve complexity in the cloud age by enabling digital transformation, cloud migration, and infrastructure monitoring of our customers' entire technology stacks. Built by engineers, for engineers, Datadog is used by organizations of all sizes across a wide range of industries.Equal Opportunity at Datadog:Datadog is an Affirmative Action and Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

#J-18808-Ljbffr