Microsoft

Senior Machine Learning Engineer

Microsoft, Redmond, Washington, United States, 98052

The Business Applications Group is a rapidly growing organization that is responsible for the Microsoft Dynamics 365 suite of products, Microsoft Flow, PowerApps, AI Builder, Power BI and more. Microsoft is a leader in Software as a Service, and this organization is at the heart of how business applications are designed and delivered.

Our group is making massive investments in AI to lead the industry to create smart, personalized, business applications leveraging a variety of models. We are looking for a Senior Machine Learning Engineer to join our team and help build the model training and inference tools to meet our ambitions. As part of the BAP Copilot AI Team, you will work directly with our product teams and our Data Scientists to design, develop, and deploy models that provide personalized, low-latency inference. You will contribute to our deployment of models ranging up to 70B parameter range including personalized, fine-tuned scenarios.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond

Responsibilities

We are looking for an individual with demonstrated experience optimzing deep learning models for scale.

Responsibilities Include

Collaborate with data scientists and software engineers to develop and deploy machine learning models.Design and implement model optimizations given the hardware, latency, and model constraints of our features.Stay up to date on emerging tools and methods for more efficient, lower latency model inference (e.g. Triton, advanced attention mechanisms.)Design and implement model inference infrastructure for personalized, fine-tuned models using mLora or other techniques.Develop internal tools for customization and optimization of models for real-time inferenceImplements tests and telemetry to monitor and continuously improve our infrastructure.Monitor and maintain deployed models, ensuring reliability and low latency at scale.As part of the AI Architecture group, advise on model training best practices and project lifecycle.

Qualifications

Required/Minimum Qualifications:

Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or PythonOR equivalent experience.At least 3 years experience building and deploying Machine Learning models in a production environment.At least 4 years building and deploying production software projectsAt least 1 year of experience with machine learning frameworks like PyTorch, Tensorflow, ONNX, or TensorRT in production environments.

Additional Or Preferred Qualifications

Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or PythonOR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or PythonOR equivalent experience.Experience profiling and optimizing runtime performance, particularly on GPUs.Experience optimizing deep models for performance: quantizing, compressing, distillation, pruning, or related techniquesDemonstrated engagement with the ML engineering communityExcellent written and spoken communication skills and ability to motivate a technical team to work together on an ambitious project.Have at least 1 year of hands on experience with PyTorch, Tensorflow, ONNX, TensorRT, or other Deep Learning librariesHave at least 1 year of experience building and deploying systems in a cloud environment like Azure, AWS, or GCP

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft will accept applications for the role until June 14, 2024.

#BETJobs #BAPJobs

Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.#J-18808-Ljbffr