Logo
Microsoft Corporation

Data Engineer

Microsoft Corporation, Washington, District of Columbia, us, 20022


Join the forefront of sustainable innovation at Microsoft's Cloud Operations & Innovation (CO+I) as a Data Engineer. In this pivotal role, you'll be instrumental in driving the sustainable evolution of our core infrastructure and foundational technologies that power Microsoft's leading online services, including Bing, Office 365, Xbox, OneDrive, and the Microsoft Azure platform.

Imagine harnessing the power of machine learning models, big data, and predictive analytics to transform the landscape of data center sustainability. With over 200 data centers spanning 32 countries and millions of servers, our global infrastructure is a playground for ambitious data scientists eager to make a tangible impact. Collaborating with a team of experts, you'll support services for over 1 billion customers and 20 million businesses across 90 countries, pushing the boundaries of what's possible in the world of cloud computing. With environmental sustainability and optimization at the forefront of our datacenter design and operations, we continue to grow and evolve as we meet the ever-changing business demands that hold Microsoft as a world-class cloud provider.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

This role is located either in one or all hub locations - Atlanta, GA, Washington, D.C., Redmond, WA, San Antonio, TX or Phoenix, AZ.

Relocation support will be provided, and successful candidates must relocate or reside within 50 miles of the hub office location.

This role is eligible for hybrid or remote work, up to 100%.

Responsibilities

Collaborates with appropriate stakeholders across teams and escalates concerns around data requirements by assessing and conducting feature estimation. Assesses access, usage, use cases, dependencies across products, and availability for business or customer scenarios related to one or more product features.

Informs leadership on feasibility of data needs and suggests transformations or strategies to acquire data if requirements cannot be met.

Negotiates agreements with partners and system owners to align on project delivery, data ownership between both parties, and the shape and cadence of data extraction for one or more features.

Proposes new data metrics or measures to assess data across varied service lines.

Leads the design of a data model that is appropriate for the project and prepares design specification documents to model the flow and storage of data for a data pipeline.

Data extraction across a wide variety of hyper-scale data sources.

Data Validation framework from source to endpoints ensuring data quality and integrity.

Qualifications

Required Qualifications:

Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering , or related field AND 2+ years experience in business analytics, data science, software development, data modeling or data engineering work

OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering or related field AND 1+ year(s) experience in business analytics, data science, software development, or data engineering work

OR equivalent experience.

2+ years experience in working with software engineering teams and data science teams.

2+ years experience in python development and Apache PySpark for production systems.

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to, the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

Production experience in data cloud computing technologies such as – Azure Synapse, Azure Data Factory, Azure Data Explorer, Power BI, PowerApps, K8s, SQL, Trino, Kafka and Hadoop ecosystem, in particular, HDFS, YARN.

Excellent analytical skills with systematic and structured approach to software design

Data Engineering IC3 - The typical base pay range for this role across the U.S. is USD $98,300 - $193,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $127,200 - $208,800 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft will accept applications for the role until November 30, 2024.

Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .