Logo
Dice

Senior Data Engineer

Dice, Redmond, Washington, United States, 98052


Dice is the leading career destination for tech experts at every stage of their careers. Our client, Microsoft Corporation, is seeking the following. Apply via Dice today!

The 1ES Microsoft team is on the lookout for a Senior Data Engineer to join the Engineering Thrive initiative. This pivotal role involves crafting and refining metrics that capture the essence of engineering systems performance and productivity, in line with the SPACE framework where SPACE stands for Satisfaction, Performance, Activity, Communication, and Efficiency. The successful candidate will have a direct impact on Microsoft's engineering culture and strategy by contributing to dashboards surfacing productivity metric data to company leaders.

The ideal candidate will bring a wealth of experience from working on large-scale software development projects, whether in the commercial sector or within the open-source community. Analytical skills, data query experience, and effective communication skills are essential, as the role demands interaction with various engineering stakeholders and partners. The ability to articulate, justify, and validate the metrics, as well as to respond to feedback and queries, is crucial. Experience in software development engineering and a keen understanding of productivity are also key to driving success in this dynamic and influential position.

QualificationsRequired/Minimum QualificationsBachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years experience in business analytics, data science, software development, data modeling or data engineeringOR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 3+ year(s) experience in business analytics, data science, software development, or data engineeringOR equivalent experience4+ years of experience in data visualization using PowerBI.2+ years of experience in developing new products for companies or individuals to use, providing maintenance to current products to enhance use.Other Requirements:Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.Additional Or Preferred QualificationsBachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 8+ years experience in business analytics, data science, software development, data modeling or data engineeringOR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years of business analytics, data science, software development, data modeling or data engineering work experienceOR equivalent experience.6+ years of experience in data visualization using PowerBI.Data Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.ResponsibilitiesCollaborates with appropriate stakeholders across teams and escalates concerns around data requirements by assessing and conducting feature estimation. Informs clients on feasibility of data needs and suggests transformations or strategies to acquire data if requirements cannot be met. Negotiates agreements with partners and system owners to align on project delivery, data ownership between both parties, and the shape and cadence of data extraction for one or more features. Proposes new data metrics or measures to assess data across varied service lines.Leads the design of a data model that is appropriate for the project and prepares design specification documents to model the flow and storage of data for a data pipeline. Designs assigned components of the data model for a functional area of a project. Partners with stakeholders (e.g., Data Science Specialists) to make iterative improvements to design specifications, data visualizations, data models, or data schemas. Considers tradeoffs between analytical requirements with compute/storage consumption for data and anticipates cost that could be influenced by the cadence of data extraction, transformation, and loading into moderately complex data products or datasets in cloud and local environments. Demonstrates an advanced understanding of costs associated with data that are used to assess the total cost of ownership (TOC).Identifies data sources and builds code to extract raw data from identified upstream sources using query languages while ensuring accuracy, validity, and reliability of the data across several pipeline components. Contributes to the code review process by providing feedback and suggestions for implementation.Plans and creates efficient techniques and operations (e.g., inserting, aggregating, joining) to transform raw data into a form that is compatible with downstream data sources, databases, and visualizations. Independently uses software, query languages, and computing tools to transform raw data across end-to-end pipelines. Evaluates data to ensure data quality and completeness using queries, data wrangling, and statistical techniques. Merges data into distributed systems, products, or tools for further processing.Writes code to implement performance monitoring protocols across data pipelines. Builds visualizations and smart aggregations. Develops and updates troubleshooting guides (TSGs) and operating procedures for reviewing, addressing, and/or fixing advanced problems/anomalies. Supports and monitors platforms.Performs root cause analysis in response to detected problems/anomalies to identify the reason for alerts or customer escalations and implement solutions that minimize points of failure. Implements and monitors self-healing processes across multiple product features to prevent issues from recurring in the future and retain data quality and optimal performance (e.g., latency, cost) throughout the data lifecycle.Documents the problem and solutions through postmortem reports and shares insights with team and the customer. Provides data-based insights into the health of data products owned by the team according to service level agreements (SLAs) across multiple features.Anticipates the need for data governance and designs data modeling and data handling procedures, with direct support and partnership with Corporate, External, and Legal Affairs (CELA), to ensure compliance with applicable laws and policies across all aspects of the data pipeline. Tags data based on categorization (e.g., personally identifiable information [PII], pseudo-anonymized, financial). Documents data type, classifications, and lineage to ensure traceability. Governs accessibility of data within assigned data pipelines. Provides guidance on contributions to the data glossary to document the origin, usage, and format of data for each program.Participate in 24/7 on-call rotation to support 1st party customers.Embody our Culture and Values.

#J-18808-Ljbffr