Logo
Walmart

Senior, Data Scientist

Walmart, California, Missouri, United States, 65018


Position Summary:This role is part of Walmart’s eCommerce Analytics Platform Enablement team. The position works on data and workflow management platform enablement, infrastructure, data integration, and supports the development of data products to enable eCommerce Analytics to achieve a broad range of objectives. Central to these efforts will be a passion for ensuring that the team’s technology and data solutions are well tailored to help drive the business and address the needs of stakeholders in our ever-changing customer environment. This position supports a broad range of internal constituents and external partners, and requires a combination of skills and experience, communication and relationship development, infrastructure management, and automation at scale in cloud-based computing environments, and a strong knowledge of data processing optimization techniques and distributed systems.Responsibilities:Work with Walmart’s Data Platform Enablement team.Responsible for Walmart’s data platform, data processing, data integrations and data solutions working with internal and external partners. The broader team is currently on a transformation path, and this role will be instrumental in enabling the broader team’s vision.System administration, security compliance, and internal tech audits.Responsible for operational excellence initiatives which include efficient use of data platform resources, identifying optimization opportunities, forecasting capacity, etc.Design and implement different flavors of architecture to deliver better system performance and resiliency.Identify opportunities to build automated processes and tools to improve efficiency.Develop capability requirements and transition plan for the next generation of data enablement technology, tools, and processes to enable Walmart to efficiently improve performance with scale.Drive best practices and standards around the usage of data platforms and tools.Implement data governance practices. Handle business and technology issues related to management of enterprise information assets and approaches related to data protection.Skills:Administering Dataproc and Airflow. Ability to create, maintain, scale, and debug production ephemeral and long-run Dataproc clusters as a Dataproc administrator.Deep understanding of data center architectures, networking, storage solutions, and scale system performance.Technical knowledge of big data analytics, optimization techniques, and data pipeline acceleration. Experience deploying and maintaining large-scale data pipeline in production. Experience deploying data science models and reporting solutions at scale, preferably with building Data tools from the ground up.Understanding of Cloud platforms such as GCP (preferred) and Azure and the difference between IaaS, CaaS, PaaS, etc.Strong experience with Apache ecosystem especially Spark, Hadoop, Hive, Kafka, Tez, Airflow and different data formats such as parquet, orc, avro, etc.Familiar with DevOps best practices and cloud native technologies.Programming experience in SQL, Python (preferred), R, Scala, Java, or Bash.Experience with BigQuery, Presto, CloudSQL, MSSQL, Cassandra, and Mongo DB is a plus.Experience with PySpark, SparkSQL, MLlib, and Spark Rapids on GPUs is a plus.Experience setting up logging and monitoring tools, and helping to debug complex data pipelines.Education & Experience:5+ years of relevant experience in roles with responsibility over data platforms and data operations dealing with large volumes of data in cloud based distributed computing environments.Graduate degree preferred in a quantitative discipline (e.g., engineering, economics, math, operations research).Proven ability to solve enterprise level data operations problems at scale which require cross-functional collaboration for solution development, implementation, and adoption.Minimum Qualifications:Option 1- Bachelor’s degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology, or related field and 3 years' experience in an analytics related field.Option 2- Master’s degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology, or related field and 1 years' experience in an analytics related field.Option 3 - 5 years' experience in an analytics or related field.Preferred Qualifications:Data science, machine learning, optimization models, Master’s degree in Machine Learning, Computer Science, Information Technology, Operations Research, Statistics, Applied Mathematics, Econometrics, Successful completion of one or more assessments in Python, Spark, Scala, or R.Using open source frameworks (for example, scikit learn, tensorflow, torch).We value candidates with a background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly.The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmart’s accessibility standards and guidelines for supporting an inclusive culture.Primary Location:850 Cherry Avenue, San Bruno, CA 94066-3031, United States of America

#J-18808-Ljbffr