Data Architect - Industrials & Energy Sector - Senior - Consulting - Location Op
Ernst and Young, San Francisco, CA, United States
At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better. Join us and build an exceptional experience for yourself, and a better working world for all.
The exceptional EY experience. It's yours to build.
EY focuses on high-ethical standards and integrity among its employees and expects all candidates to demonstrate these qualities.
US Consulting - AI & Data - Data Architect, Industrials & Energy Sector – Senior
The opportunity
EY is seeking a Data Architect with strong technology and data understanding having proven delivery capability. Lead the design, development, and management of the organization’s data architecture, ensuring scalable, efficient, and secure data solutions that align with business goals and support enterprise-wide data initiatives.
In this role, you will create, maintain, and support the data platform and infrastructure that enables the analytics front-end; this includes the testing, maintenance, construction, and development of architectures such as high-volume, large-scale data processing and databases with proper verification and validation processes.
Your key responsibilities
- Design, develop, optimize, and maintain data architecture and pipelines that adheres to ETL principles and business goals
- Develop and maintain scalable data pipelines, build out new integrations using AWS native technologies to support continuing increases in data source, volume, and complexity
- Define data requirements, gather and mine large scale of structured and unstructured data, and validate data by running various data tools in the Big Data Environment
- Support standardization, customization and ad hoc data analysis and develop the mechanisms to ingest, analyse, validate, normalize, and clean data
- Write unit/integration/performance test scripts and perform data analysis required to troubleshoot data related issues and assist in the resolution of data issues
- Implement processes and systems to drive data reconciliation and monitor data quality, ensuring production data is always accurate and available for key stakeholders, downstream systems, and business processes
- Lead the evaluation, implementation and deployment of emerging tools and processes for analytic data engineering to improve productivity
- Develop and deliver communication and education plans on analytic data engineering capabilities, standards, and processes
- Learn about machine learning, data science, computer vision, artificial intelligence, statistics, and/or applied mathematics
- Solve complex data problems to deliver insights that help achieve business objectives
- Implement statistical data quality procedures on new data sources by applying rigorous iterative data analytics
- Strong understanding & familiarity with all Hadoop Ecosystem components and Hadoop Administrative Fundamentals
- Strong understanding of underlying Hadoop Architectural concepts and distributed computing paradigms
Skills and attributes for success
- Experience in the development of Hadoop APIs and MapReduce jobs for large scale data processing
- Hands-on programming experience in Apache Spark using SparkSQL and Spark Streaming or Apache Storm
- Hands on experience with major components like Hive, Spark, and MapReduce
- Experience working with NoSQL in at least one of the data stores - HBase, Cassandra, MongoDB
- Experienced in Hadoop clustering and Auto scaling
- Good knowledge in apache Kafka & Apache Flume
- Knowledge of Spark and Kafka integration with multiple Spark jobs to consume messages from multiple Kafka partitions
- Advanced experience and understanding of data/Big Data, data integration, data modelling, AWS, and cloud technologies
- Strong business acumen with knowledge of the Industrial Products sector is preferred, but not required
- Ability to build processes that support data transformation, workload management, data structures, dependency, and metadata
- Ability to build and optimize queries (SQL), data sets, 'Big Data' pipelines, and architectures for structured and unstructured data
- Experience with or knowledge of Agile Software Development methodologies.
- Demonstrated understanding and experience using:
- Data Engineering Programming Languages (i.e., Python)
- Distributed Data Technologies (e.g., Pyspark)
- Cloud platform deployment and tools (e.g., Kubernetes)
- Relational SQL databases
- DevOps and continuous integration
- AWS cloud services and technologies (i.e., Lambda, S3, DMS, Step Functions, Event Bridge, Cloud Watch, RDS)
- Databricks/ETL
- IICS/DMS
- GitHub
- Event Bridge, Tidal
To qualify for the role you must have
- Flexible and proactive/self-motivated working style with strong personal ownership of problem resolution
- Excellent communicator (written and verbal formal and informal)
- Ability to multi-task under pressure and work independently with minimal supervision.
- Partner with Business Analytics and Solution Architects to develop technical architectures for strategic enterprise projects and initiatives
- Coordinate with Data Scientists to understand data requirements, and design solutions that enable advanced analytics, machine learning, and predictive modelling
- Support Data Scientists in data sourcing and preparation to visualize data and synthesize insights of commercial value
- Collaborate with AI/ML engineers to create data products for analytics and data scientist team members to improve productivity
- Advise, consult, mentor and coach other data and analytic professionals on data standards and practices, promoting the values of learning and growth
- Foster a culture of sharing, re-use, design for scale stability, and operational efficiency of data and analytical solutions
Ideally, you’ll also have
- Experience in leading and influencing teams, with a focus on mentorship and professional development
- A passion for innovation and the strategic application of emerging technologies to solve real-world challenges
- The ability to foster an inclusive environment that values diverse perspectives and empowers team members
What we offer
We offer a comprehensive compensation and benefits package where you’ll be rewarded based on your performance and recognized for the value you bring to the business. The base salary range for this job in all geographic locations in the US is $105,800 to $174,800. The salary range for New York City Metro Area, Washington State and California (excluding Sacramento) is $127,100 to $198,600. Individual salaries within those ranges are determined through a wide variety of factors including but not limited to education, experience, knowledge, skills and geography. In addition, our Total Rewards package includes medical and dental coverage, pension and 401(k) plans, and a wide range of paid time off options. Join us in our team-led and leader-enabled hybrid model. Our expectation is for most people in external, client serving roles to work together in person 40-60% of the time over the course of an engagement, project or year. Under our flexible vacation policy, you’ll decide how much vacation time you need based on your own personal circumstances. You’ll also be granted time off for designated EY Paid Holidays, Winter/Summer breaks, Personal/Family Care, and other leaves of absence when needed to support your physical, financial, and emotional well-being.
EY accepts applications for this position on an on-going basis. If you can demonstrate that you meet the criteria above, please contact us as soon as possible.
EY exists to build a better working world, helping to create long-term value for clients, people and society and build trust in the capital markets.
Enabled by data and technology, diverse EY teams in over 150 countries
#J-18808-Ljbffr