Logo
OCLC

Senior Data Platform Architect

OCLC, Dublin, OH


You have a life. We like that about you.

At OCLC, we believe you'll do the best work of your life when you're living the best life possible.

We work hard to build the technology that connects thousands of today's libraries. But we also work hard to make a job at OCLC a meaningful part of a balanced life- not a substitute for one.

Technology with a Purpose. OCLC supports thousands of libraries in making information more accessible and more useful to people around the world. OCLC provides shared technology services, original research and community programs that help libraries meet the ever-evolving needs of their users, institutions, and communities. With office locations around the globe, OCLC employees are dedicated to offering premier services and software to help libraries.

The Job Details are as follows:
OCLC's Data Services provides the foundational platforms for OCLC's key data assets. Are you excited to solve problems with a global footprint and scale measured in petabytes, billions of records and thousands of updates per second? We are looking for a senior technology leader to join our team, working across the organization to understand business opportunities, our current solutions, and future technology opportunities for our core data platform.

The right candidate will have the following skills, experiences, and attitude:
You are driven to understand new technology, architecture and languages. You design solutions from first-hand knowledge built on proof of concepts, rapid prototyping and active collaboration with others. You understand the best technology solution solves a problem for a customer and understanding their needs is critical to success as a technologist. You have strong experience with the Hadoop ecosystem, including HBase, HDFS, Spark, Kafka, and others. You have provided architecture guidance for both on-premises and cloud solutions. Finally, you have experience developing within the Java/Spring ecosystem, especially for Search and ETL workflows.

The Sr. Data Platform Architect will be responsible for reviewing OCLC's big data platform and adjacent application stacks to build a strategic analysis and future roadmap. The candidate is expected to perform high level analysis and design of software and systems. This includes creating, analyzing, designing, modifying and testing of system components. They are expected to communicate both strategic level vision to upper management, and technical direction to development teams, building out a shared understanding and roadmap for the platform's future.

Responsibilities:
  • Evaluate the existing Hadoop and adjacent ecosystem, including its architecture, infrastructure, and performance, to identify challenges in migration and areas of improvement.
  • Develops comprehensive strategies for migration or risk mitigation, including a detailed roadmap, timeline, and resource requirements. Identifies technology alternatives where appropriate.
  • Provide leadership to organization in specific area of expertise.
  • Assumes the technical lead and is the architect for major OCLC systems or concepts. Participates in major architectural reviews and plans.
  • Ensures that information and trends within their area of expertise are effectively communicated to relevant business units within OCLC.
  • Prototyping systems in anticipation of new requirements.
  • Develops functional requirements from prototype systems.
  • Analyze and solve problems in existing systems.
  • Design, code and test multiple modules of a system in a timely manner.
  • Ensures that project teams plan and participate in load, capacity, and performance analysis and/or testing.
  • Represents OCLC's position by participating on or leading relevant standards committees such as ISO, NISO, ACM, and IEEE Leads the implementation of standards within OCLC systems.
  • Other tasks as assigned.


Qualifications:
  • Master's degree required plus 8 to 12 years experience at a high technical level of knowledge and experience. (Equivalent experience without a Master's degree is 10 to 14 years experience at a high technical level of knowledge and experience.)
  • Familiarity with alternative big data technologies, such as Apache Kafka, Apache Flink, Apache Cassandra, or cloud-based solutions like Amazon EMR or Google BigQuery.
  • 5+ years of leadership experience with Hadoop and adjacent technologies, such as Hive, Pig, and Spark.
  • Proficiency in programming languages commonly used in big data processing, such as Java, Scala, or Python.
  • Experience transitioning Hadoop workflows to alternative solutions strongly preferred.
  • Strong understanding of distributed computing principles, experience with large-scale data processing frameworks, and big data concepts in general.
  • Excellent communication and presentation skills.
  • Ability to work independently and as part of a team.
  • Several years' proven experience leading large-scale projects across multiple teams.
  • Experience with multiple complex, business critical systems is expected.
  • Experience with on-premise big data architectures preferred.
  • Experience with Graph data architectures and Graph databases preferred.


Working Conditions: Normal office environment.

ADA/EAA: The above statements cover what are generally believed to be the principal and essential functions of this job. Specific circumstances may allow or require some people assigned to the job to perform a somewhat different combination of duties.