Perpay Inc
Senior Data Engineer
Perpay Inc, Phila, Pennsylvania, United States, 19117
About the Role:As a Senior Data Engineer at Perpay, you will play a crucial role in building and optimizing data pipelines and architectures that drive our data products and insights. You will work closely with data scientists, analysts, and other stakeholders to accelerate Perpay’s mission of creating inclusive financial products that improve the lives of our members. With the launch of our new credit card, there is an increasing demand for advanced data engineering solutions to meet our modeling, reporting, and analytical needs.In this role, you will be at the heart of almost every major release at Perpay, ensuring that our data infrastructure supports a wide array of business functions across risk, commerce, marketing, operations, and more. Your work will directly impact our customers by enabling automated and efficient data-driven services.We are looking for a Senior Data Engineer who is a quantitative, critical thinker with a deep passion for data and a proven track record of delivering data solutions at scale. The ideal candidate has extensive experience in designing and maintaining robust data architectures, optimizing ETL processes, and implementing data governance practices. You should be comfortable working in a fast-paced, entrepreneurial environment and managing multiple projects with diverse stakeholders.
Why You’ll Love It Here:
Impactful Work:
Your contributions will directly affect the lives of our customers by creating data-driven solutions that support financial inclusivity.
Cutting-Edge Technology:
Work with the latest tools and technologies in data engineering.
Career Growth:
Opportunities for professional development and advancement within a rapidly growing company.
Collaborative Culture:
Join a supportive team that values diverse perspectives and innovative ideas.
Our greatest strength is our people and we’d love for you to be one of them!
Responsibilities:
Design, implement, and maintain complex ETL pipelines using tools such as Apache Spark, Apache Airflow, and AWS Glue for both batch and near real-time data processing
Collaborate with data producers to establish robust data contracts and develop advanced data models leveraging Redshift Spectrum and partitioned data lakes
Implement advanced data governance strategies, including metadata management, data lineage tracking with tools like Open Lineage, and comprehensive data cataloging using Datahub
Partner with cross-functional teams to deliver scalable, high-performance data solutions, optimizing performance and ensuring data quality
Analyze and resolve complex data engineering challenges, employing techniques such as indexing, partitioning, and query optimization to enhance performance
Serve as a subject matter expert for distributed data processing technologies, providing guidance on best practices and architectural design
Drive the development of a cutting-edge data architecture that ensures high availability, fault tolerance, and robust data quality
Mentor junior data engineers, providing guidance and support to foster their growth and development
Continuously learn and incorporate the latest industry trends and technological advancements to keep Perpay’s data infrastructure at the forefront of innovation
What You’ll Bring:
Bachelor’s degree or higher in a quantitative/technical field (Computer Science, Statistics, Engineering, Mathematics, Physics, Chemistry)
5+ years of experience in data engineering
with a deep understanding of data pipeline design and architecture
Proficiency in SQL, Python, and familiarity with cloud-native services on AWS, GCP, or Azure, including Redshift, BigQuery, or Synapse Analytics
Extensive experience with data warehousing solutions, ETL/ELT processes, and data orchestration tools such as Apache Airflow or Luigi
Strong understanding of data modeling principles, dimensional modeling, and schema design in both OLAP and OLTP contexts
Expert knowledge of distributed data processing frameworks like Apache Spark, Hadoop, or Flink
Experience with infrastructure as code tools such as Terraform and CI/CD pipelines using Travis or Jenkins
Strong communication skills to articulate complex technical concepts to both technical and non-technical stakeholders
Ability to mentor and guide junior data engineers, fostering a collaborative and supportive team environment
Hey,
we know not everybody checks all the boxes, so if you’re interested, please apply because you could be just what we’re looking for!
#J-18808-Ljbffr
Why You’ll Love It Here:
Impactful Work:
Your contributions will directly affect the lives of our customers by creating data-driven solutions that support financial inclusivity.
Cutting-Edge Technology:
Work with the latest tools and technologies in data engineering.
Career Growth:
Opportunities for professional development and advancement within a rapidly growing company.
Collaborative Culture:
Join a supportive team that values diverse perspectives and innovative ideas.
Our greatest strength is our people and we’d love for you to be one of them!
Responsibilities:
Design, implement, and maintain complex ETL pipelines using tools such as Apache Spark, Apache Airflow, and AWS Glue for both batch and near real-time data processing
Collaborate with data producers to establish robust data contracts and develop advanced data models leveraging Redshift Spectrum and partitioned data lakes
Implement advanced data governance strategies, including metadata management, data lineage tracking with tools like Open Lineage, and comprehensive data cataloging using Datahub
Partner with cross-functional teams to deliver scalable, high-performance data solutions, optimizing performance and ensuring data quality
Analyze and resolve complex data engineering challenges, employing techniques such as indexing, partitioning, and query optimization to enhance performance
Serve as a subject matter expert for distributed data processing technologies, providing guidance on best practices and architectural design
Drive the development of a cutting-edge data architecture that ensures high availability, fault tolerance, and robust data quality
Mentor junior data engineers, providing guidance and support to foster their growth and development
Continuously learn and incorporate the latest industry trends and technological advancements to keep Perpay’s data infrastructure at the forefront of innovation
What You’ll Bring:
Bachelor’s degree or higher in a quantitative/technical field (Computer Science, Statistics, Engineering, Mathematics, Physics, Chemistry)
5+ years of experience in data engineering
with a deep understanding of data pipeline design and architecture
Proficiency in SQL, Python, and familiarity with cloud-native services on AWS, GCP, or Azure, including Redshift, BigQuery, or Synapse Analytics
Extensive experience with data warehousing solutions, ETL/ELT processes, and data orchestration tools such as Apache Airflow or Luigi
Strong understanding of data modeling principles, dimensional modeling, and schema design in both OLAP and OLTP contexts
Expert knowledge of distributed data processing frameworks like Apache Spark, Hadoop, or Flink
Experience with infrastructure as code tools such as Terraform and CI/CD pipelines using Travis or Jenkins
Strong communication skills to articulate complex technical concepts to both technical and non-technical stakeholders
Ability to mentor and guide junior data engineers, fostering a collaborative and supportive team environment
Hey,
we know not everybody checks all the boxes, so if you’re interested, please apply because you could be just what we’re looking for!
#J-18808-Ljbffr