Logo
PBS

Data Engineer

PBS, Arlington, Virginia, United States, 22201


Position Title:Data Engineer - A360

Department:Product Development

Corporate Area:Digital & Marketing

Status:Fixed Term (Fixed Term), Full time Exempt

Manager Title:Director, Technology

Position Overview:

PBS is seeking a skilled Data Engineer to join our Data Team. The ideal candidate will design, build, and maintain scalable data pipelines and ensure the availability, reliability, and integrity of our data. The candidate will join our team of talented data engineers, data scientists, and work alongside our product team and other stakeholders to improve the quality of PBS's data and support data-driven decision-making and analysis across the public media ecosystem.

Key responsibilities will include, but are not limited to:Work as part of the Data Engineering Team to design, code and deploy cloud data solutions that extract, transform and load data into our data architecture.Serve as primary point of contact for client-facing data requests from internal and external partners. Provide those partners with custom one-time or repeated exports from the data lake, assist them with navigating any technical challenges and gather technical requirements when they seek to use or integrate with the data lake.Lead the building of dashboards and other tools that facilitate the monitoring of the volume, velocity and veracity of data in the Enterprise Data Lakehouse.Build and maintain ingestion and transformation of digital analytics data into data lake. Serve as the expert for this key data source and ensure it aligns with all Data Governance policies.Evaluate and test new data-processing technologies.Maintain and update existing data pipelines, data marts and other key features of data architecture.Participate in stand-ups and software development syncs to align and collaborate with our Data Engineering Team.Requirements for success:

4+ years of experience building data products using cloud data tools.Proficient in Python, with a deep understanding of data interface libraries.Proficiency in SQL (DML, DDL) with experience with RDBMS; preferably PostGres.A deep understanding of data object modeling and database design. (normalized forms, indexing, query optimization).Experience with processing basic data file formats: csv, jsonExperience with development tools such as Github, Jira.Preferred Skills:

Familiarity with Big Data tools such as Spark (using PySpark).Familiarity with AWS Data tools such as: S3, Lambdas, EMR, Glue, Athena, Managed Airflow, ECS, DMS, DatasyncFamiliarity with big data file formats: Parquet/IcebergFamiliarity with Python Data Science libraries, such as: pandas, numpyPython visualization libraries such as Streamlit, matplotlibSnowflake / MetabaseDBT / DBT Cloud.Google cloud tools/products: BigQuery, Cloud functions, Cloud storage, Google Analytics 4

PBS is an Equal Opportunity Employer in accordance with the EEOC and the Commonwealth of Virginia.