Saxon Global
Lead Data Scientist
Saxon Global, Pittsburgh, Pennsylvania, us, 15289
This is a full time role with pnc bank. Hybrid - 2 days office 3 days remote. Candidate must be close to Pittsburgh, Cleveland, New Jersey, DC, Atlanta, Raileigh NC, Columbus Dallas.
Lead Data Scientist/ Engineer / Architect (8 + yrs)
Must have technical skills/experience (ask for alternative/tool/version):
Hadoop
Pyspark/ Python
Graph database (Neo4J)
Databricks (1 Azure, 2 AWS, 3 Google)
Flex Skills:• 2+ years of experience with a public cloud (AWS, Microsoft Azure)• 4+ years of experience with NoSQL implementation (Mongo, Cassandra)• 1+ year of experience with process orchestration including AirFlow, KubeFlow• Data lake and Delta lake experience• Familiarity with Metadata Management, Data Quality frameworks and Data as a Service concepts a big plus• Banking or financial services experience is a big plus
Key Experience:• 6+ years of financial solutions architecture, software development, data engineering, data science or business intelligence engineering experience with minimum 3 Years recent hands-on experience in PySpark• 3+ year of experience with Machine Learning code development• Deep knowledge of Hadoop ecosystem and Big Data technologies such as Spark, Hive, Hbase, Oozie, Kafka, YARN, SLURM• Spark query tuning and performance optimization• Experience and good understanding of Apache Spark Data sources API• Advanced experience in Python and common python libraries/ Scala/ Java• Strong analytical experience with database in writing complex queries, query optimization, debugging, user-defined functions, views, indexes, etc.• Strong working experience with source control systems such as Git, Bitbucket, and Jenkins build and continuous integration tools.• Experience working with Microservices, Rest API and Oauth• Experience working with one or more Agile development methods• proven consulting and delivery leadership in data transformation, data modeling, data analytics, data visualization and/or data science
Degrees or certifications for the candidate to be successful:• Bachelor's degree in Computer Science, Engineering, Statistics or other quantitative subject
Lead Data Scientist/ Engineer / Architect (8 + yrs)
Must have technical skills/experience (ask for alternative/tool/version):
Hadoop
Pyspark/ Python
Graph database (Neo4J)
Databricks (1 Azure, 2 AWS, 3 Google)
Flex Skills:• 2+ years of experience with a public cloud (AWS, Microsoft Azure)• 4+ years of experience with NoSQL implementation (Mongo, Cassandra)• 1+ year of experience with process orchestration including AirFlow, KubeFlow• Data lake and Delta lake experience• Familiarity with Metadata Management, Data Quality frameworks and Data as a Service concepts a big plus• Banking or financial services experience is a big plus
Key Experience:• 6+ years of financial solutions architecture, software development, data engineering, data science or business intelligence engineering experience with minimum 3 Years recent hands-on experience in PySpark• 3+ year of experience with Machine Learning code development• Deep knowledge of Hadoop ecosystem and Big Data technologies such as Spark, Hive, Hbase, Oozie, Kafka, YARN, SLURM• Spark query tuning and performance optimization• Experience and good understanding of Apache Spark Data sources API• Advanced experience in Python and common python libraries/ Scala/ Java• Strong analytical experience with database in writing complex queries, query optimization, debugging, user-defined functions, views, indexes, etc.• Strong working experience with source control systems such as Git, Bitbucket, and Jenkins build and continuous integration tools.• Experience working with Microservices, Rest API and Oauth• Experience working with one or more Agile development methods• proven consulting and delivery leadership in data transformation, data modeling, data analytics, data visualization and/or data science
Degrees or certifications for the candidate to be successful:• Bachelor's degree in Computer Science, Engineering, Statistics or other quantitative subject