Kumo
Data Engineer
Kumo, Mountain View, California, us, 94039
The global data management software market is set to reach $137.6 billion by 2026, and we're on a mission to make a significant impact. We're seeking intellectually curious and highly motivated Data Engineers to become foundational members of our Machine Learning and Data Platform team.
Required Qualifications for Ideal candidate
4+ years of professional experience in SaaS/Enterprise companiesStrong experience with data ingestion and connectorsExperience in building end-to-end production-grade data solutions on AWS or GCPExperience in building scalable ETL pipelines.Ability to plan effective data storage, security, sharing, and publishing within an organization.Experience in developing batch ingestion and data transformation routines using ETL tools.Familiarity with AWS services such as S3, Kinesis, EMR, Lambda, Athena, Glue, IAM, RDS.Proficiency in several programming languages (Python, Scala, Java).Familiarity with orchestration tools such as Temporal, Airflow, Luigi, etc.Self-starter, motivated, with the ability to structure complex problems and develop solutions.Excellent communication skills and ability to explain data and analytics strengths and weaknesses to both technical and senior business stakeholders.Preferred Qualifications - good to have
Deep familiarity with Spark and/or HiveUnderstanding of different storage formats like Parquet, Avro, Arrow, and JSON and when to use eachUnderstanding of schema designs like normalization vs. denormalization.Proficiency in Kubernetes, and Terraform.Azure, ADF and/or Databricks skillsExperience with integrating, transforming, and consolidating data from various data systems into analytics solutionsGood understanding of databases, SQL, ETL tools/techniques, data profiling and modelingStrong communications skills and client engagement
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Required Qualifications for Ideal candidate
4+ years of professional experience in SaaS/Enterprise companiesStrong experience with data ingestion and connectorsExperience in building end-to-end production-grade data solutions on AWS or GCPExperience in building scalable ETL pipelines.Ability to plan effective data storage, security, sharing, and publishing within an organization.Experience in developing batch ingestion and data transformation routines using ETL tools.Familiarity with AWS services such as S3, Kinesis, EMR, Lambda, Athena, Glue, IAM, RDS.Proficiency in several programming languages (Python, Scala, Java).Familiarity with orchestration tools such as Temporal, Airflow, Luigi, etc.Self-starter, motivated, with the ability to structure complex problems and develop solutions.Excellent communication skills and ability to explain data and analytics strengths and weaknesses to both technical and senior business stakeholders.Preferred Qualifications - good to have
Deep familiarity with Spark and/or HiveUnderstanding of different storage formats like Parquet, Avro, Arrow, and JSON and when to use eachUnderstanding of schema designs like normalization vs. denormalization.Proficiency in Kubernetes, and Terraform.Azure, ADF and/or Databricks skillsExperience with integrating, transforming, and consolidating data from various data systems into analytics solutionsGood understanding of databases, SQL, ETL tools/techniques, data profiling and modelingStrong communications skills and client engagement
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.