DESIGNLIBRO INC
Senior Data Engineer
DESIGNLIBRO INC, Santa Clara, California, us, 95053
Benefits:
401(k) matching
Bonus based on performance
Health insurance
Opportunity for advancement
Paid time off
Vision insurance
About Petlibro
Petlibro is a design thinking company creating products that nurture the intertwined lives of pets & their people. We launched with a philosophy that good design, in form & in function, can make a difference. Petlibro innovates with the latest technology to solve everyday problems for the modern pet parents and revolutionize how we care for our pets.
Since 2019, Petlibro has grown into one of the best-selling pet tech brands globally. From smart feeders with app insights to ultra-filtered automatic fountains, to pet health focused smart apps, its products are engineered to magnify the bond between your pet and you. We will be launching cutting edge AI pet tech for modern pet care, come join the team and be the leader in the AI pet tech space.
Job Summary
We are continuing to innovate in the pet technology space for pet owners & pet brands. Pet software & platform service industry is about 20~30 years behind the human's software industry and this yells out "opportunity to disrupt the market".
The majority of Petlibro products being sold are smart IoT devices. We plan to develop new AI-based products and data driven health care services in the coming years. We are seeking experience senior software engineers with background in big data
Responsibilities
Build and maintain vector databases to store and retrieve embeddings from unstructured data (e.g., videos, images, text), enabling advanced similarity searches and AI-driven insights.
Build and maintain a well-structured OLAP database optimized for unifed data infrastructure for IOT platforms ensuring data integrity and efficient retrieval.
Design and implement scalable data pipelines to collect, process, and store large volumes of video data, sensor readings, device status, user actions and other relevant data sources.
Develop robust ETL (Extract, Transform, Load) processes to ensure smooth data flow into model training environment and user applications.
Collaborate with computer vision scientists and animal behavior experts to translate data requirements into actionable data engineering solutions.
Preprocess and clean data, resolve inconsistencies, and handle missing or noisy data
Implement data versioning and metadata management practices to track changes and ensure reproducibility.
Evangelize software engineering best practices and lead by example.
Collaborate with other engineers, product owners, and designers to identify and solve challenging problems.
Qualifications
Bachelor's degree in Computer Science, Data Science, Engineering, or a related field.
5+ years of proven experience in data engineering, with a strong focus on data warehousing and pipeline development.
Expertise in Python/Scala/Java and data manipulation libraries (Pandas, NumPy, Spark).
Expertise in data streaming technologies (Kafka, Spark Streaming).
Proficiency with SQL and database systems (mySQL, MongoDB, or ElasticSearch, Star Rocks, S3, or similar).
Knowledge about deploying systems into a production Cloud Native Environment (AWS or similar).
Experience with video data handling and image processing is highly preferred.
Excellence in communication with peers and non-technical people alike.
Excited about designing, building, and maintaining of distributed systems.
Knowledge with Containers and/or Serverless, Microservices, Functions, Event-driven/reactive programming or other Cloud Native architectures
Benefits
Silicon Valley standard: Medical, Dental, Vision, PTO, 401K, and an amazing office location to work at in Santa Clara Square.
401(k) matching
Bonus based on performance
Health insurance
Opportunity for advancement
Paid time off
Vision insurance
About Petlibro
Petlibro is a design thinking company creating products that nurture the intertwined lives of pets & their people. We launched with a philosophy that good design, in form & in function, can make a difference. Petlibro innovates with the latest technology to solve everyday problems for the modern pet parents and revolutionize how we care for our pets.
Since 2019, Petlibro has grown into one of the best-selling pet tech brands globally. From smart feeders with app insights to ultra-filtered automatic fountains, to pet health focused smart apps, its products are engineered to magnify the bond between your pet and you. We will be launching cutting edge AI pet tech for modern pet care, come join the team and be the leader in the AI pet tech space.
Job Summary
We are continuing to innovate in the pet technology space for pet owners & pet brands. Pet software & platform service industry is about 20~30 years behind the human's software industry and this yells out "opportunity to disrupt the market".
The majority of Petlibro products being sold are smart IoT devices. We plan to develop new AI-based products and data driven health care services in the coming years. We are seeking experience senior software engineers with background in big data
Responsibilities
Build and maintain vector databases to store and retrieve embeddings from unstructured data (e.g., videos, images, text), enabling advanced similarity searches and AI-driven insights.
Build and maintain a well-structured OLAP database optimized for unifed data infrastructure for IOT platforms ensuring data integrity and efficient retrieval.
Design and implement scalable data pipelines to collect, process, and store large volumes of video data, sensor readings, device status, user actions and other relevant data sources.
Develop robust ETL (Extract, Transform, Load) processes to ensure smooth data flow into model training environment and user applications.
Collaborate with computer vision scientists and animal behavior experts to translate data requirements into actionable data engineering solutions.
Preprocess and clean data, resolve inconsistencies, and handle missing or noisy data
Implement data versioning and metadata management practices to track changes and ensure reproducibility.
Evangelize software engineering best practices and lead by example.
Collaborate with other engineers, product owners, and designers to identify and solve challenging problems.
Qualifications
Bachelor's degree in Computer Science, Data Science, Engineering, or a related field.
5+ years of proven experience in data engineering, with a strong focus on data warehousing and pipeline development.
Expertise in Python/Scala/Java and data manipulation libraries (Pandas, NumPy, Spark).
Expertise in data streaming technologies (Kafka, Spark Streaming).
Proficiency with SQL and database systems (mySQL, MongoDB, or ElasticSearch, Star Rocks, S3, or similar).
Knowledge about deploying systems into a production Cloud Native Environment (AWS or similar).
Experience with video data handling and image processing is highly preferred.
Excellence in communication with peers and non-technical people alike.
Excited about designing, building, and maintaining of distributed systems.
Knowledge with Containers and/or Serverless, Microservices, Functions, Event-driven/reactive programming or other Cloud Native architectures
Benefits
Silicon Valley standard: Medical, Dental, Vision, PTO, 401K, and an amazing office location to work at in Santa Clara Square.