DESIGNLIBRO INC

Senior Data Engineer

DESIGNLIBRO INC, Santa Clara, California, us, 95053

Benefits:

401(k) matching

Bonus based on performance

Health insurance

Opportunity for advancement

Paid time off

Vision insurance

About Petlibro

Petlibro is a design thinking company creating products that nurture the intertwined lives of pets & their people. We launched with a philosophy that good design, in form & in function, can make a difference. Petlibro innovates with the latest technology to solve everyday problems for the modern pet parents and revolutionize how we care for our pets.

Since 2019, Petlibro has grown into one of the best-selling pet tech brands globally. From smart feeders with app insights to ultra-filtered automatic fountains, to pet health focused smart apps, its products are engineered to magnify the bond between your pet and you. We will be launching cutting edge AI pet tech for modern pet care, come join the team and be the leader in the AI pet tech space.

Job Summary

We are continuing to innovate in the pet technology space for pet owners & pet brands. Pet software & platform service industry is about 20~30 years behind the human's software industry and this yells out "opportunity to disrupt the market".

The majority of Petlibro products being sold are smart IoT devices. We plan to develop new AI-based products and data driven health care services in the coming years. We are seeking experience senior software engineers with background in big data

Responsibilities

Build and maintain vector databases to store and retrieve embeddings from unstructured data (e.g., videos, images, text), enabling advanced similarity searches and AI-driven insights.

Build and maintain a well-structured OLAP database optimized for unifed data infrastructure for IOT platforms ensuring data integrity and efficient retrieval.

Design and implement scalable data pipelines to collect, process, and store large volumes of video data, sensor readings, device status, user actions and other relevant data sources.

Develop robust ETL (Extract, Transform, Load) processes to ensure smooth data flow into model training environment and user applications.

Collaborate with computer vision scientists and animal behavior experts to translate data requirements into actionable data engineering solutions.

Preprocess and clean data, resolve inconsistencies, and handle missing or noisy data

Implement data versioning and metadata management practices to track changes and ensure reproducibility.

Evangelize software engineering best practices and lead by example.

Collaborate with other engineers, product owners, and designers to identify and solve challenging problems.

Qualifications

Bachelor's degree in Computer Science, Data Science, Engineering, or a related field.

5+ years of proven experience in data engineering, with a strong focus on data warehousing and pipeline development.

Expertise in Python/Scala/Java and data manipulation libraries (Pandas, NumPy, Spark).

Expertise in data streaming technologies (Kafka, Spark Streaming).

Proficiency with SQL and database systems (mySQL, MongoDB, or ElasticSearch, Star Rocks, S3, or similar).

Knowledge about deploying systems into a production Cloud Native Environment (AWS or similar).

Experience with video data handling and image processing is highly preferred.

Excellence in communication with peers and non-technical people alike.

Excited about designing, building, and maintaining of distributed systems.

Knowledge with Containers and/or Serverless, Microservices, Functions, Event-driven/reactive programming or other Cloud Native architectures

Benefits

Silicon Valley standard: Medical, Dental, Vision, PTO, 401K, and an amazing office location to work at in Santa Clara Square.