TikTok
Site Reliability Engineer, Product - USDS
TikTok, Seattle, WA
Responsibilities
About TikTok U.S.Data Security
TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.
Why Join Us
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.
Join us.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities
We are seeking a highly motivated and experienced Site Reliability Engineer to join our growing team. You will be responsible for ensuring the reliability, performance, and scalability of our production systems. You will play a critical role in ensuring our systems are designed and operated with resiliency and high availability in mind.
In this role, you will:
- Collaborate with cross-functional teams to design, deploy, and operate large-scale, high-availability systems
- Develop and maintain automation tools and processes to improve the reliability and efficiency of our systems
- Act as a technical lead for SRE-related initiatives, providing guidance and mentorship to junior team members
- Work closely with software engineers to diagnose and resolve production issues
- Continuously monitor and evaluate the health of our systems, proactively identifying and addressing potential issues before they become problems
- Participate in an on-call rotation to provide 24/7 support for production systems
- Drive innovation and improvement in our infrastructure and processes through experimentation and research
- Participate in the design and implementation of disaster recovery plans
Qualifications
Qualifications
1. Bachelor or above degree in Computer Science or a related technical discipline
2. 5+ years experience in Site Reliability Engineering, Production Engineering or similar role, working with large-scale distributed systems
3. Strong understanding of containers and container orchestration tools such as Docker and Kubernetes
4. In-depth knowledge of Unix/Linux systems administration, network fundamentals and storage systems
3. Proficiency in one or more programming languages, such as C, C++, Java, Python, Go, Ruby, Rust, JavaScript
7. Strong analytical and problem-solving skills
8. Excellent communication and collaboration skills, able to work effectively with cross-functional teams
Candidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at [redacted]
This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.
About TikTok U.S.Data Security
TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.
Why Join Us
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.
Join us.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities
We are seeking a highly motivated and experienced Site Reliability Engineer to join our growing team. You will be responsible for ensuring the reliability, performance, and scalability of our production systems. You will play a critical role in ensuring our systems are designed and operated with resiliency and high availability in mind.
In this role, you will:
- Collaborate with cross-functional teams to design, deploy, and operate large-scale, high-availability systems
- Develop and maintain automation tools and processes to improve the reliability and efficiency of our systems
- Act as a technical lead for SRE-related initiatives, providing guidance and mentorship to junior team members
- Work closely with software engineers to diagnose and resolve production issues
- Continuously monitor and evaluate the health of our systems, proactively identifying and addressing potential issues before they become problems
- Participate in an on-call rotation to provide 24/7 support for production systems
- Drive innovation and improvement in our infrastructure and processes through experimentation and research
- Participate in the design and implementation of disaster recovery plans
Qualifications
Qualifications
1. Bachelor or above degree in Computer Science or a related technical discipline
2. 5+ years experience in Site Reliability Engineering, Production Engineering or similar role, working with large-scale distributed systems
3. Strong understanding of containers and container orchestration tools such as Docker and Kubernetes
4. In-depth knowledge of Unix/Linux systems administration, network fundamentals and storage systems
3. Proficiency in one or more programming languages, such as C, C++, Java, Python, Go, Ruby, Rust, JavaScript
7. Strong analytical and problem-solving skills
8. Excellent communication and collaboration skills, able to work effectively with cross-functional teams
Candidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at [redacted]
This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.