SRE
<p><strong>About the Role</strong></p><p>As a Site Reliability Engineer (SRE) or Platform Engineer you'll be responsible for the reliability of our systems and developer experience.From day 1, you will build resilient, scalable, and highly reliable systems that solve operational challenges. This role is divided into two specializations:</p><ul><li>Core SRE: Implement SRE practices across the organization</li><li>Embedded SRE: Work deeply within specific development teams to improve team and product operations together</li></ul><p><strong>Key Responsibilities</strong></p><ul><li>Build and maintain tools and processes to automate system operations, monitoring, and management</li><li>Design and implement alerting and monitoring systems; participate in on-call rotation</li><li>Create scripts and build automation to reduce toil</li><li>Collaborate with development teams to identify and address performance bottlenecks, optimize system architecture, and respond to incidents</li><li>Design and implement solutions to improve system reliability and scalability</li></ul><p><strong>Requirements</strong></p><ul><li>3+ years of SRE experience</li><li>Web application or library development experience, OR experience developing automation tools and CI/CD tools using Golang, Python, or similar languages</li><li>Experience managing public cloud services (Google Cloud, AWS, or Azure) using IaC tools (Terraform, CloudFormation, Ansible, etc.)</li><li>Experience managing or operating container orchestration platforms (Kubernetes, AWS ECS, etc.)</li></ul><p><strong>Preferred Qualifications</strong></p><ul><li>Experience with IAM and organization management on public cloud platforms (Google Cloud, AWS, or Azure)</li><li>Experience building infrastructure and pipelines for generative AI or machine learning</li><li>Experience handling system incidents such as high server load or database throughput degradation</li><li>Practical experience with SRE concepts such as error budgets and postmortems</li><li>Knowledge of large-scale distributed systems challenges including scalability, fault tolerance, and consistency</li></ul><p><strong>What We Offer</strong></p><ul><li>Challenging and interesting work in a fast growing AI start-up</li><li>Collaborate with product, AI, and security teams across the organization</li><li>Shape SRE practices and platform engineering standards</li><li>Hybrid/remote-friendly work</li></ul><p><strong>Location </strong></p><p>Tokyo</p><p><strong>Salary</strong></p><p>9 - 15 million yen</p><p> </p><p>Reference Number: 06940-0013318774</p><p>-----</p><p><em>By clicking 'apply', you give your express consent that Robert Half may use your personal information to process your job application and to contact you from time to time for future employment opportunities. For further information on how Robert Half processes your personal information and how to access and correct your information, please read the Robert Half privacy notice <a href="https://www.roberthalf.com/jp/en/privacy">https://www.roberthalf.com/jp/en/privacy</a>. Please do not submit any sensitive personal data to us in your resume (such as such as race, beliefs, social status, medical history or criminal record) as we do not collect your sensitive personal data at this time.</em></p><hr /><p>お客様が「今すぐ応募」ボタンをクリックすることにより、ロバート・ハーフ(以下、当社)がお客様の応募内容を処理し、求人情報を今後随時ご連絡する目的で個人情報を使用することに明示的に同意ただいたこととなります。当社による個人情報の処理方法、またお客様自身の個人情報へのアクセスおよびその訂正に関する詳細については、プライバシー規約(<a href="https://www.roberthalf.com/jp/ja/privacy">https://www.roberthalf.com/jp/ja/privacy</a>)をお読みください。当社は、要配慮個人情報はお預かりしておりませんので人種、信条、社会的身分、病歴、犯罪の経歴など、取扱いに特に配慮を要する個人情報は、ご提出いただく職務経歴書・レジュメ等に含めないようお願いいたします。</p><img src="https://counter.adcourier.com/TmFuY3kuWXVhbi43NTM1NC4xMDg5OEByaGlqcC5hcGxpdHJhay5jb20.gif">
- Tokyo,
- remote
- Permanent
-
9.0M - 15.0M JPY / Yearly
- <p><strong>About the Role</strong></p><p>As a Site Reliability Engineer (SRE) or Platform Engineer you'll be responsible for the reliability of our systems and developer experience.From day 1, you will build resilient, scalable, and highly reliable systems that solve operational challenges. This role is divided into two specializations:</p><ul><li>Core SRE: Implement SRE practices across the organization</li><li>Embedded SRE: Work deeply within specific development teams to improve team and product operations together</li></ul><p><strong>Key Responsibilities</strong></p><ul><li>Build and maintain tools and processes to automate system operations, monitoring, and management</li><li>Design and implement alerting and monitoring systems; participate in on-call rotation</li><li>Create scripts and build automation to reduce toil</li><li>Collaborate with development teams to identify and address performance bottlenecks, optimize system architecture, and respond to incidents</li><li>Design and implement solutions to improve system reliability and scalability</li></ul><p><strong>Requirements</strong></p><ul><li>3+ years of SRE experience</li><li>Web application or library development experience, OR experience developing automation tools and CI/CD tools using Golang, Python, or similar languages</li><li>Experience managing public cloud services (Google Cloud, AWS, or Azure) using IaC tools (Terraform, CloudFormation, Ansible, etc.)</li><li>Experience managing or operating container orchestration platforms (Kubernetes, AWS ECS, etc.)</li></ul><p><strong>Preferred Qualifications</strong></p><ul><li>Experience with IAM and organization management on public cloud platforms (Google Cloud, AWS, or Azure)</li><li>Experience building infrastructure and pipelines for generative AI or machine learning</li><li>Experience handling system incidents such as high server load or database throughput degradation</li><li>Practical experience with SRE concepts such as error budgets and postmortems</li><li>Knowledge of large-scale distributed systems challenges including scalability, fault tolerance, and consistency</li></ul><p><strong>What We Offer</strong></p><ul><li>Challenging and interesting work in a fast growing AI start-up</li><li>Collaborate with product, AI, and security teams across the organization</li><li>Shape SRE practices and platform engineering standards</li><li>Hybrid/remote-friendly work</li></ul><p><strong>Location </strong></p><p>Tokyo</p><p><strong>Salary</strong></p><p>9 - 15 million yen</p><p> </p><p>Reference Number: 06940-0013318774</p><p>-----</p><p><em>By clicking 'apply', you give your express consent that Robert Half may use your personal information to process your job application and to contact you from time to time for future employment opportunities. For further information on how Robert Half processes your personal information and how to access and correct your information, please read the Robert Half privacy notice <a href="https://www.roberthalf.com/jp/en/privacy">https://www.roberthalf.com/jp/en/privacy</a>. Please do not submit any sensitive personal data to us in your resume (such as such as race, beliefs, social status, medical history or criminal record) as we do not collect your sensitive personal data at this time.</em></p><hr /><p>お客様が「今すぐ応募」ボタンをクリックすることにより、ロバート・ハーフ(以下、当社)がお客様の応募内容を処理し、求人情報を今後随時ご連絡する目的で個人情報を使用することに明示的に同意ただいたこととなります。当社による個人情報の処理方法、またお客様自身の個人情報へのアクセスおよびその訂正に関する詳細については、プライバシー規約(<a href="https://www.roberthalf.com/jp/ja/privacy">https://www.roberthalf.com/jp/ja/privacy</a>)をお読みください。当社は、要配慮個人情報はお預かりしておりませんので人種、信条、社会的身分、病歴、犯罪の経歴など、取扱いに特に配慮を要する個人情報は、ご提出いただく職務経歴書・レジュメ等に含めないようお願いいたします。</p><img src="https://counter.adcourier.com/TmFuY3kuWXVhbi43NTM1NC4xMDg5OEByaGlqcC5hcGxpdHJhay5jb20.gif">
- 2025-10-16T08:22:30Z