Senior Site Reliability Engineer - Team Lead
- Tokyo
- No Remote
- Full-time
- March 19, 2021
Job Description
We are developing the world’s first enterprise-level Platform-as-a-Service (PaaS) for robots, creating a rare opportunity for an experienced, product-focused engineering professional. The PaaS aims to aid and offer innovative features to handle every part of the product lifecycle required to support and deliver consumer-facing connected machines and services.
Site Reliability Engineering combines skills of software and systems engineering. Your key responsibility is to focus on optimizing existing systems, building infrastructure, and eliminating work through automation to make them more reliable and ensure the highest possible up-time for a cloud-based robotics system.
Your responsibilities will include the following but not limited to:
- Leading the SRE team, mentoring junior engineers and supporting delivery excellence
- Supporting services before they go live through activities such as system design consulting, capacity planning, and launch reviews
- Maintaining services once they are live by measuring and monitoring availability, latency, and overall system health
- Engaging in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement
- Scaling systems sustainably through mechanisms like automation, and evolving systems by pushing for changes that improve reliability and velocity
- Practicing sustainable incident response and postmortems
- Building and evolving the operations handbook
Requirements
- Bachelor’s degree in Computer Science or a similar technical field of study, or equivalent practical experience with an outstanding track record
- At least 5 years of experience in product development and/or supporting operations
- Mastery of one or more of the following programming languages including but not limited to Python, Golang, Ruby, Bash
- Expertise with Configuration Management, Docker, IaaS, PaaS, Continuous Delivery, Continuous Integration, DevOps, ChatOps
- Solid understanding of network fundamentals and practical experience troubleshooting networked services
- Demonstrated proficiency with: Linux systems, public cloud platforms, and associated tools/technologies
- Fluency in English
Preferred Qualifications
- Extremely organized, detail oriented and thorough in every undertaking
- Ability to balance multiple tasks and projects effectively and quickly adapt to new variables
- Experience in designing, analyzing and troubleshooting distributed systems
- Experience with team management
- Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
- Ability to debug and optimize code and automate routine tasks
Benefits
- Competitive salary
- Stock options
- International working environment
- Bleeding edge technology
- Working with exceptionally talented engineers
- Relocation support
About Rapyuta Robotics
Rapyuta Roboticsは、世界的に見てもまだ黎明期であるロボティクスプラットフォーム及びロボットソリューションを創造・提供する企業でグローバルスタートアップです。
2014年にロボットのためのインターネットを先駆けたEU FP7プロジェクト「RoboEarth」を手掛けたチューリッヒ工科大学のメンバーが中心となってスピンアウトしました。現在は、日本およびインドにオフィスを構えています。
マシンとマシンを繋げ、人々の生活を豊かにする。我々の信念の一つである、「Empathy (共感)」に基づき、「きつい」「きたない」「危険」の仕事は自動化されるべきだと強く信じています。人々はより知的で創造的な仕事にチャレンジする選択肢を与えられるべきだ思っています。私たちは、円滑で、接続・調整された機械で自動化を可能にしたいと考えています。
ロボットをより身近なものにし、誰にとっても有用なものにするために、複数のロボット及び複数種類のロボットを賢く協調制御することを得意とする、ロボティクスプラットフォーム「rapyuta.io」を開発・サービス提供しており、特に倉庫物流の自動化に注力しています。
「rapyuta.io」は、ロボット間の協調連携機能のみならず、ロボットソリューションの効果計測シミュレーションや、ソフトウェア・アップデートを含めたリモートメンテナンス機能も有しています。これにより、プロジェクトの計画や実行・管理が煩雑な複数種類のロボットソリューションの導入を効果的に実行し、現場で使える品質を提供します。
我々は、ロボティクスが人を排除するのではなく、身近なパートナーとして人のために働き、新たな働き方や新たな収益機会が創造されることを期待しています。
Get Job Alerts
Sign up for our newsletter to get hand-picked tech jobs in Japan – straight to your inbox.