Site Reliability Engineer
- Tokyo
- Remote OK - Worldwide
- Full-time
- November 12, 2024
Job Description
TableCheck, Japan's leading restaurant reservation management platform, is seeking a Site Reliability Engineer. As a member of our SRE team you will own the technology stack and help support our demanding business and developer needs.
We run a robust and fault-tolerant infrastructure built on Amazon Web Services (AWS) with Terraform, Kubernetes, Helm, and an array of tools for CI/CD, logging, monitoring, and so on. We emphasize DevOps best practices such as agile, scrum, automation, and customer-centric improvements.
TableCheck has embraced remote work. As such, communication and documentation are in our blood. We look for and write about signals in the noise which enables us to constantly learn from mistakes and adapt, and we expect members of our teams to constantly follow up with questions and updates to keep everyone in the loop.
Responsibilities include
- Following SRE principles to maintain a 24/7 production environment running on Kubernetes
- Implementation of DevOps methodologies to improve IT team quality of life
- Proactive system monitoring and configuration
- Incident response
Mandatory Skills
- Must have at least 2 years experience with Amazon Web Services (AWS), with particular focus on EKS, EC2, RDS, Fargate, CloudFront, Lambda, and S3
- Must have extensive experience using AWS EKS
- Must have experience in direct software engineering following DevOps / SRE practices with at least 1 year as a technical lead
- Current ability in at least one of the following languages: Python, Ruby, Elixir, Go, Javascript, Rust
- Must understand container and hypervisor fundamentals
- Configuration management (YAML / Bash), experience with Helm and Terraform preferred
- Experience running production systems at large scale, and an understanding of the kinds of problems that can occur along with likely solutions
Recommended Skills
- Previous startup experience is highly desired
- Terraform, Pulumi
- ArgoCD
- Prometheus
- Grafana
- PostgreSQL
- MongoDB
- Kafka
- Security, PCI-DSS, GDPR, forensics, etc
Language Skills
- A native level of English is required. (No Japanese skill is required for this role.)
Evaluation Criteria
We will evaluate candidates based on the following stages:
- Initial interview - a one-on-one 30 minute chat over Google Meet to see if we're the right fit
- Technical interview - (virtually) meet the SRE team at TableCheck to evaluate your skills (no whiteboard or materials required)
- Take-home project - we will provide you with a 30-60 minute project, which will evaluate your dev and ops skills
About TableCheck
テーブルチェックは、「Dining Connected – 世界中のレストランとカスタマーの最良の架け橋になる」をミッションに事業を展開する日本発レストランテックカンパニーです。世界中のレストランとカスタマーを繋ぐプラットフォームを創造し、テクノロジーを活用した次世代の「おもてなし」を実現します。現在、展開している主なサービスは、飲食店向け予約・顧客管理システム「TableCheck」と、ユーザー向け飲食店検索・予約ポータルサイト「TableCheck」。24 時間 365日リアルタイムの空席情報を把握することで、飲食店にもユーザーにもより良いレストラン体験の実現をサポートしています。 社内公用語は英語、世界各国から優秀なメンバーが集まり(2020年8月現在、19 か国)、業界のイノベーターとしてマーケットをリードしています。 世界中に展開する大手グローバルホテルチェーンや星付きレストランを筆頭に、厳しい水準と高い信頼性を求める一流のレストラン・飲食企業を取引先として抱え、日本国内にとどまらない事業展開を実現しています。
We're remote-first, having an asynchronous style working, with employees spread throughout Asia and Europe working on the same team. As such, communication and documentation are in our blood. We look for and write about signals in the noise which enables us to constantly learn and adapt, and we expect members of our teams to constantly follow up with questions and updates to keep everyone in the loop.
Our engineering team communicates in English, and so we generally don't require Japanese skills. We also welcome applicants currently outside Japan. If you want to relocate here, we can sponsor your visa. We're also open to remote candidates who do not plan to relocate.
Get Job Alerts
Sign up for our newsletter to get hand-picked tech jobs in Japan – straight to your inbox.