Senior Site Reliability Engineer

  • Tokyo
  • Remote OK - Anywhere in Japan
  • Full-time
  • June 18, 2025
Conditions
location-icon
Apply from Japan Only
visa-icon
No relocation to Japan
(No visa sponsorship from overseas)
Requirements
language-icon
Language Requirements
Japanese: Not Required 👍
English: Business Level
career-icon
Minimum Experience
Senior or above

About KOMOJU

KOMOJU (by Degica) is the leading cross-border payment gateway for Japan. We power payments for companies like video game distribution platform Steam and the popular mobile app TikTok. Today we help thousands of merchants by providing them with the payment infrastructure they need through developer-friendly API’s to integrations on popular platforms like Shopify and Wix; we help our merchants grow in all markets they are expanding.

 

About the position

As a Site Reliability Engineer (SRE) at Degica, you will play a critical role at the intersection of software engineering and infrastructure operations. This position is ideal for engineers who are passionate about automation, systems design, and building scalable, reliable platforms.

In this role, you won’t be limited to just managing cloud infrastructure—you will take ownership of the platform's overall health, performance, and developer experience. Your work will span:

  • Cloud Infrastructure Management: Architect, implement, and maintain robust and secure infrastructure in a cloud-native environment using Terraform. You'll ensure high availability, scalability, and resilience of our systems.
  • CI/CD and Deployment Automation: Design and improve continuous integration and continuous delivery pipelines that empower development teams to ship software reliably and rapidly.
  • Observability & Monitoring: Implement end-to-end observability tooling—including metrics, logging, distributed tracing, and alerting—to provide real-time insight into platform performance and help reduce mean time to detection and resolution.
  • Platform Quality & Reliability: Champion best practices for reliability, scalability, and performance across engineering teams.

You’ll collaborate closely with developers, security engineers, and product stakeholders to ensure our systems meet both technical and business goals.

 

Responsibilities

  • Actively participate in improving and maintaining our AWS infrastructure
  • Continuously improving the system performance, reliability and secutiry
  • Design, implement, and maintain our observability stack (metrics, logging, tracing, dashboards).
  • Correspond with engineering teams to instrument applications for better observability.
  • Improving developer productivity with tooling
  • Securing the system and adhere to compliance
  • Be part of the teams on-call rotation

 

Requirements

  • 2+ years in SRE roles working with the AWS platform.
  • 2+ years experience in a software development role
  • Hands-on experience with observability tools, preferably Datadog.
  • Proficiency in Terraform.
  • Proficiency in at least one scripting or programming language (Ruby/Rails, Python, Go, Shell Script, etc.).
  • Experience on working with CI/CD tools such as GitHub Actions, Jenkins, Circle CI, etc.

 

Nice to have

  • Strong communication skills to work closely with outside companies and various departments inside the organization
  • Knowledge of TCP/IP and other networking protocols
  • Experience with AWS Direct connect

 

Benefits

  • At Degica, we embrace remote work while also offering office space for those who prefer in-person collaboration
  • 10 days regular vacation, additional 5 days summer and 5 days winter vacation
  • Paid birthday holiday
  • Budget for self-learning allowance, to ensure our employees’ skills remain current
  • Language training for Japanese
  • On-call duties come with an allowance.

KOMOJU (by Degica) is the leading cross-border payment gateway for Japan.

KOMOJU powers payments for leading companies, including the video game distribution platform Steam and the popular mobile app TikTok. Today, the company supports thousands of merchants by providing robust payment infrastructure—ranging from developer-friendly APIs to integrations with widely-used platforms like Shopify and Wix—enabling businesses to scale effectively in new markets.

In terms of engineering, KOMOJU fosters a developer-centric and inclusive culture. The team is committed to continuous improvement through regular self-evaluation and values individuality, recognizing the unique strengths that each engineer brings to the organization.

As KOMOJU grows, its development processes evolve accordingly. The engineering culture is largely self-organizing, with each engineer enjoying significant ownership over their projects. This environment empowers engineers to showcase their strengths while continuing to develop professionally.

KOMOJU's policies during COVID-19 are as follows:

  • During COVID-19 all of their members are working from home 100% of the time (previously they allowed up to one day a week).
  • Their current plan is to continue their full remote policy even after COVID-19 is no longer a concern.
  • They’re able to accept applications from overseas, but you may need to work abroad until it’s possible to move to Japan.
View KOMOJU's company page

↑ Back to top ↑

Senior Site Reliability Engineer at KOMOJU
APPLY NOW  ➜🇯🇵 Residents Only