Site Reliability Engineer

  • Partial Remote
  • Full-time
  • September 30, 2025
Conditions
yen-icon
¥6M ~ ¥12M /yr
location-icon
Apply from Japan Only
(You must live in Japan to apply)
Requirements
language-icon
Language Requirements
Japanese: Not Required 👍
English: Business Level
career-icon
Minimum Experience
Mid-level or above

About Us

At Sales Marker, our mission is to create a world where all people and companies can challenge themselves beyond existing boundaries. We are one of the fastest-growing startups in Japan—scaling at more than twice the pace of typical unicorns. Our flagship product, Sales Marker, empowers sales teams to achieve 3 times greater efficiency. In just two years since launch, we’ve achieved 2,000% growth, and today our year-over-year business growth stands at 270%—and this is only the beginning.

Backed by strong financial growth, we’re expanding into a bold portfolio of new products, empowering businesses to grow from every angle: Sales, Marketing, Recruiting, AI agents and so on.

 

The Team

Our co-founders come from leading global companies and were recognized on the Forbes 30 Under 30 Asia List (2023).

Our Product & Engineering team is proudly global, with members from 24+ countries and backgrounds at top tech companies such as Google, Microsoft, Indeed, Mercari, LINE, Yahoo, and SmartNews etc.

At Sales Marker, you’ll join a global, ambitious, and fast-moving team where your ideas truly shape the future. We’re building an engineering culture around:

  • Customer Obsession – solving real problems and exceeding expectations
  • Ownership – taking responsibility end to end, across roles and functions
  • 10x – aiming for bold impact, moving fast, and disrupting old standards

 

The Role

The Common Foundation team helps engineering teams move faster by providing scalable, reliable, and reusable systems that serve as the platform for product development. The Platform Foundation side focuses on reliability, performance, security, and developer productivity across our cloud infrastructure and Kubernetes platform. We build paved roads, automate operations, and ensure that application teams can ship safely at speed.

We're looking for a Site Reliability Engineer who can own the health and evolution of our platform. You’ll design and operate our AWS and Kubernetes environments, lead reliability initiatives, and partner with product engineers to embed best practices in availability, observability, and performance. You’ll turn complex infrastructure into simple, well-documented, self-service building blocks.

 

Responsibilities

  • Operate and improve our Kubernetes platform (EKS), including cluster lifecycle, upgrades, scaling, networking, and multi-tenant isolation.
  • Design, provision, and manage AWS infrastructure (VPC, RDS/Aurora, OpenSearch, S3, SQS, Lambda, API Gateway, Batch, Glue) with a strong focus on security, reliability, and developer experience.
  • Build infrastructure as code using Terraform and AWS CDK. Establish standards for modules, environments, and change management via GitOps.
  • Drive observability end to end: metrics, logs, traces, SLOs, error budgets, and actionable dashboards and alerts in Datadog.
  • Partner with backend engineers to improve service reliability, performance, and cost efficiency. Champion best practices in testing, rollout strategies, and production readiness.
  • Automate operations and repetitive work with tooling and pipelines. Reduce MTTR with improved runbooks, diagnostics, and incident tooling.
  • Lead incident response and post-incident reviews. Raise the operational bar through blameless retros, remediation plans, and reliability roadmaps.
  • Strengthen platform security through identity and access control, secrets management, network policies, patching, and vulnerability management.
  • Support data workloads and pipelines with robust, scalable infrastructure and monitoring.
  • Contribute to platform documentation, paved paths, and self-service developer workflows to accelerate delivery.

 

What We're Looking For

Required

  • 3+ years in SRE, Platform, or Infrastructure Engineering with production ownership of cloud-native systems.
  • Strong experience running Kubernetes in production, including upgrades, scaling, and workload reliability.
  • Deep hands-on expertise with AWS services (networking, compute, storage, databases, messaging) and secure-by-default architectures.
  • Proficiency with IaC (Terraform and/or AWS CDK), modularization, and environment management.
  • Solid observability fundamentals: metrics, logging, tracing, SLOs/error budgets, actionable alerting.
  • Proven track record improving reliability, performance, and developer experience in partnership with application teams.
  • Experience running incident response and driving post-incident improvements.

Nice to Haves

  • Experience with identity and access management patterns, Cognito, JWT, and service-to-service auth.
  • Background in multi-tenant architectures, capacity planning, and cost optimization.
  • History of handling major incidents at scale and building tooling to reduce MTTR/MTTD.
  • Contributions to internal developer platforms, golden paths, or shared libraries.
  • Fluency in English or Japanese.

 

Our Tech Stack

Front-end

  • TypeScript, React, NextJS;
  • Testing: Storybook, jest, playwright;
  • Hosting: Amplify;
  • Feature flag: Unleash;

Server Side/Back-End

  • Infrastructure: AWS, EKS, ElasticBeanstalk;
  • DB: Aurora, ElasticSearch, Redis;
  • Languages: Go, Typescript;
  • Analysis environment: Athena, Superset;
  • Monitoring: DataDog;
  • Others: AWS Lambda, AWS Batch, AWS API Gateway, AWS Glue, AWS S3;

 

Why Us?

  • One of the fastest growing Saas startup in Japan with strong financial growth.
  • Innovative new product development and opportunity to build things from scratch.
  • Plenty of leadership and career development opportunities.
  • Hybrid work environment & full flexible work schedules.
  • Global team and English speaking environment.
  • Great benefits & perks packages such as Resort Worx, Purchasing Books, Free Weekly Lunch, Offsites, etc.

 

Working Style

Hybrid Work

  • We follow a hybrid work style, combining both office and remote work. Recommended in-office days vary by role. Even when working remotely, we maintain smooth collaboration and communication through tools like Zoom, Google Meet, and Gather.

Flex Work

  • You can customize your working hours to suit your day. For business and client-facing teams, schedules are often arranged around client meetings.

Global Environment

  • With team members from over 20 countries, we bring together diverse perspectives and ideas, driving projects forward across languages and cultures in an environment where English and Japanese blend naturally into daily communication.

Sales Marker is a SaaS start-up that provides B2B solutions that focus on "intent-based sales".

They use a methodology designed to identify and engage customers showing signs of needing a product or service.

Sales Marker leverages a database of 5 million corporate entities combined with dynamic intent data to enable precise targeting and effective engagement with potential customers.

View Sales Marker's company page

↑ Back to top ↑

Site Reliability Engineer at Sales Marker
APPLY NOW  ➜🇯🇵 Residents Only