SRE (Site Reliability Engineer)
キャディ株式会社
- Python
- Linux
- Docker
- TypeScript
- Kubernetes
- PostgreSQL
- SRE
- Datadog
- Node.js
- Google Cloud
- Rust
- GraphQL
- Istio
- Express
- NestJS
- Firestore
- Auth0
- Site Reliability Engineer
- REST
- WebAssembly
- axum
- Git/GitHub
- tonic
- Fastify
- Tokio
- Disel
- SeaORM
- Infrastructure-as-Code
-
¥7,000,000 - 10,000,000
-
Tokyo
-
101 to 1,000
-
Company Homepage
Company Info
CADDi is expanding two businesses, CADDi MANUFACTURING and CADDi DRAWER. CADDi MANUFACTURING is a consulting and production support partner for manufacturing, and supports the improvement of manufacturing operations. On the other hand, CADDi DRAWER supports the use of CAD data and realizes DX, and aims to improve the entire manufacturing process.Job Summary
Thinking about the assignment to the SRE team, ensuring the reliability of the function development and operation of the product, and providing the maximum value to the user. As specific activities, it includes the implementation and operation of SLO, the design and implementation of capacity planning, and the improvement of the reliability of product trust in the direction of the product.Duties
Assigned to the CADDi DRAWER Group Enabling SRE, it ensures the reliability of the function development and operation of the product, and provides the maximum value to the user. As specific business content, it includes Metrics & Monitoring (implementation, operation, and promotion of observability), Capacity Planning (design and implementation of capacity planning, practical application, and optimization of processing power), Change Management (implementation of reliability engineering, including widespread deployment, and emergency response), etc. It also includes activities that mainly improve business content in response to the company's strategy and business conditions, create a work organization that supports the growth of products, and promote the diffusion of culture and improve the reliability of product trust.Requirements
Basic knowledge of web application development and operation, basic knowledge of public cloud, basic knowledge of Linux, basic knowledge of technologies such as Docker, Git/GitHub, and the ability to communicate in Japanese (N1 or higher as a guide)
Welcomed Skills
Experience in designing, building, and operating architecture of cloud services, experience in practical work in SRE team, experience in backend development of web applications (preferably statically typed languages), basic knowledge of network, experience in practical application of technologies such as Kubernetes, reexamination of redundancy, practical experience in Infrastructure-as-Code, practical experience in using Google Cloud, design and practical experience of monitoring environment such as Datadog, experience in leading a team of 50 or more engineers and managing technology infrastructure and system stabilization, experience in designing distributed systems, practical experience in operation