Job Description
We are seeking a skilled professional to optimize our infrastructure and deployment processes, ensuring high performance and reliability of our systems. The ideal candidate will be responsible for implementing best practices in system monitoring, fault tolerance, and automation to enhance operational efficiency.
Key Responsibilities
- Optimize services and deployments to improve cluster machine performance
- Develop and enhance deployment and maintenance procedures to improve service quality
- Design and implement system monitoring solutions, fault tolerance strategies, and automation frameworks
- Deploy and maintain public blockchain nodes and blockchain explorers
- Configure internal network DNS resolution and implement high-availability systems
Additional Responsibilities
The role may also involve collaborating with development teams to troubleshoot complex system issues, documenting operational procedures, and staying updated with emerging technologies in the infrastructure and blockchain space. The candidate will play a crucial role in maintaining our system's stability while continuously seeking opportunities for improvement.
Preferred Qualifications
- Experience with containerization technologies (Docker, Kubernetes)
- Knowledge of blockchain technology and public chain operations
- Familiarity with infrastructure as code tools (Terraform, Ansible)
- Understanding of network protocols and security best practices
- Strong problem-solving skills and attention to detail