Job Description
Key Responsibilities
- Responsible for the daily maintenance of mainstream cloud servers such as AWS, Azure, and Google, including automatic deployment, configuration, optimization, backup, and troubleshooting
 - Establish and optimize operation and maintenance standards, workflow, emergency plans, etc., and participate in the construction of the operation system
 - Write and maintain automation scripts and tools to improve operational efficiency and automation level
 - Track new technologies and services on cloud platforms, understand cutting-edge practices, and apply them to practical work
 - Provide professional technical support to quickly resolve issues
 - Manage and maintain hardware, including installation, configuration, troubleshooting, and upgrades
 - Optimize hardware resource utilization to ensure efficient operation of the cluster
 - Collaborate with software development teams to provide technical support and optimization suggestions for high concurrency computing
 
Additional Requirements
The ideal candidate should have strong problem-solving skills and be able to work in a fast-paced environment. Experience with containerization technologies (Docker, Kubernetes) and infrastructure-as-code tools (Terraform, Ansible) would be advantageous.
This position requires excellent communication skills as you will be collaborating with multiple teams across the organization. A proactive approach to identifying potential issues and implementing preventive measures is highly valued.


