About Me
Experience
Backend Engineer
Alibaba Cloud - - Now
职位: Backend Engineer | 时间段: Not specified | 工作内容: Built PAAS Platform: Implemented CI/CD functionalities for more than 500 diverse applications such as Java, Node, iOS, and Android applications by collaborating with the test, security, operation, middleware teams and utilizing Kubernetes, Prometheus, Jenkins, etc..Forged Product-Focused CD Platform: Innovated a product-centric delivery platform based on Kubevela, resolving extensibility issues with Cue. This platform concealed complex Kubernetes details, enabling rapid product iterations with a 40% acceleration in delivery speed.Engineered Product Delivery Tools: Spearheaded the development of customer-side product delivery tools, encompassing deployment topology visualization, backup, upgrade, rollback, and log viewing capabilities, which reduced delivery time by 80% for the delivery engineers.Migrated Compilation to Tekton: Led the migration of app compilation from Jenkins to Tekton, delivering a visual pipeline platform. Integrated test coverage and security scanning, boosting Tekton's usage by 50% with significant reduction in compilation time compared to Jenkins.Introduced Pod CPU Monitoring Solution: Deployed a pod container CPU monitoring operator, automating Java thread stack printing during cpu spikes. This solution complemented Prometheus monitoring, resulting in a significant reduction in troubleshooting time.Automated Vertical Pod Autoscaler Operator: Developed Rust-based VPA generating operator. Extended VPA's functionality, leading to its widespread adoption in company Kubernetes clusters. This resulted in a significant increase in Node memory utilization, rising from ~70% to ~90%, bolstering cluster stability and eliminating the need for manual node scaling and eviction.Log Compression & Backup Solution: Developed a log compression and backup program, categorizing and storing business logs in Alibaba Cloud OSS. This reduced hot storage days for Alibaba Cloud SLS logs, translating into a cost saving of approximately ~50% for the company (across ~500 applications with daily traffic of ~10TB). Additionally, implemented log thawing and retrieval features, enabling developers to easily access application history logs.Monitoring System Contributions: Participated in the development of a comprehensive monitoring system, including error log detection, traffic anomaly detection, multi-source alarm convergence, and fault self-diagnosis.Work Order System Development: Contributed to the development of a work order system, automating the process of applying for and creating KVM virtual machines and maintenance work for applications, Alibaba Cloud ECS, SLB and other resources.