I am |

ZengYan Liang Senior AI Algorithm Programmer from China
Length of service : 10 Years
Education Degree : Master
Location : BeiJing
E-mail : Lzy_Lyx@163.com
PROJECT SHOWCASE

I am currently employed at Sina Weibo, bringing 10 years of deep expertise in the field of internet algorithms. I currently serve as an Algorithm Team Lead, managing a team of five. I possess a mastery of deep learning algorithms and am proficient across various domains, including large-scale models, recommendation systems, natural language processing, computer vision, and video processing. I specialize in the practical implementation of algorithmic models within real-world business scenarios. I have spearheaded numerous core algorithm projects, leading my team to execute algorithm optimizations and model iterations that resulted in significant improvements across key performance metrics. I possess comprehensive team management experience, excelling in Agile development methodologies to cultivate highly collaborative and efficient algorithm teams that effectively balance technical depth with business value. Throughout my career to date, I have filed a total of 21 patent applications for algorithmic inventions.

The main achievements from the three most recent positions are:

  • Led a project on a theme-based audio recommendation algorithm, resulting in the filing of three invention patents. Conducted a one-week A/B test (scaled to 50% of traffic), achieving a 12% increase in click PV, a 6.76% increase in click UV, a 5.51% increase in playback duration, a 0.52% increase in playback UV, and a 31.17% increase in playback VV. Following long-term optimization, the CTR rose from a monthly average of 1.3% in March to 1.9% in June—a year-over-year increase of 46%—thereby significantly enhancing the conversion efficiency of audio recommendations.
  • Led a project focused on audio recommendation algorithms for categorized content, resulting in the filing of two invention patents. Conducted a one-week A/B test (scaled to 50% of the user base), achieving improvements of 9.91% in click PV, 6.85% in click UV, 5.61% in playback duration, 0.64% in playback UV, and 19.89% in playback VV. In the second half of the year, assumed responsibility for the audio project's KR metrics, shifting the primary quantitative recommendation metric from CTR to "Audio Playback Page Duration"; in Q4, the average duration on the audio playback page reached 97,000 minutes, achieving an 82% completion rate for the metric and successfully meeting the core business objectives.
  • Led collaboration between the algorithms team and the Product, Engineering, and Business departments; drove the breakdown and implementation of algorithmic requirements; achieved deep integration between technology and business; and earned high recognition from business stakeholders.
  • During my tenure at Yuanfudao, the English essay grading system, the multimodal "You Draw, AI Guesses" project, and the homework beautification project—all of which I spearheaded—were successfully deployed; additionally, 12 algorithm-related invention patents were secured during this period.
  • During my tenure at Xiaomi, I achieved a groundbreaking breakthrough in mobile video super-resolution algorithms, shattering the technical barrier that previously restricted on-device super-resolution processing solely to hardware chips. My proprietary super-resolution algorithm achieved state-of-the-art performance on relevant benchmark datasets, and during this period, I secured six invention patents for these algorithms.

Professional Skills

  • LLMs & AI Agents
    Frameworks: Proficient in building multi-agent collaborative systems using AutoGen, LangChain, LlamaIndex, and LiteLLM;
    Training & Inference: Expert in fine-tuning with Transformers, TRL, Llama-factory, PEFT, DeepSpeed; Mastery of vLLM for high-performance inference acceleration;
    Frontier Protocols: Deep understanding of ReAct Chain-of-Thought, with hands-on experience in MCP and RAG.
  • Core Algorithms & Frameworks
    Deep Learning: Expert in PyTorch (Author of Zhihu column "PyTorch Tutorials") and TensorFlow;
    Domain Expertise: Recommender Systems (Recall/Ranking), NLP (NER, Translation, Semantics), CV (Video Super-resolution, Multi-modal, Object Detection, Segmentation, Tracking, OCR);
    Libraries: Proficient in OpenCV, Scikit-learn, NLTK, FFmpeg, etc.
  • Algorithm Engineering & Deployment
    High-performance Inference: Proficient in model quantization and production deployment using Triton and TensorRT; Kernel programming with CUDA and OpenCL;
    Backend & Systems: Full-stack capability with FastAPI, Flask, NextJS, Spring Boot, and RPC protocols; Expert in Make/CMake build systems;
    Data Storage: Large-scale search and storage using Milvus/Faiss (Vector DB), ES, Spark, Hive, MySQL, and Redis;
    DevOps & Middleware: Containerization via Docker; High-concurrency middleware including Kafka and Nginx; Task scheduling with RabbitMQ and Celery;
    Edge Computing: Extensive experience in model migration using ONNX, LibTorch, SNPE, NCNN, and MACE.
  • Programming Languages
    Proficient: Python, C/C++, Shell; Familiar: Java, JavaScript, C#
  • Qualifications & Impact
    Professional Title: Intermediate Professional Title in AI (Chinese Academy of Sciences);
    Achievements: 27 granted invention patents; Author of "AI Programming: PyTorch" column on Zhihu;
    Portfolio: liangzengyan.cn

Edu&Job Experience

Education

2012 - 2015

Computer science and technology

Guizhou University Master

2007 - 2011

Electronic information engineering

Chengdu University of Information Technology Bachelor

Job

2024.11 - Now

Senior Algorithm Specialist / Algorithm Lead

Sina Co., Ltd.

2022.11 - 2024.7

R&D of Advanced AI Application Algorithms

Feixiang Planet Technology Co., Ltd.

2022.5 - 2022.11

Expert Engineer

Shopee information Technology Co., Ltd

2020.4 - 2022.5

Intermediate Algorithm Engineer

XiaoMi Pinecone Electronics Co., Ltd

2019.2 - 2020.3

Senior Machine Perception Algorithm Development

MouShi Technology Co., Ltd

2018.1 - 2019.1

Algorithm Development

XinCheng Technology Co., Ltd

2016.7 - 2017.9

Algorithm Development

HuaShang High Technology Co., Ltd

Project Experience

Topic-based Audio Recommendation (LLM + RecSys)

Introduction: Led the end-to-end algorithm design and deployment of the Weibo audio recommendation system, optimizing precision recall, personalized ranking, and high-concurrency performance.
Responsibilities: Data engineering (Spark/Hive, LLM-based topic/keyword generation, Milvus vector storage), recall strategy (Jina V4 embedding, cosine similarity, trending keyword cold start), model optimization (Qwen3-14B with MoE-LoRA architecture), ranking optimization (ONN-Sparse with Expert Gating), and engineering (Triton, Redis Pipeline).
Achievements: Filed 3 algorithm patents; AB testing showed +12% Click PV, +31.17% Play VV, and a 46% increase in CTR.

Category-based Audio Recommendation (Strategy Optimization)

Introduction: Spearheaded audio classification and ranking strategies, focusing on dynamic slot adjustment to boost user retention and consumption.
Responsibilities: Audio classification (LLM tagging on ASR content, Crontab automation), ranking strategy (multi-factor weighting, segmented exposure slots, weekly leaderboard mechanism), preference matching (Jina V4 similarity matching), experience optimization (author de-duplication, block-based random perturbation), and dynamic optimization via LLM agents.
Achievements: Filed 2 patents; achieved +9.91% Click PV and 82% KPI completion for Q4 playback duration.

Precise Content Delivery (Multi-Agent System)

Introduction: Developed an innovative decision system based on the AutoGen framework and ReAct paradigm to match user intent with ads/content, driving GMV and retention.
Responsibilities: Multi-Agent architecture (thinking-action-observation loops), core skill implementation (intent recognition, content retrieval, ad theme matching), and recall strategy (Milvus-based two-step retrieval). Optimized deployment via MCP (SSE hosting) for automated skill triggering based on User ID.
Achievements: Core methodology adopted by the Search Department; filed 2 patents; significantly improved ad conversion and user retention.

Industrial Robot Private Knowledge Base (Multimodal RAG)

Introduction: Built a full-stack RAG-based retrieval platform for industrial robot documentation, supporting multimodal (text + image) search results.
Responsibilities: Platform development (React + FastAPI), multi-scale retrieval (BGE-M3 embedding, LlamaIndex for PDF/Docx parsing), and re-ranking (BGE-reranker-base). Innovated a multimodal link between Milvus slices and local image storage to render synchronized text-image results.
Achievements: Increased document retrieval efficiency by over 60%, significantly reducing internal information search costs.

English Essay Correction & Polishing (LLM Fine-tuning)

Introduction: A core product for the Yuanfudao student terminal, using LLMs to help students improve writing through automated grading and polishing.
Responsibilities: Data engineering (cleaning, GT production, JSON generation), Prompt engineering (7 iterations for scoring, error correction, and style polishing), and model fine-tuning (Llama2-7B-Chat, PEFT/LoRA, FP16). Optimized inference via vLLM with LoRA fusion, achieving QPS=2 on V100 clusters.
Achievements: Successfully deployed on student terminals with high user satisfaction; fully automated the feedback loop for student writing.

"You Draw, AI Guesses" (Multimodal Recognition)

Introduction: A flagship interactive product for Dual-Teacher Classrooms using multimodal AI to recognize student drawings in real-time.
Responsibilities: Data pipeline (Quick_Draw dataset filtering, image-text pair creation), model optimization (Clip-Vit-32, Wise-ft for linear layer fine-tuning), and performance tuning (vector database grouping, warmup mechanisms). Achieved QPS=19.97 for seamless real-time interaction.
Achievements: Filed 1 patent; became one of the most popular interactive features in the company's educational product line.

LLM Infrastructure Services (Cache & RAG)

Introduction: Developed centralized Cache and RAG services for the Feixiang AI Platform, supporting multiple products like problem-solving and teacher assistants.
Responsibilities: Cache service (Faiss-GPU, text2vec-chinese, multi-node synchronization, multi-round session management), and RAG service (LangChain framework, document chunking, Faiss retrieval acceleration). Engineered for high availability with dual-datacenter RPC deployment.
Achievements: Filed 2 patents; significantly reduced API costs for overseas LLMs while maintaining high-speed response (QPS=20 for RAG).

Educational Subject Entity Recognition (NER)

Introduction: Expanded the Subject Knowledge Graph by extracting fine-grained knowledge points from exam questions to support personalized reviews.
Responsibilities: Technical research and data engineering (question cleaning, GT creation), model fine-tuning (Aton-7B-Chat, LoRA, FP16), and inference acceleration (vLLM, QPS=2). Designed post-processing logic to ensure uniqueness and graph compatibility.
Achievements: Significantly improved knowledge graph coverage; automated the entire pipeline from question input to graph expansion.

Video Quality Analysis Service (CV/Vision Transformer)

Introduction: Developed an end-to-end video quality analysis SDK for Shopee Video to identify bad cases in enhanced videos and drive algorithm iteration.
Responsibilities: Framework development (C++ SDK), content understanding (Sandwich & Face detection integration), and quality assessment (Vision Transformer - MUSIQ model training). Reached a PLCC of 93.2% on quality metrics.
Achievements: Successfully deployed to analyze millions of daily videos, identifying 83% of non-clear video bad cases for the enhancement team.

Mobile Video Frame Interpolation (Algorithm & Porting)

Introduction: Developed a 2x frame interpolation plugin for Xiaomi video players to improve playback smoothness on mobile devices.
Responsibilities: Algorithm development (AdaCoF training, self-developed VFI-FMSMI network for multi-scale motion), and mobile porting (SNPE/MACE frameworks, custom OpenCL operators). Optimized for high-resolution stability and scene-cut detection.
Achievements: Self-developed algorithm reached State-of-the-art levels; successfully integrated into the Xiaomi ecosystem.

Xiaomi Magic Sky Replacement (Segmentation)

Introduction: Developed the "Magic Sky" feature for Xiaomi Photo Gallery, enabling precise real-time sky segmentation and replacement.
Responsibilities: Model training (U2Net, self-developed CLN-SOD network for context-aware segmentation), and mobile porting (SNPE optimization). Focused on reducing foreground/background misclassification to ensure natural edges.
Achievements: Successfully deployed in Xiaomi Gallery, becoming a flagship creative feature for mobile users.

Robot Vision (Pose Estimation & Gesture Control)

Introduction: Developed vision interactive modules for Xiaomi CyberDog and IoT Smart Speakers, including pose-based and gesture-based control.
Responsibilities: Human keypoint detection (HRNet training, TensorRT/ROS porting), and gesture recognition (Yolov4-based Hand Det+Rec framework). Developed the ROS-end SDK using CUDA and OpenCV for high-speed hardware coordination.
Achievements: Enabled real-time interactive control for smart hardware; SDK maintains stable operation across multiple device categories.

Automatic Meter Pointer Recognition (Industrial Inspection)

Introduction: A core vision project for inspection robots, automating meter detection, segmentation, and value reading in industrial gas stations.
Responsibilities: Model R&D (CorNet-Lite for detection, PSP for pointer segmentation, E2E for dial digit recognition), and system integration (ROS node deployment and on-site debugging).
Achievements: Achieved 98% accuracy in meter reading; successfully replaced manual inspection in high-risk industrial scenarios.

Smart Document Processing (Scanning, OCR & PDF Conv)

Introduction: Developed document digitization capabilities for the Alpha Note app, covering smart scanning, OCR, and PDF-to-Word conversion.
Responsibilities: Smart Scanning SDK (C++ cross-platform development, edge detection, multi-style filters), and PDF conversion pipeline (CTPN line detection, DPNet92 character recognition, layout restoration).
Achievements: Edge detection performance surpassed Microsoft Office Lens; reached 80% character accuracy for complex legal document digitization.