I am currently working as a senior AI algorithm programmer at Yuanfudao Company and have been engaged in related work for 8 years. My artificial intelligence project experience includes: large language models, RAG, multi-modal large models, computer vision, video algorithms, machine vision, natural language processing, visual perception, AI engineering deployment, etc. My earliest exposure to related work was from DQ QR code recognition as an undergraduate. I have been engaged in research on related work during my graduate studies and have a solid theoretical foundation.
The main achievements in recent years are:
- From 2022 to 2024: English essay correction and polishing (large language model), AI guessing (multimodal model), homework beautification (visual). All online, and widely praised by teachers and students. Session cache service reduces third-party model requests, saving a lot of costs for the company, and independently completes Chinese essay knowledge points, product manual multi-agent RAG services. A total of 12 algorithm invention patents have been completed, including 2 large model-related patents, 1 multimodal large model-related patent, and 2 original algorithm invention patents; during this period, the industrial multimodal large model platform was built and the model was fine-tuned (website: http://112.245.58.16:8852/), and a data automatic annotation platform was built using a sparse attention detection model (website: http://112.245.58.16:8851/);
- 2022: Obtained the intermediate title of artificial intelligence issued by the Chinese Academy of Sciences, built a video quality analysis framework, which can perform static analysis of image quality, image quality enhancement badcase classification, and image quality intelligent enhancement strategy. During this period, 1 original algorithm invention patent was completed;
- From 2020 to 2021: I complete the video frame interpolation algorithm on the cellphone, which will break through the barrier that the cellphone can only use chips to video interpolation, and achieve the effect that the software algorithm can be used to video frame interpolation. I complete 6 original algorithm invention patents, and the original video interpolation algorithm reached the state of art level in relevant data set tests.
- In 2019: I complete the automatic reading recognition of the instrument pointer at the machine car. This original technology broke through the accuracy of reading the meter with the human eye for the first time, which bringing economic benefits to the cooperation between our company and Nanjing China Resources Gas Company and BASF. I complete one original algorithm invention patent.
- In 2018: I independently complete the Alpha Note App of the intelligent scanning SDK. and the APP has been launched, which bringing economic benefits to our company. During this period, I completed one original algorithm invention patent. I led the team to develop a OCR system for converting PDF documents to Microsoft Word document.
Professional Skills
- Code language
c、c++、python、java、c#、shell、html - Major skill
opencv、dlib、ffmpeg、libtorch、pillow、skimage - AI
framework:pytorch、tensorflow,deep learning&llm:transformers、vllm、diffusers、deepspeed、faiss、pymilvus、openai、langchain、llamaindex、autogen - Engineering
platform:linux、windows、android、ros,database:mysql、sqlite,compile:make、cmake,Optimization&terminal:cuda、onnx、libtorch、opencl、tensorrt、snpe、ncnn、mace,Deployment:http、rpc - Research
Ability to quickly reproduce and optimize paper codes, complete technical invention patents and paper writing