cv | Peng Haoyuan

General Information

Full Name	Haoyuan Peng - 彭浩源
Email	phy_fdu@163.com
Homepage	haoyuanpeng.github.io

Education

2015 - 2018
Master's degree, Computer Science

Software School of Fudan University
- Under the supervision of Professor Zheng Xiaoqing, my main research areas included Parsing and Word Embeddings, with results published at AAAI-17 and AAAI-18.
- Recognized as an Outstanding Graduate of Shanghai in 2018.
2011 - 2015
Bachelor's degree, Computer Software Engineering

Software School of Fudan University
- Under the supervision of Professor Zheng Xiaoqing, my main research area was Parsing, with results published at IJCAI-15.
- Completed the Fudan University Undergraduate Research Project (Denghui Program) under the supervision of Professor Zheng Xiaoqing.

Experience

2024.04 - Present
Senior Algorithm Engineer

ByteDance
- Utilizing SFT, DPO and RAG of LLMs, I build AI avatars for creators on Douyin that possess similar personas, knowledge scopes and speaking styles.
- Design and develop a long-term memory framework that provides chatbots with memory capabilities beyond the conversational context window, including memory summarization, updating, retrieval, and evaluation.
2023.01 - 2024.04
Senior Algorithm Engineer

Learnable.AI, Shanghai, China
- Conducted research on enhancing the reasoning error detection capabilities of LLMs through Chain-of-Thought (CoT) technology, and co-first authored the research findings published at IJCAI-24.
- Trained large-scale models ranging from 7B to 70B parameters for real-world systems in the education domain. Applications include directly grading students' mathematical free-response answers and translating student responses into internally defined languages.
- Investigated OCR result correction algorithms for scenarios involving student responses, effectively addressing the challenge of distinguishing between student writing errors and OCR recognition errors.
- Evaluated as Excellent in the performance assessment of probation period.
2018.07 - 2022.12
Senior Researcher

Tencent, Shanghai, China
- Led the development of multiple video information extraction algorithms within the Yunzhi Media AI Platform, including key information extraction from video frames, video tagging, and error correction for ASR/OCR. The video tagging algorithm achieved the second place in the AIWIN 2021 Algorithm Technology Competition without using the competition's training data.
- Responsible for the development of NLP algorithms as part of the Public Opinion Analysis System, tailored for securities industry regulators.
- Implemented traditional ML algorithms and deep learning-based NLP algorithms on the Tencent TI-ONE ML Platform, enabling users to train models on their custom data.
- Rated as Five-Star Performance once and Four-Star Performance twice in performance assessments.
2014 - 2015

Data Analyst Intern

eBay, Shanghai, China

Skills

Proficient in conducting cutting-edge research in natural language processing, with a track record of publishing papers in top conferences.
Experienced in all aspects of the full life cycle of AI/ML projects, including training, inference, engineering, and integration.
Highly skilled in Python programming, capable of writing high-quality python codes.
Familiar with Docker, capable of effectively deploying applications, managing versions, and migrating to enhance development efficiency.

Service

Conference Reviewer: KDD-23, EMNLP-23, SDM-24, COLING-24, ACL-24, MM-24
External Reviewer: ACL-23, ECAI-23

Honors and Awards

2021
- Second Place of AIWIN 2021 Algorithm Technology Competition
2020
- Tencent New Code Culture Award - Award for outstanding internal open source code projects
2018
- Outstanding Graduate in Shanghai

Publications

2024

LLMs Can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought

Zhuoxuan Jiang, Haoyuan Peng (Equal Contribution), Shanshan Feng, Fan Li, Dongsheng Li. Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence.
2023

VKIE: The Application of Key Information Extraction on Video Text

An, Siyu and Liu, Ye and Peng, Haoyuan and Yin, Di. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track.
2023

OSAN: A One-Stage Alignment Network To Unify Multimodal Alignment and Unsupervised Domain Adaptation

Liu, Ye and Qiao, Lingfeng and Lu, Changchong and Yin, Di and Lin, Chen and Peng, Haoyuan and Ren, Bo. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
2022

Grafting Pre-trained Models for Multimodal Headline Generation

Qiao, Lingfeng and Wu, Chen and Liu, Ye and Peng, Haoyuan and Yin, Di and Ren, Bo. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track.
2019

Detecting Abnormal Start-Ups, Unusual Resource Consumptions of the Smart Phone: A Deep Learning Approach

ZHENG, Xiaoqing and LU, Yaping and PENG, Haoyuan and FENG, Jiangtao and ZHOU, Yi and JIANG, Min and MA, Li and ZHANG, Ji and JI, Jie. ZTE Communications.
2018

Attention-based belief or disbelief feature extraction for dependency parsing

Peng, Haoyuan and Liu, Lu and Zhou, Yi and Zhou, Junying and Zheng, Xiaoqing. Proceedings of the AAAI Conference on Artificial Intelligence.
2018

RNN-based sequence-preserved attention for dependency parsing

Zhou, Yi and Zhou, Junying and Liu, Lu and Feng, Jiangtao and Peng, Haoyuan and Zheng, Xiaoqing. Proceedings of the AAAI Conference on Artificial Intelligence.
2017

Learning context-specific word/character embeddings

Zheng, Xiaoqing and Feng, Jiangtao and Chen, Yi and Peng, Haoyuan and Zhang, Wenqing. Proceedings of the AAAI Conference on Artificial Intelligence.
2015

Character-based parsing with convolutional neural network

Zheng, Xiaoqing and Peng, Haoyuan and Chen, Yi and Zhang, Pengjing and Zhang, Wenqiang. Twenty-Fourth International Joint Conference on Artificial Intelligence.

cv

General Information

Education

Master's degree, Computer Science

Software School of Fudan University

Bachelor's degree, Computer Software Engineering

Software School of Fudan University

Experience

Senior Algorithm Engineer

ByteDance

Senior Algorithm Engineer

Learnable.AI, Shanghai, China

Senior Researcher

Tencent, Shanghai, China

Data Analyst Intern

eBay, Shanghai, China

Skills

Service

Honors and Awards

Publications

LLMs Can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought

Zhuoxuan Jiang, Haoyuan Peng (Equal Contribution), Shanshan Feng, Fan Li, Dongsheng Li. Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence.

VKIE: The Application of Key Information Extraction on Video Text

An, Siyu and Liu, Ye and Peng, Haoyuan and Yin, Di. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track.

OSAN: A One-Stage Alignment Network To Unify Multimodal Alignment and Unsupervised Domain Adaptation

Liu, Ye and Qiao, Lingfeng and Lu, Changchong and Yin, Di and Lin, Chen and Peng, Haoyuan and Ren, Bo. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

Grafting Pre-trained Models for Multimodal Headline Generation

Qiao, Lingfeng and Wu, Chen and Liu, Ye and Peng, Haoyuan and Yin, Di and Ren, Bo. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track.

Detecting Abnormal Start-Ups, Unusual Resource Consumptions of the Smart Phone: A Deep Learning Approach

ZHENG, Xiaoqing and LU, Yaping and PENG, Haoyuan and FENG, Jiangtao and ZHOU, Yi and JIANG, Min and MA, Li and ZHANG, Ji and JI, Jie. ZTE Communications.

Attention-based belief or disbelief feature extraction for dependency parsing

Peng, Haoyuan and Liu, Lu and Zhou, Yi and Zhou, Junying and Zheng, Xiaoqing. Proceedings of the AAAI Conference on Artificial Intelligence.

RNN-based sequence-preserved attention for dependency parsing

Zhou, Yi and Zhou, Junying and Liu, Lu and Feng, Jiangtao and Peng, Haoyuan and Zheng, Xiaoqing. Proceedings of the AAAI Conference on Artificial Intelligence.

Learning context-specific word/character embeddings

Zheng, Xiaoqing and Feng, Jiangtao and Chen, Yi and Peng, Haoyuan and Zhang, Wenqing. Proceedings of the AAAI Conference on Artificial Intelligence.

Character-based parsing with convolutional neural network

Zheng, Xiaoqing and Peng, Haoyuan and Chen, Yi and Zhang, Pengjing and Zhang, Wenqiang. Twenty-Fourth International Joint Conference on Artificial Intelligence.