💡 About Me
I am currently in my senior year at Guangdong Polytechnic Normal University in China I am currently engaged in multimodal sentiment analysis, medical multimodal data fusion and other research. If you’re looking for an intern, please feel free to email me at yueshenghuang@stu.gpnu.edu.cn I am looking for an internship. I am enrolled in the Department of Internet of Things Engineering in GPNU’s School of Computer Science under the supervision of Assistant Professor Jiawen Li. Many of the topics I explored and experimented with myself. In 2024, I won the Guangdong Provincial Person of the Year (10 winners from Guangdong) and the Student representative of the People’s Daily National Scholarship (only 4 winners from Guangdong). My research interests include medical artificial intelligence, social computing, etc. As I have not yet deeply studied, my published papers are at a low level. I am honored to be one of the reviewers of the IJCNN International Conference.
🔥 News
- 2024.12: 🎉🎉 Yuesheng Huang was selected as the 2023 Guangdong Provincial Person of the Year. Guangdong Province only selected 10 people. He is the youngest winner in the same year and the only undergraduate student enrolled in the class of 2021.
- 2024.05: 🎉🎉 Yuesheng Huang was featured in the People’s Daily as a representative of 100 undergraduate national scholarship winners, only 4 of whom were from Guangdong Province.
📝 Publications

Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning?
Yuesheng Huang, Peng Zhang, Riliang Liu, Jiaqi Liang
- arXiv:2506.17623 Cite
title={Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning?},
author={Yuesheng Huang and Peng Zhang and Riliang Liu and Jiaqi Liang},
year={2025},
eprint={2506.17623},
archivePrefix={arXiv},
primaryClass={cs.MM},
url={https://arxiv.org/abs/2506.17623},
}

Jiawen Li, Yuesheng Huang, Yayi Lu, Leijun Wang*, Yongqi Ren and Rongjun Chen

Yuesheng Huang, Jiawen Li, Yushan Li, Routing Lin, Jingru Wu, Leijun Wang, and Rongjun Chen
ISCTIS 2024
A FinBERT Framework for Sentiment Analysis of Chinese Financial News EIECT 2023
An Ensemble Learning Approach for Wind Power Forecasting, Yuesheng Huang, Sida Chen, Qilin Wu, et al. 🏆 Honors and Awards
- 2025.02 First Prize in Information Technology, Healthcare, and Modern Service Tracks at the China University Student Technology Innovation and Entrepreneurship Competition.
- 2024.05 Finalist Award for the E problem of the American College Students Mathematical Contest in Modeling (Top 2% in the world), COMAP.
- 2023.12 Awarded the National scholarship.
- 2023.11 First prize in Guangdong division of China University Students Mathematical Contest in Modeling, Guangdong Provincial Department of Education.
- 2023.08 First prize of China College Student Computer Design Competition Guangdong Division, Guangdong Provincial Department of Education.
- 2023.08 Outstanding Award in International University Mathematical Contest in Modeling.
- 2023.08 The third prize of the National Final of Biomedical Engineering Innovation Design Competition for Chinese College students.
- 2021.12 Silver Medal in Kaggle Lux AI Competition.
🎓 Educations
- 2021.09 - 2025.06 (now), Bachelor of Engineering in Internet of Things Engineering(ESI TOP 1%), School of Computer Science, Guangdong Polytechnic Normal University.(GPA:91.9/100, Rank:1/112)
- Graduation Project: Design of an Intelligent Student Emotion Analysis and Monitoring System Empowered by Multimodal Data and Large Models
ABSTRACT
With the in-depth development of artificial intelligence and deep learning technologies, the application potential of multimodal sentiment analysis in the educational field is increasingly evident. Traditional unimodal emotion recognition methods have limitations in capturing students' complex emotional states, while multimodal analysis significantly improves the accuracy of emotion recognition by integrating facial expressions, speech information, and physiological signals. Under the current background of educational informatization, there is an urgent demand for student mental health monitoring. However, existing methods face challenges such as poor timeliness, strong subjectivity, and difficulties in scaling, which limit their widespread adoption in campus environments.
To address these challenges, this paper proposes and implements a student emotion intelligent analysis and monitoring system based on ESP32 and ESP32S3 hardware platforms, combined with lightweight multimodal fusion algorithms and large language models. The system aims to utilize low-cost, highly integrated embedded technology to fuse multi-source data including facial, speech, and heart rate information, providing educators, parents, and students with real-time, accurate, and convenient emotion monitoring and support tools.
At the hardware level, a distributed dual-mainboard architecture using ESP32 and ESP32S3 is adopted. The ESP32 mainboard integrates ESP32CAM and heart rate sensors to implement facial expression recognition, physiological data collection, and basic feedback. The ESP32S3 mainboard integrates a digital microphone, audio amplifier, display screen, etc., to achieve intelligent dialogue functions based on Baidu's ERNIE Bot API. At the software level, a Node.js-based server is constructed, SQLite is used for data storage, and multi-role web application interfaces for teachers, students, and parents are developed. On the algorithmic level, the system implements facial emotion recognition based on Deepface, speech emotion analysis using the ERNIE Bot API, designs a dynamic weight decision-level multimodal fusion algorithm, and introduces a data-volume-based multi-model emotion trend prediction method. Additionally, prompt optimization is employed to enhance the performance of large language models in emotional support dialogue tasks.
Finally, the system hardware platform was successfully built and debugged, with comprehensive functional testing and verification conducted on the software system, including white-box testing and black-box testing. The test results demonstrate stable operation of all system modules, compliance with design requirements, and effective integration of multimodal data for student emotion state analysis and monitoring, validating the feasibility and effectiveness of the design.
Keywords: Multimodal sentiment analysis; Student emotion monitoring; ESP32; Large language models; Data fusion - 2018.09 - 2021.06, Ordinary high school, Shaoguan City Wengyuan middle School
📖 Research topics
- 2023.05-2024.05, “Research and implementation of MIMO system detection algorithm based on Gaussian tree”, Chinese college students Innovation and Entrepreneurship plan project, Huang Yuesheng as host. (Project completed)
- 2024.01-2026-01, “Research on fine-grained sentiment analysis of multi-modal data fusion based on deep learning”, Guangdong Provincial Science and Technology Innovation Fund, 45,000CNY, Huang Yuesheng as host. (Project completed)
- 2024.05-2025.05, “Neurodetective: An interpretable multimodal contrastive learning Framework for the diagnosis of neurodegenerative diseases”, Chinese college students Innovation and Entrepreneurship plan project, Huang Yuesheng as host.(Project completed)
- 2024.05-2025.05, “Aquaponics, Ecological co-prosperity: A general agricultural visual large model for digital aquaponics fish pond system called DASAM”, Chinese college students Innovation and Entrepreneurship plan project, Second participant.(Project completed)
- 2025.05-2026.05, “Early Diagnosis System for Alzheimer’s Disease Based on Mixture of Experts (MoE) Multimodal Model”, Chinese college students Innovation and Entrepreneurship plan project, Second participant.
- 2025.05-2026.05, “Diffusion Model-Empowered Multimodal Decision Making: Breaking the Bottleneck of Rare Disease Medical Image Shortage for AI-Assisted Diagnosis Platform”, Chinese college students Innovation and Entrepreneurship plan project, Second participant.
©️ Patents and Copyrights
- 2024, “Flask based medical image segmentation platform V1.0”, Chinese software copyright, 2024SR0877362
- 2023, “Medical glucan information test system V1.0”, Chinese software copyright, 2023SR1635698
- 2023, “Multi-arm obstacle detection and motion planning software V1.0 based on sound wave sensing”, Chinese software copyright, 2023SR1657692