About me đđźââď¸
I received the Ph.D. degree in Information and Communication Engineering from Northwestern Polytechnical University, Xiâan, China, in 2025. From 2023 to 2024, I was a visiting Ph.D. student at Nanyang Technological University, Singapore. Since 2018, I have been a co-founder of Xiâan Lianfeng Acoustic Technologies Co., Ltd., China. I am currently a lecturer at the Center for Image and Information Processing, School of Communications and Information Engineering, Xiâan University of Posts and Telecommunications, Xiâan, China. My research interests are focused on deep learning and audio signal processing.
Work Experience đź
- 2025.03 - now Xiâan University of Posts & Telecommunications
- Lecture
- 2018.09 - now Xiâan Lianfeng Acoustic Technologies Co., Ltd.
- Algorithm Engineer
Education đ¨đźâđ
- 2020.03 - 2025.03 Northwestern Polytechnical University
- PhD in Information and Communication Engineering.
- 2017.09 - 2020.03 Northwestern Polytechnical University
- ME in Electronics and Communications Engineering.
- 2013.09 - 2017.06 North University of China
- BE in Detection Guidance and Control Technology.
Challenges đđź
- 1st in DCASE 2023 Task 4B: Sound Event Detection with Soft Labels
- 1st in 2023 IEEE ICASSP Grand Challenge: L3DAS23-3D Sound Event Localization and Detection in Simulated Reverberant Environments
- 2nd in DCASE2020 Task 5: Urban Sound Tagging with Spatiotemporal Context.
- 4th in 2024 IEEE ICASSP Grand Challenge: Music demixing/remixing for hearing aids
- 5th in DCASE 2022 Challenge Task 3: Sound Event Localization and Detection in Real Spatial Sound Scenes
- 4th in 2022 IEEE ICASSP Grand Challenge: L3DAS22-3D Sound Event Localization and Detection
- 8th in DCASE 2022 Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques
- 4th in DCASE 2019 Task 5: Urban Sound Tagging
- 6th in DCASE 2018 Task 3: Bird audio detection
Publications đ
Journal
- AudioSetCaps: An Enriched Audio-Caption Dataset Using Automated Generation Pipeline With Large Audio and Language ModelsââAccepted by IEEE/ACM Transactions on Audio, Speech, and Language Processing
- SSDPT: Self-Supervised Dual-Path Transformer for Anomalous Sound Detection in Machine Condition MonitoringââAccepted by Digital Signal Processing
- A Squeeze-and-Excitation and Transformer based Cross-task Model for Environmental Sound RecognitionââAccepted by IEEE Transactions on Cognitive and Developmental Systems
- Multimodal Urban Sound Tagging with Spatiotemporal ContextââAccepted by IEEE Transactions on Cognitive and Developmental Systems
Conference paper
- The APSIPA ASC 2025 Grand Challenge on City and Time-Aware Semi-supervised Acoustic Scene Classification: Summary and ResultsââAccepted by APSIPA ASC 2025
- AudioSetCaps: Enriched Audio Captioning Dataset Generation Using Large Audio Language ModelsââAccepted by NeurIPS 2024 Workshop AI-Driven Speech, Music, and Sound Generation
- AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive LearningââAccepted by ICME 2024
- 3D audio signal processing systems for speech enhancement and sound localization and detectionââAccepted by ICASSP 2023
- Dual-path transformer for machine condition monitoringââAccepted by APSIPA ASC 2021
- EnvSDD - Benchmarking Environmental Sound Deepfake DetectionââAccepted by INTERSPEECH 2025
- Exploring text-queried sound event detection with audio source separationââAccepted by ICASSP 2025
Experience đ
- Organizer for ICASSP 2026 Grand Challenge: Environmental Sound Deepfake Detection.
- Organizer for APSIPA ASC 2025 Grand Challenge: City and Time-Aware Semi-supervised Acoustic Scene Classification.
- Track Chair for 2024 IEEE International Conference on Multimedia and Expo Workshop GC-ASC.
- Organizer for 2024 IEEE International Conference on Multimedia and Expo Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift.
- Speaker for 2nd Signal Processing Society Entrepreneurship Forum at ICASSP 2023.
- Visiting PHD 2023-Now at Digital Signal Processing Lab/Smart Nation Trans Lab, School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore.