About me 🙋🏼‍♂️

I received the Ph.D. degree in Information and Communication Engineering from Northwestern Polytechnical University, Xi’an, China, in 2025. From 2023 to 2024, I was a visiting Ph.D. student at Nanyang Technological University, Singapore. Since 2018, I have been a co-founder of Xi’an Lianfeng Acoustic Technologies Co., Ltd., China. I am currently a lecturer at the Center for Image and Information Processing, School of Communications and Information Engineering, Xi’an University of Posts and Telecommunications, Xi’an, China. My research interests are focused on deep learning and audio signal processing.

Work Experience 💼

2025.03 - now Xi’an University of Posts & Telecommunications
- Lecture
2018.09 - now Xi’an Lianfeng Acoustic Technologies Co., Ltd.
- Algorithm Engineer

Education 👨🏼‍🎓

2020.03 - 2025.03 Northwestern Polytechnical University
- PhD in Information and Communication Engineering.
2017.09 - 2020.03 Northwestern Polytechnical University
- ME in Electronics and Communications Engineering.
2013.09 - 2017.06 North University of China
- BE in Detection Guidance and Control Technology.

Challenges 👊🏼

1st in DCASE 2023 Task 4B: Sound Event Detection with Soft Labels
1st in 2023 IEEE ICASSP Grand Challenge: L3DAS23-3D Sound Event Localization and Detection in Simulated Reverberant Environments
2nd in DCASE2020 Task 5: Urban Sound Tagging with Spatiotemporal Context.
4th in 2024 IEEE ICASSP Grand Challenge: Music demixing/remixing for hearing aids
5th in DCASE 2022 Challenge Task 3: Sound Event Localization and Detection in Real Spatial Sound Scenes
4th in 2022 IEEE ICASSP Grand Challenge: L3DAS22-3D Sound Event Localization and Detection
8th in DCASE 2022 Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques
4th in DCASE 2019 Task 5: Urban Sound Tagging
6th in DCASE 2018 Task 3: Bird audio detection

Publications 📃

Journal

AudioSetCaps: An Enriched Audio-Caption Dataset Using Automated Generation Pipeline With Large Audio and Language Models——Accepted by IEEE/ACM Transactions on Audio, Speech, and Language Processing
SSDPT: Self-Supervised Dual-Path Transformer for Anomalous Sound Detection in Machine Condition Monitoring——Accepted by Digital Signal Processing
A Squeeze-and-Excitation and Transformer based Cross-task Model for Environmental Sound Recognition——Accepted by IEEE Transactions on Cognitive and Developmental Systems
Multimodal Urban Sound Tagging with Spatiotemporal Context——Accepted by IEEE Transactions on Cognitive and Developmental Systems

Conference paper

The APSIPA ASC 2025 Grand Challenge on City and Time-Aware Semi-supervised Acoustic Scene Classification: Summary and Results——Accepted by APSIPA ASC 2025
AudioSetCaps: Enriched Audio Captioning Dataset Generation Using Large Audio Language Models——Accepted by NeurIPS 2024 Workshop AI-Driven Speech, Music, and Sound Generation
AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning——Accepted by ICME 2024
3D audio signal processing systems for speech enhancement and sound localization and detection——Accepted by ICASSP 2023
Dual-path transformer for machine condition monitoring——Accepted by APSIPA ASC 2021
EnvSDD - Benchmarking Environmental Sound Deepfake Detection——Accepted by INTERSPEECH 2025
Exploring text-queried sound event detection with audio source separation——Accepted by ICASSP 2025

Experience 📝

Organizer for ICASSP 2026 Grand Challenge: Environmental Sound Deepfake Detection.
Organizer for APSIPA ASC 2025 Grand Challenge: City and Time-Aware Semi-supervised Acoustic Scene Classification.
Track Chair for 2024 IEEE International Conference on Multimedia and Expo Workshop GC-ASC.
Organizer for 2024 IEEE International Conference on Multimedia and Expo Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift.
Speaker for 2nd Signal Processing Society Entrepreneurship Forum at ICASSP 2023.
Visiting PHD 2023-Now at Digital Signal Processing Lab/Smart Nation Trans Lab, School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore.

Bai Jisheng (白吉生)