0%

Yang Liu

Alt Text

Hi, I’m Yang Liu (刘阳)! I am a Senior Audio AI engineer in Zoom. Before that, I am a reseacher in Microsoft. I got my PhD degree from CVSSP, University of Surrey under the supervised by Prof Wenwu Wang and Prof Adrian Hilton. I am IEEE memeber, IEEE SPS member and IEEE Young Professionals and research fellow of CBAIA. My research interest is audio single processing on edge devices, audio-visual multi-modality fusion, and machine perception. The related topics include echo/noise cancellation, speech enhancement, model compression and acoustic scene classification.

Research Project

AI Echo Cancelation

  • 09/2020 - current | Zoom, UK

Face Relighting on Surface

  • 04/2020 - 09/2020 | Microsoft, UK
  • Propose unsupervised neural network Siamese VAE.

Audio-Visual Classification and Representation with GAN and VAE

  • 07/2019 - 09/2020 | Microsoft, UK
  • Propose a audio scene classification network based on GAN and VAE.
  • Train the model on Youtube BB (380,000 videos), Youtube 8M (10,000 videos), Place 365 and Dcase dataset.

S3A: Future Spatial Audio for An Immersive Listener Experience at Home

  • 10/2015 - 06/2019 | Centre for Vision, Speech and Signal Processing, Surrey, UK
  • Work with Prof. Wenwu Wang and Prof. Adrian Hilton and propose a mutli-speaker tracking framework with a microphone array and a camera using DOA, MUSIC, faster R-CNN and YOLO network.
  • The muli-sensor data are fused by sequential Monte Carlo, Probability Hypothesis Density (PHD) filter and particle flow.
  • Implement the methods by Matlab, C++ and Python on AV16.3 (135 videos), AVDIAR (23 videos) and CLEAR (50 videos).
  • Compared to the baseline lines, the tracking filter decrease 52% accuracy errors and 24% computational cost.
  • Nominated for Best student paper in International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2019.

IEEE-AASP Challenge on Acoustic Source Localization and Tracking

  • 06/2018 - 07/2018 | Centre for Vision, Speech and Signal Processing, Surrey, UK
  • Propose a mutli-speaker tracking framework based on particle flow with MUSIC method.
  • The method is evaluated on the LOCATA datasets which are recorded by four microphone arrays (20 recordings).
  • Compared to the baseline method, our proposed method increases 42% the probability of detection and decreases 71% the elevation error on the planar microphone array.

Teaching

  • Computer Algorithms and Architecture (80 student) and Computers and Programming 2 (80 student) about C++ with Dr. Jean-Yves Guillemaut. (Jan 2017 – Oct 2018, 1 yr 10 mos)
  • Al and Al Programming (40 students) and Advanced Signal Processing (40 students) about Matlab and machine learning with Dr. Terry Windeatt. (Jan 2017 – May 2017, 5 mos)
  • Web and Database Systems (70 students) about SQL, PHP and HTML with Prof. Shujun Li. (Oct 2016 – Feb 2017, 5 mos)
  • Computer and Digital Logic (80 students) about Python with Dr. Nikolaos Dikaios. (Sep 2016 – Dec 2016, 4 mos)

Publication

Journal

  • Labelled non-zero particle flow for Multi-speaker tracking, Yang Liu, Wenwu Wang, IEEE Transactions on Signal processing (under review).
  • Intensity Particle Flows for Sequential Monte Carlo Implementation of Probability Hypothesis Density Filter, Yang Liu, Wenwu Wang, IEEE Transactions on Signal processing (under review).
  • Audio-visual Zero Diffusion Particle Flow SMC-PHD Filter for Multi-speaker Tracking, Yang Liu, Volkan Kili̧c, Jian Guan, Wenwu Wang, IEEE Transactions on Multimedia, August 2019.
  • Texture features extraction method based on Worldview-II multi spectral remote sensing data, Zhenxing Zhang, Ning Li, Yang Liu, Systems Engineering and Electronics, 2013, 35(10): 2044-2049.

    Conference

  • Labelled Non-zero Particle flow for SMC-PHD filtering, Yang Liu, Qinghua Hu, Yuexian Zou, Wenwu Wang, International Conference on Acoustics, Speech, and Signal Processing, 2019.
  • Intensity Particle Flow SMC-PHD Filter For Audio Speaker Tracking, Yang Liu, Wenwu Wang, Volkan Kılıc, LOCATA challenge workshop, 2018.
  • Audio-visual SMC-PHD Filter with Non Zero Diffusion Particle Flow, Yang Liu, Wenwu Wang, Volkan Kılıc, International Conference on Acoustics, Speech, and Signal Processing, 2018.
  • Particle flow for sequential Monte Carlo implementation of probability hypothesis density, Yang Liu, Wenwu Wang, Yuxin Zhao, International Conference on Acoustics, Speech, and Signal Processing, 2017.
  • Particle Flow SMC-PHD Filter for Audio-Visual Multi-speaker Tracking, Yang Liu, Wenwu Wang, onathon Chambers, Adrian Hilton, International Conference on Latent Variable Analysis and Signal Separation, 2017.
  • Visual Mapping and Localization Using a Tree-structured Audio Model, Yuxin Zhao, Yang Liu, Wenwu Wang, International Navigation Conference, 2015.