# A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement

**Repository Path**: chenyang918/A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement

## Basic Information

- **Project Name**: A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
- **Description**: A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
- **Primary Language**: Unknown
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 2
- **Forks**: 1
- **Created**: 2021-06-01
- **Last Updated**: 2023-08-21

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

# A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement

A minimum unofficial implementation of the [A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement (CRN)](https://www.isca-speech.org/archive/Interspeech_2018/pdfs/1405.pdf) using PyTorch.

## ToDo
- [x] Real-time version
- [x] Update trainer
- [x] Visualization of the spectrogram and the metrics (PESQ, STOI, SI-SDR) in the training
- [ ] More docs

## Usage

Training:

```
python train.py -C config/train/baseline_model.json5
```

Inference:

```
python inference.py \
    -C config/inference/basic.json5 \
    -cp ~/Experiments/CRN/baseline_model/checkpoints/latest_model.tar \
    -dist ./enhanced
```

Check out the README of [Wave-U-Net for SE](https://github.com/haoxiangsnr/Wave-U-Net-for-Speech-Enhancement) to learn more.

## Performance

PESQ, STOI, SI-SDR on DEMAND - Voice Bank test dataset, for reference only:

| Experiment | PESQ | SI-SDR | STOI |
| --- | --- | --- | --- |
|Noisy | 1.979 | 8.511| 0.9258|
|CRN | 2.528| 17.71| 0.9325|
|CRN signal approximation  |2.606 |17.84 |0.9382|

## Dependencies

- Python==3.\*.\*
- torch==1.\*
- librosa==0.7.0
- tensorboard
- pesq
- pystoi
- matplotlib
- tqdm

## References

- [CRNN_mapping_baseline](https://github.com/YangYang/CRNN_mapping_baseline)
- [A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement](https://web.cse.ohio-state.edu/~wang.77/papers/Tan-Wang1.interspeech18.pdf)
- [EHNet](https://github.com/ododoyo/EHNet)
- [Convolutional-Recurrent Neural Networks for Speech Enhancement](https://arxiv.org/abs/1805.00579)