-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Add triangularclassic filterbank window and snap_to_bins #2562
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
Currently training I am not sure if I would want to keep the bin alignment option; kaldi/k2 doesn't seem to be doing it and it doesn't feel like it would be beneficial especially without high n_fft. |
It took us a long time due to many high priority things, and mostly because it does not seem to affect the results, but this is an important fix. Maybe @idruker-cerence wants to have a look. |
FWIW I have a stock RNN-T LibriSpeech Conformer being trained again with the |
Hello I am not a python expert so it's hard to me to review the code. However, I have a C++ implementation. We can do the following. You calculate the mel-spectrum with your new function and give me the numbers. Then I verify them with my C++ function. |
Current code, lines are respectively
New "
That's only one run with 100 epochs at that, but it doesn't seem like a significant accuracy difference (~1% relative difference for only one run). As for correctness, from my testing, I am tempted to turn this PR into only adding the "correct" filter as an option, but keep the default the same, and remove the At least in speech processing, correct me if I'm wrong, it doesn't seem so common to round the center frequencies to the closest bin in definitions of the mel filterbank (which is what I tried to implement as |
What does this PR do?
Fixes #2030
WIP, introduces:
triangularclassic
triangular filters (name is a WIP)No breaking changes expected. Should test model training to see if those make a difference.
Before submitting
PR review
Reviewer checklist