-
Here is a weird problem I discovered when trying to fix the bug mentioned here, reported by @Danstiv, introduced by the leading silence removal feature. The leading silence removal feature works by removing some silent audio data before sending the audio data to the output device. However, even removing just 4 bytes from the audio can sometimes make some audio segments sound like they were "chopped off" at the beginning. Also, not all segments are chopped off, only some of them. If you wait for a few seconds before making the next sound, the next audio segment will be more likely to be chopped off. To reproduce the issue: (requires modifying some code)
The chopping off is audible, although you may have to pay close attention. I checked the waveform of the output audio, and the waveform is also different. Regular beeps look more like a rectangle (actually sine waves when zoomed in), but the chopped off beeps look like a "fade in" effect was applied to them: the volume goes from zero gradually to the current level. Just because a single wave sample is removed from the beginning. And if the change to This weird problem is bugging me. I'm not an expert on audio processing. I couldn't find the cause of it, and I'm not sure if this problem only exists on my system. Could someone help me test if the problem exists on their system? Could someone give me some idea about the cause of this problem? cc @jcsteh |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
I don't have any real idea, but here's a hypothesis: could this be something the Windows mixer is doing? Does trimming 1 sample from the start result in a wave form that the Windows mixer somehow thinks might produce clicks, etc., so it fades it in? Could we solve that by leaving 1 sample of silence at the start of the audio? |
Beta Was this translation helpful? Give feedback.
-
@gexgd0419 I experience the premature cutting of speech on my system - I had to turn the feature off, because I was struggling to tell letters apart (using eSpeak at rate boost 35). Haven't tried to repro the beeps issue |
Beta Was this translation helpful? Give feedback.
I don't have any real idea, but here's a hypothesis: could this be something the Windows mixer is doing? Does trimming 1 sample from the start result in a wave form that the Windows mixer somehow thinks might produce clicks, etc., so it fades it in? Could we solve that by leaving 1 sample of silence at the start of the audio?