AudioStreamPlayer with an AudioStreamMicrophone stutters for any pitch_scale not equal to 1.0 #99930

goatchurchprime · 2024-12-02T16:45:24Z

Tested versions

Reproducible on 4.3.stable and 4.4.dev5

System information

Godot v4.3.stable (77dcf97) - NixOS #1-NixOS SMP PREEMPT_DYNAMIC Thu Sep 12 09:13:13 UTC 2024 - X11 - GLES3 (Compatibility) - Mesa Intel(R) Graphics (ADL GT2) - 12th Gen Intel(R) Core(TM) i5-1240P (16 Threads)

Issue description

I encountered this issue while working on the twovoip plugin because I thought I could use this inline resampling feature to read the stream of samples at 48000Hz from a microphone that was recording at 44100Hz. (At the moment the twovoip plugin implements its own resampling on the chunks because the opus compression library doesn't handle 44100Hz audio.)

Here is a prerecorded example. record.zip I have the same on Windows.

This is not an accidental feature, since the AudioStreamPlaybackMicrophone class derives from the AudioStreamPlaybackResampled class, when if it derived directly from the AudioStreamPlayback it wouldn't have this capability.

I've not gone into the code far enough to find the bug, which is probably something to do with reading invalid samples out of the ring buffer. I am pretty sure there is no special case for pitch_scale=1.0 that skips the resampling algorithm.

This means the stuttering that happens on a Windows machine after running the AudioStreamMicrophone for more than 10 minutes under normal conditions (pitch_scale=1.0) might be related to the same bug due to some fractional slippage along the resampler over time.

On a design note, my twovoip plugin puts an AudioEffectOpusChunked class on the Audio Bus fed by the AudioStream carrying the microphone, and runs its own chunking buffer from which you can extract the Opus packets as they are filled. This is versatile because it means I could encode any stream or music from another bus into Opus packets, or apply a voice effect on the microphone before it gets processed.

However, it introduces a delay of an extra buffer as well as the potential bugs like this. So if I am trying to make a quicker response without these features (which don't apply to Opus compression since it is tuned for normal voice audio), should I write a plugin class to derive from directly from the AudioStreamPlaybackMicrophone instead so it can copy the samples directly out of the AudioDriver into its own chunking buffers without an intermediate buffer?

Steps to reproduce

Use the Audio Mic Record Demo from the Godot-demo-projects, and set the pitch_scale to 1.1
https://github.com/godotengine/godot-demo-projects/tree/master/audio/mic_record
Then record your voice and play it back.

Minimal reproduction project (MRP)

See above

The text was updated successfully, but these errors were encountered:

fire · 2024-12-04T12:24:43Z

Is this using AudioEffectCapture?

goatchurchprime · 2024-12-04T12:46:22Z

Same problem happens with AudioEffectCapture. But I'm referring to the demo project that uses AudioEffectRecord because it's better to reproduce issues on an official demo project.

fire · 2024-12-04T12:49:35Z

See also #99572 for sample rate modification

goatchurchprime · 2024-12-04T18:54:35Z

Not to get too far off topic, this is a straightforward bug at the moment, and I suspect it could have something to do with the degradation of the mic input that happens occasionally even when pitch_scale=1.0.

The quickest fix would be to remove the resampling capability on the microphone stream so that nothing can go wrong with it. It's certainly buggy enough that I don't believe anyone is using it. The twovoip plugin has its own internal 44.1kHz -> 48kHz resampler, for example.

The root problem with the microphone is we're treating it like it is just another audio stream when it is a very special case. For a start, any time you direct it into an AudioBus you have to set that bus to mute it to avoid amplifier feedback, which kind of means it's not doing much good going into the audio system in the first place. And because it's coming from a device with its own clock-cycle instead of a file, the system goes out of phase over time.

Secondly, it's main purpose is to record speech so it can be transmitted across a network to another player. We know how this works: the audio gets chunked into 20ms chunks and compressed by the Opus library. Resampling could more easily be done against these chunks rather than in a sophisticated continuous resampling filter.

And finally there's this new Audio To Expressions system in the Meta libraries that is replaces the OVRLipSync library. This is important because it means there is a second totally independent system listening to the microphone outside of Godot engine and inferring the inputs as if you had face tracking. This tells you that the microphone is all about spoken words, so it is reasonable to tune all of its functionality towards serving this purpose.

fire · 2024-12-04T22:34:27Z

Do you know of an independent recreation of Audio To Expressions system? I'm unsure how we can integrate other than through the "Unified Expression" system and AudioCapture

goatchurchprime mentioned this issue Dec 2, 2024

Make AudioStreamMicrophoneOpus instead of AudioEffectOpus goatchurchprime/two-voip-godot-4#37

Open

AThousandShips added bug topic:audio labels Dec 3, 2024

fire mentioned this issue Dec 4, 2024

Use libsamplerate on WAV loads #99572

Open

goatchurchprime mentioned this issue Dec 9, 2024

Allow GDExtension to directly read the AudioDriver input buffer used by the microphone godotengine/godot-proposals#11325

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AudioStreamPlayer with an AudioStreamMicrophone stutters for any pitch_scale not equal to 1.0 #99930

AudioStreamPlayer with an AudioStreamMicrophone stutters for any pitch_scale not equal to 1.0 #99930

goatchurchprime commented Dec 2, 2024

fire commented Dec 4, 2024

goatchurchprime commented Dec 4, 2024

fire commented Dec 4, 2024

goatchurchprime commented Dec 4, 2024

fire commented Dec 4, 2024

AudioStreamPlayer with an AudioStreamMicrophone stutters for any pitch_scale not equal to 1.0 #99930

AudioStreamPlayer with an AudioStreamMicrophone stutters for any pitch_scale not equal to 1.0 #99930

Comments

goatchurchprime commented Dec 2, 2024

Tested versions

System information

Issue description

Steps to reproduce

Minimal reproduction project (MRP)

fire commented Dec 4, 2024

goatchurchprime commented Dec 4, 2024

fire commented Dec 4, 2024

goatchurchprime commented Dec 4, 2024

fire commented Dec 4, 2024