Page MenuHomeFreeBSD

sound: Implement AFMT_FLOAT support
ClosedPublic

Authored by christos on Nov 16 2024, 6:15 PM.
Tags
None
Referenced Files
F133271199: D47638.id149085.diff
Fri, Oct 24, 12:34 PM
F133269606: D47638.id152188.diff
Fri, Oct 24, 12:14 PM
F133259955: D47638.id152720.diff
Fri, Oct 24, 10:32 AM
F133256650: D47638.id152750.diff
Fri, Oct 24, 9:42 AM
F133229393: D47638.id148936.diff
Fri, Oct 24, 3:59 AM
Unknown Object (File)
Thu, Oct 23, 6:10 PM
Unknown Object (File)
Thu, Oct 23, 8:28 AM
Unknown Object (File)
Thu, Oct 23, 7:48 AM

Details

Summary

Even though the OSS manual [1] advises against using AFMT_FLOAT, there
are applications that expect the sound driver to support it, and might
not work properly without it.

This patch adds AFMT_F32_LE|BE (as well as AFMT_FLOAT for OSS
compatibility) in sys/soundcard.h and implements AFMT_F32_LE|BE <->
AFMT_S32_LE|BE conversion functions. As a result, applications can
write/read floats to/from sound(4), but internally, because sound(4)
works with integers, we convert floating point samples to integer ones,
before doing any processing.

[1] http://manuals.opensound.com/developer/AFMT_FLOAT.html

PR: 157050, 184380, 264973, 280612, 281390
Sponsored by: The FreeBSD Foundation
MFC after: 1 week

Test Plan

Apply D47639 to test sound.

Diff Detail

Repository
rG FreeBSD src repository
Lint
Lint Skipped
Unit
Tests Skipped
Build Status
Buildable 62889
Build 59773: arc lint + arc unit

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Use __DECONST to silence const warning in pcm_sample_read(). A bit ugly...

  • Fix bugs.
  • Write unit tests.
  • Re-arrange fpu_kern(9) calls.

There are a few sound tests I haven't done yet, but I am posting as a WIP in
case someone wants to leave a comment.

sys/dev/sound/pcm/pcm.h
218

Fixed.

Fix big endian regressions introduced in last diff.

I tried recording and playback with real audio (both LE and BE) and things seem
to work fine.

@dev_submerge.ch The big endian tests fail, because on a little endian machine,
converting 0x3f000000 (0.5) to 0x0000003f will result in the float value being
0, so the pcm_sample_* functions return will return wrong values. For
example, pcm_sample_read() reads the input as float (i.e 0x3f000000 ->
0.5), then converts it to big endian (0x3f000000 -> 0x0000003f -> 0), and
lastly converts it to S32 (0 * PCM_S32_MAX = 0), in which case the return
value will be 0, hence the test failure. I am not sure yet how we should adapt
the test program to accomodate floating-point big endian tests.

@dev_submerge.ch The big endian tests fail, because on a little endian machine,
converting 0x3f000000 (0.5) to 0x0000003f will result in the float value being
0, so the pcm_sample_* functions return will return wrong values. For
example, pcm_sample_read() reads the input as float (i.e 0x3f000000 ->
0.5), then converts it to big endian (0x3f000000 -> 0x0000003f -> 0), and
lastly converts it to S32 (0 * PCM_S32_MAX = 0), in which case the return
value will be 0, hence the test failure. I am not sure yet how we should adapt
the test program to accomodate floating-point big endian tests.

Well, you do have the test data backwards for the big-endian floats? Just fill in the byte array in big-endian order and check for the correct sample values.

sys/dev/sound/pcm/pcm.h
199–210

Isn't this a bit complicated? Just read it as a 32bit integer from src, reinterpret that as a float, and then calculate the integer value from that float?

christos marked an inline comment as done.
  • Simplify read.
  • Make big-endian tests pass. Hopefully it will all work properly on actual big-endian architectures as well.

@Alexander88207_protonmail.com I remember you tested the patch when it was still a draft. Can you re-test please?

Thanks for the ping, sure!

Can you please update your patch so i can apply it cleanly?

error: patch failed: sys/dev/sound/pcm/pcm.h:124
error: sys/dev/sound/pcm/pcm.h: patch does not apply

Thanks.

Reflect minor changes in D47932 so that patch can apply cleanly.

@Alexander88207_protonmail.com now it should apply fine. Do not forget to apply D47932 and D48421 first.

[Creating objdir /usr/obj/usr/src/amd64.amd64/sys/GENERIC/modules/usr/src/sys/modules/sgx...]
--- machine ---
--- feeder_eq.o ---
In file included from /usr/src/sys/dev/sound/pcm/feeder_eq.c:43:
/usr/src/sys/dev/sound/pcm/pcm.h:124:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
--- modules-all ---
machine -> /usr/src/sys/amd64/include
--- feeder_eq.o ---
  124 | static __always_inline __inline intpcm_t
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:220:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  220 | static __always_inline __inline intpcm_t
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:229:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  229 | static __always_inline __inline intpcm_t
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:256:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  256 | static __always_inline __inline void
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:361:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  361 | static __always_inline __inline void
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:370:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  370 | static __always_inline __inline void
      |                        ^
--- modules-all ---
--- x86 ---
x86 -> /usr/src/sys/x86/include
--- i386 ---
i386 -> /usr/src/sys/i386/include
--- feeder_eq.o ---
6 errors generated.
--- modules-all ---
--- opt_hwpmc_hooks.h ---
ln -sf /usr/obj/usr/src/amd64.amd64/sys/GENERIC/opt_hwpmc_hooks.h opt_hwpmc_hooks.h
--- opt_kstack_pages.h ---
--- all_subdir_sge ---
ctfconvert -L VERSION -g if_sge.o
--- all_subdir_sgx ---
ln -sf /usr/obj/usr/src/amd64.amd64/sys/GENERIC/opt_kstack_pages.h opt_kstack_pages.h
--- feeder_eq.o ---
*** [feeder_eq.o] Error code 1

make[2]: stopped in /usr/obj/usr/src/amd64.amd64/sys/GENERIC
--- modules-all ---
*** [modules-all] Error code 6

make[2]: stopped in /usr/obj/usr/src/amd64.amd64/sys/GENERIC
--- hdaa.o ---
ctfconvert -L VERSION -g hdaa.o
2 errors
[...]
  124 | static __always_inline __inline intpcm_t
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:220:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  220 | static __always_inline __inline intpcm_t
      |                        ^
[...]

Interesting, it builds fine for me. @markj do you have any clue as to why that is the case?

[...]
  124 | static __always_inline __inline intpcm_t
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:220:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  220 | static __always_inline __inline intpcm_t
      |                        ^
[...]

Interesting, it builds fine for me. @markj do you have any clue as to why that is the case?

I'm not sure. __always_inline apparently includes the __inline qualifier as well, but that's not enough to suppress unused warnings with clang. See commit d25f0bdceb3ab for the justification of adding that qualifier in the first place. Perhaps it's a bad idea to use __always_inline for functions defined in a header file. I can't see any other examples of that in the kernel.

[...]
  124 | static __always_inline __inline intpcm_t
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:220:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  220 | static __always_inline __inline intpcm_t
      |                        ^
[...]

Interesting, it builds fine for me. @markj do you have any clue as to why that is the case?

I'm not sure. __always_inline apparently includes the __inline qualifier as well, but that's not enough to suppress unused warnings with clang. See commit d25f0bdceb3ab for the justification of adding that qualifier in the first place. Perhaps it's a bad idea to use __always_inline for functions defined in a header file. I can't see any other examples of that in the kernel.

Do you think defining it as just __inline will suffice?

[...]
  124 | static __always_inline __inline intpcm_t
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:220:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  220 | static __always_inline __inline intpcm_t
      |                        ^
[...]

Interesting, it builds fine for me. @markj do you have any clue as to why that is the case?

I'm not sure. __always_inline apparently includes the __inline qualifier as well, but that's not enough to suppress unused warnings with clang. See commit d25f0bdceb3ab for the justification of adding that qualifier in the first place. Perhaps it's a bad idea to use __always_inline for functions defined in a header file. I can't see any other examples of that in the kernel.

Do you think defining it as just __inline will suffice?

It compiles fine.

[...]
  124 | static __always_inline __inline intpcm_t
      |                        ^
/usr/src/sys/dev/sound/pcm/pcm.h:220:24: error: duplicate 'inline' declaration specifier [-Werror,-Wduplicate-decl-specifier]
  220 | static __always_inline __inline intpcm_t
      |                        ^
[...]

Interesting, it builds fine for me. @markj do you have any clue as to why that is the case?

I'm not sure. __always_inline apparently includes the __inline qualifier as well, but that's not enough to suppress unused warnings with clang. See commit d25f0bdceb3ab for the justification of adding that qualifier in the first place. Perhaps it's a bad idea to use __always_inline for functions defined in a header file. I can't see any other examples of that in the kernel.

Do you think defining it as just __inline will suffice?

It compiles fine.

I think that's reasonable.

@Alexander88207_protonmail.com could you apply D47932, D48421 and D47638 again and see if everything works fine?

theron.tarigo_gmail.com added inline comments.
sys/dev/sound/pcm/pcm.h
207

Rounding toward zero introduces a crossover distortion bias. Although quantization error itself is unavoidable here, we shouldn't do that.
lrintf or roundf or equivalent is needed.

Tested values -> results -> errors
{F111904629}

Error with INTPCM_T(fv * (float)PCM_S32_MAX);
{F111904009}

Error with INTPCM_T(roundf(fv * (float)PCM_S32_MAX));
{F111904015}

Test programs
{F111903997}
{F111904003}

sys/dev/sound/pcm/pcm.h
207

That's very useful information, which I admittedly didn't take into consideration. However, I cannot access those files. Can you share them somehow else?

christos added inline comments.
sys/dev/sound/pcm/pcm.h
207

Thank you very much for this. :-)

Updating the diff in a moment.

sys/dev/sound/pcm/pcm.h
207

Hmm, while we can call roundf() in userland, it cannot be used in the kernel, which where this code is executed in real-world scenarios.

sys/dev/sound/pcm/pcm.h
207

Considering that use of AFMT_FLOAT is discouraged in new programs and that the motivation for this review is to support certain closed libraries (please correct me if I am mistaken), is it reasonable to only support this on CPUs with SSE2, which introduced the efficient rounding instructions?

Otherwise, I can provide a slower though still reasonably efficient non-FPU conversion routine. Even if fpu_kern_enter()/fpu_kern_leave() is moved outside the per-sample loop, avoiding FPU may still be a win depending on write size.

sys/dev/sound/pcm/pcm.h
207

Feel free to share the non-FPU solution and we can definitely discuss it. I have no objection towards not using fpu_kern(9), in fact, in the crude benchmarks I've posted above you can see that using it makes things around 10 times more expensive for each sample.

It looks like the patch fails to apply against main:

$ git apply D47638.diff
error: patch failed: sys/dev/sound/pcm/feeder_mixer.c:73
error: sys/dev/sound/pcm/feeder_mixer.c: patch does not apply

It looks like the patch fails to apply against main:

$ git apply D47638.diff
error: patch failed: sys/dev/sound/pcm/feeder_mixer.c:73
error: sys/dev/sound/pcm/feeder_mixer.c: patch does not apply

I forgot to rebase it since I committed some patches. Try again. :-)

I forgot to rebase it since I committed some patches. Try again. :-)

Thanks! I can confirm that the patches fix game crashing and sound issues in multiple games on LSU Steam + FreeBSD Proton, such as Tell Me Why, Dishonored 2, and Republique.

However, Cyberpunk 2077 still has broken, static sound output, even with the patch.

I forgot to rebase it since I committed some patches. Try again. :-)

Thanks! I can confirm that the patches fix game crashing and sound issues in multiple games on LSU Steam + FreeBSD Proton, such as Tell Me Why, Dishonored 2, and Republique.

Glad to hear that! :-)

However, Cyberpunk 2077 still has broken, static sound output, even with the patch.

Is this related to the sound system? If yes, please open a bug report on Bugzilla and we can investigate it there.

I forgot to rebase it since I committed some patches. Try again. :-)

Thanks! I can confirm that the patches fix game crashing and sound issues in multiple games on LSU Steam + FreeBSD Proton, such as Tell Me Why, Dishonored 2, and Republique.

Glad to hear that! :-)

However, Cyberpunk 2077 still has broken, static sound output, even with the patch.

Is this related to the sound system? If yes, please open a bug report on Bugzilla and we can investigate it there.

Well, audio works correctly in the game if I choose PulseAudio or Alsa as the Wine sound backends (via winetricks) in the game. So it's only by using the OSS audio backend that sound is all static. Other games with fixed audio (with the patch) are using the OSS backend.

I forgot to rebase it since I committed some patches. Try again. :-)

Thanks! I can confirm that the patches fix game crashing and sound issues in multiple games on LSU Steam + FreeBSD Proton, such as Tell Me Why, Dishonored 2, and Republique.

Glad to hear that! :-)

However, Cyberpunk 2077 still has broken, static sound output, even with the patch.

Is this related to the sound system? If yes, please open a bug report on Bugzilla and we can investigate it there.

Well, audio works correctly in the game if I choose PulseAudio or Alsa as the Wine sound backends (via winetricks) in the game. So it's only by using the OSS audio backend that sound is all static. Other games with fixed audio (with the patch) are using the OSS backend.

That could also be a fault in Cyberpunks OSS code though. But you could try setting hw.snd.verbose=2 and seeing if there any sound(4) warnings.

jrtc27 added inline comments.
sys/dev/sound/pcm/pcm.h
204

Not all architectures support FPU use in the kernel.

sys/dev/sound/pcm/pcm.h
204

According to the man page:

The fpu_kern is currently implemented only for the i386, amd64, arm64,
and powerpc architectures.

And after a quick grep for ^fpu_kern_enter():

sys/i386/i386/npx.c:fpu_kern_enter(struct thread *td, struct fpu_kern_ctx *ctx, u_int flags)
sys/amd64/amd64/fpu.c:fpu_kern_enter(struct thread *td, struct fpu_kern_ctx *ctx, u_int flags)
sys/powerpc/powerpc/fpu.c:fpu_kern_enter(struct thread *td, struct fpu_kern_ctx *ctx, u_int flags)
sys/arm64/arm64/vfp.c:fpu_kern_enter(struct thread *td, struct fpu_kern_ctx *ctx, u_int flags)
sys/arm/arm/vfp.c:fpu_kern_enter(struct thread *td, struct fpu_kern_ctx *ctx, u_int flags)

So it seems like all the main architecture are supported. Which ones are not, except riscv? I guess we could add and #ifdef to support AFMT_FLOAT only for supported architectures.

sys/dev/sound/pcm/pcm.h
204

Also, I have no objections towards a non-FPU solution to this, but I think it's better to (for now) push the patch as-is so that some ports that need this patch can work again ASAP.

sys/dev/sound/pcm/pcm.h
204

At a minimum we can't break building kernels on other architectures

sys/dev/sound/pcm/pcm.h
204

I agree. We can go ahead with an #ifdef until a non-FPU solution is implemented.

christos marked 3 inline comments as done.

Use IEEE754 instead of the FPU, since some platforms do not support this. The
code is based on Ariff's AFMT_FLOAT draft [1]. Unit and audio tests pass
normally. That should also give us a performance boost.

[1] https://people.freebsd.org/~ariff/utils/ieee754.c

Use IEEE754 instead of the FPU

IEEE754 is the representation for floating point numbers, also used by the FPU and it's not something we're using "instead of" the FPU. Maybe "integer fixed point conversion"

sys/dev/sound/pcm/pcm.h
132

s could be bool
and maybe expand it to sign?

209

with s being bool this can just be s = v & 0x80000000U

Use IEEE754 instead of the FPU

IEEE754 is the representation for floating point numbers, also used by the FPU and it's not something we're using "instead of" the FPU. Maybe "integer fixed point conversion"

Yeah, that was bad wording. It'd be more clear to say "instead of fpu_kern(9)".

sys/dev/sound/pcm/pcm.h
132

I agree with changing to bool. As for expanding the name, I don't have a strong objection, but I just kept it like this for uniformity with the other variables. After all the meaning of this variable is quite clear from the code IMHO.

christos marked an inline comment as done.

Address Ed's comments.

OK. Please include a reference to the source of the float conversion routines in the commit message.

This revision is now accepted and ready to land.Mar 30 2025, 3:20 PM
This revision was automatically updated to reflect the committed changes.