Page MenuHomeFreeBSD

libdtrace: Implement inline function tracing for kinst
Needs RevisionPublic

Authored by christos on Feb 28 2023, 5:10 PM.
Tags
Referenced Files
F133126523: D38825.id121407.diff
Thu, Oct 23, 4:44 AM
F133108741: D38825.id119058.diff
Thu, Oct 23, 1:27 AM
Unknown Object (File)
Wed, Oct 22, 5:19 AM
Unknown Object (File)
Wed, Oct 22, 4:30 AM
Unknown Object (File)
Tue, Oct 21, 11:57 PM
Unknown Object (File)
Mon, Oct 20, 7:46 PM
Unknown Object (File)
Mon, Oct 20, 12:31 PM
Unknown Object (File)
Mon, Oct 20, 12:22 AM

Details

Summary

The following code implements inline tracing for the kinst DTrace
provider.

Briefly explained, what the code does is, for each probe of the form:

kinst::<inline_func>:<entry/return>

libdtrace checks to see if <inline_func> is indeed an inline function,
where in this case it finds all inline copies of it and creates new
probes for each one of them. Otherwise, it converts the probe to an FBT
one so that we don't duplicate FBT's functionality in kinst.

Notes:

Unlike FBT, leaving the probe name empty won't trace both entry and
return. kinst goes into "inline-tracing mode" only when it sees
entry/return in the probe name, otherwise it thinks it's a regular kinst
probe, in which case the dtrace(1) invocation will fail if we try to
trace an inline function with an empty probe name.

Sponsored by: The FreeBSD Foundation
MFC after: 1 week

Diff Detail

Repository
rG FreeBSD src repository
Lint
Lint Skipped
Unit
Tests Skipped
Build Status
Buildable 61600
Build 58484: arc lint + arc unit

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Got rid of deep copying, appending to the main clause is simpler.

Handle multiple dn_pdescs properly, move handling away from
dt_sugar_visit_all().

christos edited the summary of this revision. (Show Details)

Do not always go back one instruction.

Improve example in man page.

Get rid of push %rbp check (depends on D39166).

christos edited the summary of this revision. (Show Details)

Parse all loaded modules and their debug files instead of just kernel.
Mention new bug in summary.

christos edited the summary of this revision. (Show Details)

Fix bugs, surround amd64-specific code in an #ifdef, merge flags into
dtsp_flags, handle dt_sugar_elf_init() and dwarf_elf_init() errors, use
longjmp() for dt_alloc() failures.

Fix per-module parsing. Create an additional FBT probe if a non-inline definition is
found along an inline definition in the same module.

More bug fixes. I always keep finding them after I submit the patch.

Do not exclude inline copies that have DW_AT_subprogram instead of
DW_AT_inlined_subroutine.

Add tests. kinst_test_inline() needs modification so that it can be splitted
into DW_AT_ranges.

Implement dt_sugar_elf_verify_debuglink() to make sure the debug files we're
reading are up to date, and if they're not, skip them. Also stop leaking
elf_info structs in dt_sugar_do_kinst_inline().

Fix accidental dangling brace.

Surround amd64-specific code in an ifdef.

Use dtrace_instr_size() and get rid of machine-dependent code (depends on
D39489).

Fix Makefile conflict, check if dtracetest is already loaded before attempting to load it.

Add libdwarf in Makefile.inc1 (Thanks Mark).

Bail out if kinst doesn't exist (i.e the architecture doesn't support it).

Fix return offset handling.

Fix off->val type causing a build failure.

sys/cddl/dev/dtrace/dtrace_test.c
107

@markj do you have a good idea for this?

domagoj.stolfa_gmail.com added inline comments.
cddl/contrib/opensolaris/lib/libdtrace/common/dt_sugar.c
884

I don't see any other uses of cont, perhaps this can be done without using a goto?

cddl/contrib/opensolaris/lib/libdtrace/common/dt_sugar.c
883–887

I don't see any other uses of cont, perhaps this can be done without using a goto?

I guess it could be rewritten like this.

christos retitled this revision from libdtrace: implement inline function tracing for kinst to libdtrace: Implement inline function tracing for kinst.Jan 8 2025, 2:04 PM
christos edited the summary of this revision. (Show Details)
christos marked an inline comment as done.
  • Remove unncessary goto.
  • Rebase on top of main.

tinderbox build on the cluster done with no errors.

After some testing, this generally works fine for me. As per our chat on IRC, one thing I'd quite like before this lands are two additional tests in dtrace_test, one for a simple tail call and one for tail recursion which is unrolled into a loop (both should be achievable via __attribute__((musttail)), though I'm not 100% clear on how to ensure that you get the desired loop unrolling behaviour for recursion. See https://clang.llvm.org/docs/AttributeReference.html#musttail for more details). Even if they don't necessarily pass, I think it's OK to land this and simply fix the tail calls later as the state of things with this change would still improve over what's currently there.

This revision now requires changes to proceed.Apr 15 2025, 3:08 PM

I will spend some time testing this again within a week. My recollection from last time is that there were some edge cases where probes didn't fire when I expected. One thing I didn't test yet but am a bit suspicious will not work properly is stack unwinding -- did you test that the stack() action prints reasonable output when used within an inline function probe?

There is no documentation for the new functionality; dtrace_kinst.4 needs to be updated.

There is no documented "theory of operation" for this functionality, either in the commit log or in the comments (the latter would be much better). There should be a high-level description of how this actually works, as in, how do we actually find the entry and return instructions. As far as I understand, the algorithm is basically just a generalization of some examples drawn from the kernel. As such, I suspect that over time we'll find examples where it doesn't work, and have to add more complexity to handle them. That will be quite difficult without documentation and comprehensive test cases. That maintenance burden is part of why I'm quite hesitant about adding this feature as it is.

This functionality will be much more useful if we can get inline function arguments somehow, but I don't know how far away that is. What would be involved in adding support for that?

I will spend some time testing this again within a week. My recollection from last time is that there were some edge cases where probes didn't fire when I expected. One thing I didn't test yet but am a bit suspicious will not work properly is stack unwinding -- did you test that the stack() action prints reasonable output when used within an inline function probe?

I will test it in the following days.

There is no documentation for the new functionality; dtrace_kinst.4 needs to be updated.

There is. D40874

There is no documented "theory of operation" for this functionality, either in the commit log or in the comments (the latter would be much better). There should be a high-level description of how this actually works, as in, how do we actually find the entry and return instructions. As far as I understand, the algorithm is basically just a generalization of some examples drawn from the kernel. As such, I suspect that over time we'll find examples where it doesn't work, and have to add more complexity to handle them. That will be quite difficult without documentation and comprehensive test cases. That maintenance burden is part of why I'm quite hesitant about adding this feature as it is.

I've written 2 articles on that:

As well as the AsiaBSDCon 2024 paper which explains gives a description of inline tracing:

Would it make sense to just include those links either in the man page or in the comments?

This functionality will be much more useful if we can get inline function arguments somehow, but I don't know how far away that is. What would be involved in adding support for that?

This is in my TODO list, but I haven't worked on it yet because it seems to be quite involved and we probably need to use more of DWARF metadata. In any case I do not think not having this feature is a reason not to commit inline tracing in its current form. IMHO it's better to have it out there and let people play around with it, than leaving it here until it's perfected.

I will spend some time testing this again within a week. My recollection from last time is that there were some edge cases where probes didn't fire when I expected. One thing I didn't test yet but am a bit suspicious will not work properly is stack unwinding -- did you test that the stack() action prints reasonable output when used within an inline function probe?

I will test it in the following days.

@markj I actually needed that today when I was working on D50488. In the commit message you can see a stack() call from a kinst inline probe.

root@freebsd:~ # dtrace -n 'kinst::chn_abort:entry { stack(); }'
dtrace: description 'kinst::chn_abort:entry ' matched 5 probes
dtrace: buffer size lowered to 1m
CPU     ID                    FUNCTION:NAME
  1  72580                  chn_notify:1192
              sound.ko`0xffffffff8296cab4
              sound.ko`vchan_trigger+0x105
              sound.ko`chn_trigger+0xb4
              sound.ko`chn_flush+0x2a
              sound.ko`dsp_close+0x330
              kernel`devfs_destroy_cdevpriv+0xab
              kernel`devfs_close_f+0x63
              kernel`_fdrop+0x1a
              kernel`closef+0x1e3
              kernel`closefp_impl+0x76
              kernel`amd64_syscall+0x151
              kernel`0xffffffff8103841b

I have been testing this again recently, and it works fine, at least with the tests I ran. I think we should go ahead and get it into the tree, it's been sitting in review for 3 years almost. releng/15.0 is branched so there is no fear of breaking the release, even though this shouldn't break anything in the first place, it's a kinst-specific change and at worst it won't always yield 100% correct results. Once we have it in tree we can work on remaining issues, but I do not see a point in keeping it here anymore.