Paths

Table of Contentst

tests: convert forwarding tests to python
Needs ReviewPublic
Actions

Authored by melifaro on Apr 6 2023, 10:16 AM.

Details

Reviewers

markj
kp

Group Reviewers

network

Diff Detail

Repository

rG FreeBSD src repository

Lint

Lint Errors

Severity	Location	Code	Message
Error	tests/atf_python/sys/net/tools.py:35	E501	PEP8 E501
Error	tests/atf_python/sys/net/tools.py:52	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:55	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:56	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:57	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:93	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:118	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:182	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:191	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:211	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:214	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:226	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:262	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:275	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:285	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:302	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:321	E501	PEP8 E501
Error	tests/atf_python/sys/net/vnet.py:339	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:340	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:344	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:345	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:346	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:350	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:401	E701	PEP8 E701
Error	tests/atf_python/sys/net/vnet.py:431	E701	PEP8 E701

Unit

No Test Coverage

Build Status

Buildable 50784
Build 47675: arc lint + arc unit

Event Timeline

melifaro created this revision.Apr 6 2023, 10:16 AM

Herald added subscribers: glebius, ae, asomers, imp. · View Herald TranscriptApr 6 2023, 10:16 AM

melifaro requested review of this revision.Apr 6 2023, 10:16 AM

Harbormaster completed remote builds in B50784: Diff 119933.Apr 6 2023, 10:16 AM

melifaro added reviewers: network, markj, kp.Apr 6 2023, 10:17 AM

melifaro mentioned this in D39420: netinet tests: Enable parallelization of forwarding tests.Apr 6 2023, 11:08 AM

If this is is just a PoC for the discussion in D39420, please ignore. Otherwise, I believe this will need an ObsoleteFiles.inc entry?
The only drawback I can find to python tests is that they run a bit slower (for now [1]), allegedly compensated by reduced development time :-))

# kyua test sys/netinet6/forward6
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_gu_fast_success  ->  passed  [5.536s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_gu_slow_success  ->  passed  [5.296s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_ll_fast_success  ->  passed  [5.493s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_ll_slow_success  ->  passed  [5.587s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_iface_fast_success  ->  passed  [5.056s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_iface_slow_success  ->  passed  [4.729s]

# kyua test sys/netinet6/test_ip6_forward.py
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[gu-fast]  ->  passed  [7.736s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[gu-slow]  ->  passed  [7.825s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[if-fast]  ->  passed  [7.847s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[if-slow]  ->  passed  [7.861s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[ll-fast]  ->  passed  [7.927s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[ll-slow]  ->  passed  [7.841s]

[1]: I'm investigating setup and teardown.

This is neat. It took me some time to understand what's going on, but certainly this framework is better than atf-sh for writing complex or heavily parameterized tests.

In D39445#897931, @jlduran_gmail.com wrote:

# kyua test sys/netinet6/forward6
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_gu_fast_success  ->  passed  [5.536s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_gu_slow_success  ->  passed  [5.296s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_ll_fast_success  ->  passed  [5.493s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_ll_slow_success  ->  passed  [5.587s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_iface_fast_success  ->  passed  [5.056s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_iface_slow_success  ->  passed  [4.729s]

# kyua test sys/netinet6/test_ip6_forward.py
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[gu-fast]  ->  passed  [7.736s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[gu-slow]  ->  passed  [7.825s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[if-fast]  ->  passed  [7.847s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[if-slow]  ->  passed  [7.861s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[ll-fast]  ->  passed  [7.927s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[ll-slow]  ->  passed  [7.841s]

[1]: I'm investigating setup and teardown.

Anything you can do to improve python test case runtimes would be appreciated. I run the regression tests in qemu quite frequently (partially because that can be done without any special system privileges) and python tests are very slow there. Parallelization helps somewhat.

tests/sys/netinet6/test_ip6_forward.py
15	To me this needs a comment explaining the high-level idea behind the test. It is not very easy to tell what this is actually doing.
136	This is a bit magical. Why not send the command you want the handler to execute, instead of having some apparently hard-coded list of messages? Or have some dictionary mapping message names to actions in the handler, so that it's a bit easier to see what's going on.

In D39445#897960, @markj wrote:

This is neat. It took me some time to understand what's going on, but certainly this framework is better than atf-sh for writing complex or heavily parameterized tests.

I'm currently trying to write a carp test based on this example, and ... well, I can sort of see it making sense when I need scapy to parse packets (which is what I want to do in the test I'm writing), but there's a lot of magic here, and it's taken me quite a bit of time already just to get the setup going (a vnet jail with one epair interface with carp configured on it), and it still doesn't do things that are trivial in atf-sh (How do I skip the test if carp.ko is not loaded?).

In D39445#897963, @kp wrote:

In D39445#897960, @markj wrote:

This is neat. It took me some time to understand what's going on, but certainly this framework is better than atf-sh for writing complex or heavily parameterized tests.

I'm currently trying to write a carp test based on this example, and ... well, I can sort of see it making sense when I need scapy to parse packets (which is what I want to do in the test I'm writing), but there's a lot of magic here, and it's taken me quite a bit of time already just to get the setup going (a vnet jail with one epair interface with carp configured on it), and it still doesn't do things that are trivial in atf-sh (How do I skip the test if carp.ko is not loaded?).

Having more or less finished that test (D39454) I think my view remains that python is great for tests that do things like use scapy to parse packets, but I don't think they'd be suitable for tests such as pfsync:bulk (and indeed the majority of pf tests) where 99% of the test is setup.
There's a fair criticism of the atf-sh tests that there's a lot of boilerplate, but I find it makes the test more obvious about what's going on. That is, I can read the test as if it were a very simple shell script. Nothing is hidden, and it's clear what setup is being created.

In D39445#898034, @kp wrote:

In D39445#897963, @kp wrote:

In D39445#897960, @markj wrote:

This is neat. It took me some time to understand what's going on, but certainly this framework is better than atf-sh for writing complex or heavily parameterized tests.

I'm currently trying to write a carp test based on this example, and ... well, I can sort of see it making sense when I need scapy to parse packets (which is what I want to do in the test I'm writing), but there's a lot of magic here, and it's taken me quite a bit of time already just to get the setup going (a vnet jail with one epair interface with carp configured on it), and it still doesn't do things that are trivial in atf-sh (How do I skip the test if carp.ko is not loaded?).

Having more or less finished that test (D39454) I think my view remains that python is great for tests that do things like use scapy to parse packets, but I don't think they'd be suitable for tests such as pfsync:bulk (and indeed the majority of pf tests) where 99% of the test is setup.
There's a fair criticism of the atf-sh tests that there's a lot of boilerplate, but I find it makes the test more obvious about what's going on. That is, I can read the test as if it were a very simple shell script. Nothing is hidden, and it's clear what setup is being created.

I believe that in some cases, complex templates can be created, for example:

  outside
     |
 +---+---+
fw2 --- fw2
 +---+---+
     |
   inside

can be a class named SimplePFSync that you can inherit in your tests. You would need to understand what SimplePFSync does and returns first (as a newcomer), also it will likely reside in some common file somewhere else (as I understand you prefer to see the setup explicitly in the test), but it should allow you to focus on what's being tested. It is a matter of personal preference, of course.

In D39445#897963, @kp wrote:

In D39445#897960, @markj wrote:

This is neat. It took me some time to understand what's going on, but certainly this framework is better than atf-sh for writing complex or heavily parameterized tests.

I'm currently trying to write a carp test based on this example, and ... well, I can sort of see it making sense when I need scapy to parse packets (which is what I want to do in the test I'm writing), but there's a lot of magic here, and it's taken me quite a bit of time already just to get the setup going (a vnet jail with one epair interface with carp configured on it), and it still doesn't do things that are trivial in atf-sh (How do I skip the test if carp.ko is not loaded?).

You can use REQUIRED_MODULES list in the test class. Please see an example here: https://github.com/freebsd/freebsd-src/blob/main/tests/examples/test_examples.py

In D39445#897960, @markj wrote:

This is neat. It took me some time to understand what's going on, but certainly this framework is better than atf-sh for writing complex or heavily parameterized tests.

Do you have any suggestions on the approach to reduce the "understanding" phase? There are some framework examples in tests/examples, but apparently that's not enough. Do you think that, for example, atf-python(7) may help in that regard?

In D39445#897931, @jlduran_gmail.com wrote:

# kyua test sys/netinet6/forward6
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_gu_fast_success  ->  passed  [5.536s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_gu_slow_success  ->  passed  [5.296s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_ll_fast_success  ->  passed  [5.493s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_gw_ll_slow_success  ->  passed  [5.587s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_iface_fast_success  ->  passed  [5.056s]
sys/netinet6/forward6:fwd_ip6_gu_icmp_iface_slow_success  ->  passed  [4.729s]

# kyua test sys/netinet6/test_ip6_forward.py
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[gu-fast]  ->  passed  [7.736s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[gu-slow]  ->  passed  [7.825s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[if-fast]  ->  passed  [7.847s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[if-slow]  ->  passed  [7.861s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[ll-fast]  ->  passed  [7.927s]
sys/netinet6/test_ip6_forward.py:TestIP6Forward::test_success[ll-slow]  ->  passed  [7.841s]

Interesting, that looks quite high. On my VM:

19:43 [1] m@devel2 s kyua test -k /usr/tests/sys/netinet6/Kyuafile test_ip6_forward.py
test_ip6_forward.py:TestIP6Forward::test_success[gu-fast]  ->  passed  [1.658s]
test_ip6_forward.py:TestIP6Forward::test_success[gu-slow]  ->  passed  [1.599s]
test_ip6_forward.py:TestIP6Forward::test_success[if-fast]  ->  passed  [1.571s]
test_ip6_forward.py:TestIP6Forward::test_success[if-slow]  ->  passed  [1.688s]
test_ip6_forward.py:TestIP6Forward::test_success[ll-fast]  ->  passed  [1.744s]
test_ip6_forward.py:TestIP6Forward::test_success[ll-slow]  ->  passed  [1.685s]

Something can be improved here (avoid IPv6 tentative delays by default) and bring this doesn 300-400ms more or less easily.
I guess the biggest thing here is scapy initialisation (not sure if we actually needs scalpy.all).

[1]: I'm investigating setup and teardown.

Anything you can do to improve python test case runtimes would be appreciated. I run the regression tests in qemu quite frequently (partially because that can be done without any special system privileges) and python tests are very slow there. Parallelization helps somewhat.

In D39445#898034, @kp wrote:

In D39445#897963, @kp wrote:

In D39445#897960, @markj wrote:

This is neat. It took me some time to understand what's going on, but certainly this framework is better than atf-sh for writing complex or heavily parameterized tests.

I'm currently trying to write a carp test based on this example, and ... well, I can sort of see it making sense when I need scapy to parse packets (which is what I want to do in the test I'm writing), but there's a lot of magic here, and it's taken me quite a bit of time already just to get the setup going (a vnet jail with one epair interface with carp configured on it), and it still doesn't do things that are trivial in atf-sh (How do I skip the test if carp.ko is not loaded?).

Having more or less finished that test (D39454) I think my view remains that python is great for tests that do things like use scapy to parse packets, but I don't think they'd be suitable for tests such as pfsync:bulk (and indeed the majority of pf tests) where 99% of the test is setup.

Thanks for trying it out, I really appreciate it. pfsync:bulk is indeed mostly test-specific setup /business logic and using python for just one test won't make any benefits. Some cumulative benefit may appear if one writes helpers for the most-often used pf helpers, but thats a matter of the personal preference.

There's a fair criticism of the atf-sh tests that there's a lot of boilerplate, but I find it makes the test more obvious about what's going on. That is, I can read the test as if it were a very simple shell script. Nothing is hidden, and it's clear what setup is being created.

Yep, "what's going on" is a good argument. In the end I guess we all want to be able to easily write (and debug) the tests if/when something goes wrong. I'd also love to hear on what you think should be improved in the python part - I'd really prefer to make it more user-friendly.

melifaro added inline comments.Apr 6 2023, 8:03 PM

tests/sys/netinet6/test_ip6_forward.py
15	Absolutely. Sorry, I didn't have enough time to fully finish it, so I pushed the code to the review as is. I totally agree that should have a description of what test/test group is about.
136	Originally I thought of IPC between the vnets to serve 2 purposes: information exchange (request/reply or just push models). synchronisation - letting the vnet signal that it's ready to process certain event or it's fine to process to the next code stage. I copied the flow logic from some other test and it's indeed a bit over-complicated. I'll probably try addin some pre-defined commands handlers & leave the ability to have the custom commands. That'll reduce the _amount_ of code here, but may make look even more magical :-) Anyway, I'll try it out, thank you for the suggestion!

In D39445#898079, @melifaro wrote:

Yep, "what's going on" is a good argument. In the end I guess we all want to be able to easily write (and debug) the tests if/when something goes wrong. I'd also love to hear on what you think should be improved in the python part - I'd really prefer to make it more user-friendly.

Which class methods get called when is a bit non-obvious. Perhaps it'd be easier to follow if that was driven by annotations rather than function names. So something like

@atf_py.test_case
def test_success(self):
   pass

@atf_py.setup_jail("vnet1")
def make_router(self, vnet):
  pass

I also find the way to access interfaces a bit clunky. self.vnet.iface_alias_map["if1"] doesn't quite roll of the tongue, and "if1" confused me initially because I'm used to thinking in terms of epairs. It's very magical, but it should be possible to teach the vnet object to just give me if1 if I do self.vnet.if1 or even self.if1, for arbitrary names (https://stackoverflow.com/questions/31177629/can-i-create-an-object-that-receives-arbitrary-method-invocation-in-python).

I should also be explicitly clear that using Python for vnet tests absolutely has its place for certain types of tests. The carp example is clearly better expressed in Python than it would have been in shell, because I'd either have had to write a separate python script to parse the packet, or resort to tcpdump output parsing with grep.