Page MenuHomeFreeBSD

Leave HYP mode upon startup
ClosedPublic

Authored by jpa-semihalf.com on Feb 9 2015, 11:52 AM.
Tags
None
Referenced Files
Unknown Object (File)
Sun, Jan 19, 6:25 PM
Unknown Object (File)
Sat, Jan 18, 8:40 AM
Unknown Object (File)
Tue, Jan 14, 9:36 PM
Unknown Object (File)
Tue, Jan 14, 8:59 PM
Unknown Object (File)
Fri, Jan 10, 2:31 AM
Unknown Object (File)
Fri, Jan 10, 2:03 AM
Unknown Object (File)
Mon, Jan 6, 8:21 PM
Unknown Object (File)
Sun, Jan 5, 6:18 PM

Details

Summary

If ARMv7 boots in HYP mode, switch to SVC32.

Submitted by: Wojciech Macek <wma@semihalf.com>
Obtained from: Semihalf

Diff Detail

Lint
Lint Skipped
Unit
Tests Skipped

Event Timeline

zbb retitled this revision from to Leave HYP mode upon startup.
zbb updated this object.
zbb edited the test plan for this revision. (Show Details)
zbb added reviewers: ian, andrew.
zbb added a subscriber: Unknown Object (MLST).

This seems reasonable, but I'm not knowledgeable enough about hyper mode to know if this is really good or not.

sys/arm/arm/locore-v6.S
94

Do we want to note this fact in a variable that can be exported as a sysctl? Here and below?

What about when we're ready to do something with hyper mode? The Zen guys bug us every once in a while to check out their patches because they have Dom0 support for us now (not sure how complete it is), is this just going to make their lives even harder? (I guess if I had ever even glanced at their patches I might know the answer to that.)

sys/arm/arm/locore-v6.S
94

Note what fact? That the loader launched us in hyper mode? I'm not sure what use that would have.

95

ERET loads CPSR from SPSR_hyp, how do we know what mode that is going to return to? It seems like this code should be setting up the value and doing msr spsr_hyp, rN.

zbb edited edge metadata.
sys/arm/arm/locore-v6.S
95

What is the value in spsr at this point? Previous debugging experience tells me it's random bits, at least on some cortex-a processors. I think the right sequence would be to just manipulate the state of cpsr that's already in r0 and save it as spsr.

Except... once that occurred to me, it then popped into my head that we don't need this entire sequence (from #if through #endif), instead we should replace the very first instruction above with:

cpsid ifa, #(PSR_SVC32_MODE)

And the same in mpentry below. That should work on all armv6/v7 no matter what the entry-time processor mode is, right?

Replies from Wojciech Macek (who doesn't have direct write access here)

  1. The write barrier.

Yep, it might be not necessary. I was testing it on the revision which did not have br_prod_tail atomic. I've just removed it and run the test again, but it will take about 24h, cause the reproduction rate is really low.

  1. Read barrier

I wouldn't say that it prevents reordering here. It rather guarantees observability of the valid data. Im my scenario I'm running drdb_dequeue_sc/buf_ring_dequeue_sc few times in the loop. It's inline, so the compiler puts them inside the loop's body. I suspect, that the issue is possible only on cortex-a15, due to its long pipeline and advanced branch predictors. In normal case, where the ring is empty, the dequeue_sc returns NULL. But the core wants to optimize memmory accessess, so there is a chance that it preloads its readbuffers with a data that can be used in further code execution. The cons_head does not change within the function, so there is no reason why the core shouldn't early-load the br->br_ring[cons_head] pointer, which might be outdated. The core just has no knowledge that the data is valid only when prod_tail changes. Nevertheless, without rmb the buffer did not work properly - dequeue function returned some "old" mbuf which left there from the previous pass. This is basically how I found this bug.

sys/arm/arm/locore-v6.S
95

Michal pointed out that cpsid can't be used to leave Hyp mode, only return-from-exception is allowed to transition hyp->other. So nevermind on the cpsid suggestion, but I still think constructing spsr from the hyp mode cpsr is the right thing to do.

You probably additionally should install a stub vector allowing to go back to HYP mode if necessary and deal with the timers. FYI, the corresponding Bitrig change is: https://github.com/Bluerise/bitrig/commit/e1f661110a9a2e38ae0c00c4e394194ee0491a78

From W. Macek:

I don't like to mess with hyp vector here from two reasons. This patch is intended to be simple doing as few changes as it could. On the other hand, some uboots provide an interface for safe hw peripheral acces using smc instruction. I think that would require more talks to find a proper solution, which is not the part of the current fix.

sys/arm/arm/locore-v6.S
76

This will not be pushed

sys/arm/include/asm.h
232 ↗(On Diff #3856)

If clang doesn't support the eret and msr instructions we should file a bug upstream and assign it to Renato Golin.

For binutils we can add them locally if needed as we won't be updating it to a any later versions.

It looks like there is support for eret in llvm http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20141201/246665.html. Can you test with this, if it works we can pull it into FreeBSD.

ian edited edge metadata.

This looks good to me.

This revision is now accepted and ready to land.Feb 21 2015, 5:05 PM
In D1810#16, @andrew wrote:

It looks like there is support for eret in llvm http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20141201/246665.html. Can you test with this, if it works we can pull it into FreeBSD.

This patch doesn't work (Toolchain will not compile).
I'm posting the patch as it is.

Can you wait for a few days. The change is part of llvm 3.6 which should be imported soon.

We would still need to either stop supporting gcc on armv6 or fix binutils to also handle the instructions.

emaste added inline comments.
sys/arm/include/asm.h
232 ↗(On Diff #3856)

Personally I think the hack is acceptable as long as it is clearly a short-term workaround. But I would like to see a comment explaining why it's done like this and what steps need to be taken before it can be replaced with the proper instruction, ideally with links to patches, PRs etc. for those steps.

Here we need the Clang 3.6 import and either a local patch to gas (or a deprecated binutils)

zbb edited edge metadata.

With comment

This revision now requires review to proceed.Mar 2 2015, 12:42 PM
sys/arm/include/asm.h
234 ↗(On Diff #4064)

This is not quite correct, we are still missing support in binutils so will also fail to build with gcc.

zbb edited edge metadata.

Any response to the newest revision?

We now have clang 3.6 in the tree so you should be able to use the instructions directly.

Do you have any concerns regarding these changes? If so, please, let me know.
There is one issue still not solved - current gcc version probably will not be able to handle that. Do we really care?

This is

Do you have any concerns regarding these changes? If so, please, let me know.
There is one issue still not solved - current gcc version probably will not be able to handle that. Do we really care?

Yes. We still care. While it may not be for long, we still support building arm with gcc in the tree.

Does anyone has objections to this version?

This is a polished version of previous commit:

  • the check for binutils is based on clang variable which is more reliable.
  • code duplication removal
  • ensuring that interrupts are disabled on mode change
sys/arm/arm/locore-v6.S
51

Isn't this a binutils issue too?

74

This is a binutils issue, no? Not a compiler issue?

sys/arm/include/asm.h
246–248 ↗(On Diff #7602)

It would be better to always use these #defines in the code, and have a work around for binutils / clang that doesn't support them instead.

sys/arm/arm/locore-v6.S
51

From what I've learned because of licensing issues there is no way gcc >= 4.9 will be available. The initial version was for people's internal use. Here we drop it. And we use llvm 3.6 and higher by default. The description is of little relevance to the code but gives hints for further works.

74

For clang we have a built-in solution. gcc < 4.9needs workaround, and because of the previous comment, it is not considered as a actual option.

sys/arm/include/asm.h
246–248 ↗(On Diff #7602)

You mean moving the code closer to the macro? Clang supports it natively.

sys/arm/arm/locore-v6.S
51

What does gcc have to do with a .S file? You still haven't adequately answered that question.

74

We've typically used a macro to handle this sort of thing so as to not littler the code with #ifdefs. When support for the old stuff goes away, we expand the macro in the code that uses it. This code, and what I'm objecting to, does the opposite: it puts the #ifdefs inline. Just define the macros differently and use them here.

sys/arm/include/asm.h
246–248 ↗(On Diff #7602)

Yes, I mean exactly that. The clang #define will be simpler.

This is yet another revision of this patch, after out-of-band discussion with Warner, so should be much closer to a final version.

This fell off my todo list to review... Sorry.

This looks good apart from the weird #ifdef nesting you have there.

sys/arm/arm/locore-v6.S
70

You can have ifdefs inside of #defines? I think it needs to be the other way around.

This revision was automatically updated to reflect the committed changes.