This remove the extra overhead when defining KLD_MODULE. The
performance gains from inline atomic(9) and mutex(9) functions.
However the modules built with kernel is not supposed to be used
with other kernel. In practical, this is unlikely happen.
The behavior of building a module standalone is not changed. This
maintain the compactibility for the out-of-tree modules as before.