Previously this code simply wasn't compiled since it tests arm64
instead of aarch64. It was also wrong since it tried to use the
broadcast variant of "ic", which can't be executed in EL0.
Use builtin_clear_cache() instead. It clears the virtual address
range to point of unification and invalidates the corresponding range in
the icache.