Performance-portable, length-agnostic SIMD with runtime dispatch
Fix asan/msan for older clang, finish f16 conversions, fix warnings.