The standard library uses cryptobyte.Builder in hkdfExpandLabel. This costs quite a bit of performance. Using an optimized implementation speeds up the initialization of the AEAD used for the Initial encryption level by ~15%.