Code inspection: Use UTF-8 string literal

UTF-8 is one of the most commonly used character encodings, particularly on the internet. However, in .NET, the char and string types use UTF-16 to represent their values. This necessitates an additional step to obtain the UTF-8 representation of a string, such as invoking System.Text.Encoding.UTF8.GetBytes(), which makes the conversion at runtime. To avoid this runtime cost, some developers might choose to perform the encoding in advance and then incorporate the output byte array in the source code as follows:

// "HTTP/1.1 "
private static ReadOnlySpan<byte> HttpVersion11Bytes =>
  new byte[] { 0x48, 0x54, 0x54, 0x50, 0x2f, 0x31, 0x2e, 0x31, 0x20 };

// Notice the 'u8' suffix after the string literal
private static ReadOnlySpan<byte> HttpVersion11Bytes => "HTTP/1.1 "u8;

It also detects usages of Encoding.Utf8.GetBytes() with string literals and helps transform it to the new UTF-8 string literal. This not only improves the readability but also enhances performance by eliminating the need for runtime encoding.

Code inspection: Use UTF-8 string literal

See also

Concepts

Code inspection: Use UTF-8 string literal﻿

tip

Suboptimal code

After the quick-fix

note

See also

Concepts

Code inspection: Use UTF-8 string literal