Unicode Inspector

Unicode Inspector

Inspect code points, bytes, and character metrics in text.

Input

Unicode Report

About The Unicode Inspector

Unicode Inspector breaks pasted text into code units, code points, UTF-8 bytes, lines, words, and a preview of individual Unicode values. It is useful for debugging emoji, accented characters, hidden whitespace, encoding surprises, and strings copied from mixed sources.

Inspection runs locally in your browser. The report shows the first code points with their U+XXXX values so you can compare what is actually present in the text.

How to Inspect Unicode Online

  1. Paste text that contains symbols, emoji, accents, or suspicious whitespace.
  2. Click Analyze Unicode.
  3. Compare code units, code points, byte count, line count, and word count.
  4. Review the first code points to identify the exact characters in the input.

Understanding the Report

Code units vs code points:
- JavaScript strings use UTF-16 code units.
- Some characters, such as many emoji, use two UTF-16 code units but count as one code point.

UTF-8 bytes:
- The byte count shows how large the text is when encoded as UTF-8.
- Accented characters and emoji usually use more than one byte.

Common Use Cases

  • Debug why two strings that look the same do not compare equal.
  • Inspect emoji and symbols before storing or transmitting text.
  • Check copied content for unexpected characters.
  • Estimate UTF-8 byte length for payloads or database fields.

Quick FAQ

Does pasted text leave my browser?
No. Unicode inspection is designed to run in the browser.

Why does an emoji count as two code units?
Many emoji are outside the basic multilingual plane and use surrogate pairs in UTF-16.

Why are only the first code points listed?
The output may cap long text to keep the page readable and responsive.

What is this useful for?
Use it to spot hidden characters, combining marks, unusual spaces, emoji sequences, and lookalike Unicode characters.