The Statistics View

The statistics view shows you statistics about the currently open document.

_images/statistics-view.png

There are five main groups of statistics:

You can expand or collapse each section by clicking on the header of the relevant section.

You can also copy information from the statistics view to the clipboard by selecting the rows of interest and then copying from the main menu (Edit->Copy) or by using standard ‘Copy’ shortcut keys.

Word Statistics

This section displays information about the total and unique number of words in the currently open document. This includes:

  • Total

    The total number of words in the document.

  • Total Known

    The total number of words in the document that you know (based on the currently active vocabulary profile).

  • Total Percent Known

    The percentage of known words in the document compared to the total number of words.

  • Total Unknown

    The total number of unknown words that appear in the document.

  • Total Percent Unknown

    The percentage of unknown words, compared to the total number of words in the document.

  • Unique

    The total number of unique words in the document.

  • Unique Known

    The total number of known unique words.

  • Unique Percent Known

    The percentage of known unique words, compared to the total number of unique words in the document.

  • Unique Unknown

    The number of unknown unique words.

  • Unique Percent Unknown

    The percentage of unknown unique words, compared to the total number of unique words in the document.

HSK Statistics

This section shows the percentage of both the total and unique number of words in the document for each HSK level (1-6), as well as the percentage of total and unique words in the document that are not defined in the HSK vocabulary lists.

Levels 2-6 also include the cumulative percentage of all preceding levels, displayed in brackets after the raw percentage for that level.

TOCFL Statistics

This section shows the percentage of both the total and unique number of words in the document for each TOCFL level (1-5), as well as the percentage of total and unique words in the document that are not defined in the TOCFL vocabulary lists.

Levels 2-5 also include the cumulative percentage of all preceding levels, displayed in brackets after the raw percentage for that level.

Character Statistics

This section show the total number, and the total number of unique Chinese characters in the entire document.

This number does not include punctuation or whitespace.

File Statistics

The section shows general information about the currently open text document, including:

  • The total number of bytes in the text file.
  • The total number of Unicode codepoints in the file.
  • The total number of lines in the file.
  • The total time it took to segment and analyse the entire file.