Corpus tools

On this page, you'll find free online tools I developed and maintain for corpus linguists.

  • Lexical diversity Measurements. Calculate a number of measures of lexical diversity (LD) by simply copy-pasting the text you want to analyse. For each measurement, a reference is given.
  • N-gram generator (with probabilities). Generate n-grams (bi-grams, tr-grams, 4- and 5-grams) for your texts, including measures of probability and strength.
  • 𝛘² (chi-square) & Cramér's V calculator. Perform the chi-square test easily, including a measure of association strength and all steps explained (including things like expected frequency calculations in easy terms (Dutch only).
  • 𝛘² to p calculator. Convert a chi-square value to an exact p value. You only need a chi-square value and degrees of freedom (df).
  • Wordlist generator. Generate a wordlist (frequency list) of a text.
  • Fisher's Exact calculator. Perform the Fisher's Exact test easily (Dutch only).
  • Keyword analysis. Extract keywords from a Dutch or English texts. Keywords are extracted by comparison two large corpora.