n-grams-and-association-measures

 

The current issue of IRW should reach subscribers inboxes during the day.

This is Part Two of the series on statistical analysis of n-grams. This is a text mining analysis technique widely used in information retrieval and data mining in general. In this issue we cover the implementation of association measures derived from contingency tables.

The QA section explains how to conduct a Chi Square Test for tables with many items; i.e., beyond the usual 2 x 2 contingency tables.

Enjoy it.

Advertisements