Here is Eq 5.6 in the book:

It is stated that "C stands for the number of classes". I think the Nc, which represents the number of cases in the cth class, should be replaced by C. Nc is simply irrelevant here since softmax is calucated per individual.
Please correct me if I am wrong. Thank you.
Patrick Wen